Clickhouse required memory and config

Sliver_Horizon · April 9, 2026, 10:28am

Hi, has anyone successfully run ClickHouse? I’ve enabled platform-plugin-aspects, but it’s consuming too much memory. I’m running it on a Kubernetes setup with a dedicated 32GB node for ClickHouse.
Here are my user_config.xml settings:

<clickhouse>
<profiles>
    <default>
        <http_max_field_value_size>1048576</http_max_field_value_size>
        <http_max_field_name_size>1048576</http_max_field_name_size>
        <max_threads>2</max_threads>
        <max_block_size>8192</max_block_size>
        <max_download_threads>1</max_download_threads>
        <max_memory_usage>21474836480</max_memory_usage>
        <input_format_parallel_parsing>0</input_format_parallel_parsing>
        <output_format_parallel_formatting>0</output_format_parallel_formatting>
        <max_bytes_before_external_group_by>3221225472</max_bytes_before_external_group_by>
        <max_bytes_before_external_sort>3221225472</max_bytes_before_external_sort>
        <aggregation_memory_efficient_merge_threads>1</aggregation_memory_efficient_merge_threads>
    </default>
</profiles>
</clickhouse>

Here are my server_config.xml settings:

<clickhouse>
<http_port>8123</http_port>


<tcp_port>9000</tcp_port>

<listen_host>::</listen_host>
<listen_host>0.0.0.0</listen_host>
<listen_try>1</listen_try>
<mark_cache_size>268435456</mark_cache_size>
<asynchronous_metric_log remove="1" />
<metric_log remove="1" />
<text_log remove="1" />
<trace_log remove="1" />
<logger>
    <console>true</console>
    <log remove="remove"/>
    <errorlog remove="remove"/>
    <level>error</level>
    </logger>
</clickhouse>

Clickhouse Pod,

resources:
  requests:
    memory: 22Gi
  limits:
    memory: 22Gi

AKS Node Allocatable Memory 24GB

Looking forward for response.

sarina · April 9, 2026, 1:24pm

I’ve recategorized to Working Groups > Data.

Sara_Burns · April 9, 2026, 3:16pm

@Cristhian_Garcia @Felipe do either of you have any suggestions for Silver? I don’t have much experience with running on Kubernetes.

Felipe · April 9, 2026, 8:09pm

Hi Sliver,

We run Click House often in different configurations for different customers. Not knowing the specifics of your instance in terms of the traffic that you reach is difficult to judge, but I can tell you some experiences we have had.

First of all, we mostly install click house outside of the Kubernetes cluster, and we put it on a separate server on a separate set of servers in case we’re running in a cluster. The key limitation for click house is the speed of the disc. Make sure that you’re reaching somewhere in the order of megabytes when you’re running randomized inserts.

Running a command such as time dd if=/dev/zero of=/tmp/.wlog bs=4k count=50000 oflag=dsync status=progress && rm -f /tmp/.wlog should be giving you more than ~1.5 MB/s or better. Sometimes having all the abstraction layers from k8s or from a network storage.

Also it looks like you are constraining the performance of the database in many dimensions, this might be creating more issues than what it is solving.

A final note I would say that you should also take a look at using vector as your data insert pipeline. Take a look at feat: set vector as default data pipeline by Ian2012 · Pull Request #1207 · openedx/tutor-contrib-aspects · GitHub. You can use this configuration even before the PR is merged, the PR is mostly about making it the default, not about creating the possibility to use vector.

Sliver_Horizon · April 10, 2026, 10:10am

Hi @Felipe thanks for the detailed insights - especially around disk and avoiding over-constraining the setup.

We did some additional testing on our side and noticed something concerning around memory usage with below query. I update my pod limit to 9GB and max_memory to 5GB.FYI, I’m using aspects version : 3.0.3.

For example:

SELECT * FROM reporting.fact_pageview_engagement LIMIT 1;

This query processed ~217K rows (~42 MB), but:

Peak memory usage: ~4.96 GiB
Eventually failed with:
MEMORY_LIMIT_EXCEEDED (limit set to 5 GiB)
Failure occurred during JoiningTransform

This seems unusually high for such a small dataset, especially given that:

Overall pod memory limit is 9Gi

So I’m trying to understand:

Why would such query trigger ~5GB memory usage?
Could this be related to how joins are executed in ClickHouse (e.g., full in-memory joins)?
Is aspects generating queries or schemas that are particularly memory-intensive?
Are there recommended settings to control join memory usage more aggressively?

Also, your point about Kubernetes overhead and storage is interesting - we haven’t yet benchmarked disk throughput inside the pod, so we’ll look it next.

We’ll also take a look at using Vector as the ingestion pipeline as suggested.

Any guidance on the join-related memory behavior (or tuning for aspects workloads specifically) would be really helpful.

Thanks again!

Felipe · April 10, 2026, 4:18pm

Just to get an idea of the size of your tables. About how many rows do you have in xapi_events_all?

Sara_Burns · April 13, 2026, 2:56pm

In your example, fact_pageview_engagement is a view within ClickHouse, so when you run that select query, ClickHouse is actually querying fact_navigation_completion which is also a view selecting from navigation_events… We try to balance views vs tables so both inserts (when events are created) and selects (for the dashboards) are as efficient as possible.

We try to make sure the dashboards load in a reasonable amount of time with a large amount of data - are you seeing issues in Superset?

Topic		Replies	Views
Clickhouse is consuming too much CPU Site Operations Help data , aspects	5	63	June 17, 2026
Tutor Cairn Infrastructure Sizing Recommendations? Tutor Help tutor , data	3	357	July 31, 2023
Issue running dump_data_to_clickhouse Community aspects	2	67	September 2, 2025
How to Forward Data from Aspects LRS (Ralph/ClickHouse) to an External Database Development	7	161	September 25, 2025
Redis `memory > max memory`. Page load times and useability suffer dramatically Site Operators	19	926	April 17, 2024

Clickhouse required memory and config

Related topics