Skip to content

Commit f4ebffa

Browse files
committed
Merge branch 'main' into visual_testing
2 parents fdf2431 + a9c14ad commit f4ebffa

File tree

15 files changed

+398
-8
lines changed

15 files changed

+398
-8
lines changed

docs/en/cloud/bestpractices/partitioningkey.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ sidebar_label: Choose a Low Cardinality Partitioning Key
44
title: Choose a Low Cardinality Partitioning Key
55
---
66

7-
When you send an insert statement (that should contain many rows - see [section above](#ingest-data-in-bulk)) to a table in ClickHouse Cloud, and that
7+
When you send an insert statement (that should contain many rows - see [section above](/docs/en/optimize/bulk-inserts)) to a table in ClickHouse Cloud, and that
88
table is not using a [partitioning key](/docs/en/engines/table-engines/mergetree-family/custom-partitioning-key.md) then all row data from that insert is written into a new part on storage:
99

1010
![compression block diagram](images/partitioning-01.png)

docs/en/cloud/manage/scaling.md

+2
Original file line numberDiff line numberDiff line change
@@ -86,6 +86,8 @@ In the **Settings** page, you can also choose whether or not to allow automatic
8686

8787
:::note
8888
In certain special cases, for instance when a service has a high number of parts, the service will not be idled automatically.
89+
90+
The service may enter an idle state where it suspends refreshes of [refreshable materialized views](/docs/en/materialized-view/refreshable-materialized-view), consumption from [S3Queue](/docs/en/engines/table-engines/integrations/s3queue), and scheduling of new merges. Existing merge operations will complete before the service transitions to the idle state. To ensure continuous operation of refreshable materialized views and S3Queue consumption, disable the idle state functionality.
8991
:::
9092

9193
:::danger When not to use automatic idling

docs/en/cloud/security/cloud-authentication.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -116,7 +116,7 @@ ClickHouse Cloud also supports security assertion markup language (SAML) single
116116
Use the SHA256_hash method when [creating user accounts](/docs/en/sql-reference/statements/create/user.md) to secure passwords.
117117

118118
**TIP:** Since users with less than administrative privileges cannot set their own password, ask the user to hash their password using a generator
119-
such as [this one](https://tools.keycdn.com/sha256-online-generator) before providing it to the admin to setup the account. Passwords should follow the [requirements](#establish-strong-passwords) listed above.
119+
such as [this one](https://tools.keycdn.com/sha256-online-generator) before providing it to the admin to setup the account. Passwords should follow the [requirements](#password-settings) listed above.
120120

121121
```
122122
CREATE USER userName IDENTIFIED WITH sha256_hash BY 'hash';

docs/en/cloud/security/gcp-private-service-connect.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -108,7 +108,7 @@ In the Google Cloud console, navigate to **Network services -> Private Service C
108108
Open the Private Service Connect creation dialog by clicking on the **Connect Endpoint** button.
109109

110110
- **Target**: use **Published service**
111-
- **Target service**: use **endpointServiceId** from [Obtain GCP service attachment for Private Service Connect](#obtain-gcp-service-attachment-for-private-service-connect) step.
111+
- **Target service**: use **endpointServiceId** from [Obtain GCP service attachment for Private Service Connect](#obtain-gcp-service-attachment-and-dns-name-for-private-service-connect) step.
112112
- **Endpoint name**: set a name for the PSC **Endpoint name**.
113113
- **Network/Subnetwork/IP address**: Choose the network you want to use for the connection. You will need to create an IP address or use an existing one for the Private Service Connect endpoint. In our example, we precreated an address with the name **your-ip-address** and assigned IP address `10.128.0.2`
114114
- To make the endpoint available from any region, you can enable the **Enable global access** checkbox.

docs/en/integrations/data-ingestion/clickpipes/object-storage.md

+5
Original file line numberDiff line numberDiff line change
@@ -145,6 +145,11 @@ Service Accounts for GCS aren't directly supported. HMAC (IAM) Credentials must
145145
The Service Account permissions attached to the HMAC credentials should be `storage.objects.list` and `storage.objects.get`.
146146

147147
## F.A.Q.
148+
148149
- **Does ClickPipes support GCS buckets prefixed with `gs://`?**
149150

150151
No. For interoprability reasons we ask you to replace your `gs://` bucket prefix with `https://storage.googleapis.com/`.
152+
153+
- **What permissions does a GCS public bucket require?**
154+
155+
`allUsers` requires appropriate role assignment. The `roles/storage.objectViewer` role must be granted at the bucket level. This role provides the `storage.objects.list` permission, which allows ClickPipes to list all objects in the bucket which is required for onboarding and ingestion. This role also includes the `storage.objects.get` permission, which is required to read or download individual objects in the bucket. See: [Google Cloud Access Control](https://cloud.google.com/storage/docs/access-control/iam-roles) for further information.

docs/en/integrations/data-ingestion/clickpipes/postgres/faq.md

+41-1
Original file line numberDiff line numberDiff line change
@@ -63,4 +63,44 @@ Please refer to the [ClickPipes for Postgres: Schema Changes Propagation Support
6363

6464
### What are the costs for ClickPipes for Postgres CDC?
6565

66-
During the preview, ClickPipes is free of cost. Post-GA, pricing is still to be determined. The goal is to make the pricing reasonable and highly competitive compared to external ETL tools.
66+
During the preview, ClickPipes is free of cost. Post-GA, pricing is still to be determined. The goal is to make the pricing reasonable and highly competitive compared to external ETL tools.
67+
68+
### My replication slot size is growing or not decreasing; what might be the issue?
69+
70+
If you're noticing that the size of your Postgres replication slot keeps increasing or isn’t coming back down, it usually means that **WAL (Write-Ahead Log) records aren’t being consumed (or “replayed”) quickly enough** by your CDC pipeline or replication process. Below are the most common causes and how you can address them.
71+
72+
1. **Sudden Spikes in Database Activity**
73+
- Large batch updates, bulk inserts, or significant schema changes can quickly generate a lot of WAL data.
74+
- The replication slot will hold these WAL records until they are consumed, causing a temporary spike in size.
75+
76+
2. **Long-Running Transactions**
77+
- An open transaction forces Postgres to keep all WAL segments generated since the transaction began, which can dramatically increase slot size.
78+
- Set `statement_timeout` and `idle_in_transaction_session_timeout` to reasonable values to prevent transactions from staying open indefinitely:
79+
```sql
80+
SELECT
81+
pid,
82+
state,
83+
age(now(), xact_start) AS transaction_duration,
84+
query AS current_query
85+
FROM
86+
pg_stat_activity
87+
WHERE
88+
xact_start IS NOT NULL
89+
ORDER BY
90+
age(now(), xact_start) DESC;
91+
```
92+
Use this query to identify unusually long-running transactions.
93+
94+
3. **Maintenance or Utility Operations (e.g., `pg_repack`)**
95+
- Tools like `pg_repack` can rewrite entire tables, generating large amounts of WAL data in a short time.
96+
- Schedule these operations during slower traffic periods or monitor your WAL usage closely while they run.
97+
98+
4. **VACUUM and VACUUM ANALYZE**
99+
- Although necessary for database health, these operations can create extra WAL traffic—especially if they scan large tables.
100+
- Consider using autovacuum tuning parameters or scheduling manual VACUUMs during off-peak hours.
101+
102+
5. **Replication Consumer Not Actively Reading the Slot**
103+
- If your CDC pipeline (e.g., ClickPipes) or another replication consumer stops, pauses, or crashes, WAL data will accumulate in the slot.
104+
- Ensure your pipeline is continuously running and check logs for connectivity or authentication errors.
105+
106+
For an excellent deep dive into this topic, check out our blog post: [Overcoming Pitfalls of Postgres Logical Decoding](https://blog.peerdb.io/overcoming-pitfalls-of-postgres-logical-decoding#heading-beware-of-replication-slot-growth-how-to-monitor-it).
Loading
Loading
Loading
Loading
Loading

0 commit comments

Comments
 (0)