Merge pull request #5745 from ClickHouse/fix-lakhouse-terminology

dhtclk · web-flow · commit 5abc76b8de89 · 2026-03-13T15:03:07.000-05:00
fixing lakehouse terminology
diff --git a/docs/use-cases/data_lake/getting-started.md b/docs/use-cases/data_lake/getting-started.md
@@ -1,5 +1,5 @@
 ---
-title: 'Getting started with lakehouse table formats'
+title: 'Getting started with open table formats'
 sidebar_label: 'Getting started'
 slug: /use-cases/data-lake/getting-started
 sidebar_position: 1
diff --git a/docs/use-cases/data_lake/guides/accelerating-analytics.md b/docs/use-cases/data_lake/guides/accelerating-analytics.md
@@ -11,7 +11,7 @@ keywords: ['data lake', 'lakehouse', 'MergeTree', 'accelerate', 'analytics', 'in
 doc_type: 'guide'
 ---
 
-In the [previous section](/use-cases/data-lake/getting-started/connecting-catalogs), you connected ClickHouse to a data catalog and queried open table formats directly. While querying data in place is convenient, lakehouse formats are not optimized for the low-latency, high-concurrency workloads that power dashboards and operational reporting. For these use cases, loading data into ClickHouse's [MergeTree](/engines/table-engines/mergetree-family/mergetree) engine delivers dramatically better performance.
+In the [previous section](/use-cases/data-lake/getting-started/connecting-catalogs), you connected ClickHouse to a data catalog and queried open table formats directly. While querying data in place is convenient, open table formats are not optimized for the low-latency, high-concurrency workloads that power dashboards and operational reporting. For these use cases, loading data into ClickHouse's [MergeTree](/engines/table-engines/mergetree-family/mergetree) engine delivers dramatically better performance.
 
 MergeTree offers several advantages over reading open table formats directly:
 
@@ -90,7 +90,7 @@ FROM unity.`icebench.single_day_log`
 1 row in set. Elapsed: 1.265 sec.
 ```
 
-## Query over the lakehouse table {#query-lakehouse}
+## Query over the data lake table {#query-lakehouse}
 
 Let's run a query that filters logs by thread name and instance type, searches the message text for errors, and groups results by logger:
 
@@ -163,7 +163,7 @@ ORDER BY (instance_type, thread_name, toStartOfMinute(event_time))
 
 ### Insert data from the catalog {#insert-data}
 
-Use `INSERT INTO SELECT` to load the ~300m from the lakehouse table into our ClickHouse table:
+Use `INSERT INTO SELECT` to load the ~300m from the data lake table into our ClickHouse table:
 
 ```sql
 INSERT INTO single_day_log SELECT * FROM icebench.`icebench.single_day_log`
diff --git a/docs/use-cases/data_lake/guides/writing-data.md b/docs/use-cases/data_lake/guides/writing-data.md
@@ -11,7 +11,7 @@ keywords: ['data lake', 'lakehouse', 'write', 'iceberg', 'reverse ETL', 'INSERT
 doc_type: 'guide'
 ---
 
-In the previous guides, you queried open table formats in place and loaded data into MergeTree for fast analytics. In many architectures, data also needs to flow in the other direction - from ClickHouse back into lakehouse formats. Two common scenarios drive this:
+In the previous guides, you queried open table formats in place and loaded data into MergeTree for fast analytics. In many architectures, data also needs to flow in the other direction - from ClickHouse back into open table formats. Two common scenarios drive this:
 
 - **Offloading to long-term storage** - Data arrives in ClickHouse as a real-time analytics layer, powering dashboards and operational reporting. Once the data ages beyond its real-time window, it can be written out to Iceberg in object storage for durable, cost-effective retention in an interoperable format.
 - **Reverse ETL** - Transformations, aggregations, and enrichment performed inside ClickHouse produce derived datasets that downstream tools and other teams need to consume. Writing these results to Iceberg tables makes them available across the broader data ecosystem.
diff --git a/docs/use-cases/data_lake/index.md b/docs/use-cases/data_lake/index.md
@@ -3,18 +3,18 @@ description: 'Use ClickHouse to query, accelerate, and analyze data in open tabl
 pagination_prev: null
 pagination_next: null
 slug: /use-cases/data-lake
-title: 'Data Lakehouse'
+title: 'Data Lake'
 keywords: ['data lake', 'lakehouse', 'iceberg', 'delta lake', 'hudi', 'paimon', 'glue', 'unity', 'rest', 'OneLake', 'BigLake']
 doc_type: 'landing-page'
 ---
 
-ClickHouse integrates with open lakehouse table formats, including [Apache Iceberg](/engines/table-engines/integrations/iceberg), [Delta Lake](/engines/table-engines/integrations/deltalake), [Apache Hudi](/engines/table-engines/integrations/hudi), and [Apache Paimon](/sql-reference/table-functions/paimon). This allows users to connect ClickHouse to data already stored in these formats across object storage, combining the analytical power of ClickHouse with their existing data lake infrastructure.
+ClickHouse integrates with open table formats, including [Apache Iceberg](/engines/table-engines/integrations/iceberg), [Delta Lake](/engines/table-engines/integrations/deltalake), [Apache Hudi](/engines/table-engines/integrations/hudi), and [Apache Paimon](/sql-reference/table-functions/paimon). This allows users to connect ClickHouse to data already stored in these formats across object storage, combining the analytical power of ClickHouse with their existing data lake infrastructure.
 
 ## Why use ClickHouse with open table formats? {#why-clickhouse-uses-lake-formats}
 
 ### Query existing data in place {#querying-data-in-place}
 
-ClickHouse can query open table formats directly in object storage without duplicating data. Organizations standardized on Iceberg, Delta Lake, Hudi, or Paimon can point ClickHouse at existing tables and immediately use its SQL dialect, analytical functions, and efficient native Parquet reader. At the same time, tools like [clickhouse-local](/operations/utilities/clickhouse-local) and [chDB](/chdb) enable exploratory, ad hoc analysis across more than 70 file formats in remote storage, allowing users to interactively explore lakehouse datasets with no infrastructure setup.
+ClickHouse can query open table formats directly in object storage without duplicating data. Organizations standardized on Iceberg, Delta Lake, Hudi, or Paimon can point ClickHouse at existing tables and immediately use its SQL dialect, analytical functions, and efficient native Parquet reader. At the same time, tools like [clickhouse-local](/operations/utilities/clickhouse-local) and [chDB](/chdb) enable exploratory, ad hoc analysis across more than 70 file formats in remote storage, allowing users to interactively explore data lake datasets with no infrastructure setup.
 
 Users can achieve this with either direct reading, using [table functions and table engines](/use-cases/data-lake/getting-started/querying-directly), or by [connecting to a data catalogue](/use-cases/data-lake/getting-started/connecting-catalogs).
 
diff --git a/docs/use-cases/data_lake/support-matrix.md b/docs/use-cases/data_lake/support-matrix.md
@@ -5,19 +5,19 @@ slug: /use-cases/data-lake/support-matrix
 sidebar_position: 3
 pagination_prev: null
 pagination_next: null
-description: 'Comprehensive support matrices for ClickHouse lakehouse format integrations and data catalog connections.'
+description: 'Comprehensive support matrices for ClickHouse open table format integrations and data catalog connections.'
 keywords: ['data lake', 'lakehouse', 'support', 'iceberg', 'delta lake', 'hudi', 'paimon', 'catalog', 'features']
 doc_type: 'reference'
 ---
 
 import Tabs from '@theme/Tabs';
 import TabItem from '@theme/TabItem';
 
-This page provides comprehensive support matrices for ClickHouse's lakehouse integrations. It covers the features available for each lakehouse table format, the catalogs ClickHouse can connect to, and the capabilities supported by each catalog.
+This page provides comprehensive support matrices for ClickHouse's data lake integrations. It covers the features available for each open table format, the catalogs ClickHouse can connect to, and the capabilities supported by each catalog.
 
-## Lakehouse format support {#format-support}
+## Open table format support {#format-support}
 
-ClickHouse integrates with four lakehouse table formats: [Apache Iceberg](/engines/table-engines/integrations/iceberg), [Delta Lake](/engines/table-engines/integrations/deltalake), [Apache Hudi](/engines/table-engines/integrations/hudi), and [Apache Paimon](/sql-reference/table-functions/paimon). Select a format below to view its support matrix.
+ClickHouse integrates with four open table formats: [Apache Iceberg](/engines/table-engines/integrations/iceberg), [Delta Lake](/engines/table-engines/integrations/deltalake), [Apache Hudi](/engines/table-engines/integrations/hudi), and [Apache Paimon](/sql-reference/table-functions/paimon). Select a format below to view its support matrix.
 
 **Legend:** ✅ Supported | ⚠️ Partial / Experimental | ❌ Not supported