This chapter demonstrates how to create, manage, and query Apache Iceberg tables within the AWS ecosystem using various AWS services and open source libraries including DuckDB, Daft, and Java API libraries.
- AWS Glue Data Catalog - Catalog configuration and management
- Amazon Athena - Querying Iceberg tables with Athena
- Amazon S3 Tables - Using managed Iceberg tables
- Amazon Redshift - Redshift integration patterns
- AWS Lambda with PyIceberg - Serverless Iceberg operations
- Apache Spark - Spark integration and examples
- Apache Flink - Flink integration and examples
- Trino on EMR - Distributed queries with Trino
- DuckDB Integration - Local analytics with DuckDB
- Daft Integration - Distributed DataFrames with Daft
- S3 Tables Java Iceberg Implementation - Complete Java Iceberg API example for S3 Tables
- For PyIceberg see AWS Lambda with PyIceberg