Skip to content

AbdallaRabeaMabed/Databricks_Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

17 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Databricks Data Lakehouse Project

This repository contains a complete, real-world Data Lakehouse implementation built on Databricks.

๐Ÿ—๏ธ Architecture

This project follows the Medallion Architecture:

๐Ÿฅ‰ Bronze Layer

  • Raw data ingestion
  • Schema inference and storage as Delta tables

๐Ÿฅˆ Silver Layer

  • Data cleaning and standardization
  • Type casting and validation

๐Ÿฅ‡ Gold Layer

  • Dimensional Data Model (Business Transformation)
  • Ready for BI and analysis

๐Ÿ› ๏ธ Technologies Used

  • Databricks
  • Apache Spark
  • PySpark
  • Spark SQL
  • Delta Lake
  • Unity Catalog

Pipeline

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages