A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0
-
Updated
Aug 5, 2021 - Scala
A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0
Design/Implement stream/batch architecture on NYC taxi data | #DE
Java Application, uses Apache Spark, handles batch as well as streaming processing
Various data stream/batch process demo with Apache Scala Spark 🚀
Build an end to end data application with Yelp review dataset. (data collect -> DB config -> data ETL -> data dashboard (analysis/ML)
The Road Monitoring System is a real-time software application that utilizes big data technologies to monitor and analyze vehicular data on Tunisian roads. It provides insights into vehicle locations, movements, and real-time analytics. The system offers an effective solution for monitoring cars conditions and detecting potential issues promptly.
Learning batch processing with Pyspark Interface for Apache Spark
Batch Data Processing Pipeline using MinIO, Spark, PostgreSQL, Great Expectations, DBT and Dbeaver
Add a description, image, and links to the spark-batch topic page so that developers can more easily learn about it.
To associate your repository with the spark-batch topic, visit your repo's landing page and select "manage topics."