π Passionate Data Scientist & ML Engineer with hands-on experience in Time Series Forecasting, Deep Learning, and Real-time Data Systems.
I love building end-to-end intelligent data pipelines β from ingestion to model deployment β using GCP, Python, and open-source frameworks.
π― Current Focus Areas:
- Building real-time streaming pipelines with Apache Airflow, Kafka, Spark, and Cassandra (Dockerized on GCP).
- Developing a Flask-based ERP System integrated with Google Sheets & Drive APIs for smart hub management and analytics.
βοΈ Cloud Skills:
- GCP Data Services: BigQuery, Dataflow, Dataproc, Pub/Sub, Cloud Composer (Airflow), Cloud Storage
- MLOps & AI Tools: Vertex AI, AI Platform Pipelines, and Cloud Run for model deployment
- Infrastructure: Docker, Cloud Build, IAM, VPC networking
π Learning Journey:
Exploring MLOps, Vector Databases, and Lightweight LLMs for Synthetic Data Generation.
π« Reach me: anuragmukati09@gmail.com
β‘ Fun Fact: I code with empathy β because great tech should make life easier for others π
- βοΈ GCP Data Pipeline Automation β Airflow + Dataflow + BigQuery workflow for automated data ETL and ML preprocessing.
- π Flask ERP System β Lightweight ERP connected with Google Sheets/Drive for delivery hub management.
- βοΈ Real-time Streaming Analytics β Kafka + Spark + Cassandra with Docker on GCP for high-throughput processing.
- π§ Synthetic Data Generator β Lightweight LLM-powered SQL β CSV data generation engine using Faker library.
