Skip to content

Latest commit

 

History

History
29 lines (19 loc) · 684 Bytes

File metadata and controls

29 lines (19 loc) · 684 Bytes

Playlist Data Pipeline

This project builds a small data pipeline that fetches playlist metadata from a public JSON dataset, performs transformations using Pandas, and stores cleaned data locally.

Dataset Source

https://raw.githubusercontent.com/rushi4git/spotify-playlist-data/refs/heads/main/spotify_playlist.json

Features

  • Fetch playlist data from external JSON
  • Convert JSON to Pandas DataFrame
  • Perform transformations
  • Save raw and processed data as CSV
  • Generate summary report

How to Run

Install dependencies:

pip install -r requirements.txt

Run pipeline:

python main.py

Output Files

playlist_raw.csv
playlist_transformed.csv
summary_report.txt