Sarajevo Flats Price Analysis

This project collects real estate data from Sarajevo online listings and performs regression analysis to predict apartment prices based on features like size, location, and other available attributes.

Project Structure

sarajevo-flats/
│
├── README.md               # Project description and instructions
├── requirements.txt        # Python dependencies
├── .gitignore              # Files to ignore in Git
├── data/                   # Folder for scraped or cleaned datasets
│   └── sarajevo_flats.csv  # Scraped dataset
├── src/                    # Python scripts
│   ├── scrape.py           # Web scraping script using BeautifulSoup
│   ├── clean_data.py       # Optional: cleaning/preprocessing data
│   └── regression.py       # Regression model (train/test)
└── notebooks/              # Optional Jupyter notebooks for exploration
    └── exploration.ipynb

Installation

Clone the repository:

git clone https://github.com/EmreArapcicUevak/EE418-Introduction-to-Machine-Learning-Project
cd EE418-Introduction-to-Machine-Learning-Project

Create a virtual environment (optional but recommended):

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Usage

1. Scrape data

Run the scraping script to collect apartment listings:

python src/scrape.py

This will save the dataset as data/sarajevo_flats.csv.

2. Clean / preprocess data (optional)

python src/clean_data.py

This script will format prices, sizes, and handle missing values.

3. Train regression model

python src/regression.py

This will train a regression model to predict apartment prices and display performance metrics.

Dependencies

Python 3.8+
BeautifulSoup4
Requests
Pandas
Scikit-learn
Lxml (parser for BeautifulSoup)

Install via:

pip install -r requirements.txt

Notes

Scraped data is intended for educational and research purposes only.
Web page structure may change; scraping scripts may need updates accordingly.
CSV files are ignored in .gitignore to avoid large data in Git.

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
MachineLearning/notebooks		MachineLearning/notebooks
ProjectPitch		ProjectPitch
notebooks		notebooks
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sarajevo Flats Price Analysis

Project Structure

Installation

Usage

1. Scrape data

2. Clean / preprocess data (optional)

3. Train regression model

Dependencies

Notes

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

EmreArapcicUevak/EE418-Introduction-to-Machine-Learning-Project

Folders and files

Latest commit

History

Repository files navigation

Sarajevo Flats Price Analysis

Project Structure

Installation

Usage

1. Scrape data

2. Clean / preprocess data (optional)

3. Train regression model

Dependencies

Notes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages