The model uses the 'healthcare-dataset-stroke-data.csv' file, which includes the following features:
- id
- gender
- age
- hypertension
- heart_disease
- ever_married
- work_type
- Residence_type
- avg_glucose_level
- bmi
- smoking_status
- stroke
- Data Preprocessing:
- Removal of unnecessary columns
- Handling missing data
- Standardization of numerical features
- Clustering:
- K-means clustering is applied to identify patterns in the data
- The optimal number of clusters is determined using the elbow method
- Analysis:
- Visualization of clusters
- Interpretation of results in the context of stroke prediction
[Briefly describe the key findings and insights from your model. Include any relevant visualizations or metrics.]
Contributions to this project are welcome. Please follow these steps:
- Fork the repository
- Create a new branch (
git checkout -b feature/AmazingFeature
) - Commit your changes (
git commit -m 'Add some AmazingFeature'
) - Push to the branch (
git push origin feature/AmazingFeature
) - Open a Pull Request
This project is licensed under the MIT License.