Skip to content

Commit 252ffa2

Browse files
committed
Merge branch 'refactoring' of github.com:aimclub/FEDOT.LLM into refactoring
2 parents 73e126c + ba2d7fe commit 252ffa2

1 file changed

Lines changed: 7 additions & 7 deletions

File tree

kaggle_comp/obesity_risks/output/report.md

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -51,7 +51,7 @@ Here’s a breakdown of the most critical sections of the code:
5151

5252
1. **Data Preprocessing**
5353
Converts the raw dataset to a clean format usable by the models.
54-
```python
54+
```python
5555
from sklearn.impute import SimpleImputer
5656
from sklearn.preprocessing import MinMaxScaler
5757

@@ -67,11 +67,11 @@ def transform_data(dataset: pd.DataFrame):
6767
features = scaler.fit_transform(features)
6868

6969
return features, target
70-
```
70+
```
7171

7272
2. **Model Training and AutoML**
7373
Automated the model training process using the **FEDOT AutoML framework** to efficiently explore high-quality predictive models.
74-
```python
74+
```python
7575
from fedot.api.main import Fedot
7676
from fedot.core.repository.tasks import Task, TaskTypesEnum
7777

@@ -80,16 +80,16 @@ def train_model(features, target):
8080
pipeline = Fedot(problem='classification', preset='best_quality', timeout=1.0, metric='accuracy')
8181
pipeline.fit(features, target)
8282
return pipeline
83-
```
83+
```
8484

8585
3. **Evaluation and Predictions**
8686
Evaluated model performance and prepared submission files.
87-
```python
87+
```python
8888
def create_submission():
8989
predictions = model.predict(test_features)
9090
submission_df = pd.DataFrame({'id': test_data['id'], 'NObeyesdad': predictions})
9191
submission_df.to_csv("submission.csv", index=False)
92-
```
92+
```
9393

9494
## 📊 Metrics
9595

@@ -105,4 +105,4 @@ This means the model correctly identifies the obesity risk category for 9 out of
105105

106106
This model effectively predicts obesity risk with **90.7% accuracy**, a promising result for public health applications. By leveraging advanced AutoML techniques and robust preprocessing, it demonstrates a scalable, efficient approach to tackle similar classification problems. While this is a synthetic competition dataset, the pipeline could easily be adapted for real-world use cases like predicting cardiovascular risk or targeting dietary interventions.
107107

108-
Modeling obesity risk is not just about prediction—it's about enabling preventive healthcare measures that could save lives. This competition shows how machine learning can make serious strides in addressing global health challenges.
108+
Modeling obesity risk is not just about prediction—it's about enabling preventive healthcare measures that could save lives. This competition shows how machine learning can make serious strides in addressing global health challenges.

0 commit comments

Comments
 (0)