LLM Training (Refactored)

This repository provides a modular, extensible framework for data preparation, fine-tuning, evaluation, and compression of large language models (LLMs).

Key Features

Modular Python package: llm_training
Unified CLI: llm-train for data prep, training, and evaluation
Configuration via JSON/YAML files with OmegaConf
Support for tensorized model compression (Llama, Mixtral)
Integrated WandB logging for experiment tracking
Unit tests with pytest
MIT licensed with community guidelines

Quickstart

Install dependencies:
```
pip install -e .
```

Run data preparation:

llm-train data-prep --config configs/data_prep_config.json

Fine-tune a model:

llm-train train --config configs/sft_config.json

Evaluate a model:

llm-train eval --config configs/evaluate_config.json

For model compression, see the scripts in scripts/compression/.

Configuration

All workflows are configured via JSON or YAML files using OmegaConf. See configs/ for examples. The config files specify model paths, hyperparameters, dataset paths, and training arguments.

Example Configs

data_prep_config.json - Dataset preparation configuration
sft_config.json - Supervised fine-tuning parameters
evaluate_config.json - Evaluation benchmark settings
accelerate_config.yaml - Multi-GPU training with Accelerate

Project Structure

llm_training/ - Main Python package with core functionality
scripts/ - Standalone scripts for data prep and compression workflows
configs/ - Configuration files for different workflows
tests/ - Unit tests

Testing

Run all tests with:

pytest

Docker

Build and run in a reproducible environment:

docker build -t llm_training .
docker run -it llm_training

Contributing

See CONTRIBUTING.md for guidelines. All contributions and issues are welcome!

Changelog

See CHANGELOG.md for release history.

Citation

If you find this work useful, please cite it as follows:

@misc{sarkar2024llmtraining,
  author = {Abhijoy Sarkar},
  title = {LLM Training: A Modular Framework for Fine-tuning Large Language Models},
  year = {2024},
  publisher = {GitHub},
  journal = {GitHub Repository},
  howpublished = {\url{https://github.com/acebot712/llm_training}},
}

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
.github/workflows		.github/workflows
configs		configs
llm_training		llm_training
notebooks		notebooks
plots		plots
scripts		scripts
tests		tests
.env.example		.env.example
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
mkdocs.yml		mkdocs.yml
mmlu.json		mmlu.json
openbookqa.json		openbookqa.json
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM Training (Refactored)

Key Features

Quickstart

Configuration

Example Configs

Project Structure

Testing

Docker

Contributing

Changelog

Citation

Contact

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

License

acebot712/llm_training

Folders and files

Latest commit

History

Repository files navigation

LLM Training (Refactored)

Key Features

Quickstart

Configuration

Example Configs

Project Structure

Testing

Docker

Contributing

Changelog

Citation

Contact

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages