Add Mixtral-8x22B-v0.1 model support #286

VinitP1102 · 2024-09-24T10:23:49Z

Summary

This PR introduces the integration of the Mixtral-8x22B model into the codebase. Specifically, the following changes have been made:

Added the Mixtral model in mixtral.py to support the 8x22B architecture.
Introduced a new engine for the Mixtral model in mixtral_engine.py
Updated __init__.py files in the models and engines directories to register the new Mixtral model.
Modified generation_config.yaml to include parameters for the Mixtral model's generation tasks.
Updated finetuning_config.yaml to configure Mixtral model-specific parameters for finetuning.
Updated documentation, including README.md and supported_models.md, to reflect the addition of the Mixtral-8x22B model, with its identifier key as "mixtral".

Checklist

Tested the integration of the Mixtral model with both generation and finetuning tasks.
Updated documentation files to reflect the changes.

Additional Information

A new example script, mixtral.py, has been added in the examples directory. This script demonstrates how to use the Mixtral model with xTuring and provides instructions for testing the model.
These changes enable the Mixtral-8x22B model's functionality for both generation and finetuning tasks within the xTuring project. The model's parameters have been incorporated into the respective configuration files.

Added information checks and typo fixes for README.md

Added information about VRAM to main README.md

Added licence information to main README.md

Fixed licence line in README.md

Added important installation step

Added fixes for information, removed docker dependancy

int4 docker

…ription int4 description

Added batch size control to docker

Changed comment to be more informative

int4 docker

Changed README.md both for main and INT4 example

…nges docs: refined readme changes for INT4

Run Docker container INT4 from the shell

Co-authored-by: Sarthak Langde <[email protected]>

…_int4 feat: run Docker container INT4 from the shell

changed README.md

…ription docs: add new feature highlighted section

removed new feature in README.md

…ription docs: change casing on README.md

feat: add INT4 demo support

Explained why we're getting the numbers we're getting and what the optimal configuratio could be

Adding int4 finetuning pipeline

fix: added explanatory paragraph and more benchmarks

docs: int4 readme changes

release 0.1.8

Docs Back

Fixed docs links and naming

Docs update

Signed-off-by: yiliu30 <[email protected]>

Integrate ITREX to support int8 model on the CPU-only devices

Signed-off-by: yiliu30 <[email protected]>

Update the CPU inference doc

Dev

feat: add PR template

…rver fixed a minor typo in heading

adding import

Modified README.md with better formatting and replaced link in the getting started example

docs: update README.md

- load via MambaForCausalLM - upgrade Transformers - add mamba to yamls

Add Mamba to available LLMs

- Added Mixtral engine to support the model. - Added example of Mixtral model. - Edited `README.md` to display the latest added model. - Edited config files for Mixtral model.

StochasticRomanAgeev and others added 30 commits April 6, 2023 23:36

fix: information check

626be64

Added information checks and typo fixes for README.md

feat: information about vram

f28390b

Added information about VRAM to main README.md

feat: added licence info

59fd949

Added licence information to main README.md

fix: licence line

ea3ff6e

Fixed licence line in README.md

fix: more installation info

7625f9e

Added important installation step

fix: typos and info fix

5cc00fa

Added fixes for information, removed docker dependancy

Merge pull request stochasticai#114 from stochasticai/roman/int4_docker

0fa6934

int4 docker

Merge pull request stochasticai#113 from stochasticai/roman/int4_desc…

26669eb

…ription int4 description

updated INT4 readme about LLaMA access right

a5e1c32

small changes to int4 readme

ded0a97

feat: batch size

b76aca6

Added batch size control to docker

fix: more informative name

7990e7f

Changed comment to be more informative

Merge pull request stochasticai#117 from stochasticai/roman/int4_docker

2a80a09

int4 docker

docs: refined readme changes for INT4

101f54b

Changed README.md both for main and INT4 example

Merge pull request stochasticai#118 from stochasticai/int4/readme_cha…

ffe530e

…nges docs: refined readme changes for INT4

feat: run Docker container INT4 from the shell

9b6285f

Run Docker container INT4 from the shell

Update examples/int4_finetuning/LLaMA_lora_int4.ipynb

85f9df2

Co-authored-by: Sarthak Langde <[email protected]>

Update examples/int4_finetuning/LLaMA_lora_int4.ipynb

030657c

Co-authored-by: Sarthak Langde <[email protected]>

Merge pull request stochasticai#119 from stochasticai/marcos/notebook…

56f46a3

…_int4 feat: run Docker container INT4 from the shell

added changed in README.md

e423ded

docs: merged upstream dev

c09b4b1

changed README.md

Merge pull request stochasticai#120 from stochasticai/roman/int4_desc…

b52fb03

…ription docs: add new feature highlighted section

docs: removed duplicate section

aad9bed

removed new feature in README.md

docs: fixed casing

a096de5

removed new feature in README.md

Merge pull request stochasticai#121 from stochasticai/roman/int4_desc…

4779732

…ription docs: change casing on README.md

Merge pull request stochasticai#122 from stochasticai/dev

75f89fb

feat: add INT4 demo support

feat: added explanatory paragraph and more benchmarks

8d78ab8

Explained why we're getting the numbers we're getting and what the optimal configuratio could be

feat: int4 finetuning

d57b18b

Adding int4 finetuning pipeline

Merge pull request stochasticai#126 from stochasticai/john/int4

62520d1

fix: added explanatory paragraph and more benchmarks

Merge pull request stochasticai#130 from stochasticai/dev

b52fd4e

docs: int4 readme changes

MarcosRiveraMartinez and others added 28 commits September 6, 2023 20:23

Merge pull request stochasticai#254 from stochasticai/dev

fbeea1a

release 0.1.8

Add Semgrep CI

7d64132

Merge pull request stochasticai#258 from stochasticai/main

ba3b365

Docs Back

fix: doc links and naming

688307f

Fixed docs links and naming

Merge pull request stochasticai#259 from stochasticai/dev

55eda97

Docs update

add itrex for cpu

442a67d

Signed-off-by: yiliu30 <[email protected]>

add ut

c53c79b

Signed-off-by: yiliu30 <[email protected]>

revert change for qlora

6d563f7

Signed-off-by: yiliu30 <[email protected]>

add more log

fbc3558

Signed-off-by: yiliu30 <[email protected]>

remove comments

a9dbb28

Merge pull request stochasticai#268 from yiliu30/itrex_woq

54d1ec3

Integrate ITREX to support int8 model on the CPU-only devices

fixed a minor typo in heading

18952c2

update docs

26cd3c8

Signed-off-by: yiliu30 <[email protected]>

update docs

671b324

Signed-off-by: yiliu30 <[email protected]>

fix typo

9129ef3

Signed-off-by: yiliu30 <[email protected]>

add sample code

7b4ff6e

Signed-off-by: yiliu30 <[email protected]>

Merge pull request stochasticai#271 from yiliu30/cpu_infer_doc

a733f36

Update the CPU inference doc

Merge pull request stochasticai#272 from stochasticai/dev

5736d16

Dev

Update supported_models.md

5ee479c

feat: add PR template

a197334

Merge pull request stochasticai#275 from stochasticai/marcos/pr_template

d47f9f5

feat: add PR template

Merge pull request stochasticai#270 from shashankshet/fix/typo_api_se…

35368f2

…rver fixed a minor typo in heading

Merge pull request stochasticai#274 from xiaoranzhou/patch-2

afc00ba

adding import

docs: update README.md

9e7cd2c

Modified README.md with better formatting and replaced link in the getting started example

Merge pull request stochasticai#280 from stochasticai/glenn/mixtral

6a0c18d

docs: update README.md

mamba model and engine test

dabb5d1

- load via MambaForCausalLM - upgrade Transformers - add mamba to yamls

Merge pull request stochasticai#284 from mapmeld/mamba

570a0d6

Add Mamba to available LLMs

feat(model): add Mixtral-8x22B-v0.1 support

4bed468

- Added Mixtral engine to support the model. - Added example of Mixtral model. - Edited `README.md` to display the latest added model. - Edited config files for Mixtral model.

glennko closed this Sep 28, 2025

glennko force-pushed the main branch from 2fee13b to dd8e0dc Compare September 28, 2025 17:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Mixtral-8x22B-v0.1 model support #286

Add Mixtral-8x22B-v0.1 model support #286

Uh oh!

VinitP1102 commented Sep 24, 2024

Uh oh!

Uh oh!

Add Mixtral-8x22B-v0.1 model support #286

Add Mixtral-8x22B-v0.1 model support #286

Uh oh!

Conversation

VinitP1102 commented Sep 24, 2024

Summary

Checklist

Additional Information

Uh oh!

Uh oh!