Pipeline RL

A scalable asynchronous reinforcement learning implementation with in-flight weight updates. Designed to maximize GPU utilization while staying as on-policy as possible.

Setup

Clone the repository and change the directory to pipelinerl

git clone [email protected]:ServiceNow/PipelineRL.git
cd pipelinerl

Create the environments with dependencies.

conda create -n pipeline-rl -y python=3.11
conda run --no-capture-output -n pipeline-rl pip install torch==2.5.1 --index-url https://download.pytorch.org/whl/cu121 
conda run --no-capture-output -n pipeline-rl pip install -r requirements.txt --no-build-isolation

By default Pipeline-RL will use the file system as the medium for streaming the generated data to the trainer processes. This works on one node, but the files can get quite large. To use Redis instead you will need to install the Redis server in the same conda environment:

conda install redis-server==7.4.0 -c conda-forge

Run experiments

First, activate the conda environment:

conda activate pipeline-rl

Single node with 8 H00 GPUs:

python -m pipelinerl.launch output_dir=results/base1

If you only have 4 H100 GPUs:

python -m pipelinerl.launch --config-name base_4gpu output_dir=results/base1

To use Redis instead of the filesystem for data streaming:

python -m pipelinerl.launch streams=redis output_dir=results/base1

Multi node: coming soon.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
conf		conf
pipelinerl		pipelinerl
.gitignore		.gitignore
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
figure1.jpg		figure1.jpg
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pipeline RL

Setup

Run experiments

About

Releases

Packages

Languages

License

ServiceNow/PipelineRL

Folders and files

Latest commit

History

Repository files navigation

Pipeline RL

Setup

Run experiments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages