FaultAlarm_RL

Official code for paper Learning by Doing: An Online Causal Reinforcement Learning Framework with Causal-Aware Policy

Installations

Create Conda Environment

conda create -n faultAlarm_RL python=3.8
source activate faultAlarm_RL
pip install torch==1.7.1+cu110 -f https://download.pytorch.org/whl/torch_stable.html

Install requirements

git clone https://github.com/DMIRLAB-Group/FaultAlarm_RL.git
cd ./FaultAlarm_RL
pip install -r requirements.txt

Install FaultAlarm RL Environment

cd ./FaultAlarm_RL
pip install -e .

This will install the custom faultAlarm environment so it can be accessed via standard Python imports.

Usage

Below is a simple example demonstrating how to interact with the faultAlarm environment via gym:

Usage for FaultAlarm RL Environment

import gym
import faultAlarm


env = gym.make('FaultAlarm-v0')
print("state space dimension : ", env.observation_space.shape[0])
print("action space dimension : ", env.action_space.n)

state = env.reset()
while True:
    action = env.action_space.sample()
    
    next_state, reward, done, info = env.step(action)
    
    state = next_state
    if done:
         break

Run

Logging

This codebase includes TensorBoard logging. To visualize training logs, run:

tensorboard --logdir=logs_tensorboard --port 6008

assuming you used the default log_dir.

RL method

An example of training a standard PPO agent on the FaultAlarm environment:

python train_ppo.py --max_episodes 2000 --max_ep_len 100

where

--max_episodes: Maximum number of training episodes.
--max_ep_len: Maximum length of each episode.

CausalRL method

An example of training a Causal PPO agent on the FaultAlarm environment:

python train_c_ppo.py --subset_size 8 --random_g 0 --reg_parm 0.005

Where:

--subset_size: Specifies the size of the causal action subsets.
--random_g: Determines whether to initialize the agent with a random causal graph (1 for random, 0 for predefined).
--reg_parm: Regularization parameter for causal pruning during causal structure learning.

Code Structure

The main components of the causal reinforcement learning framework are organized as follows:

└── CausalReinforcementLearning
    ├── causal_learner.py
    │   ├── THP: Learns the initial causal structure.
    │   ├── CausalLearner: Handles online causal structure learning, including orientation and pruning stages.
    ├── c_ppo.py
    │   ├── Implements the Causal PPO algorithm.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
CausalReinforcementLearning		CausalReinforcementLearning
ReinforcementLearning		ReinforcementLearning
faultAlarm.egg-info		faultAlarm.egg-info
faultAlarm		faultAlarm
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py
train_c_ppo.py		train_c_ppo.py
train_ppo.py		train_ppo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FaultAlarm_RL

Installations

Create Conda Environment

Install requirements

Install FaultAlarm RL Environment

Usage

Usage for FaultAlarm RL Environment

Run

Logging

RL method

CausalRL method

Code Structure

About

Releases

Packages

Contributors 2

Languages

DMIRLAB-Group/FaultAlarm_RL

Folders and files

Latest commit

History

Repository files navigation

FaultAlarm_RL

Installations

Create Conda Environment

Install requirements

Install FaultAlarm RL Environment

Usage

Usage for FaultAlarm RL Environment

Run

Logging

RL method

CausalRL method

Code Structure

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages