-
Set up the Python environment
First, create and activate a Python virtual environment using Conda (Python 3.11 is recommended), then install the dependencies:
conda create -n cr python=3.11 conda activate cr pip install -r requirements.txt
-
Run the evaluation script
Execute the following command to start evaluation:
./run_cyclicreflex.sh
-
--model_choicesupports:qwen1.5b: DeepSeek-R1-Distill-Qwen-1.5Bqwen7b: DeepSeek-R1-Distill-Qwen-7Bllama8b: DeepSeek-R1-Distill-Llama-8B
-
--datasetssupports:math500aime2024aime2025amc2023
-
-
Evaluation results will be stored under:
results/evaluation/cyclicreflex. You can also monitor detailed logging via wandb.
Base
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|
parent directory.. | ||||