Toolkit to robustify CBRN MCQA benchmarks: consensus/shortcut detection, verified cloze variants, statistical bias battery; deterministic and fail‑graceful.
Practical toolkit to evaluate and improve robustness of AI models on CBRN (Chemical, Biological, Radiological, Nuclear) multiple‑choice QA. Implements choices‑only consensus screens, verified cloze scoring, and a heuristics battery (position bias, longest‑answer), with reproducible, fail‑graceful execution.
- Install uv:
curl -LsSf https://astral.sh/uv/install.sh | sh - Create venv and install deps:
uv venv && uv pip install -r requirements.txt
- Run a sample:
make setup && make sample
- Full pipeline (recommended):
make pipeline
More pipeline options are documented in scripts/PIPELINE_README.md.
- Overview and rationale:
overview.md - Getting started and CLI usage:
docs/getting-started/usage.md - Architecture:
docs/architecture/architecture.md - Security & release policy:
docs/safety/security-considerations.md - Results/report templates:
docs/results/ - Full docs index:
docs/README.md
- Lint:
.venv/bin/ruff check robustcbrn tests - Tests:
.venv/bin/pytest -q - Pre‑commit hooks:
pip install pre-commitpre-commit install(orbash scripts/install-hooks.sh)
- Cross‑platform pipeline (Windows/macOS/Linux) with robust error handling.
- Public artifacts are sanitized (no raw questions/choices or per‑item exploit labels). See
scripts/validate_release.sh.
- Release Checklist:
docs/safety/release-checklist.md
We welcome contributions! Please see our Contributing Guidelines for details on:
- Code of conduct
- Development workflow
- Commit message conventions
- Pull request process
- Testing requirements
This project is licensed under the MIT License - see the LICENSE file for details.
If you use this toolkit in your research, please cite:
@software{robustcbrn-eval,
title = {RobustCBRN Eval: Toolkit for Robustifying CBRN AI Benchmarks},
author = {[Authors]},
year = {2024},
url = {https://github.com/apart-research/robustcbrn-eval}
}See docs/safety/security-considerations.md for anonymization and public artifact rules.