Cohere Labs Open Science Research into ML Agents and Reasoning

Community Resources

👉 New to ML Agents? Check out the BEGINNER_GUIDE.md for a step-by-step walkthrough!

ML Agents Community Program - Main hub for Cohere Labs' community-driven initiative on open-source agent research, focusing on agentic frameworks, applications, evaluations, and benchmarks
Project Documentation - Detailed specifications and roadmap for the ZeroHPO (Zero-shot Hyperparameter Optimization) project for agentic tasks
Project Tracker - Community project tracking, task assignments, and progress monitoring
Discord Community - Join the #ml-agents channel for discussions, meetings, and collaboration with the community

Overview

This project investigates how different reasoning approaches impact AI model performance across various tasks. It provides a comprehensive framework for comparing multiple reasoning techniques with various language models.

🎉 Phase 15 Complete: The platform now includes structured answer extraction with Instructor library, providing clean separation of reasoning text from final answers across all reasoning approaches with provider-aware optimization!

Research Questions

Universal Benefit: Do all tasks benefit from reasoning?
Model Variability: Do different models show varying benefits from reasoning?
Approach Comparison: How do different reasoning approaches (CoT, PoT, etc.) compare?
Task-Approach Fit: Do certain tasks benefit more from specific reasoning methods?
Cost-Benefit Analysis: What is the tradeoff for each approach and task?
Predictive Reasoning: Can we predict the need for reasoning based on the input prompt alone?

Reasoning Approaches Available

The platform currently supports 8 production-ready reasoning approaches with structured answer extraction:

None - Baseline direct prompting without reasoning
Chain-of-Thought (CoT) - Step-by-step reasoning process
Program-of-Thought (PoT) - Code-based problem solving
Reasoning-as-Planning - Strategic planning with goal decomposition
Reflection - Self-evaluation and iterative improvement
Chain-of-Verification - Systematic verification with follow-up questions
Skeleton-of-Thought - Hierarchical outline-first reasoning
Tree-of-Thought - Multiple reasoning path exploration and synthesis

Additional approaches planned: Graph-of-Thought, ReWOO, Buffer-of-Thoughts (Phase 6)

🎯 Structured Answer Extraction (Phase 15)

All reasoning approaches now include clean answer extraction that:

Separates reasoning from answers: Preserves full reasoning traces while extracting clean final answers
Removes common prefixes: Converts "The answer is 42" → "42" automatically
Provider-optimized: Uses ANTHROPIC_TOOLS, TOOLS, or JSON modes based on provider capabilities
Reliable fallback: TOOLS → JSON fallback ensures compatibility across all providers
Type-safe extraction: Pydantic models with validation and confidence scoring

Quick Start

Prerequisites

Python 3.9+
uv (for virtual environment management)
API keys for at least one provider (Anthropic, Cohere, or OpenRouter)

Installation

Option 1: pip Install (Recommended)

Install the latest stable version from PyPI:

# Install globally
pip install ml-agents-reasoning

# Or install with development dependencies
pip install ml-agents-reasoning[dev]

# Verify installation
ml-agents --version
ml-agents --help

Option 2: Modern Python (uv/uvx)

With uv (fastest):

# Install with uv
uv tool install ml-agents-reasoning

# Run without installing (recommended for trying out)
uvx ml-agents-reasoning eval run LOCAL_TEST ChainOfThought --samples 10

# Add to project dependencies
uv add ml-agents-reasoning

Option 3: Development Installation

For contributors or advanced users:

# Clone and install in development mode
git clone https://github.com/thompsonson/c4ai-ml-agents
cd c4ai-ml-agents
pip install -e .[dev]

# Or with uv (recommended)
uv sync --all-extras

Configure API Keys

After installation, configure your API keys:

# Create configuration file
cp .env.example .env
# Edit .env with your actual API keys

# Or set environment variables directly
export ANTHROPIC_API_KEY="your-key-here"
export OPENROUTER_API_KEY="your-key-here"

⚠️ Important: CLI Command Classification

The ML Agents CLI includes two types of commands:

Stable Commands (✅ Production Ready): setup, db, preprocess - Well-tested, stable API, suitable for production use
Pre-Alpha Commands (⚠️ Experimental): eval, results - Experimental features that may be unstable or have breaking changes

For production use or getting started, we recommend using only the stable commands first.

CLI Quick Start

Once installed, you can use the ML Agents CLI:

# Validate your environment
ml-agents setup validate-env

# List available reasoning approaches
ml-agents setup list-approaches

# Discover available datasets (⚠️ PRE-ALPHA)
ml-agents eval list

# Get dataset information (⚠️ PRE-ALPHA)
ml-agents eval info LOCAL_TEST

# Run a simple experiment (⚠️ PRE-ALPHA)
ml-agents eval run LOCAL_TEST ChainOfThought --samples 10

# Run with repository benchmark (⚠️ PRE-ALPHA)
ml-agents eval run BENCHMARK-01-GPQA.csv TreeOfThought --samples 50

# Compare multiple approaches (⚠️ PRE-ALPHA)
ml-agents eval compare --config examples/configs/comparison_study.yaml

Jupyter Notebook (Original Interface)

To use the original Jupyter notebook interface:

jupyter notebook Reasoning_LLM.ipynb

Configuration

Supported Providers and Models

Anthropic: Claude Opus 4, Claude Sonnet 4, Claude 3.5 Haiku
Cohere: Command R+, Command R, Command Light
OpenRouter: GPT-5, GPT-5 Mini, GPT OSS-120B, Gemini 2.5 Flash Lite

Hyperparameters

Temperature: 0.0 - 2.0 (controls randomness)
Max Tokens: 64 - 4096 (output length limit)
Top P: 0.0 - 1.0 (nucleus sampling parameter)

MCP Integration (Phase 7)

The platform includes SQLite database persistence for all experiment results and supports Claude Code MCP server integration for direct database access during conversations.

Database Features

Real-time persistence: All experiment results are automatically saved to ml_agents_results.db
Read-only MCP access: Query the database directly from Claude Code conversations
Rich export formats: CSV, JSON, and Excel with advanced formatting
Advanced analytics: Approach comparisons, failure analysis, and cost tracking

Database CLI Commands

# Database management (Stable Commands)
ml-agents db init --db-path ./results.db          # Initialize database
ml-agents db backup --source ./results.db         # Create backup
ml-agents db stats --db-path ./results.db         # Show statistics
ml-agents db migrate --db-path ./results.db       # Migrate database schema

# Export and analysis (⚠️ PRE-ALPHA)
ml-agents results export EXPERIMENT_ID --format excel     # Export to Excel
ml-agents results compare "exp1,exp2,exp3"               # Compare experiments
ml-agents results analyze EXPERIMENT_ID --type accuracy   # Generate reports
ml-agents results list --status completed                # List experiments

CLI Usage Guide

Basic Commands

Dataset Discovery (⚠️ PRE-ALPHA)

Explore available datasets before running experiments:

# List all available datasets
ml-agents eval list

# Get detailed information about a dataset
ml-agents eval info LOCAL_TEST
ml-agents eval info BENCHMARK-01-GPQA.csv

# List datasets from custom repository
ml-agents eval list --repo your-org/your-benchmarks

Single Experiment (⚠️ PRE-ALPHA)

Run one reasoning approach on a dataset:

# Basic usage with LOCAL_TEST
ml-agents eval run LOCAL_TEST ChainOfThought --samples 10

# Use repository benchmark
ml-agents eval run BENCHMARK-01-GPQA.csv TreeOfThought --samples 50

# With specific model provider
ml-agents eval run LOCAL_TEST Reflection --provider anthropic --model claude-3-5-haiku-20241022

# With advanced reasoning settings
ml-agents eval run LOCAL_TEST ChainOfVerification --multi-step-verification --max-reasoning-calls 5

# With custom repository
ml-agents eval run my-dataset.csv ChainOfThought --repo your-org/benchmarks --samples 100

Comparison Experiments (⚠️ PRE-ALPHA)

Compare multiple approaches using configuration files:

# Basic comparison with config file
ml-agents eval compare --config examples/configs/comparison_study.yaml

# Override config settings
ml-agents eval compare --config examples/configs/comparison_study.yaml --samples 200 --parallel

Note: The compare command uses YAML configuration files to specify multiple approaches. See the Configuration Files section below for details.

Configuration Files

For complex experiments, use YAML configuration files:

# Run single experiment with config (⚠️ PRE-ALPHA)
ml-agents eval run LOCAL_TEST ChainOfThought --config examples/configs/single_experiment.yaml

# Run comparison with config (⚠️ PRE-ALPHA)
ml-agents eval compare --config examples/configs/comparison_study.yaml

# Override config parameters (⚠️ PRE-ALPHA)
ml-agents eval compare --config examples/configs/comparison_study.yaml --samples 200 --parallel

Example configuration (config.yaml):

experiment:
  name: "reasoning_comparison_study"
  sample_count: 100
  output_dir: "./results"

model:
  provider: "openrouter"
  name: "openai/gpt-oss-120b"
  temperature: 0.3
  max_tokens: 512

reasoning:
  approaches:
    - ChainOfThought
    - ReasoningAsPlanning
    - TreeOfThought
  multi_step_verification: true
  max_reasoning_calls: 5

execution:
  parallel: true
  max_workers: 4
  save_checkpoints: true

Checkpoint Management (⚠️ PRE-ALPHA)

Resume interrupted experiments:

# List available checkpoints
ml-agents eval checkpoints

# Resume from specific checkpoint
ml-agents eval resume checkpoint_exp_20250818_123456.json

Advanced Features (⚠️ PRE-ALPHA)

Cost Control

# Set reasoning limits to control costs
ml-agents eval run LOCAL_TEST ChainOfVerification --max-reasoning-calls 3 --samples 50

# Monitor costs with verbose output
ml-agents eval compare --config examples/configs/comparison_study.yaml --samples 100 --verbose

Multi-step Reasoning

# Enable multi-step reflection
ml-agents eval run LOCAL_TEST Reflection --multi-step-reflection --max-reflection-iterations 3

# Enable multi-step verification
ml-agents eval run LOCAL_TEST ChainOfVerification --multi-step-verification --max-reasoning-calls 5

Parallel Processing

# Parallel execution with config file (approaches defined in YAML)
ml-agents eval compare --config examples/configs/comparison_study.yaml --parallel --max-workers 2

# Override config for large experiments
ml-agents eval compare --config examples/configs/comparison_study.yaml --samples 500 --parallel --max-workers 8

Output and Results

Results are organized by dataset with full preprocessing-evaluation traceability:

./outputs/
├── {dataset_name}/
│   ├── preprocessing/
│   │   ├── {timestamp}/
│   │   │   ├── analysis.json           # Dataset schema analysis
│   │   │   ├── rules.json              # Transformation rules
│   │   │   ├── processed.json          # Standardized dataset
│   │   │   └── metadata.json           # Preprocessing metadata
│   │   └── latest → {most_recent}/     # Symlink to latest preprocessing
│   └── eval/
│       ├── {exp_timestamp}/
│       │   ├── experiment_config.json  # Experiment configuration
│       │   ├── experiment_results.csv  # Detailed results per approach
│       │   ├── experiment_summary.json # Performance summary
│       │   └── experiment_errors.json  # Any processing errors
│       └── latest → {most_recent}/     # Symlink to latest experiment

Each result file contains:

Input prompts and model responses
Complete reasoning traces
Performance metrics (accuracy, time, cost)
Configuration details
Error information
Preprocessing lineage: Complete traceability to preprocessing rules used

Example Workflows

1. Environment Setup (Stable)

# Validate your setup first
ml-agents setup validate-env
ml-agents setup list-approaches
ml-agents setup version

2. Database Management (Stable)

# Initialize and manage experiment database
ml-agents db init
ml-agents db stats
ml-agents db backup --source ./ml_agents_results.db

3. Dataset Preprocessing (Stable)

# Preprocess datasets for evaluation
ml-agents preprocess list
ml-agents preprocess inspect MilaWang/SpatialEval --samples 100
ml-agents preprocess batch --max 5

4. Complete Dataset → Evaluation Pipeline

# 1. Preprocess a custom dataset (creates organized folder structure)
ml-agents preprocess inspect MilaWang/SpatialEval --config tqa
ml-agents preprocess generate-rules MilaWang/SpatialEval --config tqa
ml-agents preprocess transform MilaWang/SpatialEval rules.json --config tqa

# 2. Run evaluation with preprocessed data (auto-detects latest preprocessing) (⚠️ PRE-ALPHA)
ml-agents eval run MilaWang_SpatialEval_tqa ChainOfThought --samples 50

# 3. Compare approaches on same preprocessed dataset (⚠️ PRE-ALPHA)
ml-agents eval run MilaWang_SpatialEval_tqa TreeOfThought --samples 50
ml-agents eval run MilaWang_SpatialEval_tqa Reflection --samples 50

# 4. View organized results
ml-agents results list

5. Quick Testing (⚠️ PRE-ALPHA)

# Test with small sample size
ml-agents eval run LOCAL_TEST ChainOfThought --samples 5 --verbose

6. Research Study (⚠️ PRE-ALPHA)

# Comprehensive comparison study
ml-agents eval compare \
  --approaches "None,ChainOfThought,ReasoningAsPlanning,TreeOfThought,Reflection" \
  --samples 200 \
  --parallel \
  --max-workers 4 \
  --multi-step-verification \
  --output "./studies/comprehensive_study"

Command Reference

Stable Commands (Production Ready)

Command	Description
`ml-agents setup validate-env`	Check environment setup
`ml-agents setup list-approaches`	Show available reasoning methods
`ml-agents setup version`	Show version information
`ml-agents db init`	Initialize experiment database
`ml-agents db backup`	Create database backup
`ml-agents db stats`	Show database statistics
`ml-agents db migrate`	Migrate database schema
`ml-agents preprocess list`	List unprocessed datasets
`ml-agents preprocess inspect`	Inspect dataset schema
`ml-agents preprocess batch`	Batch process datasets

Pre-Alpha Commands (⚠️ Experimental)

Command	Description
`ml-agents eval run`	Single reasoning experiment
`ml-agents eval compare`	Multi-approach comparison
`ml-agents eval resume`	Resume from checkpoint
`ml-agents eval checkpoints`	Show available checkpoints
`ml-agents results export`	Export experiment results
`ml-agents results compare`	Compare experiments
`ml-agents results analyze`	Analyze experiment patterns

For detailed help on any command:

ml-agents setup --help
ml-agents eval run --help
ml-agents db --help

Jupyter Notebook Usage (Original Interface)

For users who prefer the notebook interface:

Setup: Ensure dependencies are installed via ./setup.sh
Configuration: Use interactive widgets to select models and approaches
Data: Default uses "bbeh-eval" dataset, customizable
Execute: Run experiment cells to process your dataset
Results: Tables and CSV files with format {model}_{approach}_{timestamp}.csv

Dataset Requirements

Your dataset should include:

input column: The question/problem to solve
answer column (optional): Expected output for evaluation
task column (optional): Task category for analysis

Output Files

The notebook generates CSV files containing:

Input prompts
Model outputs
Full reasoning traces
Execution time
Cost estimates
Configuration details

Project Structure

ml-agents/
├── src/                           # Main source code
│   ├── cli/                      # CLI interface (Phase 5)
│   │   ├── main.py              # CLI entry point
│   │   ├── commands.py          # Run/compare commands
│   │   ├── config_loader.py     # Configuration management
│   │   ├── display.py           # Rich output formatting
│   │   └── validators.py        # Input validation
│   ├── core/                    # Core experiment logic
│   │   ├── experiment_runner.py # Experiment orchestration
│   │   ├── dataset_loader.py    # Dataset loading
│   │   └── reasoning_inference.py # Inference engine
│   ├── reasoning/               # Reasoning approaches
│   │   ├── base.py             # Base reasoning class
│   │   ├── chain_of_thought.py # CoT implementation
│   │   ├── tree_of_thought.py  # ToT implementation
│   │   └── ...                 # Other approaches
│   └── utils/                   # Utilities
│       ├── api_clients.py      # API wrappers
│       ├── rate_limiter.py     # Rate limiting
│       └── logging_config.py   # Logging setup
├── examples/                    # Usage examples
│   ├── configs/                # Configuration templates
│   ├── scripts/                # Batch processing scripts
│   └── README.md               # Examples documentation
├── tests/                      # Test suite
├── outputs/                    # Organized experiment and preprocessing results
│   └── {dataset_name}/        # Dataset-centric organization
│       ├── preprocessing/     # Preprocessing runs with timestamps
│       └── eval/              # Evaluation runs with timestamps
├── Reasoning_LLM.ipynb        # Original Jupyter notebook
├── config.py                  # Environment configuration
├── requirements.txt           # Python dependencies
├── setup.sh                   # Automated setup script
├── Makefile                   # Development commands
└── README.md                  # This file

Best Practices

For Researchers

Start Small: Begin with --samples 10 to test approaches quickly
Use Baselines: Always include None approach for comparison
Cost Control: Monitor costs with --verbose and set --max-reasoning-calls
Parallel Processing: Use --parallel for faster comparison studies
Reproducibility: Save configuration files and use checkpoints

For Cost Management

Temperature Settings: Lower values (0.1-0.3) for consistent, cost-effective results
Token Limits: Set appropriate --max-tokens based on your task complexity
Sample Sizing: Use smaller samples for initial exploration
Provider Selection: Compare costs across different providers
Multi-step Limits: Control --max-reasoning-calls for approaches like Chain-of-Verification

For Performance

Parallel Execution: Use --parallel --max-workers N for comparison studies
Checkpoint Usage: Enable checkpoints for long-running experiments
Rate Limiting: Adjust --max-workers based on provider rate limits
Batch Processing: Use configuration files and scripts for multiple experiments

Troubleshooting

Common Issues

Environment Setup

# Check environment
ml-agents setup validate-env

# Fix dependency issues
make clean && make install-dev

# Verify imports
make debug-imports

API Key Problems

# Check .env file exists and has keys
cat .env

# Validate specific provider
ml-agents setup validate-env

Error messages will guide you to set missing keys:

export OPENROUTER_API_KEY="your_key_here"
export ANTHROPIC_API_KEY="your_key_here"

Rate Limiting

If you encounter rate limits:

# Reduce parallel workers for comparison experiments
ml-agents eval compare --config examples/configs/comparison_study.yaml --max-workers 1

# Disable parallel processing for single experiments
ml-agents eval run LOCAL_TEST ChainOfThought --samples 50 --parallel false

Memory Issues

For large experiments:

# Reduce sample size for comparison experiments
ml-agents eval compare --config examples/configs/comparison_study.yaml --samples 50

# Disable parallel processing for comparison experiments
ml-agents eval compare --config examples/configs/comparison_study.yaml --parallel false

NumPy Compatibility Warning

The warning about NumPy 1.x vs 2.x is cosmetic and doesn't affect functionality:

A module that was compiled using NumPy 1.x cannot be run in NumPy 2.3.2...

This is a known PyTorch compatibility issue and can be ignored.

CLI Issues

Command Not Found

# Reinstall the package
make install-dev

# Check entry point
which ml-agents

Import Errors

# Activate virtual environment
source .venv/bin/activate

# Test imports
make debug-imports

Configuration Validation

For configuration errors, check:

YAML/JSON syntax is valid
All required fields are present
Approach names match available options (ml-agents list-approaches)
Provider/model combinations are supported

Getting Help

Command Help: Use --help with any command

ml-agents --help
ml-agents eval run --help
ml-agents eval compare --help

Verbose Output: Add --verbose to see detailed execution logs

ml-agents eval run LOCAL_TEST ChainOfThought --samples 5 --verbose

Check Status: Validate your setup

ml-agents setup validate-env
ml-agents setup list-approaches
make validate-env

Community Support: Join the Discord #ml-agents channel for help

Development Issues

For developers working on the codebase:

# Run test suite
make test

# Check code quality
make lint

# Type checking
make type-check

# Full development check
make check

Development Tools

Claude Code MCP Server Setup

For developers using Claude Code, enable direct database queries in conversations:

# Configure MCP server (one-time setup)
make configure-mcp

# Or run the script directly
./scripts/install-sqlite-mcp-server.sh

Available MCP Tools:

read_query: Execute validated SELECT queries
list_tables: Show all database tables
describe_table: Show table schemas

⚠️ Note: Project-scoped MCP servers don't appear in claude mcp list due to a known bug. Use claude mcp get sqlite-read-only to verify installation.

Contributing

Feel free to extend the notebook with:

Additional reasoning approaches
New evaluation metrics
Support for more models/providers
Performance optimizations

License

Recommended: CC BY 4.0 (Creative Commons Attribution 4.0 International)

This project is licensed under the Creative Commons Attribution 4.0 International License. This means:

✅ Share - Copy and redistribute the material in any medium or format
✅ Adapt - Remix, transform, and build upon the material for any purpose, even commercially
✅ Attribution - You must give appropriate credit, provide a link to the license, and indicate if changes were made

This license is chosen because:

Open Science: Aligns with Cohere Labs' open science mission
Maximum Impact: Allows both academic and commercial use, accelerating AI research
Community Growth: Enables derivatives while ensuring original work is credited
Simplicity: Easy to understand and implement

Note: For the code components specifically, you may want to consider dual-licensing with MIT or Apache 2.0 for better software compatibility.

Alternative Options Considered

CC BY-SA 4.0: Adds "ShareAlike" requirement - derivatives must use same license (more restrictive but ensures openness)
CC BY-NC 4.0: Adds "NonCommercial" restriction - prevents commercial use (limits industry collaboration)
CC0: Public domain dedication - no attribution required (maximum freedom but no credit requirement)

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
.github/workflows		.github/workflows
analysis_output		analysis_output
concurrent_logs/20250911_080708		concurrent_logs/20250911_080708
documentation		documentation
examples		examples
scripts		scripts
src/ml_agents		src/ml_agents
tests		tests
.env.example		.env.example
.env.test		.env.test
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
BEGINNER_GUIDE.md		BEGINNER_GUIDE.md
CLAUDE.md		CLAUDE.md
EXAMPLE_EXPERIMENT.md		EXAMPLE_EXPERIMENT.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
ROADMAP.md		ROADMAP.md
Reasoning_LLM.ipynb		Reasoning_LLM.ipynb
analysis_none_vs_cot.py		analysis_none_vs_cot.py
config.py		config.py
config.txt		config.txt
conftest.py		conftest.py
debug_concurrent.py		debug_concurrent.py
debug_concurrent_verbose.py		debug_concurrent_verbose.py
monitor_benchmarks.sh		monitor_benchmarks.sh
none_vs_cot_analysis_report.md		none_vs_cot_analysis_report.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
run_all_benchmarks.sh		run_all_benchmarks.sh
run_concurrent_benchmarks.sh		run_concurrent_benchmarks.sh
run_parallel_benchmarks.sh		run_parallel_benchmarks.sh
script_architecture.md		script_architecture.md
setup.sh		setup.sh
uv.lock		uv.lock

License

thompsonson/c4ai-ml-agents

Folders and files

Latest commit

History

Repository files navigation

Cohere Labs Open Science Research into ML Agents and Reasoning

Community Resources

Overview

Research Questions

Reasoning Approaches Available

🎯 Structured Answer Extraction (Phase 15)

Quick Start

Prerequisites

Installation

Option 1: pip Install (Recommended)

Option 2: Modern Python (uv/uvx)

Option 3: Development Installation

Configure API Keys

⚠️ Important: CLI Command Classification

CLI Quick Start

Jupyter Notebook (Original Interface)

Configuration

Supported Providers and Models

Hyperparameters

MCP Integration (Phase 7)

Database Features

Database CLI Commands

CLI Usage Guide

Basic Commands

Dataset Discovery (⚠️ PRE-ALPHA)

Single Experiment (⚠️ PRE-ALPHA)

Comparison Experiments (⚠️ PRE-ALPHA)

Configuration Files

Checkpoint Management (⚠️ PRE-ALPHA)

Advanced Features (⚠️ PRE-ALPHA)

Cost Control

Multi-step Reasoning

Parallel Processing

Output and Results

Example Workflows

1. Environment Setup (Stable)

2. Database Management (Stable)

3. Dataset Preprocessing (Stable)

4. Complete Dataset → Evaluation Pipeline

5. Quick Testing (⚠️ PRE-ALPHA)

6. Research Study (⚠️ PRE-ALPHA)

Command Reference

Stable Commands (Production Ready)

Pre-Alpha Commands (⚠️ Experimental)

Jupyter Notebook Usage (Original Interface)

Dataset Requirements

Output Files

Project Structure

Best Practices

For Researchers

For Cost Management

For Performance

Troubleshooting

Common Issues

Environment Setup

API Key Problems

Rate Limiting

Memory Issues

NumPy Compatibility Warning

CLI Issues

Command Not Found

Import Errors

Configuration Validation

Getting Help

Development Issues

Development Tools

Claude Code MCP Server Setup

Contributing

License

Recommended: CC BY 4.0 (Creative Commons Attribution 4.0 International)

Alternative Options Considered

About

Topics

Resources