Model Settings Guide

The models.yaml file centralizes all Model configurations used by your agents. This includes default model choices, a comprehensive library of possible APIs and models, and optional embedding libraries. Agents can either use these defaults or override them in their own YAML files.

Location

your_project_root/.agentforge/settings/models.yaml

Default Structure

A typical models.yaml might look like this:

# Default model settings for all agents unless overridden
default_model:
  api: gemini_api
  model: gemini_flash

# Library of models and parameter defaults
model_library:
  openai_api:
    GPT:
      models:
        omni_model:
          identifier: gpt-4o
          params:
            max_new_tokens: 15000
        fast_model:
          identifier: gpt-3.5-turbo
      params:
        temperature: 0.8
        max_tokens: 10000
        n: 1

  gemini_api:
    Gemini:
      models:
        gemini_pro:
          identifier: gemini-1.5-pro
        gemini_flash:
          identifier: gemini-1.5-flash
      params:
        temperature: 0.8
        top_k: 40

  # ... Other API entries ...

# Embedding library
embedding_library:
  library: sentence_transformers

In broad strokes, AgentForge loads these configurations at startup and merges them into a single dictionary accessible at agent_data['settings']['models'].

Key Sections

1. `default_model`

default_model:
  api: gemini_api
  model: gemini_flash

api: The name of the API your agents will use if they don’t specify an override (e.g., gemini_api, openai_api, etc.).
model: The default model within that API library.

Any agent that doesn’t declare a model_overrides block will use this combination of api and model.

2. `model_library`

Under the model_library key, you’ll find one or more sections for each API your project supports. For instance, openai_api, anthropic_api, gemini_api, and so on. Each API section typically has one or more classes (like GPT or Gemini) that define multiple models.

model_library:
  openai_api:
    GPT:
      models:
        fast_model:
          identifier: gpt-3.5-turbo
          params:
            max_new_tokens: 4000
        omni_model:
          identifier: gpt-4o
      params:
        temperature: 0.8

API Name (e.g., openai_api)
Corresponds to a Python module under agentforge/apis/openai_api.py.
Class Name (e.g., GPT)
Typically the class used to instantiate or connect to the API from within that module.
models
A dictionary where each key is a model name (like fast_model or omni_model), pointing to a configuration that includes:
- identifier: The actual string your code uses to refer to the LLM (like gpt-3.5-turbo).
- params: (Optional) Parameter overrides specific to this model (e.g., custom max_new_tokens).
params
A dictionary of default parameters for all models in this class. For example, if you define temperature: 0.8 here, each model under GPT inherits that unless it’s overridden by the model’s own params.

Merging Parameters
When AgentForge resolves which model to use, it merges parameters in the following order:

API-level params (if present).
Class-level params (like the params block under GPT).
Model-level params (the block nested under each specific model).
Agent-level overrides if the agent’s YAML includes a model_overrides section.

3. `embedding_library`

embedding_library:
  library: sentence_transformers

This section often appears in models.yaml for specifying which embedding toolkit you’re using system-wide. In most cases:

library: The name of the Python module or method used to generate text embeddings.
Additional details can appear here if your embedding approach requires more nuance (though typically it’s handled in storage.yaml when specifying how embeddings are used to store data).

Specifying Model Overrides in Agents

Agents can override the API, model, or params defined in default_model and model_library. Here’s how:

# In your agent's .yaml
prompts:
  system: "You are a specialized summarizer."
  user: "Summarize: {text}"

model_overrides:
  api: openai_api
  model: fast_model
  params:
    temperature: 0.5
    max_new_tokens: 5000

When this agent loads:

It starts with the global default_model (e.g., api: gemini_api, model: gemini_flash).
Sees the override: api: openai_api and model: fast_model.
Merges parameters from model_library.openai_api.GPT.params, plus fast_model.params, plus agent-level overrides (temperature: 0.5, max_new_tokens: 5000).

This final merged set of parameters is then used to instantiate the underlying model class.

Example Usage in Code

When your agent is instantiated, the system merges the chosen model’s config and creates a model object. For instance:

from agentforge.agent import Agent

class SummarizeAgent(Agent):
    def run_llm(self):
        # Optionally inspect final parameters
        final_params = self.agent_data['params']
        self.logger.log(f"Using model params: {final_params}", "debug", "model_io")
        
        # The 'model' field is already an instantiated class from the relevant API module
        self.result = self.model.generate(self.prompt, **final_params)

Here, self.model is the Python object created from your model_library definitions, and self.agent_data['params'] is the fully merged parameter dictionary.

Adding New APIs or Models

See the APIs Guide for more detailed information on how to create your own custom API interface.

Best Practices

Leverage default_model for common usage, and only override in agent YAML when needed.
Keep parameter definitions minimal at each level. If you only need to change temperature, specify that alone in your override.
Use Clear Naming in your model_library. For example, call the class GPT or Gemini to reflect the actual Python class used.
Test new models thoroughly by enabling debug logs at the model_io level (model_io: debug) to confirm you’re sending the correct parameters.

Conclusion

By splitting model configuration into a default model and a model_library, AgentForge offers both convenience and flexibility. Most agents can rely on a universal default, while specialized agents can override exactly what they need. Meanwhile, new APIs or models can be added by simply creating a Python module and updating the models.yaml file accordingly.

For more details about how these model settings interact with system or storage configurations, check out:

Settings Overview
System Settings Guide
Storage Settings Guide
Personas Guide

Need Help?

If you have questions or need assistance, feel free to reach out:

Email: contact@agentforge.net
Discord: Join our Discord Server

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Models.md

Models.md

Model Settings Guide

Location

Default Structure

Key Sections

1. `default_model`

2. `model_library`

3. `embedding_library`

Specifying Model Overrides in Agents

Example Usage in Code

Adding New APIs or Models

Best Practices

Conclusion

Files

Models.md

Latest commit

History

Models.md

File metadata and controls

Model Settings Guide

Location

Default Structure

Key Sections

1. default_model

2. model_library

3. embedding_library

Specifying Model Overrides in Agents

Example Usage in Code

Adding New APIs or Models

Best Practices

Conclusion

1. `default_model`

2. `model_library`

3. `embedding_library`