Unified Provider Architecture

This document specifies the unified provider architecture in dartantic_ai (dartantic) that supports both chat and embeddings operations through a single, consistent interface.

Overview

The unified provider architecture enables:

Single provider interface for both chat and embeddings models
Consistent API key and configuration management
Provider capability discovery and filtering
Clean separation of concerns between Agent, Provider, and Model layers

Core Architecture

Architecture Overview

graph TD
    A[User Code] --> B[Agent Layer]
    B --> C[Provider Layer]
    C --> D[Model Layer]
    D --> E[External LLM APIs]
    
    B -.-> B1[Parses model strings]
    B -.-> B2[Orchestrates tool execution]
    B -.-> B3[Manages conversation state]
    B -.-> B4[Handles streaming UX]
    
    C -.-> C1[Resolves API keys]
    C -.-> C2[Selects default models]
    C -.-> C3[Creates model instances]
    C -.-> C4[Declares capabilities]
    
    D -.-> D1[Direct API communication]
    D -.-> D2[Request/response formatting]
    D -.-> D3[Protocol implementation]
    D -.-> D4[Error handling]
    
    E -.-> E1[OpenAI]
    E -.-> E2[Anthropic]
    E -.-> E3[Google]
    E -.-> E4[Others...]

Provider Base Class

The Provider class serves as the unified interface for all LLM providers (see lib/src/providers/provider.dart).

Key characteristics:

Generic type parameters for chat and embeddings options
Required properties: name, displayName, defaultModelNames, capabilities
Optional properties: apiKey, baseUrl, apiKeyName, aliases
Factory methods: createChatModel() and createEmbeddingsModel()
Discovery method: listModels() stream

The Provider base class unifies what were previously separate ChatProvider and EmbeddingsProvider types into a single interface that can create both model types.

Model Kinds

The ModelKind enum (see lib/src/providers/model_kind.dart) defines the types of models a provider can support. Currently implemented kinds are chat and embeddings, with placeholders for future expansion (tts, image, audio, etc.).

This enum is used in the defaultModelNames map to specify different default models for each capability.

Provider Capabilities

ProviderCaps Enum

The ProviderCaps enum (see lib/src/provider_caps.dart) enables dynamic discovery and filtering of providers based on their features:

chat: Basic chat completion capability
embeddings: Text embeddings generation
multiToolCalls: Support for multiple function/tool calls in one response
typedOutput: Structured JSON output generation
typedOutputWithTools: Combination of typed output + tools
vision: Multi-modal input support (images, etc.)

Note: Streaming is not a capability because all chat providers support it by default.

Capability Declaration

Each provider explicitly declares its capabilities during instantiation. See provider implementations in lib/src/providers/ for examples. Capabilities should accurately reflect what the provider's API actually supports, not what we wish it supported.

Capability Querying

The Provider class offers static methods for capability-based discovery:

Providers.allWith(Set<ProviderCaps>) - Get providers with all specified capabilities
Direct capability checking via provider.caps.contains(capability)

This enables test suites to run feature-specific tests only on supporting providers.

Important: Capabilities are Informational

Capabilities are informational metadata only:

Developers can choose to use or ignore capabilities
No enforcement at the Agent or Provider level
Models themselves decide whether to throw errors
Allows experimentation with undocumented features

Provider Registry

The Provider class maintains a static registry of all available providers (see lib/src/providers/provider.dart):

classDiagram
    class ProviderRegistry {
        +static openai : Provider
        +static google : Provider
        +static anthropic : Provider
        +static mistral : Provider
        +static cohere : Provider
        +static ollama : Provider
        +static forName(name) Provider
        +static all List~Provider~
        +static allWith(caps) List~Provider~
    }
    
    class Provider {
        +name : String
        +aliases : List~String~
        +displayName : String
        +createChatModel()
        +createEmbeddingsModel()
        +listModels()
    }
    
    ProviderRegistry ..> Provider : manages
    
    note for ProviderRegistry "Alias lookup:\n'claude' → Anthropic\n'gemini' → Google"

Lazy instances: Each provider has a lazy getter (e.g., Providers.openai) to avoid initialization errors
Name-based lookup: Providers.get(String) with case-insensitive matching
Alias support: Providers can have alternative names (e.g., 'claude' → 'anthropic')
Discovery methods: Providers.all lists all providers, Providers.allWith() filters by capabilities

Providers are created lazily on first access to avoid initialization errors when API keys are missing. This allows users to use specific providers without needing all API keys configured.

Separation of Concerns

Request Flow Diagram

sequenceDiagram
    participant User
    participant Agent
    participant Provider
    participant Model
    participant API
    
    User->>Agent: Agent('openai').send('Hello!')
    
    rect rgb(200, 220, 240)
        note over Agent: AGENT LAYER
        Agent->>Agent: Parse "openai" → provider name
        Agent->>Provider: Providers.get("openai")
        Provider-->>Agent: OpenAIProvider instance
        Agent->>Agent: Lazy model creation
    end
    
    rect rgb(220, 240, 200)
        note over Provider: PROVIDER LAYER
        Agent->>Provider: createChatModel()
        Provider->>Provider: Resolve API key
        note right of Provider: 1. Check provider.apiKey<br/>2. Try Agent.environment<br/>3. Try Platform.environment
        Provider->>Provider: Select model "gpt-4o"
        Provider->>Model: new ChatModel(config)
        Model-->>Provider: ChatModel instance
        Provider-->>Agent: ChatModel instance
    end
    
    rect rgb(240, 220, 200)
        note over Model: MODEL LAYER
        Agent->>Model: sendStream(messages)
        Model->>Model: Initialize API client
        Model->>Model: Format per OpenAI spec
        Model->>API: POST /chat/completions
        API-->>Model: Stream response
        Model-->>Agent: Stream ChatResult
        Agent-->>User: Stream output
    end

1. Agent Layer (lib/src/agent/agent.dart)

Responsibilities:

Parse model strings via ModelStringParser
Look up providers from registry
Orchestrate tool execution
Manage conversation state and message accumulation
Handle streaming UX through orchestrators

NOT Responsible For:

API key resolution
Base URL configuration
Model instantiation details
Direct API communication

The Agent creates models lazily when needed, allowing the Provider to handle all configuration concerns.

2. Provider Layer (lib/src/providers/)

Responsibilities:

API key resolution from environment (via tryGetEnv helper - allows lazy initialization)
Default model selection from defaultModelNames map
Base URL configuration and defaults
Model factory operations (createChatModel, createEmbeddingsModel)
Capability declaration
API key validation at model creation time (not constructor time)

Key Pattern: Providers use tryGetEnv() in constructors to allow lazy initialization without throwing errors. API key validation happens when creating models, not when creating providers. This allows users to access provider metadata and use specific providers without needing all API keys configured.

3. Model Layer (lib/src/chat_models/, lib/src/embeddings_models/)

Responsibilities:

Direct API communication via provider-specific clients
Request/response formatting per API specification
Error handling for unsupported features
Stream processing and message consolidation
Protocol-specific implementation details

Requirements:

Models receive non-null, non-empty configuration from providers
Models validate their own capabilities and throw appropriate errors
Models handle their underlying API client lifecycle (dispose pattern)

Implementation Patterns

Provider Implementation Pattern

See actual implementations in lib/src/providers/:

anthropic_provider.dart - Example of chat-only provider
openai_provider.dart - Full-featured provider with OpenAI-compatible pattern
ollama_provider.dart - Local provider without API keys
google_provider.dart - Native API provider with custom protocol

Key patterns:

Providers extend Provider<TChatOptions, TEmbeddingsOptions>
Constructor calls super with all required metadata, using tryGetEnv() for API keys
Factory methods validate API keys and throw if required but missing
Unsupported operations throw UnsupportedError
Providers are instantiated lazily via getters to avoid initialization errors

OpenAI-Compatible Pattern

Many providers use OpenAI's API format. The OpenAIProvider class can be instantiated with different configurations to support multiple providers (OpenRouter, Together, etc.). See how Providers.openrouter and others are defined as configured OpenAIProvider instances.

Custom Provider Pattern

For implementing new providers, follow the pattern in existing implementations:

Define provider-specific option classes
Extend Provider with appropriate type parameters
Implement factory methods with proper configuration resolution
Add static instance to Provider registry

Usage Patterns

Agent Creation Patterns

Provider name: Agent('openai') - Uses all defaults
Model specification: Agent('openai?chat=gpt-4') - Override specific models
Provider instance: Agent.forProvider(customProvider) - Full control

See example/bin/ for working examples of all patterns.

Capability-Based Testing

Tests use capability filtering to ensure feature compatibility. See test files for patterns like:

Running tool tests only on providers with multiToolCalls
Testing embeddings only on providers with embeddings capability
Validating typed output on supporting providers

Direct Model Access

While Agent is the primary interface, direct model creation is supported for advanced use cases. Providers expose their factory methods for this purpose.

Design Principles

1. Single Provider Interface

One provider supports both chat and embeddings
Consistent configuration across model types
Simplified API surface

2. Deferred Validation Philosophy

API key validation deferred to model creation time
Allows provider access without all API keys configured
Clear error messages when actually trying to use a provider
No silent fallbacks that hide configuration errors

3. Capability as Information

Capabilities inform but don't restrict
Models enforce their own limitations
Allows experimentation and discovery

4. Clean Separation

Each layer has clear responsibilities
No cross-layer dependencies
Easy to test and maintain

5. Extensibility

Easy to add new providers
Support for custom implementations
Future model kinds already considered

Known Provider Limitations

Provider Capability Matrix

Provider	Chat	Embeddings	Tools	Typed Output	Tools+Typed	Vision
OpenAI	✅	✅	✅	✅	✅	✅
Google	✅	✅	✅	✅	❌	✅
Anthropic	✅	❌	✅	✅	✅	✅
Mistral	✅	✅	❌	❌	❌	✅
Cohere	✅	✅	✅	✅	❌	❌
Ollama	✅	❌	✅	✅	✅	✅
OpenRouter	✅	❌	✅	✅	❌	✅
Together	✅	❌	❌	✅	❌	✅

Legend:

Tools = multiToolCalls capability
Typed Output = typedOutput capability
Tools+Typed = typedOutputWithTools capability

Chat-Only Providers

Anthropic: No embeddings support
Ollama: No embeddings in native API (use OpenAI-compatible endpoint)
Together: No embeddings support

Limited Tool Support

Mistral: No tool calling support
Cohere: Cannot use typed output with tools simultaneously

Full-Featured Providers

OpenAI: Supports all capabilities
Google: Supports all capabilities except typedOutputWithTools

Summary

The unified provider architecture simplifies the dartantic_ai API while maintaining flexibility and extensibility. By consolidating chat and embeddings support into a single provider interface, the system becomes easier to use and understand while still supporting the full range of LLM capabilities across 15+ providers.

Key benefits:

Simplified API: One provider, multiple model types
Consistent Configuration: Same patterns across all providers
Clear Architecture: Well-defined separation of concerns
Capability Discovery: Easy to find and filter providers
Future-Proof: Ready for new model types and capabilities

Uh oh!

Unified Provider Architecture

Overview

Core Architecture

Architecture Overview

Provider Base Class

Model Kinds

Provider Capabilities

ProviderCaps Enum

Capability Declaration

Capability Querying

Important: Capabilities are Informational

Provider Registry

Separation of Concerns

Request Flow Diagram

1. Agent Layer (lib/src/agent/agent.dart)

2. Provider Layer (lib/src/providers/)

3. Model Layer (lib/src/chat_models/, lib/src/embeddings_models/)

Implementation Patterns

Provider Implementation Pattern

OpenAI-Compatible Pattern

Custom Provider Pattern

Usage Patterns

Agent Creation Patterns

Capability-Based Testing

Direct Model Access

Design Principles

1. Single Provider Interface

2. Deferred Validation Philosophy

3. Capability as Information

4. Clean Separation

5. Extensibility

Known Provider Limitations

Provider Capability Matrix

Chat-Only Providers

Limited Tool Support

Full-Featured Providers

Summary

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Core Architecture

Features

Configuration

Implementation

Clone this wiki locally