Interactive Model Selection, Custom Prompts, and Expanded Model Support #247

malah-code · 2025-07-07T14:34:31Z

Version v2.0.15 (Latest) Release Summary

New Features:

Centralized Model Management: All model configurations are now managed in a single file (operate/models/model_configs.py), making it easier to add, remove, and manage models.
Expanded Ollama Model Support: Added support for qwen2.5vl:3b and gemma3:4b.
Enhanced Debugging: Added a -d flag (alias for --verbose) that provides detailed debugging information, including the full prompt sent to the AI and the raw response received.

Improvements:

Improved System Prompt: The system prompt has been enhanced with a more structured format, explicit JSON schema definitions, and clear examples to improve model accuracy and reliability.

Bug Fixes:

Fixed an issue where the model selection screen was not correctly displaying all available models.
Resolved an IndentationError in the model configuration file.

Using the same inputs and outputs as a human operator, the model views the screen and decides on a series of mouse and keyboard actions to reach an objective. Released Nov 2023, the Self-Operating Computer Framework was one of the first examples of usiself-ai-operating-computerng a multimodal model to view the screen and operate a computer.

Key Features

Compatibility: Designed for various multimodal models.
Expanded Model Support: Now integrated with the latest OpenAI o3, o4-mini, GPT-4.1, GPT-4.1 mini, GPT-4.1 nano, Gemini 2.5 Pro, Gemini 2.5 Flash, and Gemma 3n models (including e2b and e4b variants), and Gemma 3:12b alongside existing support for GPT-4o, Claude 3, Qwen-VL, and LLaVa.
Enhanced Ollama Integration: Improved handling for Ollama models, including default host configuration and more informative error messages.
Future Plans: Support for additional models.

…improved user prompts

…em prompt in README

… handling

…ent variable selection

telmalah and others added 30 commits July 3, 2025 15:43

feat: update README and configuration for new model integrations and …

3ed4ae4

…improved user prompts

feat: Add interactive model selection and custom system prompt feature

f2dbae0

docs: Add screenshots for interactive model selection and custom syst…

29e83a1

…em prompt in README

docs: Add image assets for README

901be5a

add image

169249d

add immages

1c5236a

add

3f852f8

update the images

618deb5

fix path

65e689f

fix path

0c53c29

fix path

9a50036

fix path

ee85ed7

docs: Add release notes summary to README

9dfcdbc

docs: Update README with correct pip uninstall command

544e84a

Create python-publish.yml

4671d1b

ci: Configure python-publish.yml for PyPI deployment

3d1cf00

feat: add support for new OpenAI models and enhance Gemini coordinate…

4fc4439

… handling

remove image base URLs

2d249f9

remove image base URLs

fc26458

ci: Include requirements.txt in sdist using MANIFEST.in

29748fc

ci: Fix FileNotFoundError for requirements.txt in setup.py

250232e

chore: Update version to 2.0.4 in setup.py

2d8b56e

ci: Embed install_requires directly in setup.py to fix build error

3b5fb9d

chore: Update version to 2.0.5 in setup.py

84e808c

chore: Update version to 2.0.6 in setup.py

5028b38

chore: Update version to 2.0.7 in setup.py

89a3890

chore: Update version to 2.0.8 in setup.py

fd34a87

chore: Update version to 2.0.9 in setup.py

b68717b

chore: Update version to 2.0.10 in setup.py

14c7730

feat: Include all uncommitted changes for v2.0.11 release

76c90ae

telmalah added 4 commits July 7, 2025 10:15

Bump version to 2.0.12 for PyPI upload"

3139141

ci: Update python-publish.yml with latest changes

719178d

chore: Update version to 2.0.13 in setup.py

483536b

chore: Update version to 2.0.14 in setup.py

da00a8b

malah-code closed this Jul 7, 2025

feat: release version 2.0.15

8134243

malah-code reopened this Jul 7, 2025

telmalah added 14 commits July 7, 2025 12:25

docs: Add guide for adding new models to README.md

1ac0fdb

feat: Add OpenRouter support with dynamic model fetching and environm…

08a9e08

…ent variable selection

feat: Simplify OpenRouter model selection to manual input

1047a69

feat: Implement interactive selection of free OpenRouter vision models

0c3a901

fix: Improve OpenRouter model detection in get_next_action

376c2ed

feat: Implement internal prefix for OpenRouter models

a340578

fix: Revert OpenRouter model selection to manual input and update README

7e031fa

fix: Handle None content from OpenRouter model to prevent NoneType error

80c41c7

fix: Robustify OpenRouter API response handling in call_openrouter_model

630b08a

feat: Improve OpenRouter model input screen with input_dialog

c0c501b

docs: Update README with latest OpenRouter changes and bug fixes

d8d5a2f

docs: Update README version to v2.0.16

f899e5a

chore: Remove obsolete open-router-selection-2.png image file

ee6f899

Bump version to 2.0.16

99f06f9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Interactive Model Selection, Custom Prompts, and Expanded Model Support #247

Interactive Model Selection, Custom Prompts, and Expanded Model Support #247

Uh oh!

malah-code commented Jul 7, 2025 •

edited

Loading

Uh oh!

Uh oh!

Interactive Model Selection, Custom Prompts, and Expanded Model Support #247

Are you sure you want to change the base?

Interactive Model Selection, Custom Prompts, and Expanded Model Support #247

Uh oh!

Conversation

malah-code commented Jul 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Version v2.0.15 (Latest) Release Summary

Key Features

Uh oh!

Uh oh!

malah-code commented Jul 7, 2025 •

edited

Loading