Agent Workflow (Reason–Act–Observe)

This project implements an intelligent agent that follows a Reason–Act–Observe (ReAct) loop. Instead of responding directly to user input, the agent dynamically decides whether to answer or to use external tools.

Workflow Overview

User Input
   ↓
Memory stores input
   ↓
Gemini receives prompt + tools
   ↓
Gemini decides:
   - respond directly
   - OR call a tool
   ↓
If tool is called:
   ToolRegistry executes tool
   ↓
Tool result returned to Gemini
   ↓
Gemini generates final response
   ↓
Memory stores the final response

Setup Instructions

Install dependencies:

pip install -r requirements.txt

requirements.txt includes:

requests Used to send HTTP requests to external web services or APIs. In this project, it is useful for tools that need to communicate with online services, such as weather APIs or other external endpoints.
- google-genai
  Official Google Gen AI SDK for accessing Gemini models from Python. In this project, it is used by the TranslatorTool to send text to the Gemini API and receive translated output.
tzdata. Provides time zone database support. This is required on some systems (especially Windows), where Python’s zoneinfo module depends on external timezone data.

Gemini API Key Setup (Windows PowerShell)

You must set the GEMINI_API_KEY environment variable before running the main program.

Set the API key permanently

Run this once in PowerShell:

setx GEMINI_API_KEY "YOUR_API_KEY"

Notes: "YOUR_API_KEY" can get Gemini API key from Google AI Studio

Then do all of the following

Close all VS Code windows.
Open VS Code again.
Open a new terminal.

Check whether the key is available:

echo $env:GEMINI_API_KEY

If the terminal prints your API key, then the key is now available for future terminals.

Running the CLI Application

To start the Personal Assistant Agent, run:

python main.py

If everything is set up correctly, you will see:

Personal Assistant Agent started. Type 'exit' to quit.
You:

This means the application is running and waiting for your input.

How to Interact with the Agent

You can type messages directly into the terminal after the You: prompt.

The agent will process your input and respond accordingly.

Example interaction

You: Hello
Agent: Hello! How can I help you today?

You: What is 15 * 9?
Agent: 15 * 9 is 135.

You: What time is it in Asia/Tokyo?
Agent: The current time in Tokyo is ...

You: Translate "How are you?" to Vietnamese.
Agent: Bạn khỏe không?

You: Read tests/notes.txt
Agent: <file contents here>

Supported Capabilities

The agent can automatically decide when to use tools:

Mathematical calculations → CalculatorTool
Time queries → TimeTool
Weather queries → WeatherTool
Translation → TranslatorTool
File reading → FileReaderTool

You do not need to call tools manually - just type natural language.

How to Exit the Application

To stop the application, type:

exit

Then press Enter.

Expected output:

You: exit
Agent: Goodbye!

Notes

The application runs in a continuous loop and will not stop automatically.
It only exits when the user types exit.
API rate limits may apply when using the Gemini API (free tier).

The program will then terminate and return to the terminal.

Step-by-Step Explanation

1. User Input

The process begins when the user sends a message to the agent.

user_input = input("You: ")
response = agent.handle_user_input(user_input)

2. Memory Storage

The agent stores the user message to maintain conversation context.

self.memory.add_user_message(user_input)

This allows the agent to handle multi-turn conversations.

3. Gemini Processing

The agent sends:

conversation history
available tool declarations to Gemini for reasoning.

response = self.model.generate_content(self.memory.get_history())

4. Decision Making (Reason)

Gemini decides between:

A. Direct Response

If no tool is needed:

User: Hello
→ Gemini: Hi! How can I help you?

B. Tool Invocation

If external information is needed:

User: What is 45 * 12?
→ Gemini decides to call calculator

5. Tool Execution (Act)

The agent uses the ToolRegistry to execute the tool:

tool_result = self.registry.execute_tool(tool_name, tool_args)

This design avoids hardcoded logic and follows the Registry Pattern.

6. Observation (Observe)

The tool result is sent back to Gemini as part of the conversation:

{
  "function_response": {
    "name": tool_name,
    "response": {"result": tool_result}
  }
}

This allows Gemini to continue reasoning.

7. Final Response Generation

Gemini is called again to generate the final natural-language answer:

final_response = self.model.generate_content(self.memory.get_history())

8. Memory Update

The final response is stored:

self.memory.add_model_message(final_text)

This ensures context is preserved for future interactions.

Why Gemini May Be Called Twice

When tools are used:

First call → decide whether to use a tool
Tool executes
Second call → generate final answer

This process may increase response time slightly when tools are involved.

Example: Weather Query

User:

What is the weather in Riga?

Flow:

Gemini decides to call weather(city="Riga")
Tool returns weather data
Gemini generates final answer using tool result

Running tests

This project includes simple manual test files for checking whether the implemented tools work correctly.

Manual Test: Run MemoryManager checks

python -m tests.test_memory

This test checks:

initial memory state (should be empty)
adding user messages
adding model messages
preserving conversation order
clearing memory correctly

Example output:

=== Test 1: Initial history ===
[]

=== Test 2: Add user and model messages ===
[{'role': 'user', 'parts': ['Hello']}, {'role': 'model', 'parts': ['Hi there!']}, {'role': 'user', 'parts': ['What is my name?']}, {'role': 'model', 'parts': ['I do not know your name yet.']}]

=== Test 3: Clear memory ===
[]

Notes:

The conversation history is stored as a list of message objects.
Each message follows the format: {"role": "...", "parts": ["..."]}.
The order of messages must be preserved.
The goal of this test is to verify correct memory behavior inside the agent.

Manual Test: Run ToolRegistry checks

python -m tests.test_registry

This test checks:

registering tools correctly
retrieving tools by name
executing tools by name
handling non-existent tools

Example output:

=== Test 1: Register tools ===
Registered calculator and time tools.

=== Test 2: Get tool by name ===
calculator -> <CalculatorTool object>
time -> <TimeTool object>
weather -> None

=== Test 3: Execute tool by name === 
15 
Current time in Europe/Riga: <dynamic>

=== Test 4: Execute non-existent tool ===
Error: requested tool 'weather' not found.

Notes:

The object memory addresses (e.g., <CalculatorTool object at ...>) may differ each run.
The time value is dynamic and will change depending on when the test is executed.
The goal of this test is to verify correct tool management behavior inside the registry.

Manual Test: Run all basic tool checks

python -m tests.test_tools

This test runs sample checks for:

CalculatorTool
FileReaderTool
TimeTool
WeatherTool
TranslatorTool

Example output:

Contents of 'tests/notes.txt': 
Hello, this txt is for test_tools.

Current time in Europe/Riga: <dynamic>

Current weather in Riga, Latvia: <dynamic>

Xin chào

Note: the time and weather values may change depending on when the test is executed.

Manual Test: Run calculator-only checks

python -m tests.test_calculator_tool

This test checks:

valid expression handling
missing argument handling
empty expression handling
invalid mathematical expression handling

Example output:

Error: invalid arguments for calculator.

Error: invalid arguments for calculator.

Calculator error: invalid syntax (<string>, line 1) - invalid mathematical expression.

Manual Test: Run file reader checks

python -m tests.test_file_reader_tool

This test checks:

reading a valid text file
handling non-existing file paths
handling empty file path input
handling invalid data types
rejecting unsupported file extensions

Example output:

Contents of 'tests/notes.txt': 
Hello, this txt is for test_tools.

Error: file 'tests/abcxyz.txt' does not exist.

Error: invalid arguments for file reader.

Error: invalid arguments for file reader.

Error: unsupported file type. Supported types are .csv, .json, .md, .txt.

Note: make sure the file tests/notes.txt exists and contains sample text.

Manual Test: Run time tool checks

python -m tests.test_time_tool

This test checks:

valid timezone handling
default timezone behavior (UTC)
handling of invalid timezone input

Example output:

Current time in Europe/Riga: <dynamic>

Current time in UTC: <dynamic>

Time error: unknown timezone 'Mars/Unknown'. Make sure 'tzdata' is installed (pip install tzdata).

Note: the time values will change depending on when the test is executed.

Manual Test: Run weather tool checks

python -m tests.test_weather_tool

This test checks:

valid city input (e.g., "Riga")
handling of empty city input
handling of non-existent or invalid city names

Example output:

Current weather in Riga, Latvia: <dynamic>

Weather error: 'city' is required.

Weather error: could not find location 'asdkjasdkjasd'.

Note: the weather values (temperature, wind speed, and description) may change depending on when the test is executed and API responses.

Manual Test: Run TranslatorTool test

python -m tests.test_translator_tool

Test cases

This test checks:

Valid translation (English → Vietnamese)
Missing target language
Empty text input
English → Latvian translation
Latvian → English translation

Example output:

=== Test 1: Valid translation (English to Vietnamese) === 
Chào buổi sáng, hôm nay bạn thế nào?

=== Test 2: Missing target language ===
Translation error: 'target_lang' is required.

=== Test 3: Empty text ===
Translation error: 'text' is required.

=== Test 4: English to Latvian ===
Labrīt

=== Test 5: Latvian to English ===
Good morning

Notes: The translation results may vary depending on the external API. Minor differences in translated text are expected. The main goal is to verify:

The tool runs without errors
Proper error handling is implemented**
Valid translations return non-empty results

Manual Test: Run the full agent flow

python -m tests.test_agent

This manual test checks whether the complete assistant architecture works together, including:

Agent initialization
MemoryManager integration
ToolRegistry integration
Gemini direct response generation
tool calling through the registered tools
final response generation after tool execution

The test runs the following example prompts:

Hello
What is 12 * 8 + 1?
What time is it in Europe/Riga?
Translate 'Good night' to Vietnamese.
Read tests/notes.txt

Example output may include:

a direct greeting response
the result of a calculator query
the current time in a requested timezone
a translation result
the contents of a local text file

Note:

The exact responses may vary depending on the model output.
Response speed may be slower than individual tool tests because the full agent may perform multiple model calls and tool-calling steps.

Key Design Concepts

ReAct Pattern (Reason–Act–Observe)
Tool-based architecture
Registry Pattern (ToolRegistry)
Memory-based context handling
Separation of concerns

Summary

Input → Memory → Gemini → Tool? → Tool Execution → Gemini → Final Answer → Memory

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
__pycache__		__pycache__
agent		agent
tests		tests
tools		tools
utils		utils
README.md		README.md
config.py		config.py
main.py		main.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Agent Workflow (Reason–Act–Observe)

Workflow Overview

Setup Instructions

Install dependencies:

Gemini API Key Setup (Windows PowerShell)

Set the API key permanently

Running the CLI Application

How to Interact with the Agent

Example interaction

Supported Capabilities

How to Exit the Application

Step-by-Step Explanation

1. User Input

2. Memory Storage

3. Gemini Processing

4. Decision Making (Reason)

A. Direct Response

B. Tool Invocation

5. Tool Execution (Act)

6. Observation (Observe)

7. Final Response Generation

8. Memory Update

Why Gemini May Be Called Twice

Example: Weather Query

Running tests

Manual Test: Run MemoryManager checks

Manual Test: Run ToolRegistry checks

Manual Test: Run all basic tool checks

Manual Test: Run calculator-only checks

Manual Test: Run file reader checks

Manual Test: Run time tool checks

Manual Test: Run weather tool checks

Manual Test: Run TranslatorTool test

Manual Test: Run the full agent flow

Key Design Concepts

Summary

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages