Multi-Agent Homework System

This repository showcases a Python-based multi-agent system that uses multiple specialized agents to process and complete homework assignments. The system breaks down assignments into subtasks and delegates them to specialized agents for code generation, Google Slides presentation generation, and voiceover audio generation for presentation via ElevenLabs.

Two versions of the multi-agent system is built. Version 1 (LlamaIndex) is built following LlamaIndex's multi-agent example. A strict workflow is followed with each agent taking turns performing their individual tasks. The downside of this strict workflow is the lack of user interaction and lack of deviation/autonomy for the agents.

Version 2 (AutoGen) is built using Microsoft's Autogen framework. The goal is to allow the agents more autonomy. It worked to some extent, but we faced many issues (as you can see in the demo below) like rate limiting and failed api formatting/calls. However, using autogen did seem to provide more creativity from the agents and it seemed like they were better able to resolve errors they faced by themselves and with the help of the group chat.

Both versions uses the following OpenAI LLMs:

OpenAI 4o
OpenAI 4.1-nano
OpenAI 4o-mini

This project serves as a proof-of-concept that a multi-agent system can take a coding-related task and break it down into subtasks to create a full presentation with slide show and voiceover. While we used homework assignments as the basis of this POC, the potential of such a multi-agent system can help in the automatic creation of sample integration walkthroughs or other small MVP projects in industry roles such as solution architect or implementation engineer. The ability to generate not only the code, but a presentation and voiceover audio is a powerful combination when it comes to conveying information to new audiences and presents new opportunities for knowledge sharing and collaboration.

Features

Command-line interface for initializing the multi-agent system
Multiple specialized agents:
- Docker Code Agent: Generates Python code implementations within a Docker environment for full isolation of the coding agent; uploads completed code to Github via Github MCP Server
- Documentation Agent (optional): Creates comprehensive documentation
- Presentation Agent: Generates Google Slides presentation via Google Slides MCP Server
- Voiceover Agent: Generates script and audio file for presentation via ElevenLabs MCP Server

Installation

Clone the repository:

git clone <repository-url>
cd <repository>

Create and activate a virtual environment:

python3.12  -m  venv  genai-mas-venv
source  genai-mas-venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:

pip  install  -r  requirements.txt

Version 1 Usage

Place your assignment description in a text file within the assignment_vault directory.
Run the orchestrator with your assignment file:

python orchestrator_merged.py --assignment <assignment file>

There are 3 outputs that are generated:
- a repository under the Github user robot-coder
- a Google Slides Presentation in robot-coder's Google Drive Account
- a voiceover mp3 file in output/audio

Version 2 Usage

Place your assignment description in a text file within the assignment_vault directory.
Run the Autogen execution file with your assignment file:

python local_autogen/main.py --assignment assignment_vault/<assignment file>

Project Structure

GenAI-Multi-Agent-System-Demo/
├── assignment_vault/               # directory for assignment text files
├── orchestrator_merged.py          # Main coordination logic
├── agents/                         # Specialized agents
│   ├── __init__.py
│   ├── documentation_agent.py      # Creates documentation
│   ├── presentation_agent.py       # Prepares presentations
│   └── voiceover_agent.py          # Generates script and audio file
├── sandbox/                        # Directory is mapped as volume into Docker environment
│   ├── docker_code_agent.py        # coding agent
│   ├── Dockerfile                  # Docker file for Docker environment
│   ├── instructions                # Assignment instructions are copied here
│   └── run_docker_agent.sh         # Runs the coding agent in a docker container
├── output/audio                    # Directory where audio files are generated
├── requirements.txt
├── PRESENTATION.md
├── .env.example
└── README.md

Agent Capabilities

Code Agent

Analyzes coding requirements
Generates and executes Python code in a sandboxed environment
Uploads completed code to Github repository

Documentation Agent

Generates comprehensive documentation for a given repository
Creates structured Markdown content
Includes metadata and formatting

Presentation Agent

Creates Google Slides presentation
Generates slide content and speaker notes
Provides estimated duration and structure

Voiceover Agent

Creates audio file for presentation and code using ElevenLabs
Outputs audio file locally

Future Enhancements

Dynamic Agent Creation

Create specialized agents based on task requirements
Support for additional programming languages

Enable more user interaction
- Implement chat-based behavior prior to initiating workflows so user can further customize and define behavior
Presentation and Voiceover integration
- Enable automated presentation with voiceover via Playwright or alternative solutions
Interactive Voice Agentic AI for Presentation for real-time user-agent interaction
- Enable users to interact with the Voice Agent to ask questions in real-time and engage in conversation about produce output.

Demo

Version 1 (LlamaIndex)

The following link shows how Version 1 of this multi-agent system works. The command python orchestrator_merged.py --assignment project_3.txt is run, and it kicks off all the necessary agents to complete the task one by one in order. The Github Repository, Google Slides (minus Title Slide), and the voiceover recording are all generated automatically.

Version 2 (AutoGen)

2025-05-10.20-37-52.mp4

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
agents		agents
assets		assets
assignment_vault		assignment_vault
autogen-output		autogen-output
demo_code		demo_code
elevenlabs-mcp-server		elevenlabs-mcp-server
github-mcp-server		github-mcp-server
local_autogen		local_autogen
mcp_servers/google-slides-mcp		mcp_servers/google-slides-mcp
output		output
result		result
sandbox		sandbox
.env.example		.env.example
.gitignore		.gitignore
PRESENTATION.md		PRESENTATION.md
README.md		README.md
orchestrator_merged.py		orchestrator_merged.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Agent Homework System

Features

Installation

Version 1 Usage

Version 2 Usage

Project Structure

Agent Capabilities

Code Agent

Documentation Agent

Presentation Agent

Voiceover Agent

Future Enhancements

Demo

Version 1 (LlamaIndex)

Version 2 (AutoGen)

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Multi-Agent Homework System

Features

Installation

Version 1 Usage

Version 2 Usage

Project Structure

Agent Capabilities

Code Agent

Documentation Agent

Presentation Agent

Voiceover Agent

Future Enhancements

Demo

Version 1 (LlamaIndex)

Version 2 (AutoGen)

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages