Generative Computer is an interactive desktop inspired by Claude Imagine and built on top of Gemini CLI. You can talk to the computer to generate any content you want.
This is a proof of concept which will be iterated quickly as the project develops.
A generative computer runs an AI agent on the backend that generates code to render what the user requests in real time.
- You type a request in the computer (rendered in the browser).
- The backend forwards the request to the Gemini agent.
- Gemini edits
GeneratedContent.tsx
(and only that file right now). - Vite hot-reloads, so the desktop windows morph in real time.
Upstream Gemini CLI powers this project; notable capabilities include:
- π― Free tier: 60 requests/min and 1,000 requests/day with personal Google account.
- π§ Powerful Gemini 2.5 Pro: Access to 1M token context window.
- π§ Built-in tools: Google Search grounding, file operations, shell commands, web fetching.
- π Extensible: MCP (Model Context Protocol) support for custom integrations.
- π» Terminal-first: Designed for developers who live in the command line.
- π‘οΈ Open source: Apache 2.0 licensed.
- Node.js 20 or newer
- npm 9+
- Gemini CLI credentials on the machine (OAuth login, API key, or Vertex AI)
Clone and boot the desktop:
git clone https://github.com/joshbickett/generative-computer
cd generative-computer
./computer
The helper installs dependencies, builds the Gemini CLI bundle, checks authentication, and launches both backend and frontend. If cached credentials are missing, it guides you through login before continuing. Future runs are the sameβjust type ./computer
from the repo root.
Prefer a global command? Run npm link
(or symlink computer
somewhere on your $PATH
) to start the Generative Computer from any directory with a plain computer
.
./computer
The orchestrator will:
- Double-check Gemini authentication (and reopen
npm start
if credentials are missing) - Install dependencies for the repo, backend, and frontend when
node_modules/
folders are absent - Build the Gemini CLI bundle if
bundle/gemini.js
is missing - Start the backend on
http://localhost:3001
withUSE_REAL_AGENT=true
- Start the Vite dev server on
http://localhost:5173
Press Ctrl+C
to shut everything down.
DEBUG_AGENT=true ./computer
β surface Gemini stdout/stderr and write transcripts tologs/agent/
generative-computer/
βββ runtime/
β βββ AGENTS.md # Overview of the agent stack and workflows
β βββ frontend/
β β βββ src/components/
β β β βββ CommandInput.tsx # Always-on prompt box (protected from edits)
β β β βββ Desktop.tsx # Desktop layout and window manager
β β β βββ Window.tsx # Draggable window wrapper
β β β βββ GeneratedContent.tsx # File Gemini rewrites on every command
β β βββ package.json
β βββ backend/
β βββ server.js # Express API (POST /api/command)
β βββ gemini-agent.js # Spawns the Gemini CLI bundle with guardrails
β βββ smart-simulator.js # Deterministic fallback content generator
βββ bundle/ # Output of `npm run build` (gemini.js lives here)
βββ computer # Thin wrapper that forwards to start.sh
βββ start.sh # Launch script for auth + both services
βββ logs/agent/ # Populated when DEBUG_AGENT=true
The nested ./runtime/
folder keeps the frontend and backend agents grouped together so the top-level ./computer
launcher can keep its original name.
π Follow me on Twitter for updates on the project: Follow me on Twitter
- Missing bundle β run
npm run build
ifbundle/gemini.js
is absent before calling./computer
(ornpm start
). - Authentication loop β rerun
npm start
and choose βLogin with Googleβ (or your preferred auth mode) to refresh credentials. - Ports 3001/5173 busy β stop conflicting processes;
./computer
attempts to free both ports automatically. - Node version warnings β switch to Node 20 (
nvm use 20
,fnm use 20
, etc.) to avoid runtime issues.
Enjoy building on the generative desktop! β¨
The sections below summarize the latest upstream Gemini CLI information for reference.
See Releases for more details.
New preview releases will be published each week at UTC 2359 on Tuesdays. These releases will not have been fully vetted and may contain regressions or other outstanding issues. Please help us test and install with preview
tag.
npm install -g @google/gemini-cli@preview
- New stable releases will be published each week at UTC 2000 on Tuesdays, this will be the full promotion of last week's
preview
release + any bug fixes and validations. Uselatest
tag.
npm install -g @google/gemini-cli@latest
- New releases will be published each week at UTC 0000 each day, This will be all changes from the main branch as represented at time of release. It should be assumed there are pending validations and issues. Use
nightly
tag.
npm install -g @google/gemini-cli@nightly
- Query and edit large codebases
- Generate new apps from PDFs, images, or sketches using multimodal capabilities
- Debug issues and troubleshoot with natural language
- Automate operational tasks like querying pull requests or handling complex rebases
- Use MCP servers to connect new capabilities, including media generation with Imagen, Veo or Lyria
- Run non-interactively in scripts for workflow automation
- Ground your queries with built-in Google Search for real-time information
- Conversation checkpointing to save and resume complex sessions
- Custom context files (GEMINI.md) to tailor behavior for your projects
Integrate Gemini CLI directly into your GitHub workflows with Gemini CLI GitHub Action:
- Pull Request Reviews: Automated code review with contextual feedback and suggestions
- Issue Triage: Automated labeling and prioritization of GitHub issues based on content analysis
- On-demand Assistance: Mention
@gemini-cli
in issues and pull requests for help with debugging, explanations, or task delegation - Custom Workflows: Build automated, scheduled and on-demand workflows tailored to your team's needs
Choose the authentication method that best fits your needs:
β¨ Best for: Individual developers as well as anyone who has a Gemini Code Assist License. (see quota limits and terms of service for details)
Benefits:
- Free tier: 60 requests/min and 1,000 requests/day
- Gemini 2.5 Pro with 1M token context window
- No API key management - just sign in with your Google account
- Automatic updates to latest models
Start Gemini CLI, then choose Login with Google and follow the browser authentication flow when prompted
gemini
If you are using a paid Code Assist License from your organization, remember to set the Google Cloud Project
# Set your Google Cloud Project
export GOOGLE_CLOUD_PROJECT="YOUR_PROJECT_ID"
gemini
β¨ Best for: Developers who need specific model control or paid tier access
Benefits:
- Free tier: 100 requests/day with Gemini 2.5 Pro
- Model selection: Choose specific Gemini models
- Usage-based billing: Upgrade for higher limits when needed
# Get your key from https://aistudio.google.com/apikey
export GEMINI_API_KEY="YOUR_API_KEY"
gemini
β¨ Best for: Enterprise teams and production workloads
Benefits:
- Enterprise features: Advanced security and compliance
- Scalable: Higher rate limits with billing account
- Integration: Works with existing Google Cloud infrastructure
# Get your key from Google Cloud Console
export GOOGLE_API_KEY="YOUR_API_KEY"
export GOOGLE_GENAI_USE_VERTEXAI=true
gemini
For Google Workspace accounts and other authentication methods, see the authentication guide.
gemini
gemini --include-directories ../lib,../docs
gemini -m gemini-2.5-flash
Get a simple text response:
gemini -p "Explain the architecture of this codebase"
For more advanced scripting, including how to parse JSON and handle errors, use the --output-format json
flag to get structured output:
gemini -p "Explain the architecture of this codebase" --output-format json
cd new-project/
gemini
> Write me a Discord bot that answers questions using a FAQ.md file I will provide
git clone https://github.com/google-gemini/gemini-cli
cd gemini-cli
gemini
> Give me a summary of all of the changes that went in yesterday
- Quickstart Guide - Get up and running quickly.
- Authentication Setup - Detailed auth configuration.
- Configuration Guide - Settings and customization.
- Keyboard Shortcuts - Productivity tips.
- Commands Reference - All slash commands (
/help
,/chat
, etc). - Custom Commands - Create your own reusable commands.
- Context Files (GEMINI.md) - Provide persistent context to Gemini CLI.
- Checkpointing - Save and resume conversations.
- Token Caching - Optimize token usage.
- Built-in Tools Overview
- MCP Server Integration - Extend with custom tools.
- Custom Extensions - Build and share your own commands.
- Headless Mode (Scripting) - Use Gemini CLI in automated workflows.
- Architecture Overview - How Gemini CLI works.
- IDE Integration - VS Code companion.
- Sandboxing & Security - Safe execution environments.
- Trusted Folders - Control execution policies by folder.
- Enterprise Guide - Deploy and manage in a corporate environment.
- Telemetry & Monitoring - Usage tracking.
- Tools API Development - Create custom tools.
- Troubleshooting Guide - Common issues and solutions.
- FAQ - Frequently asked questions.
- Use
/bug
command to report issues directly from the CLI.
Configure MCP servers in ~/.gemini/settings.json
to extend Gemini CLI with custom tools:
> @github List my open pull requests
> @slack Send a summary of today's commits to #dev channel
> @database Run a query to find inactive users
See the MCP Server Integration guide for setup instructions.
Gemini CLI is fully open source (Apache 2.0), and the community is encouraged to:
- Report bugs and suggest features.
- Improve documentation.
- Submit code improvements.
- Share MCP servers and extensions.
See the upstream Contributing Guide for development setup, coding standards, and how to submit pull requests.
Check the Official Roadmap for planned features and priorities.
- Official Roadmap - See what's coming next.
- Changelog - See recent notable updates.
- NPM Package - Package registry.
- GitHub Issues - Report bugs or request features.
- Security Advisories - Security updates.
See the Uninstall Guide for removal instructions.
- License: Apache License 2.0
- Terms of Service: Terms & Privacy
- Security: Security Policy
Built with β€οΈ by Google and the open source community