The world's first all-seeing, all-hearing, (almost) all-doing AI agent
The only AI agent with Claude Code living in its brain. A cross-platform, multimodal agent that sees, hears, thinks, speaks, and acts on your machine. Free & open source.
Seven integrated subsystems working in concert — perception, cognition, voice, action, knowledge, memory, and collaboration.
HAL 9000 Control Panel — live dashboard
Webcam vision with HUD overlay — scanlines, corner brackets, real-time analysis. HAL observes your environment.
Click-to-speak mic with live waveform visualization, browser-native recording, silence detection, and Whisper transcription.
Multi-provider LLM with real-time token streaming — GPT-4o, Claude, Gemini, or Ollama (free). Parallel tool execution, 43-tool agent layer.
Chunked sentence-level TTS — HAL speaks as it thinks. Three engines: Edge TTS (free), ElevenLabs (premium), XTTS (local cloning).
43 OS-level tools — shell commands, file operations, app control, web search, clipboard, screenshots, AppleScript automation.
Drag-and-drop PDFs, docs, code, images — HAL extracts, chunks, and indexes them locally. BM25 search recalls knowledge on demand.
Background task runner, artifact workspace, multi-agent orchestration, and cross-agent context handoff with Claude Code.
Knowledge upload system, drag-and-drop learning, chat cursor animations, and a cleaner codebase.
Drag-and-drop files onto the chat window or use the Knowledge panel. HAL extracts text from PDFs, Word docs, spreadsheets, code, and images.
Large files are chunked with overlap and indexed using BM25 — a probabilistic ranking function. HAL recalls the most relevant passages on demand.
Choose per file: Deep Read chunks and indexes for full recall. Skim loads small files directly into context for instant access. You decide.
Collapsible sidebar listing all uploaded files with type, size, mode, and storage usage. Delete files, upload new ones — all from the dashboard.
HAL's responses show a blinking terminal cursor that pulses as text appears, then fades out — giving the chat a live, cinematic feel.
Rotating hints at the bottom of chat teach you features as you use HAL — slash commands, voice input, knowledge uploads, and more.
Seven subsystems connected through a multi-provider brain with a 43-tool agent layer and local knowledge index.
HAL coordinates work across itself, Claude Code CLI, and Claude Desktop — with shared memory, background tasks, and multi-agent orchestration.
Memories are categorized — facts, decisions, preferences, session summaries. When a Claude Code session ends, context is distilled and pushed to HAL's persistent store. Next session loads where you left off.
Submit long-running coding tasks that execute asynchronously via Claude Code. Real-time progress, configurable concurrency, and cancellation. HAL keeps working while tasks run.
HAL generates interactive artifacts — code, Mermaid diagrams, rendered HTML — in a tabbed workspace panel alongside the chat. Copy, close, or iterate vocally.
Spawn multiple Claude Code agents on parallel tasks — one does frontend, another backend. HAL monitors all agents and detects file conflicts when they overlap.
Run HAL entirely for free with local AI. One toggle in your .env file switches everything to open-source alternatives.
brew install ollamacurl -fsSL https://ollama.com/install.sh | shollama pull llama3.1echo "FREE_MODE=true" > .envpython server.py| Layer | Free Provider | Paid Alternative |
|---|---|---|
| Brain (LLM) | Ollama — Llama 3.1, Mistral, Phi-3 | GPT-4o, Claude, Gemini |
| Speech-to-Text | faster-whisper (local) | OpenAI Whisper API |
| Text-to-Speech | Edge TTS (always free) | ElevenLabs, XTTS |
One codebase, auto-detects your OS. No #ifdef, no separate builds — HAL uses the right system commands on every platform.
| Feature | macOS | Windows | Linux |
|---|---|---|---|
| Volume | AppleScript | PowerShell | pactl |
| Notifications | osascript | Toast API | notify-send |
| Clipboard | pbcopy | Get-Clipboard | xclip |
| Screenshot | screencapture | PIL.ImageGrab | scrot / grim |
| Apps | open -a | Start Menu | .desktop files |
| Terminal | Terminal.app | Windows Terminal | gnome-terminal |
| Embedded Terminal | ✓ xterm.js + PTY | External only | ✓ xterm.js + PTY |
Register HAL as an MCP server — Claude Code gains access to webcam, voice, macOS control, and shared memory.
Two tiers — run HAL for free with local AI, or connect to cloud providers for premium quality.
| OS | macOS 12+ / Windows 10+ / Ubuntu 20.04+ |
| CPU | 4 cores (i5 / M1 / Ryzen 5+) |
| RAM | 8 GB minimum |
| Disk | 6 GB free (models + HAL) |
| Python | 3.10+ |
| Network | Not required (fully offline) |
| Mic | Required (built-in or USB) |
| Webcam | Optional (vision features only) |
Ollama llama3.1 8B uses ~5 GB RAM. Use phi3 for lighter footprint (~3 GB).
| OS | macOS 12+ / Windows 10+ / Ubuntu 20.04+ |
| CPU | 2 cores (any modern CPU) |
| RAM | 4 GB |
| Disk | 500 MB free |
| Python | 3.10+ |
| Network | Required (API calls) |
| API Key | OpenAI, Anthropic, or Gemini |
Lower resource usage — models run in the cloud. Faster responses.
| Mode | Brain | STT | TTS | RAM |
|---|---|---|---|---|
| Free (llama3.1) | 2-5s CPU / 1-2s Apple Silicon | 1-3s | 0.7s | ~6 GB |
| Free (phi3) | 1-2s CPU | 1-3s | 0.7s | ~3 GB |
| Paid (GPT-4o) | 1-2s API | 0.5s | 0.7s | ~200 MB |
| Paid (Claude) | 1-3s API | 0.5s | 0.7-1.2s | ~200 MB |
Apple Silicon (M1-M4) runs Ollama 2-3x faster via Metal. NVIDIA GPUs (6 GB+ VRAM) get similar acceleration via CUDA.
HAL runs on macOS, Windows, and Linux. Pick your platform — same features everywhere.
Python 3.10+, Homebrew (recommended). For free mode: Ollama. For paid mode: an OpenAI API key.
brew install python@3.12brew install ollama && ollama pull llama3.1git clone https://github.com/shandar/HAL9000.gitcd HAL9000python3 -m venv .venvsource .venv/bin/activatepip install -r requirements.txtecho "FREE_MODE=true" > .envcp .env.example .envnano .env| Key | Required | Purpose |
|---|---|---|
| FREE_MODE=true | Free | Skip all keys — uses Ollama + faster-whisper |
| OPENAI_API_KEY | Paid | GPT-4o brain + Whisper STT |
| ANTHROPIC_API_KEY | Paid | Claude as brain |
| GEMINI_API_KEY | Paid | Gemini as brain |
python server.pyopen http://localhost:9000Click the power button to activate HAL. Grant microphone + camera permissions when prompted.
Requires Claude Code installed on your machine.
claude mcp add hal-9000 -- python /path/to/HAL9000/hal_mcp_server.pyPython 3.10+ from python.org (check "Add to PATH" during install). For free mode: Ollama for Windows.
python --versionollama pull llama3.1pip install pipwinpipwin install pyaudiogit clone https://github.com/shandar/HAL9000.gitcd HAL9000python -m venv .venv.venv\Scripts\activatepip install -r requirements.txtNote: If pyaudio fails, install the Visual C++ Build Tools or use pipwin install pyaudio.
echo FREE_MODE=true > .envcopy .env.example .envnotepad .envpython server.pyOpen http://localhost:9000 in your browser. Click power button to activate.
Windows may prompt for firewall access — allow it for localhost only.
Note: The embedded terminal (xterm.js) is not available on Windows (no PTY support). Use an external terminal instead.
Requires Claude Code installed on your machine.
claude mcp add hal-9000 -- python \path\to\HAL9000\hal_mcp_server.pyPython 3.10+, system audio libraries, and optionally Ollama for free mode.
sudo apt install python3 python3-venv python3-pip portaudio19-devsudo dnf install python3 python3-pip portaudio-develcurl -fsSL https://ollama.com/install.sh | shollama pull llama3.1sudo apt install libnotify-bin xclip scrotgit clone https://github.com/shandar/HAL9000.gitcd HAL9000python3 -m venv .venvsource .venv/bin/activatepip install -r requirements.txtecho "FREE_MODE=true" > .envcp .env.example .envnano .envpython server.pyxdg-open http://localhost:9000Click the power button to activate HAL. Ensure PulseAudio/PipeWire is running for audio.
Requires Claude Code installed on your machine.
claude mcp add hal-9000 -- python /path/to/HAL9000/hal_mcp_server.pyInteract via voice or text. HAL responds with speech and can control your system, run commands, search the web, manage files, and delegate tasks to Claude Code.
Explore real conversation flows, tool chains, and workflows. Click each scenario to see HAL in action — with animated step-by-step breakdowns.
python server.py.
Ask HAL to launch any installed app with AppleScript automation.
HAL reads, writes, searches, and manages files on your Mac.
Volume, brightness, clipboard, notifications, Wi-Fi, battery status.
Search the web and fetch page content in real time.
Run whitelisted shell commands with security guardrails.
Ask about your webcam, screen, or surroundings.
frontend and backend. Both are working. I'll flag any file conflicts.api.py that was resolved.Pick a capability — watch HAL process real requests from input through tool execution to response.
This covers only the basics. Download and run HAL to experience the full magic.
Nothing is recorded, stored, or sent. No telemetry, no tracking, no cloud. Vision and voice can each be toggled off independently. Mic is click-only — never listens without your action.
Ollama + faster-whisper + Edge TTS. Zero API keys, zero cost. Paid providers optional for premium quality.
macOS, Windows, Linux — one codebase. Auto-detects your OS and uses native system commands.
Not just an assistant — an operations platform. Background tasks, multi-agent orchestration, shared memory with Claude Code.
MIT licensed. Fork it, extend it, make it yours. Every line of code is readable and documented.
Shell, files, apps, web search, clipboard, screenshots, memory, Claude Code delegation — all from voice or chat.
The full agent — 43 tools, 4 LLM providers, cross-platform — is free and open source. Pro tiers unlock power features for serious builders.
Same codebase, same repo. Pro is unlocked with a license key in your .env.
Keys are offline-verified — no phone-home, no telemetry, no tracking.
Clone, install, run. HAL handles the rest.
git clone https://github.com/shandar/HAL9000.git && cd HAL9000 && pip install -r requirements.txt
Click to copy
Learn to build AI agents, multimodal systems, and production-grade tools from scratch — the same architecture behind HAL 9000.
Join the Bootcamp →By Affordance Design — Design Engineering Studio