Bonobo Terminal pairs an xterm.js emulator with a chat interface. Runs any .gguf model via ik_llama.cpp or llama.cpp, or talks to any OpenAI-compatible endpoint. Written in TypeScript on Bun. Linux-first; ships as a Docker Compose bundle.
Look around.
Chat, a live context graph, knowledge base, projects, models, and skills · all in one workspace. Want to see how it works? Jump to architecture →
Architecture
How it works
Two moving parts do the heavy lifting on every turn: a decision flow that assembles the right prompt and tools for your message, and adaptive context that keeps the conversation lean. Step through each below.
Full message history, every tool result, and recalled memories — together they overflow the token budget.
Open source
Bonobo Terminal is self-hosted, open-source, and transparent. Build on it, break it, send PRs.