Configuration
Botholomew reads its settings from config/config.json inside the project directory. The full schema lives in src/config/schemas.ts.
{
"llm": {
"provider": "anthropic",
"model": "claude-opus-4-6",
"api_key": ""
},
"chunker_llm": {
"provider": "anthropic",
"model": "claude-haiku-4-5-20251001",
"api_key": ""
},
"embedding_model": "Xenova/bge-small-en-v1.5",
"embedding_dimension": 384,
"tick_interval_seconds": 300,
"max_tick_duration_seconds": 120,
"system_prompt_override": "",
"max_turns": 0,
"worker_heartbeat_interval_seconds": 15,
"worker_dead_after_seconds": 60,
"worker_reap_interval_seconds": 30,
"worker_stopped_retention_seconds": 3600,
"schedule_min_interval_seconds": 60,
"schedule_claim_stale_seconds": 300,
"tui_idle_timeout_seconds": 180,
"dream_lookback_hours": 24,
"log_level": "",
"membot_scope": "global",
"mcpx_scope": "global",
"approvals": {
"enabled": true,
"allowed_tools": [],
"auto_allow_read_only": false
}
}LLM providers
Botholomew talks to language models through the Vercel AI SDK. Three providers ship today: Anthropic (Claude), Ollama (local), and OpenAI-compatible (LM Studio, llama.cpp's HTTP server, OpenRouter, vLLM, Groq, Together, etc.).
Whatever provider you pick, the model must support tool/function calling — the agent's entire surface depends on structured tool calls. Botholomew probes for tool support at startup and refuses to run with a non-tool-capable model.
llm (main agent) and chunker_llm (auxiliary)
Both blocks have the same shape:
| Field | Default | Purpose |
|---|---|---|
provider | "anthropic" | One of anthropic, ollama, openai-compatible. |
model | claude-opus-4-6 / …-haiku-… | Model id. Provider-specific. |
base_url | "" | Required for openai-compatible; optional for ollama (default http://localhost:11434); ignored for anthropic. |
api_key | "" | Required for anthropic. Optional for openai-compatible and ollama. |
max_input_tokens | 0 | Override the context window. 0 falls back to a lookup table and provider defaults. |
supports_tools | true | Override the tool-capability probe (only relevant for openai-compatible). |
llm governs chat and worker turns. chunker_llm governs auxiliary calls (schedule evaluation, thread titles, capability summarization) and can point at a smaller/cheaper model.
Anthropic example
{
"llm": { "provider": "anthropic", "model": "claude-opus-4-6", "api_key": "sk-ant-..." },
"chunker_llm": { "provider": "anthropic", "model": "claude-haiku-4-5-20251001", "api_key": "sk-ant-..." }
}Ollama (local) example
Run ollama serve and ollama pull llama3.1:8b first.
{
"llm": { "provider": "ollama", "model": "llama3.1:8b" },
"chunker_llm": { "provider": "ollama", "model": "qwen2.5:3b" }
}Known-good tool-capable Ollama models: llama3.1:8b, llama3.1:70b, qwen2.5:7b, mistral-nemo, command-r. Smaller models without the tools capability (e.g. gemma:2b) will be refused at startup.
OpenAI-compatible example
{
"llm": {
"provider": "openai-compatible",
"model": "gpt-4o",
"base_url": "https://openrouter.ai/api/v1",
"api_key": "sk-or-..."
}
}Any OpenAI-compatible chat-completions endpoint that supports tool calling works. Set supports_tools: false to opt out of the probe assumption.
Keys
| Key | Default | Purpose |
|---|---|---|
llm | see above | Provider + model for the main agent loop (workers + chat). See "LLM providers". |
chunker_llm | see above | Provider + model for auxiliary calls (schedule eval, thread titles, capability summarization). |
embedding_model | Xenova/bge-small-en-v1.5 | A local @huggingface/transformers feature-extraction model. Weights are downloaded on first use and cached under the project's models/ directory. Any feature-extraction model in the Xenova/* namespace works — e.g. Xenova/multilingual-e5-small (also 384-dim) for non-English content. |
embedding_dimension | 384 | Vector dimension. Must match the model. Changing model + dimension requires running botholomew membot reembed to recompute every stored vector — old and new vectors aren't comparable. |
tick_interval_seconds | 300 | Seconds a --persist worker sleeps between ticks when there's no work. It ticks back-to-back while a backlog exists. |
max_tick_duration_seconds | 120 | Soft cap per tick. Stale-task reset fires at 3× this value. |
system_prompt_override | "" | Appended to the built-in system prompt. Use this for project-specific instructions that should be always-loaded without editing the files under prompts/. |
max_turns | 0 | Maximum tool-use turns per agent loop (0 = unlimited). Safety net against runaway loops. |
worker_heartbeat_interval_seconds | 15 | How often a running worker writes to workers.last_heartbeat_at. Runs on its own setInterval, independent of the tick loop, so long LLM calls don't starve the heartbeat. |
worker_dead_after_seconds | 60 | A worker whose heartbeat is older than this is considered dead. The reaper flips its status to dead and releases every task/schedule claim it held. |
worker_reap_interval_seconds | 30 | How often a --persist worker scans for dead peers to reap and prunes old cleanly-stopped workers. One-shot workers don't run the reaper. |
worker_stopped_retention_seconds | 3600 | Cleanly-stopped workers older than this are deleted from the workers table. Dead workers are kept as forensic evidence and not auto-pruned. |
schedule_min_interval_seconds | 60 | Minimum gap between successive evaluations of the same schedule. A schedule that ran less than this many seconds ago is skipped. |
schedule_claim_stale_seconds | 300 | If a worker claimed a schedule but never released it (crash), another worker may steal the claim after this many seconds. |
tui_idle_timeout_seconds | 180 | Seconds of inactivity (no keystrokes, no streamed agent tokens, no tool events) before the chat TUI freezes its visible animations and pauses the status-bar count refresh. Animations resume on the next activity. Set to 0 to disable (always animate — useful for demo recordings). |
dream_lookback_hours | 24 | Default recall window for botholomew dream / /dream when no --since is given — the agent reviews threads from the last this-many hours. See reflection.md. |
log_level | "" | Verbosity for botholomew CLI logs. One of silent, error, warn, info, debug. Empty string falls back to the runtime default (info normally, error under NODE_ENV=test). BOTHOLOMEW_LOG_LEVEL env var overrides this. |
membot_scope | "global" | Where this project's knowledge store lives. "global" → ~/.membot/index.duckdb (shared across every Botholomew project on the machine). "project" → <projectDir>/index.duckdb (isolated). Affects both the agent and the botholomew membot … CLI passthrough. |
mcpx_scope | "global" | Where this project's MCP server config lives. "global" → ~/.mcpx/ (shared). "project" → <projectDir>/mcpx/ (isolated). Affects both the agent and the botholomew mcpx … CLI passthrough. |
approvals | see below | Human-in-the-loop gate for outbound mcpx tool calls. See Approvals. |
approvals
| Field | Default | Meaning |
|---|---|---|
enabled | true | Master switch. When true, every mcpx tool call requires approval unless allowlisted. false disables the gate (same as running --unsafe). |
allowed_tools | [] | Patterns for mcpx tools that run without approval. Match <server>/<tool>: exact (gmail/send), wildcard (gmail/*, */search), bare token (search), or /regex/ on the tool name. Empty ⇒ gate everything. |
auto_allow_read_only | false | Also skip the gate for tools the server annotates readOnlyHint: true. Annotations are untrusted hints, so off by default. |
A run started with --unsafe (on chat, worker run, worker start) bypasses the gate regardless of enabled.
Environment variables
| Var | Effect |
|---|---|
ANTHROPIC_API_KEY | Fills in llm.api_key / chunker_llm.api_key when the provider is anthropic. Always wins. |
OPENAI_API_KEY | Fills in api_key when the provider is openai-compatible and the field is empty. |
OLLAMA_HOST | Fills in base_url when the provider is ollama and the field is empty. |
BOTHOLOMEW_LOG_LEVEL | Overrides log_level in config. One of silent, error, warn, info, debug. |
BOTHOLOMEW_NO_UPDATE_CHECK | Disable the background "new version available" check. |
Tuning guidance
For personal/low-volume use: defaults are fine. One tick every five minutes is plenty when tasks are mostly "every morning, summarize my email".
For bursty workloads: lower tick_interval_seconds to 30–60. A persist worker only sleeps when the queue is empty, so this is safe — it just reduces latency between the last item landing and the next tick firing. Alternatively, spawn more one-shot workers (via cron or chat) and leave the interval alone.
For multi-worker setups: if you routinely run more than a handful of workers, consider lowering worker_reap_interval_seconds (so dead ones are cleaned quickly) and raising worker_dead_after_seconds (so a temporary DB-lock hiccup doesn't flip a live worker to dead). The defaults (30s reap, 60s threshold) are conservative.
For model-cost sensitivity:
- Switch
llm.modeltoclaude-sonnet-4-*orclaude-haiku-*. Opus is the default because quality on complex knowledge work matters more than per-token cost for most users, but Sonnet handles the majority of tasks well. - The default
chunker_llmalready targets Haiku — leave it there. - Lower
max_turns(e.g., 15) to hard-cap tool-use budgets. - For zero-API-cost runs, switch both blocks to a tool-capable Ollama model. Note: cache-token reporting is Anthropic-only, so the TUI will show all input tokens as fresh on local providers.
For prompt-sensitive workflows: use system_prompt_override to add instructions without touching prompts/goals.md. This keeps the default personality intact while layering on project-specific rules ("always respond in British English", "never call mcp_exec on the slack server without confirmation", …).
Per-project vs. global
config.json itself is always per-project — different projects have different goals, beliefs, and tuning. But the two data stores it points at (membot for knowledge, mcpx for MCP servers) default to shared global locations, because reusing a personal knowledge base and a set of authenticated MCP servers across every project is almost always what you want.
Defaults for new projects:
| Concern | Default scope | Resolves to |
|---|---|---|
membot_scope | "global" | ~/.membot/ |
mcpx_scope | "global" | ~/.mcpx/ |
To opt one (or both) into per-project isolation, set the key to "project" in config/config.json, or pass --membot-scope=project / --mcpx-scope=project to botholomew init. The agent loop, chat session, TUI, and CLI passthroughs (botholomew membot …, botholomew mcpx …) all honour the scope on every invocation.
Migrating between scopes:
- Global → project:
botholomew membot import-global(copies~/.membot/into the project) orbotholomew mcpx import-global(copies~/.mcpx/), then flip the scope key to"project". - Project → global: copy
<projectDir>/index.duckdbto~/.membot/index.duckdb(or<projectDir>/mcpx/*.jsonto~/.mcpx/), then flip the scope key to"global".
Projects initialized before the scope settings existed have no membot_scope / mcpx_scope keys; both default to "global", so the agent reads the shared store. Any pre-existing project-local index.duckdb or mcpx/servers.json is left in place but unused until you flip the scope back.