Clanker

LLM-powered smart home assistant on top of Home Assistant.

Clanker is a self-hosted Python service that adds a brain, memory, vision reasoning, proactive announcements, and voice control on top of your existing Home Assistant setup. HA remains the source of truth for devices and automations — Clanker adds the intelligence layer.

Architecture

┌──────────────────────────────────────────────────────────────────────┐
│                        Voice Surfaces                                │
│  ESP32-S3 satellites · HA Voice PE · Mobile app · Browser            │
│  (HA Assist pipeline handles STT/wake-word/TTS — Clanker is the      │
│   conversation agent behind it)                                      │
└───────────────────────────────┬──────────────────────────────────────┘
                                │
┌───────────────────────────────▼──────────────────────────────────────┐
│                        Clanker Core                                  │
│                                                                      │
│  ┌──────────┐  ┌──────────┐  ┌──────────┐  ┌──────────┐              │
│  │  Brain   │  │  Memory  │  │  Vision  │  │ Announce │              │
│  │ Router   │  │ Struct.  │  │ Frigate  │  │  Router  │              │
│  │          │  │ Semantic │  │  VLM     │  │Occupancy │              │
│  │ Anthropic│  │          │  │  Faces   │  │  Quiet   │              │
│  │ OpenAI   │  │  SQLite  │  │          │  │  Hours   │              │
│  │ Ollama   │  │ Markdown │  │          │  │          │              │
│  │ Generic  │  │ ChromaDB │  │          │  │          │              │
│  └──────────┘  └──────────┘  └──────────┘  └──────────┘              │
│                                                                      │
│  ┌───────────┐  ┌──────────┐  ┌────────┐  ┌──────────┐               │
│  │ Proactive │  │  Remote  │  │  MCP   │  │  Convo   │               │
│  │ Scheduler │  │ Telegram │  │ Server │  │  Agent   │               │
│  │ Briefing  │  │  SMS     │  │ Tools  │  │ Sessions │               │
│  │ Handlers  │  │  Push    │  │        │  │  HTTP API│               │
│  └───────────┘  └──────────┘  └────────┘  └──────────┘               │
│                                                                      │
│  Tools exposed to brain via MCP:                                     │
│  ha_call_service · ha_get_state · ha_find_entities                   │
│  memory_read · memory_write · memory_search · notify_user            │
└───────────────────────────────┬──────────────────────────────────────┘
                                │ WebSocket + REST
                                │ (long-lived access token)
┌───────────────────────────────▼──────────────────────────────────────┐
│                     Home Assistant (substrate)                       │
│                                                                      │
│  Devices · Entities · Automations · Frigate · Occupancy sensors      │
│  TTS targets · Notify services · Mobile app · Exposed entities       │
│                                                                      │
│  HA's exposed-entities allowlist = hard safety gate                  │
└──────────────────────────────────────────────────────────────────────┘

Features

Brain

Pluggable LLM providers — Anthropic (Claude), OpenAI, Ollama, and any OpenAI-compatible endpoint. Config-driven task routing (vision → Claude, quick intents → local Ollama).
Conversation agent — Tool-calling loop with HA device control, memory search, and entity discovery. Multi-turn sessions with persistence (SQLite).
Intent fast-path — Simple commands (turn on/off, brightness, weather, timers) handled by HA's built-in intent matcher in <50ms, bypassing the LLM entirely.
Streaming TTS — Sentence-by-sentence delivery while the LLM is still generating. First sentence speaks ~0.3s after generation begins.
Context compaction — Token-aware summarization of old messages via the LLM. Keeps context bounded without losing continuity.
Auto-RAG — Relevant memory is automatically injected into the system prompt before each brain call. No explicit tool call needed.

Voice

Full voice pipeline — "Hey Clanker" wake word → Whisper STT → brain → Piper TTS. All local by default.
Custom wake word — Train "Hey Clanker" via openWakeWord (synthetic speech, no recording needed).
HA custom component — Registers Clanker as a conversation agent in HA so all voice surfaces (Assist, ESP32 satellites, mobile app) work automatically.

Vision

Frigate integration — Event subscription, snapshot fetching, deduplication with configurable cooldown.
VLM pipeline — Camera snapshots described by vision-capable LLMs (Claude, GPT-4o, LLaVA).
Face recognition — Double Take integration with structured memory lookup. Unknown faces described via VLM.

Proactive Automation

Morning briefing — Motion-triggered daily summary (weather, home state) delivered via TTS.
Critical alerts — Smoke, CO, flood, glass break → deterministic fast path (no LLM), all speakers + push with 911/Safe/False Alarm actions.
Doorbell — Person detected → snapshot → VLM description → face lookup → contextual announcement + push with Talk/Ignore.
Appliance completion — Washer/dryer/dishwasher done → announces to occupied rooms.
Unknown person — VLM description + time/location threat assessment.

Memory

Structured (SQLite) — Faces, people, rooms, appliances, preferences.
Semantic (Markdown + ChromaDB) — Human-readable files with vector search via Ollama embeddings.
Session persistence — Conversations survive restarts. Stale sessions evicted by TTL.

Notifications

Telegram — Bidirectional chat, image attachments, inline keyboard actions.
SMS via Twilio — Text alerts + inbound commands. MMS for images.
HA mobile push — Fallback via HA notify services.
Announcement router — Occupancy-aware TTS delivery, quiet hours, priority-based routing.

Setup & Deployment

HA Add-on — One-click install from HA's app store. Auto-configures token, component, and config.
Setup wizards — CLI and browser-based, with HA auto-discovery, connection testing, entity discovery, and config validation.
Ollama auto-setup — Cross-platform install (Windows/Mac/Linux), model pulling, TTFT optimizations.
Pre-built OS images — Flash to SD/USB for dedicated hardware. First boot launches the web wizard.
CI/CD — GitHub Actions: test matrix (3.11/3.12/3.13), lint, Docker image published to GHCR.
Identity verification — Telegram chat ID confirmation + SMS code verification. Unverified messages silently dropped.

Security

Entity allowlisting — HA's exposed-entities feature is the hard safety gate.
Prompt injection defense — System prompt instructs the brain to treat tool results as data, not instructions.
Deterministic critical paths — Life-safety events bypass the LLM for immediate, reliable response.
Secrets in env vars — API keys never stored in config YAML.

See docs/safety.md for the full safety model.

Install

Path 1: HA Add-on (recommended for most users)

Already have Home Assistant? Install Clanker as an add-on in 30 seconds:

In HA: Settings → Apps → ⋮ (top-right) → Repositories
Add https://github.com/nolankramer/clanker
Find Clanker in the app store → Install
Configure API keys in the add-on settings → Start

The add-on auto-configures everything — HA token, custom component, config files. No manual setup needed.

Path 2: Standalone Install (any machine)

Run Clanker on your own machine (PC, NUC, server) alongside or separate from HA.

Linux/Mac:

curl -fsSL https://raw.githubusercontent.com/nolankramer/clanker/main/install.sh | bash

Windows (PowerShell):

irm https://raw.githubusercontent.com/nolankramer/clanker/main/install.ps1 | iex

Docker:

docker pull ghcr.io/nolankramer/clanker:latest

All methods launch a setup wizard that auto-discovers HA, configures LLM providers, and sets up voice/notifications. No manual file editing.

Path 3: Flash an Image (start from scratch)

For dedicated hardware with no existing OS. See the Hardware Guide for what to buy.

Download from Releases: clanker-x86_64-*.img.gz or clanker-arm64-*.img.gz
Flash with Balena Etcher
Boot → open http://clanker.local → setup wizard

Voice Pipeline

Two speed tiers for voice responses:

"Turn off the kitchen lights"
  → openWakeWord → Whisper STT → HA intent fast-path (~50ms) → Piper TTS

"Set the house to movie mode"
  → openWakeWord → Whisper STT → LLM brain (streaming TTS)
  → first sentence spoken in ~0.3s → rest streams while speaking

All processing is local by default. Cloud LLM providers (Anthropic, OpenAI) are optional — route conversation tasks to Ollama for a fully offline setup.

Custom Wake Word

Train "Hey Clanker" using openWakeWord (synthetic speech generation, no voice recording needed):

pip install openwakeword tensorflow
python -m clanker.setup.wakeword
python -m clanker.setup.wakeword --deploy /share/openwakeword

Configuration

All config lives in config/clanker.yaml. Secrets go in .env or environment variables.

See config/clanker.yaml.example for the full schema with comments.

Development

# Run tests (180 tests)
uv run pytest

# Lint
uv run ruff check .

# Type check
uv run mypy clanker/

License

MIT — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
.github/workflows		.github/workflows
addon		addon
clanker		clanker
config		config
docs		docs
ha_component/custom_components/clanker		ha_component/custom_components/clanker
image		image
tests		tests
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
install.ps1		install.ps1
install.sh		install.sh
pyproject.toml		pyproject.toml
repository.yaml		repository.yaml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Clanker

Architecture

Features

Brain

Voice

Vision

Proactive Automation

Memory

Notifications

Setup & Deployment

Security

Install

Path 1: HA Add-on (recommended for most users)

Path 2: Standalone Install (any machine)

Path 3: Flash an Image (start from scratch)

Voice Pipeline

Custom Wake Word

Configuration

Development

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Clanker

Architecture

Features

Brain

Voice

Vision

Proactive Automation

Memory

Notifications

Setup & Deployment

Security

Install

Path 1: HA Add-on (recommended for most users)

Path 2: Standalone Install (any machine)

Path 3: Flash an Image (start from scratch)

Voice Pipeline

Custom Wake Word

Configuration

Development

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages