LIVE TAPE
OpenClaw 88,412 stars·CVE-2026-25898 disclosed (HIGH, Hermes)·Hermes Agent v2026.4.7 published·Hermes Agent +182 stars (last hour)·OpenClaw v2026.4.6 — credential vault hardening·CVE-2026-26133 patched (NanoClaw)·Pi 5 16GB rumoured for Q3 — recheck guidance·Nanobot +47 stars (last hour)·ZeroClaw v0.4.2 — Apple container fixes·Mac Mini M4 wins quarterly hardware survey·OpenClaw 88,412 stars·CVE-2026-25898 disclosed (HIGH, Hermes)·Hermes Agent v2026.4.7 published·Hermes Agent +182 stars (last hour)·OpenClaw v2026.4.6 — credential vault hardening·CVE-2026-26133 patched (NanoClaw)·Pi 5 16GB rumoured for Q3 — recheck guidance·Nanobot +47 stars (last hour)·ZeroClaw v0.4.2 — Apple container fixes·Mac Mini M4 wins quarterly hardware survey·
PocketClawvol. 1 · 2026

Document RAG over personal notes

Agent searches your notes, documents, codebases via vector embeddings. Answers questions with references to source files.

The problem

You have years of accumulated notes, READMEs, internal wikis, code comments. Useful information lives in there. Finding it costs you 5-10 minutes per lookup. RAG turns those minutes into seconds.

Recommended setup

AgentZeroClaw (bundled offline RAG) or Hermes Agent + Postgres pgvector
HardwareMac Mini M4 (24 GB+) for fast local indexing, or any mini PC if you index occasionally
LLMLocal Mistral 7B Q4 or Llama 3 8B Q4 via Ollama for the response generation

How it works

Point ZeroClaw at your notes folder. It chunks documents, generates embeddings using BGE-small (local, no API call), stores in SQLite. On query: retrieves top-K relevant chunks, sends to local LLM with your question. Returns answer + source citations.

Reality check

I run this over 800+ notes (Obsidian vault), 12,000 chunks, indexed in 14 minutes on a Mac Mini M4. Query latency: ~2 seconds end-to-end. Answers are useful when the relevant content is in the corpus; useless when it isn't (the model won't make stuff up — that's the point of RAG over fine-tuning). Re-indexing on file changes: incremental, ~5 seconds for normal edit cadence.

What breaks

  • Documents in obscure formats (proprietary databases, scanned PDFs without OCR)
  • Queries that need synthesis across many chunks (RAG is good at retrieval, average at synthesis)
  • Outdated content if you don't refresh the index

Alternative setups

Hermes Agent + Postgres with pgvector if you want the vector store separately for other tools to query. Nanobot + manual SQLite implementation if you want to read every line.

Other use cases
€5/month VPS + €15-30/month LLM API
Inbox triage
Self-hosted agent reads incoming email, classifies, drafts replies, flags actions. Saves 30-60 minut…
€0 if you already have the hardware. €5-30/month LLM if calling cloud.
Pre-PR code review
Agent reviews your PRs locally before you push, catches obvious bugs, flags security issues, suggest…
€38/month VPS + €30-100/month LLM (varies with volume)
First-line customer support
Self-hosted agent answers tier-1 customer questions from your docs/knowledge base. Hands off to huma…
€5/month Pi 5 power + €5-15/month LLM (light usage)
Security monitoring & alert triage
Agent watches your security feeds (CVE alerts, GitHub advisories, log anomalies), summarises and pri…