Skip to content

codecoradev/uteke

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

304 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Uteke โ€” Give your AI a memory

Uteke

Give your AI a memory that never leaves your machine.

Offline-first semantic memory engine โ€” single binary, zero config, 30ms recall.

CI License: Apache 2.0 Rust 1.75+ v0.6.0

๐Ÿ‡ฌ๐Ÿ‡ง English ยท ๐Ÿ‡ฎ๐Ÿ‡ฉ Bahasa Indonesia


Quick Start

# Install (macOS, Linux, Windows)
curl -sSL codecora.dev/install | sh

# Store a memory with metadata
uteke remember "Deploy v2.1 to staging" --tags deploy,staging \
  --entity staging-server --category infrastructure

# Hybrid search (vector + FTS5, ranked by RRF)
uteke recall "when do we deploy?"

# Stats
uteke stats

That's it. No API keys. No Docker. No Python. First run downloads the embedding model (~188MB) and you're good to go.

๐Ÿ“– Install options ยท Pre-built binaries ยท Docker ยท Full docs

Docker

Listens on localhost only by default. See Docker docs for auth setup.

# One-liner (model pre-baked in image)
docker run -d --name uteke -p 127.0.0.1:8767:8767 -v uteke-data:/data \
  ghcr.io/codecoradev/uteke:latest

# Or docker compose
docker compose up -d

๐Ÿ“– Docker docs ยท Compose file


Why Uteke?

AI agents forget everything between sessions. Uteke gives them persistent, searchable memory โ€” entirely offline, in one binary.

Uteke Mem0 Letta Zep
Setup Single binary pip + Docker + Qdrant pip + Docker + Postgres pip + Docker + Neo4j
API keys needed โŒ None โœ… OpenAI/LLM key โœ… LLM key โœ… LLM key
Offline โœ… Fully โŒ Cloud embedding โŒ Needs LLM server โŒ Needs LLM + vector DB
Semantic search โœ… Local ONNX + FTS5 hybrid โœ… Cloud embedding โš ๏ธ Keyword + archival โœ… GraphRAG
Full-text search โœ… FTS5 built-in โŒ โš ๏ธ Keyword only โŒ
Recall speed ~30ms (library) Network round-trip Network round-trip Network round-trip
Privacy โœ… Data never leaves machine โš ๏ธ Data sent to LLM โš ๏ธ Data sent to LLM โš ๏ธ Data sent to LLM
License Apache 2.0 Apache 2.0 Apache 2.0 Apache 2.0

Uteke vs cloud alternatives


Key Features

  • ๐Ÿง  Hybrid Search โ€” Vector similarity + FTS5 full-text search, merged by Reciprocal Rank Fusion (RRF)
  • ๐Ÿ  Rooms โ€” Group memories by context (meetings, projects) with author attribution
  • โณ Time-travel queries โ€” Recall memories as they existed at any point in time
  • ๐Ÿ”Œ Pluggable embeddings โ€” Swap ONNX/OpenAI/Ollama backends via config
  • ๐Ÿท๏ธ Metadata Enrichment โ€” Tag, entity, category, and key:value metadata on every memory
  • ๐Ÿ”— Relationship graph โ€” Link memories with typed edges (supersedes, contradicts, references)
  • ๐Ÿ“‰ Smart decay โ€” Composite importance scoring, pin critical memories
  • โšก Recall cache โ€” LRU cache eliminates redundant embedding for repeated queries
  • ๐Ÿ“Š Benchmarks โ€” Built-in uteke bench for perf testing + LongMemEval retrieval harness
  • ๐Ÿ‘ฅ Multi-Agent Namespaces โ€” Fully isolated memory per agent, zero overhead
  • ๐Ÿ–ฅ๏ธ Server Mode โ€” Persistent daemon with ~42ms warm recall (75x faster than CLI)
  • ๐Ÿ”ฅ Tiered Memory โ€” Hot/Warm/Cold tracking with auto-cleanup of stale memories
  • ๐Ÿ”’ Fully Offline โ€” Local ONNX embeddings (768d), no telemetry, no cloud, no API calls
  • ๐Ÿ”„ Embed Fallback โ€” Gracefully falls back to cloud API if local embedder fails; MockEmbedder for testing
  • ๐Ÿ“‚ Batch Import โ€” Import entire directories (--batch-dir) with auto-strategy routing (document vs. memory extraction)
  • ๐Ÿ“ฆ Single Binary โ€” Zero dependencies. No Docker, no database server, no Python, no API keys
  • ๐Ÿ“ฅ Import/Export โ€” JSONL-based backup and restore
  • ๐Ÿงฉ Memory Types โ€” Typed categories (fact, procedure, decision, etc.) with auto-inference
  • ๐Ÿ”— Backlinks โ€” Bidirectional memory edges โ€” references are automatically reciprocal
  • ๐Ÿ“œ Timeline Events โ€” Chronological audit log per memory (created, updated, superseded)
  • ๐Ÿ“ˆ Salience + Recency โ€” Dual-axis recall boost by memory type and age
  • ๐ŸŒ™ Dream Cycle โ€” One-command maintenance pipeline (lint โ†’ backlinks โ†’ dedup โ†’ orphans)
  • ๐Ÿ” Orphan Detection โ€” Find disconnected, low-importance memories for cleanup
  • ๐Ÿ“Ž Citations โ€” Source attribution on every memory (URL, file, user, import)
  • ๐Ÿ”Œ MCP Server โ€” JSON-RPC over stdio + Streamable HTTP transport
  • ๐Ÿ“ Document Engine โ€” Wiki/knowledge base with uteke doc create/get/list and auto-chunking
  • ๐Ÿค– Cosine Auto-Linking โ€” Automatically creates similar_to edges between related memories
  • ๐ŸŒ Graph API โ€” GET /graph endpoint returns nodes + edges JSON for visualization
  • ๐Ÿ”‘ View-Only API Keys โ€” Read-only tokens for safe GET-only access to the server
  • ๐Ÿ“„ Markdown Chunker โ€” Splits documents by headings, respects code blocks and token limits

๐Ÿ“– Full documentation ยท CLI reference ยท Configuration


Architecture

Uteke architecture

Hybrid search pipeline:

  • HNSW (usearch) โ€” vector similarity, finds by meaning
  • FTS5 (SQLite) โ€” full-text keyword search, finds by exact terms
  • Reciprocal Rank Fusion (k=60) โ€” merges both ranked lists โ†’ best of both worlds
  • Local ONNX (EmbeddingGemma Q4, 768d) โ€” embeddings computed on-device, no API calls

Everything runs in-process. No network. No cloud. No server required (unless you want server mode).


Development

cargo build --workspace        # Build
cargo test --workspace         # Test (327 unit tests)
cargo clippy -- -D warnings    # Lint
cargo fmt                      # Format

See CONTRIBUTING.md for the full contribution guide.


License

Apache License 2.0 โ€” use it, fork it, ship it.


Star History

Star History Chart

About

๐Ÿง  The Brain for Your AI โ€” Local-first memory engine for AI agents. Store, recall, and search memories with semantic embeddings. Single Rust binary, zero config, fully offline.

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Packages

 
 
 

Contributors