ESC
Start typing to search...

Memory System

Teleton uses a hybrid RAG architecture combining vector embeddings (sqlite-vec) with full-text search (FTS5) for intelligent context retrieval.

Hybrid Architecture

The memory system combines two complementary search methods:

ComponentTechnologyPurpose
Vector Searchsqlite-vecSemantic similarity matching
Full-Text SearchFTS5Exact keyword matching
Databasebetter-sqlite3WAL mode for concurrent access

Embeddings

By default, Teleton uses local ONNX embeddings for privacy and offline operation:

config.yaml
memory:
  embeddings:
    provider: local          # local (ONNX) | voyage
    model: all-MiniLM-L6-v2  # Lightweight, fast model

For higher quality embeddings, you can use Voyage AI:

config.yaml
memory:
  embeddings:
    provider: voyage
    model: voyage-2
    apiKey: ${VOYAGE_API_KEY}

Auto-Compaction

AI-powered memory compaction keeps the context relevant by summarizing and consolidating old entries:

config.yaml
memory:
  autoCompact: true        # Enable AI compaction
  compactThreshold: 1000   # Trigger after N entries
  • Summarizes repetitive conversations
  • Extracts key facts and preferences
  • Removes outdated information
  • Maintains conversation continuity

Storage Location

All memory data is stored in a single SQLite database:

Path
~/.teleton/memory.db