Knowledge & RAG › MUXI Documentation

MUXI's knowledge system lets agents answer questions from your documents - PDFs, markdown, spreadsheets, images, and more. Automatic indexing, semantic search, and multimodal support built in.

How It Works

sequenceDiagram
    participant U as User
    participant A as Agent
    participant K as Knowledge Index
    participant D as Your Documents

    U->>A: "What's our refund policy?"
    A->>K: Semantic search
    K->>D: Retrieve relevant chunks
    D-->>K: "Refunds within 30 days..."
    K-->>A: Context injected
    A->>U: "Our refund policy allows..."

sequenceDiagram
    participant U as User
    participant A as Agent
    participant K as Knowledge Index
    participant D as Your Documents

    U->>A: "What's our refund policy?"
    A->>K: Semantic search
    K->>D: Retrieve relevant chunks
    D-->>K: "Refunds within 30 days..."
    K-->>A: Context injected
    A->>U: "Our refund policy allows..."

You add documents to knowledge/ directory
MUXI indexes them at startup (embeddings + chunking)
User asks a question related to your docs
Agent searches the index semantically
Relevant context injected into the prompt
Agent answers using your domain knowledge

Supported Formats

MUXI uses MarkItDown for document conversion - any format MarkItDown supports works with MUXI.

Common Formats

Category	Examples	Notes
Text & Documents	`.md`, `.txt`, `.pdf`, `.docx`, `.pptx`, `.xlsx`	Headers, structure, and formatting preserved
Data	`.csv`, `.json`, `.html`	Structure-aware chunking
Multimodal	`.jpg`, `.png`, `.gif`, and more	OCR + vision model analysis

MUXI handles images natively. Vision models (GPT-4V, Claude, Gemini) can analyze screenshots, diagrams, charts, and photos in your knowledge base.

Source rules & validation

Paths must stay within the formation (relative paths only; no ..); keep sources under knowledge/.
Large/recursive sources can be limited with max_files_per_source, file_limit, allowed_extensions, and max_file_size.
Files are chunked and cached with MD5; unchanged files are skipped on restart, only deltas are re-embedded.

Multimodal Support

MUXI understands images, not just extracts text from them:

knowledge/
├── architecture-diagram.png    ← Vision model analyzes
├── product-screenshot.jpg      ← UI elements recognized
├── chart.png                   ← Data extracted
└── handwritten-notes.jpg       ← OCR + interpretation

When a user asks about your architecture, the agent can reference the diagram directly:

User:  "How does data flow through our system?"
Agent: "Based on the architecture diagram, data flows from..."

Agent-Specific Knowledge

Different agents can access different knowledge bases:

# agents/support.afs
schema: "1.0.0"
id: support
name: Support Agent
description: Customer support

system_message: You're a customer support specialist...

knowledge:
  enabled: true
  sources:
    - path: knowledge/faq/
    - path: knowledge/troubleshooting/

# agents/sales.afs
schema: "1.0.0"
id: sales
name: Sales Agent
description: Sales advisor

system_message: You're a sales advisor....

knowledge:
  enabled: true
  sources:
    - path: knowledge/pricing/
    - path: knowledge/features/

Support agent knows troubleshooting; sales agent knows pricing. No overlap, no confusion.

Keep knowledge files focused. Smaller, topic-specific files retrieve better than large catch-all documents. Split your FAQ into sections rather than one giant file.

How Indexing Works

Automatic at Startup

muxi dev
# Indexing knowledge...
# ✓ 47 documents indexed
# ✓ 1,234 chunks created
# ✓ Embeddings cached

Incremental Updates

When files change:

MD5 hash detects modifications
Only changed files re-indexed
Cache preserved for unchanged content

Force full reindex:

muxi dev --reindex

Lazy Loading

First query triggers loading - startup stays fast even with large knowledge bases.

Why This Matters

Without MUXI	With MUXI
Fine-tune models for domain knowledge	Just add documents
Manually chunk and embed	Automatic indexing
Text-only RAG	Multimodal (images, charts, diagrams)
Same knowledge for all agents	Agent-specific expertise
Manual index management	Automatic incremental updates

The result: agents that know your domain, not generic chatbots that need everything explained.

Quick Setup

agents:
  - id: assistant
    knowledge:
      enabled: true
      sources:
        - path: knowledge/docs/
          description: Product documentation

Add your docs to knowledge/docs/, restart, done.

Learn More

Configure knowledge - YAML syntax
Add Knowledge Guide - Step-by-step tutorial

We use cookies