Knowledge & RAG
How agents access domain expertise from your documents
MUXI's knowledge system lets agents answer questions from your documents - PDFs, markdown, spreadsheets, images, and more. Automatic indexing, semantic search, and multimodal support built in.
How It Works
sequenceDiagram
participant U as User
participant A as Agent
participant K as Knowledge Index
participant D as Your Documents
U->>A: "What's our refund policy?"
A->>K: Semantic search
K->>D: Retrieve relevant chunks
D-->>K: "Refunds within 30 days..."
K-->>A: Context injected
A->>U: "Our refund policy allows..."
- You add documents to
knowledge/directory - MUXI indexes them at startup (embeddings + chunking)
- User asks a question related to your docs
- Agent searches the index semantically
- Relevant context injected into the prompt
- Agent answers using your domain knowledge
Supported Formats
MUXI uses MarkItDown for document conversion - any format MarkItDown supports works with MUXI.
Common Formats
| Category | Examples | Notes |
|---|---|---|
| Text & Documents | .md, .txt, .pdf, .docx, .pptx, .xlsx
| Headers, structure, and formatting preserved |
| Data | .csv, .json, .html
| Structure-aware chunking |
| Multimodal | .jpg, .png, .gif, and more
| OCR + vision model analysis |
MUXI handles images natively. Vision models (GPT-4V, Claude, Gemini) can analyze screenshots, diagrams, charts, and photos in your knowledge base.
Source rules & validation
- Paths must stay within the formation (relative paths only; no
..); keep sources underknowledge/. - Large/recursive sources can be limited with
max_files_per_source,file_limit,allowed_extensions, andmax_file_size. - Files are chunked and cached with MD5; unchanged files are skipped on restart, only deltas are re-embedded.
Multimodal Support
MUXI understands images, not just extracts text from them:
knowledge/
├── architecture-diagram.png ← Vision model analyzes
├── product-screenshot.jpg ← UI elements recognized
├── chart.png ← Data extracted
└── handwritten-notes.jpg ← OCR + interpretation
When a user asks about your architecture, the agent can reference the diagram directly:
User: "How does data flow through our system?"
Agent: "Based on the architecture diagram, data flows from..."
Agent-Specific Knowledge
Different agents can access different knowledge bases:
# agents/support.afs
schema: "1.0.0"
id: support
name: Support Agent
description: Customer support
system_message: You're a customer support specialist...
knowledge:
enabled: true
sources:
- path: knowledge/faq/
- path: knowledge/troubleshooting/
# agents/sales.afs
schema: "1.0.0"
id: sales
name: Sales Agent
description: Sales advisor
system_message: You're a sales advisor....
knowledge:
enabled: true
sources:
- path: knowledge/pricing/
- path: knowledge/features/
Support agent knows troubleshooting; sales agent knows pricing. No overlap, no confusion.
Keep knowledge files focused. Smaller, topic-specific files retrieve better than large catch-all documents. Split your FAQ into sections rather than one giant file.
How Indexing Works
Automatic at Startup
muxi dev
# Indexing knowledge...
# ✓ 47 documents indexed
# ✓ 1,234 chunks created
# ✓ Embeddings cached
Incremental Updates
When files change:
- MD5 hash detects modifications
- Only changed files re-indexed
- Cache preserved for unchanged content
Force full reindex:
muxi dev --reindex
Lazy Loading
First query triggers loading - startup stays fast even with large knowledge bases.
Why This Matters
| Without MUXI | With MUXI |
|---|---|
| Fine-tune models for domain knowledge | Just add documents |
| Manually chunk and embed | Automatic indexing |
| Text-only RAG | Multimodal (images, charts, diagrams) |
| Same knowledge for all agents | Agent-specific expertise |
| Manual index management | Automatic incremental updates |
The result: agents that know your domain, not generic chatbots that need everything explained.
Quick Setup
agents:
- id: assistant
knowledge:
enabled: true
sources:
- path: knowledge/docs/
description: Product documentation
Add your docs to knowledge/docs/, restart, done.
Learn More
- Configure knowledge - YAML syntax
- Add Knowledge Guide - Step-by-step tutorial