Agentic RAG Transformer for intelligent knowledge retrieval
Semantic search powered by FAISS and sentence transformers. Ask questions, run calculations, search the web, and more.
Set up rag in seconds
rag works out of the box with minimal configuration. Choose quick setup or manual installation.
Run the interactive minimal TUI:
Or with rag-tui:
Choose your preferred model
rag supports multiple backends. OpenAI for best quality, Cerebras for speed, Ollama for offline use.
GPT
API
Local
HF
Built-in utilities
Extend rag with built-in commands. Run calculations, search Wikipedia, execute shell commands, and more.
Customize rag behavior
Configure rag via environment variables. Set the LLM backend, embedding model, memory mode, and API keys.