Skip to content
Docs Menu
Documentation

RAG

RAG indexing and search in packages/nikcli/src/rag.

Overview

RAG uses local chunking + vector storage with configurable embedding models.

Indexing

Rag.index() discovers files, chunks text, and stores vectors in RagStorage.

Defaults

chunkLines200
maxFiles200
maxChunks5000
maxFileBytes1,000,000
modelnvidia/llama-embed-nemotron-8b
providernvidia