Question 1

What is RAG and how does it work with Interlocute?

Accepted Answer

RAG (Retrieval-Augmented Generation) is a technique that enhances LLM responses by retrieving relevant context from your own documents before generating an answer. In Interlocute, you upload documents to your node's knowledge base, and the platform automatically chunks, embeds, and indexes the content. When a query arrives, the most relevant chunks are retrieved and injected into the LLM prompt, producing grounded and accurate responses.

Question 2

What document formats does Interlocute RAG support?

Accepted Answer

Interlocute supports common document formats including plain text, PDF, and structured data files. Documents are processed through an automated chunking and embedding pipeline. You can upload documents through the dashboard or programmatically via the API.

Question 3

Do I need to set up a vector database to use RAG?

Accepted Answer

No. Interlocute manages the entire vector storage and retrieval infrastructure for you. There are no databases to provision, no connection strings to configure, and no embedding models to host. RAG works out of the box once you upload your documents.

Question 4

How does Interlocute handle document updates and re-indexing?

Accepted Answer

Interlocute supports incremental document updates. When you add, replace, or remove documents, the platform automatically re-chunks and re-indexes the affected content. Your node's knowledge base stays current without manual intervention or full re-indexing.

Question 5

Can I control how many chunks are retrieved per query?

Accepted Answer

Yes. You can configure the number of nearest chunks retrieved (k-nearest) and adjust similarity thresholds to fine-tune the relevance of retrieved context. These settings are configurable per node, so different use cases can have different retrieval strategies.

Question 6

Is each node's knowledge base isolated from other nodes?

Accepted Answer

Yes. Every Interlocute node has its own isolated knowledge partition. Documents uploaded to one node are never visible to or retrievable by another node. This isolation is enforced at the infrastructure level, making it safe for multi-tenant and multi-use-case deployments.

Question 7

How is RAG usage tracked and billed?

Accepted Answer

RAG operations are metered as part of your node's computation usage. Each retrieval operation — including the embedding of the query and the similarity search — is logged and included in your usage ledger. There are no separate charges for the vector storage; it is included in the platform's usage-based pricing.

Question 8

Can I use RAG alongside other Interlocute features like memory and scheduling?

Accepted Answer

Absolutely. RAG, long-term memory, scheduling, streaming, and tool use are all composable features of an Interlocute node. You can enable RAG on a node that also uses persistent memory and scheduled tasks — the features work together without conflicts.

Knowledge Retrieval

What is RAG?

Why it matters

How Interlocute helps

Built for production

Frequently Asked Questions

Documentation

Related Features

Long-term Memory

Tool Use & Function Calling

Observability & Logging

Ready to build with RAG (Knowledge Retrieval)?