Karpathy’s Markdown Alternative To RAG

A proposed LLM knowledge base architecture discards the usual vector database plus RAG pipeline and centers everything on a living markdown library. Instead of embedding documents and querying a separate retrieval stack, an AI system continuously edits and organizes markdown files that act as the model’s evolving source of truth.

The design treats the markdown corpus as a kind of high‑level memory consolidation layer, while the language model handles pattern recognition and text generation. Rather than tuning cosine similarity thresholds or managing embedding drift, the system focuses on version control, file granularity, and how the AI agent decides what to summarize, merge, or delete. Concepts like information entropy and marginal cost of retrieval shift from the vector index to the structure of the markdown graph itself.

Karpathy presents the approach as looser and more forgiving than enterprise RAG stacks, which often accumulate complexity in orchestration logic, ranking heuristics, and monitoring. The markdown‑first pattern favors transparency, since every change is visible and diffable, and it aligns naturally with developer workflows built around repositories, pull requests, and documentation review.