Agentic and Applied AI / Course

RAG Systems: From Naive to Agentic

Every level of RAG — from naive cosine-on-embeddings to hybrid + reranking + multi-hop + agentic + production DevOps. Each module is the next move when the previous one breaks.

Free preview

Certificate: 1 of 5 capstones

Ten modules, ~100 challenges that take you from the 30-line naive RAG everyone ships first, through hybrid search, reranking, query rewriting, multi-hop, evaluation, citations, agentic RAG, scale, and the production DevOps that makes it real. Code in Python (the AI default) with Node alternatives where it matters. Built on what real production RAG looks like at 2026: Anthropic's Citations API, prompt caching, Contextual Retrieval, MCP, and a properly layered eval discipline.

Built by Lakshya Kumar

rag

retrieval

llm

embeddings

vector-db

agentic

evaluation

Before you start4 items

You're comfortable in Python (or Node.js) — every demo runs in Python with examples for Node where relevant.
You've called an LLM API before (OpenAI, Anthropic, or any local model).
You can stand up a Postgres + Docker locally OR have access to a managed Postgres (Supabase, Neon, RDS).
You're OK reading a paper now and then — a few modules link to the seminal papers behind the technique.

Is this course for you?Ask an AI

Get access to RAG Systems: From Naive to Agentic

$3.99

30-day access

Prefer the whole catalog? See all-access membership.

Ask for access

We grant free access case-by-case — students, career-switchers, builders on a tight budget. Sign in to send us a note.

Capstone projects

Submit any 1 of 5 to earn the certificate

Complete all modules, then submit the required number of capstone projects. Each must earn a passing rating from an admin reviewer.

capstoneProduction RAG over a real corpus

Pick a real corpus (your company docs, a public dataset like Wikipedia or arXiv, or your own knowledge base). Ship a production-quality RAG with: chunking dispatcher, hybrid + rerank, query rewriting, agentic mode for complex queries, citations, eval gate in CI, live monitoring, and the full production checklist. Submit the live URL + the metrics + the checklist.

Submit RAG systemMinimum rating for approval: 3/5

rag-eval-packEval pack for any RAG

Build an eval framework that another team could drop in: golden-set tooling, retrieval + generation metrics, LLM-as-judge with calibration, CI gate, online signal integration, dashboard. Submit the framework as a small npm/pypi package or repo.

Further reading & study material7 sources

Prompt

I'm taking a "RAG Systems" course that runs from naive RAG through hybrid + rerank, query rewriting, multi-hop, evals, citations, agentic RAG, scale, and production DevOps. It uses Python (with Node alternatives) and lots of real 2026 production tricks (Anthropic Citations API, prompt caching, Contextual Retrieval, MCP).

Here's my context:
1. My current product/project is: [describe]
2. My current RAG state: [haven't built one / naive prototype / shipping / scaling]
3. My corpus: [size, doc types, growth rate]
4. Where I think RAG is failing me: [my guess]

Given that, answer:
- Which module should I prioritize, and why?
- Name 3 concrete wins this course would unlock for my situation.
- Name 1 thing the course won't help me with so I don't have wrong expectations.
- If I only had 2 hours this week, which single technique gives me the biggest lift? How would I measure that it worked?

RAG Systems: From Naive to Agentic

Naive RAG: the floor you build from

Chunking strategies

Better retrieval: BM25, hybrid, rerank

Query rewriting & expansion

Multi-hop retrieval

RAG evaluation metrics

Citations & faithfulness enforcement

Agentic RAG: agents decide what and when

RAG at scale

RAG DevOps & production