learn_rag

A RAG service that answers questions about FastAPI by searching its own documentation. Intentionally built without LangChain — direct control over chunking, retrieval, and prompt construction.

What it does

Fetches FastAPI docs from GitHub, chunks them by section, embeds with sentence-transformers, stores in PostgreSQL via pgvector.
On a query: embeds the question, finds the closest chunks, passes them as context to Claude, returns the answer with source references.
Every query is traced in LangFuse — token usage, retrieval scores, latency.

Stack

API — FastAPI + asyncpg (no ORM)
Vector search — PostgreSQL + pgvector (all-MiniLM-L6-v2, 384 dims)
LLM — Anthropic (claude-sonnet-4-6)
Observability — LangFuse Cloud
Local infra — Docker Compose (DB only)

Setup

cp .env.example .env      # add ANTHROPIC_API_KEY and LANGFUSE_* keys
make install              # install dependencies + set up pre-commit hooks
make db-up                # start PostgreSQL with pgvector
make ingest               # fetch docs, embed, store — run once
make dev                  # API at http://localhost:8000

Query example:

curl -s -X POST http://localhost:8000/api/v1/query \
  -H "Content-Type: application/json" \
  -d '{"question": "How do I declare path parameters?"}' | python -m json.tool

Evaluation

make eval

Runs 15 questions from eval/golden_dataset.json through the pipeline and scores each answer with an LLM judge on faithfulness and relevance (1–5).

Project structure

app/
  api/routes/    routers — HTTP only, call services and return
  services/      business logic — owns the RAG flow
  core/          infrastructure — embedder, retriever, LLM client, tracing
  db/            asyncpg pool + schema
  schemas/       Pydantic models
ingestion/       one-shot pipeline: fetch → chunk → embed → upsert
eval/            offline evaluation: golden dataset + LLM-as-judge

See DESIGN.md for architecture decisions and data flow details.

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
.claude/agents		.claude/agents
app		app
client		client
eval		eval
ingestion		ingestion
tests		tests
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CLAUDE.md		CLAUDE.md
DESIGN.md		DESIGN.md
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

learn_rag

What it does

Stack

Setup

Evaluation

Project structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

learn_rag

What it does

Stack

Setup

Evaluation

Project structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages