RAG Chatbot

Retrieval-Augmented Generation with pgvector + OpenAI

Upload File

Welcome to RAG Chatbot!

Upload documents and ask questions about them.

Upload documents to build your custom knowledge base

Vector similarity search finds relevant context

GPT-4 generates answers based on your data

Tech Stack & Architecture

PDF/TXT Upload

→

Vercel Blob

→

Extract Text

→

Chunk (500 tokens)

→

Generate Embeddings

→

Store in Neon

User Question

→

Embed Query

→

Vector Search (pgvector)

→

Retrieve Top 5 Chunks

→

GPT-4 + Context

→

AI Response

✓HNSW Indexing: Fast approximate nearest neighbor search

✓Semantic Chunking: 500 tokens with 50-token overlap

✓Cosine Similarity: <=> operator for vector distance

✓Source Citations: Track which documents answered questions