What is RAG?... and how to get it right
Amit Kohli
Access Social Care - Head of Data
& freelance consultant
What you’ll learn today
- What is Retrieval Augmented Generation (RAG)
- The traps you’ll probably fall into
- Tangible things you can implement right away
I WANT...
- MY STUFF
- DOCS I CAN TRUST
- BALANCE
and serious use-cases:
- Find information faster
- Bespoke chatbots
- Generate 'on-brand' content faster
- Control the corpus!
- chaos / one source dominate
- You need metadata!
- Evaluate outputs against clear criteria
- Human in the loop and internal until you're sure
- Don't trust simple solutions!
- "everything should be made as simple as possible, but no simpler"
What did I do?
- Data sources and metadata are stored in Monday.com
- Built an app: Ingest Docs / Ask Questions
Tech stack
Streamlit app, chromadb vector store, ChatGPT LLM, Langsmith eval
Key takeaways
- Start with your corpus: What docs do you trust?
- Metadata systems for document storage are key
- Similarity scores aren’t the whole story
- Chunk re-ranking
- Smarter chunking / multimodal digestion
- Knowledge graphs / graphRAG
What is RAG?... and how to get it right Amit Kohli Access Social Care - Head of Data & freelance consultant