<noscript />

kodeco.com uses JavaScript extensively to offer the best possible user experience. JavaScript is currently disabled in your browser, and so we are unable to display all of our wonderful content. Please enable JavaScript in your browser and refresh this page.

Lessons

Retrieval-Augmented Generation with LangChain

5 lessons · 2 hrs, 3 mins

Lesson 1: Introduction to Retrieval-Augmented Generation (RAG)

7 parts · 21 minutes

Reading
Introduction
Reading · 1 min
Reading
Introduction to Retrieval-Augmented Generation
Reading · 6 mins
Video
Basic RAG Application Demo
Video · 3 mins
Reading
Introducing Embeddings & Vector Databases
Reading · 4 mins
Video
Embeddings & Vector Databases Demo
Video · 6 mins
Reading
Conclusion
Reading · 1 min

Lesson 2: Working with Embeddings & Vector Databases

8 parts · 22 minutes

Locked
Introduction
Reading · 1 min
Locked
Vector Databases in RAG Applications
Reading · 3 mins
Locked
Vector Dimensions & Embeddings
Reading · 4 mins
Locked
Vector Embeddings Demo
Video · 4 mins
Locked
Introducing Chroma Database
Reading · 6 mins
Locked
Chroma Demo
Video · 5 mins
Locked
Conclusion
Reading · 1 min

Lesson 3: Building a Basic RAG System with LangChain

7 parts · 25 minutes

Locked
Introduction
Reading · 1 min
Locked
Introducing SportsBuddy
Reading · 11 mins
Locked
Building a Basic RAG App Demo
Video · 4 mins
Locked
Enhancing a RAG App
Reading · 4 mins
Locked
Conversational RAG App Demo
Video · 4 mins
Locked
Conclusion
Reading · 1 min

Lesson 4: Advanced RAG Techniques

7 parts · 17 minutes

Locked
Introduction
Reading · 1 min
Locked
Advanced RAG Techniques
Reading · 5 mins
Locked
OpenAI & LangChain Demo
Video · 4 mins
Locked
Enhancing a Basic RAG App
Reading · 4 mins
Locked
Enhancing a Basic RAG App Demo
Video · 3 mins
Locked
Conclusion
Reading · 1 min

Lesson 5: Evaluating & Optimizing RAG Systems

8 parts · 35 minutes

Locked
Introduction
Reading · 1 min
Locked
Assessing a RAG Pipeline
Reading · 12 mins
Locked
Assessing a RAG Pipeline Demo
Video · 5 mins
Locked
Understanding Query Analysis
Reading · 7 mins
Locked
Understanding Query Analysis Demo
Video · 5 mins
Locked
Improving Conversational Traits
Reading · 5 mins
Locked
Conclusion
Reading · 1 min

Retrieval-Augmented Generation with LangChain

Nov 12 2024 · Python 3.12, LangChain 0.3.x, JupyterLab 4.2.4

Lesson 01: Introduction to Retrieval-Augmented Generation (RAG)

Basic RAG Application Demo

Episode complete

Play next episode

Transcript

Demo

In this demo, you’ll see a simple RAG application in action. The community has done a great job at abstracting away a lot of the internals. In basically three steps, you can have a complete RAG for yourself. And you can do all these in fewer than 50 lines of code.

In this JupyterLab session, this notebook contains a full RAG application. Here’s a description and demonstration of the RAG at work.

The first cell contains simply an environment variable to help keep record of operations that pertain to this application. Nothing fancy. It’s worth noting the power of notebooks here. With notebooks, you get to build and test your application one cell at a time. Every cell can be executed independently, and subsequent cells retain memory of data from previous ones.

This cell contains the initializing code for the external data source. Remember that a RAG application first retrieves data. In this cell, the data is a Wikipedia page of the 2024 Summer Olympics. The model for this RAG has information on events up to 2021. It knows nothing about an event that happened in 2024. But in this RAG application, you’re feeding it with a more recent event. This is one of the key features that makes RAG super useful.

After loading the data source, the next step is to split the data into manageable chunks. This improves organization and facilitates efficient search and retrieval in the RAG system.

This isn’t crucial to the operations of a RAG but is certainly important. It’s the persistence feature of the RAG. In this cell, the Chroma vector database stores the previously retrieved Wikipedia data. It also includes an existing model from Ollama, the model used for this RAG application.

This next cell tests the database to make sure it’s good. This is a notebook, so it’s perfectly fine to test portions of the app, cell by cell.

This cell sets up the Ollama model. The specific model used here is the llama3.1:latest. There are many other models with different parameters, features, and size.

Next is a quick test of the model. Here, you can see that it doesn’t have any information about events beyond 2021.

Finally, this cell puts everything together. It starts with a prompt, which guides the model into giving a more accurate response.

It then creates a chain of processes with the source data, the prompt, the model, and a class that converts the output into a string format.

The result is a valid, natural response to an event that happened years after the model was trained. This is just awesome, isn’t it? Well, there you have it.

In the upcoming sections, you’ll learn more about all the things you’ve seen in this demo. See you there.

Retrieval-Augmented Generation with LangChain

Lesson 01: Introduction to Retrieval-Augmented Generation (RAG)

Basic RAG Application Demo

Episode complete

Demo

Sign up/Sign in

All videos. All books. One low price.

All videos. All books.
One low price.