Preparing and Using Data for AI with LangChain and OpenSearch^®

Learn how to prepare your existing content for AI using LangChain and store it in OpenSearch®, so it can be used with an LLM in the Retrieval Augmented Generation (RAG) pattern.

This workshop is 1,5 hours long.

See all workshops

What's in the Workshop Recipe?

We’ll work together to generate embeddings for podcast transcriptions and load that data into OpenSearch. Then we’ll search the documents using similarity search and use those results to improve our responses from an LLM (Large Language Model). Along the way we’ll explain the Retrieval Augmented Generation (RAG) pattern and show how it’s possible to try different LLMs without having to completely rewrite your code.

This workshop is particularly useful for determining what would be required to make your data usable with the RAG pattern.

Prepare for a Brain Upgrade

You will learn how to:

Find and generate embeddings for existing content
Ingest that content and its embeddings into OpenSearch
Use OpenSearch and LangChain to implement the RAG (Retrieval Augmented Generation) pattern with an LLM (Large Language Model)

Prerequisities

A web browser.
An Aiven account, using our free trial. We will lead you through setting that up in the workshop, if you don’t already have one.
A GitHub account. Here’s the GitHub repository we’ll be using.

Workshop host

Jay Miller portrait — Jay Miller
Staff Product Advocate, Aiven
Jay Miller is a Staff Product Advocate and long term advocate for building developer communities. Jay has been a contributor to the Python Community ecosystem and has participated in many user groups and conferences.
Learn more

Boost your tech skills with our developer workshops

Live and interactive sessions to upgrade your skills with expert guidance covering a range of open source technologies.

Explore all workshops

Related resources

Developer Center

Retrieval augmented generation with OpenAI and OpenSearch^®

Use OpenSearch® as a vector database to generate responses to user queries using AI

Developer Center

TensorFlow, PostgreSQL^®, PGVector & Next.js: building a movie recommender

Leveraging TensorFlow, PostgreSQL®, PGVector, and Next.js for vector search with this step-by-step video guide.

Developer Center

Find your perfect movie with ClickHouse^®, vector search, Hugging Face API, and Next.js

Get an introduction to machine learning using Aiven services and the Hugging Face API to recommend movies based on Wikipedia synopses.

Preparing and Using Data for AI with LangChain and OpenSearch®

What's in the Workshop Recipe?

Prepare for a Brain Upgrade

Prerequisities

Workshop host

Boost your tech skills with our developer workshops

Related resources

Retrieval augmented generation with OpenAI and OpenSearch®

TensorFlow, PostgreSQL®, PGVector & Next.js: building a movie recommender

Find your perfect movie with ClickHouse®, vector search, Hugging Face API, and Next.js

Preparing and Using Data for AI with LangChain and OpenSearch^®

Retrieval augmented generation with OpenAI and OpenSearch^®

TensorFlow, PostgreSQL^®, PGVector & Next.js: building a movie recommender

Find your perfect movie with ClickHouse^®, vector search, Hugging Face API, and Next.js