Preparing and Using Data for AI with LangChain and OpenSearch®

Learn how to prepare your existing content for AI using LangChain and store it in OpenSearch®, so it can be used with an LLM in the Retrieval Augmented Generation (RAG) pattern.

This workshop is 1,5 hours long.

See all workshops

What's in the Workshop Recipe?


We’ll work together to generate embeddings for podcast transcriptions and load that data into OpenSearch. Then we’ll search the documents using similarity search and use those results to improve our responses from an LLM (Large Language Model). Along the way we’ll explain the Retrieval Augmented Generation (RAG) pattern and show how it’s possible to try different LLMs without having to completely rewrite your code.

This workshop is particularly useful for determining what would be required to make your data usable with the RAG pattern.

Prepare for a Brain Upgrade

You will learn how to:

  • Find and generate embeddings for existing content
  • Ingest that content and its embeddings into OpenSearch
  • Use OpenSearch and LangChain to implement the RAG (Retrieval Augmented Generation) pattern with an LLM (Large Language Model)

Prerequisities

  • A web browser.
  • An Aiven account, using our free trial. We will lead you through setting that up in the workshop, if you don’t already have one.
  • A GitHub account. Here’s the GitHub repository we’ll be using.

Workshop host

Jay Miller

Staff Developer Advocate, Aiven

Jay Miller is a Staff Developer Advocate and long term advocate for building developer communities. Jay has been a contributor to the Python Community ecosystem and has participated in many user groups and conferences.

Learn more
Boost your tech skills with our developer workshops

Live and interactive sessions to upgrade your skills with expert guidance covering a range of open source technologies.

Explore all workshops