We'll walk you through the creation of a Retrieval-Augmented Generation (RAG) chatbot using open-source tools and AWS services like LangChain, Hugging Face, Amazon SageMaker, and Amazon OpenSearch Serverless.
Part 1: • Retrieval-Augmented Ge... - LangChain, Hugging Face, FAISS, Amazon SageMaker, and Amazon TextTract.
⭐️⭐️⭐️ Don't forget to subscribe to be notified of future videos. Follow me on Medium at / julsimon or Substack at julsimon.substack.com. ⭐️⭐️⭐️
We start by deploying Mistral 7B, a cutting-edge open-source LLM, onto a SageMaker endpoint. Following this, we work with the Reuters dataset, a Hugging Face dataset comprising 20,000 news articles. We break down these articles into smaller sections and apply bge-small, a compact open-source embedding model, to them.
Next, we proceed to index these sections into an Amazon OpenSearch Serverless vector index, which we then query through LangChain.
Additionally, aside from the RAG demonstration, we delve into some vital yet often overlooked steps related to authentication and security for OpenSearch Serverless.
- Notebook: gitlab.com/juliensimon/huggin...
- LangChain: www.langchain.com/
- Amazon OpenSearch Serverless: docs.aws.amazon.com/opensearc...
- Embedding leaderboard: huggingface.co/spaces/mteb/le...
- Embedding model: huggingface.co/BAAI/bge-small...
- LLM: huggingface.co/mistralai/Mist...
Негізгі бет Retrieval-Augmented Generation chatbot, part 2 - LangChain, Hugging Face, OpenSearch, AWS
Пікірлер: 16