RAG-Driven Generative AI

Build custom retrieval augmented generation pipelines with LlamaIndex, Deep Lake, and Pinecone

Denis Rothman (Autor)

Buch | Softcover

334 Seiten

2024
Packt Publishing Limited (Verlag)
978-1-83620-091-8 (ISBN)

Titel nicht im Sortiment

Artikel merken

Minimize AI hallucinations and build accurate, custom generative AI pipelines with RAG using embedded vector databases and integrated human feedback

Purchase of the print or Kindle book includes a free eBook in PDF format

Key Features

Implement RAG’s traceable outputs, linking each response to its source document to build reliable multimodal conversational agents
Deliver accurate generative AI models in pipelines integrating RAG, real-time human feedback improvements, and knowledge graphs
Balance cost and performance between dynamic retrieval datasets and fine-tuning static data

Book DescriptionRAG-Driven Generative AI provides a roadmap for building effective LLM, computer vision, and generative AI systems that balance performance and costs.

This book offers a detailed exploration of RAG and how to design, manage, and control multimodal AI pipelines. By connecting outputs to traceable source documents, RAG improves output accuracy and contextual relevance, offering a dynamic approach to managing large volumes of information. This AI book shows you how to build a RAG framework, providing practical knowledge on vector stores, chunking, indexing, and ranking. You’ll discover techniques to optimize your project’s performance and better understand your data, including using adaptive RAG and human feedback to refine retrieval accuracy, balancing RAG with fine-tuning, implementing dynamic RAG to enhance real-time decision-making, and visualizing complex data with knowledge graphs.

You’ll be exposed to a hands-on blend of frameworks like LlamaIndex and Deep Lake, vector databases such as Pinecone and Chroma, and models from Hugging Face and OpenAI. By the end of this book, you will have acquired the skills to implement intelligent solutions, keeping you competitive in fields from production to customer service across any project.What you will learn

Scale RAG pipelines to handle large datasets efficiently
Employ techniques that minimize hallucinations and ensure accurate responses
Implement indexing techniques to improve AI accuracy with traceable and transparent outputs
Customize and scale RAG-driven generative AI systems across domains
Find out how to use Deep Lake and Pinecone for efficient and fast data retrieval
Control and build robust generative AI systems grounded in real-world data
Combine text and image data for richer, more informative AI responses

Who this book is forThis book is ideal for data scientists, AI engineers, machine learning engineers, and MLOps engineers. If you are a solutions architect, software developer, product manager, or project manager looking to enhance the decision-making process of building RAG applications, then you’ll find this book useful.

Denis Rothman graduated from Sorbonne University and Paris-Diderot University, and as a student, he wrote and registered a patent for one of the earliest word2vector embeddings and word piece tokenization solutions. He started a company focused on deploying AI and went on to author one of the first AI cognitive NLP chatbots, applied as a language teaching tool for Moët et Chandon (part of LVMH) and more. Denis rapidly became an expert in explainable AI, incorporating interpretable, acceptance-based explanation data and interfaces into solutions implemented for major corporate projects in the aerospace, apparel, and supply chain sectors. His core belief is that you only really know something once you have taught somebody how to do it.

Table of Contents

Why Retrieval Augmented Generation?
RAG Embedding Vector Stores with Deep Lake and OpenAI
Building Index-Based RAG with LlamaIndex, Deep Lake, and OpenAI
Multimodal Modular RAG for Drone Technology
Boosting RAG Performance with Expert Human Feedback
Scaling RAG Bank Customer Data with Pinecone
Building Scalable Knowledge-Graph-Based RAG with Wikipedia API and LlamaIndex
Dynamic RAG with Chroma and Hugging Face Llama
Empowering AI Models: Fine-Tuning RAG Data and Human Feedback
RAG for Video Stock Production with Pinecone and OpenAI

Erscheinungsdatum	23.08.2024
Verlagsort	Birmingham
Sprache	englisch
Maße	191 x 235 mm
Themenwelt	Mathematik / Informatik ► Informatik ► Netzwerke
	Informatik ► Theorie / Studium ► Künstliche Intelligenz / Robotik
	Informatik ► Weitere Themen ► Hardware
ISBN-10	1-83620-091-9 / 1836200919
ISBN-13	978-1-83620-091-8 / 9781836200918
Zustand	Neuware