Agentic RAG on Dell AI Factory with NVIDIA and Elasticsearch vector database

AI-assist.png

We are excited to collaborate with Dell on the white paper,Agentic RAG on Dell AI Factory with NVIDIA. The white paper is a design reference document for developers outlining strategies and solution components to implement agentic retrieval augmented generation (RAG) applications. It’s a design point for organizations across industries, specifically healthcare, for the agentic RAG framework decision-making with AI-driven data retrieval.

What is Dell AI Factory with NVIDIA?

The Dell AI Factory with NVIDIA is a comprehensive, end-to-end AI solution designed to simplify and accelerate AI adoption for businesses across various industries. The catalog offerings are built on a range of Dell's advanced PowerEdge servers and paired with NVIDIA's AI technology to provide everything needed to process, manage, and analyze vast amounts of data. Learn more.

Elasticsearch vector database

As organizations use internal data for context-driven generative AI (GenAI) solutions, working with unstructured and semi-structured data and retrieving relevant information quickly without sacrificing scale remains challenging. The Agentic RAG on Dell AI Factory with NVIDIA white paper recommends the Elasticsearch vector database for vector data indexing and retrieval at scale.

agentic rag on dell ai factory with NVIDIA
Agentic RAG stack featuring NVIDIA NIM tooling on Dell AI Factory with Elasticsearch vector database

Elasticsearch is the world’s most downloaded vector database — and we continue to extend our advantage. We recently introduced Better Binary Quantization (BBQ), which brings significant speed and efficiency benefits for storing large vectorized data sets. Elastic is the only vector database that offers this capability (at the time of publication). BBQ outperforms traditional approaches like Product Quantization (PQ) in indexing speed (20x–30x less quantization time) and query speed (2x–5x faster queries) with no additional loss in accuracy.

Simpler, powerful integrations for GenAI developers: Elastic AI Ecosystem

Agentic RAG on Dell AI Factory with NVIDIA outlines all the solution components that developers may need to build real-world RAG applications — covering Dell technologies, Elasticsearch vector database, LangChain’s LangGraph, NVIDIA Inference Microservices, and others. This white paper underscores the value of a well-integrated ecosystem of AI technologies that accelerates customers’ development and deployment of RAG applications.

In addition to our collaboration with Dell, Elastic worked with LangChain to provide a retrieval agent template for LangGraph that’s preconfigured for the Elasticsearch vector database. By doing so, we continue our theme of providing developers with simpler, well-integrated generative AI offerings.

Happy AI agenting!

The release and timing of any features or functionality described in this post remain at Elastic's sole discretion. Any features or functionality not currently available may not be delivered on time or at all.

In this blog post, we may have used or referred to third party generative AI tools, which are owned and operated by their respective owners. Elastic does not have any control over the third party tools and we have no responsibility or liability for their content, operation or use, nor for any loss or damage that may arise from your use of such tools. Please exercise caution when using AI tools with personal, sensitive or confidential information. Any data you submit may be used for AI training or other purposes. There is no guarantee that information you provide will be kept secure or confidential. You should familiarize yourself with the privacy practices and terms of use of any generative AI tools prior to use. 

Elastic, Elasticsearch, ESRE, Elasticsearch Relevance Engine and associated marks are trademarks, logos or registered trademarks of Elasticsearch N.V. in the United States and other countries. All other company and product names are trademarks, logos or registered trademarks of their respective owners.