Exploring RAG Pipelines with Private AI Foundation and NVIDIA

25.11.2024

Virtually Speaking Podcast

0:00

19:09

In this episode of the Virtually Speaking Podcast, we delve into the world of AI with Justin Murray, Product Marketing Engineer, and Frank Denneman, Chief Technologist for AI at Broadcom. We discuss retrieval augmented generation (RAG), a powerful approach that combines large language models with real-time, trusted data. Learn how RAG pipelines can be architected using Private AI Foundation with NVIDIA, including insights into key components like LLMs, NVIDIA Inference Microservices, and Vector DB. We also explore best practices for GPU sizing and when to use fractional or multiple GPUs for optimal performance. Join us for this fascinating conversation!

Weitere Episoden von „Virtually Speaking Podcast“

Weitere Episoden

Hol dir die ganze Welt der Podcasts mit der kostenlosen GetPodcast App.

Abonniere alle deine Lieblingspodcasts, höre Episoden auch offline und erhalte passende Empfehlungen für Podcasts, die dich wirklich interessieren.

Exploring RAG Pipelines with Private AI Foundation and NVIDIA

Virtually Speaking Podcast

Weitere Episoden von „Virtually Speaking Podcast“

Revolutionizing Private Cloud: Broadcom’s VAO Partnerships with Dell, HPE, and Lenovo

The Lost Art of Blogging: Why it Still matters

Optimizing Storage Costs with vSAN and VMware Cloud Foundation

Mastering Cloud Costs with FinOps in VMware Cloud Foundation

VMware Live Recovery and New Innovations in Cyber Resilience

Revolutionizing Workload Performance with NVMe-based Memory Tiering

Achieving Zero Trust with VMware NSX Security and vDefend

ITQ’s POC Service: Accelerating Private AI Adoption

Exploring RAG Pipelines with Private AI Foundation and NVIDIA

Unpacking Explore Barcelona: Cutting-edge developments in private cloud, AI, and edge computing