Unlocking Reliable GenAI: Strategies for Assessing LLMs in Real-World Applications

Dive into the world of LLM reliability with HoneyHive. In this video, Dhruv uncovers the shortcomings of current evaluation methods and provides practical solutions to boost your GenAI application's performance. Learn innovative strategies for rapid iteration and leveraging human feedback to ensure safer operations. Dhruv also covers how using other models (LLMs) to help with your evaluation pipeline is required to scale your evaluations framework.
👉 Sign up for our "No BS" Newsletter to get the latest technical data & AI content: hubs.li/Q02vz6xC0
ABOUT THE SPEAKER:
Dhruv Singh, Co-founder & CTO, HoneyHive (ex- Microsoft)
ABOUT DATA COUNCIL:
Data Council brings together the brightest minds in data to share industry knowledge, technical architectures and best practices in building cutting edge data & AI systems and tools.
FIND US:
Twitter: / datacouncilai
LinkedIn: / datacouncil-ai
Website: www.datacounci...

Жүктеу

Processing Trillions of Records at Okta with Mini Serverless Databases

Building Responsible and Trustworthy Generative AI Products at LinkedIn

WORLD BEST MAGIC SECRETS

Synyptas 4 | Заявщиктер көбейіп кетті ! | 2 Bolim

Допрос | 2 серия | Сериал «Эскорт. Новый вызов» | КОНКУРС

小丑妹妹插队被妈妈教训！#小丑#路飞#家庭#搞笑

Towards Robust GenAI: Techniques for Evaluating Enterprise LLM Applications

Mitigating LLM Hallucinations with a Metrics-First Evaluation Framework

Intro to Metrics tree - a powerful way to make metrics operational

How Developers Should Think About the Emerging AI Stack | Together, Pinecone, Anthropic

LLM Evaluation: Creating an LLM Eval from Scratch Featuring Bazaarvoice

Building an Ecosystem for Open Foundation Models, Together

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Unified Stream/Batch Execution with Ibis

Evaluating LLM-based Applications // Josh Tobin // LLMs in Prod Conference Part 2

Andrew Ng: Opportunities in AI - 2023

WORLD BEST MAGIC SECRETS

Unlocking Reliable GenAI: Strategies for Assessing LLMs in Real-World Applications

Пікірлер