Dive into the world of LLM reliability with HoneyHive. In this video, Dhruv uncovers the shortcomings of current evaluation methods and provides practical solutions to boost your GenAI application's performance. Learn innovative strategies for rapid iteration and leveraging human feedback to ensure safer operations. Dhruv also covers how using other models (LLMs) to help with your evaluation pipeline is required to scale your evaluations framework.
👉 Sign up for our "No BS" Newsletter to get the latest technical data & AI content: hubs.li/Q02vz6xC0
ABOUT THE SPEAKER:
Dhruv Singh, Co-founder & CTO, HoneyHive (ex- Microsoft)
ABOUT DATA COUNCIL:
Data Council brings together the brightest minds in data to share industry knowledge, technical architectures and best practices in building cutting edge data & AI systems and tools.
FIND US:
Twitter: / datacouncilai
LinkedIn: / datacouncil-ai
Website: www.datacounci...
Негізгі бет Unlocking Reliable GenAI: Strategies for Assessing LLMs in Real-World Applications
Пікірлер