Why Build an LLM Benchmark πŸ“Š

Learn how to select and maintain effective LLM benchmarks for better AI model evaluation and development.

Why Build an LLM Benchmark πŸ“Š
Big Data Demystified
3.8K views β€’ Jan 17, 2024
Why Build an LLM Benchmark πŸ“Š

About this video

πŸ“Š Dive Deep into the World of LLM Benchmarks! πŸ“Š

Objective: By the end of this session, you should have a good understanding of how to select and maintain your own LLM benchmark.

Agenda:
πŸ”¬ Demo!
πŸ”Discover what ARC, HellSwag, and MMLU are exactly
🧫 Learn how to select the right benchmark
πŸ§ͺ Methods to test LLMs tailored to your unique use case
🧱 Q&A

Speaker: J. Yarkoni ex-Google AI/ML Specialist (Shujin.ai)
Jonathan comes from a background of leading R&D teams. Previously he co-founded NAM, an advertising startup, and AA-TLV meetup, which at its peak had 3,500 members. Over the last six years, he spearheaded AI/ML initiatives at Google Cloud Israel. More recently, he established Shujin.AI, a consultancy specializing in ML projects with an emphasis on Generative AI.

https://big-data-demystified.ninja/2024/01/17/why-you-should-build-an-llm-benchmark/

Video Information

Views

3.8K

Likes

103

Duration

37:53

Published

Jan 17, 2024

User Reviews

4.6
(3)
Rate:

Related Trending Topics

LIVE TRENDS

Related trending topics. Click any trend to explore more videos.