Chatbot Arena: Test & Compare LLMs by Human Preference
Evaluate LLMs in the Chatbot Arena to see which delivers the highest human satisfaction. Let the best chatbot win! ๐ค

Ahmed Khaled
33 views โข Oct 4, 2024

About this video
Which is the best #LLM ? let's have them fight it out in the #Chatbot Arena!
The one that scores the best in "human satisfaction" wins! But how to measure this subjective metric? Anastasios Angelopoulo presented his work to me.
The core idea is simple. ๐ฅ Humans are presented with the outputs of two different LLMs for the same prompt and have to decide which one performs better. All votes are aggregated to a final ranking score. Try it out at: https://chat.lmsys.org/
The one that scores the best in "human satisfaction" wins! But how to measure this subjective metric? Anastasios Angelopoulo presented his work to me.
The core idea is simple. ๐ฅ Humans are presented with the outputs of two different LLMs for the same prompt and have to decide which one performs better. All votes are aggregated to a final ranking score. Try it out at: https://chat.lmsys.org/
Video Information
Views
33
Duration
1:52
Published
Oct 4, 2024
Related Trending Topics
LIVE TRENDSRelated trending topics. Click any trend to explore more videos.
Trending Now