Chatbot Arena: Test & Compare LLMs by Human Preference

Evaluate LLMs in the Chatbot Arena to see which delivers the highest human satisfaction. Let the best chatbot win! ๐Ÿค–

Chatbot Arena: Test & Compare LLMs by Human Preference
Ahmed Khaled
33 views โ€ข Oct 4, 2024
Chatbot Arena: Test & Compare LLMs by Human Preference

About this video

Which is the best #LLM ? let's have them fight it out in the #Chatbot Arena!
The one that scores the best in "human satisfaction" wins! But how to measure this subjective metric? Anastasios Angelopoulo presented his work to me.

The core idea is simple. ๐ŸฅŠ Humans are presented with the outputs of two different LLMs for the same prompt and have to decide which one performs better. All votes are aggregated to a final ranking score. Try it out at: https://chat.lmsys.org/

Video Information

Views

33

Duration

1:52

Published

Oct 4, 2024

Related Trending Topics

LIVE TRENDS

Related trending topics. Click any trend to explore more videos.