Top LLM Leaderboard Based on 3 Benchmarks πŸ†

A new leaderboard ranks LLMs using Chatbot Arena, user votes, and more, showcasing the best models today.

Top LLM Leaderboard Based on 3 Benchmarks πŸ†
1littlecoder
6.5K views β€’ Nov 28, 2023
Top LLM Leaderboard Based on 3 Benchmarks πŸ†

About this video

πŸ† This leaderboard is based on the following three benchmarks.

Chatbot Arena - a crowdsourced, randomized battle platform. We use 100K+ user votes to compute Elo ratings.
MT-Bench - a set of challenging multi-turn questions. We use GPT-4 to grade the model responses.
MMLU (5-shot) - a test to measure a model's multitask accuracy on 57 tasks.

πŸ”— Links πŸ”—

ChatBOT Arena Leaderboard from Lmsys - https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard

Arena Leaderboard Elo Ranking Method - https://colab.research.google.com/drive/1RAWb22-PFNI-X1gPVzc927SGUdfr6nsR?usp=sharing

Play at the Arena - https://chat.lmsys.org/?arena



Intro Sound from Honest Trailers- https://youtu.be/lZMzf-SDWP8

❀️ If you want to support the channel ❀️
Support here:
Patreon - https://www.patreon.com/1littlecoder/
Ko-Fi - https://ko-fi.com/1littlecoder

🧭 Follow me on 🧭
Twitter - https://twitter.com/1littlecoder
Linkedin - https://www.linkedin.com/in/amrrs/

Tags and Topics

Browse our collection to discover more content in these categories.

Video Information

Views

6.5K

Likes

249

Duration

11:24

Published

Nov 28, 2023

User Reviews

4.6
(1)
Rate:

Related Trending Topics

LIVE TRENDS

Related trending topics. Click any trend to explore more videos.