Qwen3 has a major problem

Alibaba has introduced Qwen3, a new open-weight family of models featuring two MoE (Mixture-of-Experts) variants and six dense models. According to Alibaba, ...

Qwen3 has a major problem
Plivo
83.2K views β€’ May 15, 2025
Qwen3 has a major problem

About this video

Alibaba has introduced Qwen3, a new open-weight family of models featuring two MoE (Mixture-of-Experts) variants and six dense models. According to Alibaba, the top model of Qwen3 outperforms competitors like OpenAI's o1, DeepSeek-R1, and Gemini 2.5 Pro. However, real-world testing presents mixed results. Ivan tested Qwen3’s 30B and 32B models with identical prompts, and the models passed the hexagon and strawberry tests, as well as generating a functional game prototype. Despite these successes, Qwen3 exhibited some significant issues. One notable problem, identified by Theo, is the model's tendency to overthink. In some cases, it consumed thousands of tokens simply processing its response, exhausting its context before delivering an answer. In one instance, it repeatedly said "wait" 32 times during a code prompt. This issue intensified on Qwen's official chat site, where the model said "wait" 81 times, though a workaround involving the /no\_think prompt was found to resolve the issue.

Tags and Topics

Browse our collection to discover more content in these categories.

Video Information

Views

83.2K

Likes

1.7K

Duration

1:05

Published

May 15, 2025

User Reviews

4.7
(16)
Rate:

Related Trending Topics

LIVE TRENDS

Related trending topics. Click any trend to explore more videos.

Trending Now