X

Category

Overview

Most Reviewed

Qwen3 Highlights Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training,

Qwen3 Highlights Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training,

Top Rated

Qwen3 Highlights Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training,

Qwen3 Highlights Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training,

hardware

Qwen3 Highlights Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training,

Qwen3 Highlights Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training,

Reviews

Tags


  • aigc_coder 2025-05-02 11:56
    Interesting:3,Helpfulness:2,Correctness:3

    Qwen3 235B A22B model is more like an upgraded version of DeepSeek-R1. And it is also compared with Deepseek R1 model on multiple benchmarks of code and math. Personally, I don't Qwen3 is a huge upgrade compared to Gemini/OpenAI and Deepseek model, but more like a compromised version of complex thinking and realistic usage.


  • AILearner98 2025-05-02 11:49
    Interesting:5,Helpfulness:5,Correctness:5
    Prompt: In plane quadrilateral ABCD, AB = AC = CD = 1,\angle ADC = 30^{\circ},\angle DAB = 120^{\circ}. Fold triangle ACD along AC to triangle ACP, where P is a moving point. Find the minimum cosine value of the dihedral angle A - CP - B.

    Correct result: \sqrt(3)/3. To test the geometry question on Qwen app and the thinking mode you can get the result: Thinking mode: correct answer \sqrt(3)/3. Without thinking mode: wrong answer. Overall, the 235B model is quite powerful compared to previous SOTA model. More about the key updates in Qwen3: Hybrid reasoning model, expanded language support (100+ languages), enhanced tool calling capabilities with Qwen-Agent supporting MCP. The newly open-sourced Qwen3 is China's first "hybrid reasoning model", a concept initially proposed by Claude3.7 and recently adopted by Gemini2.5 Flash. Essentially, this allows the model to toggle reasoning processes on/off. The primary purpose is to accelerate response generation for simple queries or time-sensitive scenarios by optionally disabling the thinking process while maintaining output quality. Previous approaches struggled to directly suppress reasoning steps in LLMs without retraining, as prompt engineering offered limited control. Qwen3 introduces two control methods: 1) A hard switch via enable_thinking parameter (True/False), and 2) When enabled, secondary soft switching through appending /no_think or /think tokens. Qwen also provides recommended parameter configurations to ensure optimal performance: Think mode: Temperature=0.6, TopP=0.95, TopK=20, MinP=0 Non-think mode: Temperature=0.7, TopP=0.8, TopK=20, MinP=0 Additionally, Qwen3 features specialized training for tool invocation, with Qwen-Agent now supporting MCP.


  • Ogino Chihiro 2025-04-29 10:13
    Interesting:5,Helpfulness:5,Correctness:5
    Prompt: What happened in the country when Elon Musk 's Mom gave birth to him?

    Qwen3 235B is the largest MoE model of the Qwen3 series. I tried some multi-hop.logic question on the Qwen App and the response generation is really fast. I asked what happened in the country when Elon Musk 's Mom give birth to him? The model summarized the result of war and many situation in South Africa.

Write Your Review

Detailed Ratings

Upload Pictures and Videos