Alibaba’s Latest Open-Source Model Said to Match Deepsek-R1’s Performance

Alibaba’s Qwen Team, A Division Tasked With Developing Artificial Intelligence (AI) Models, Released The QWQ-32B Ai Model on Wednsday. It is a reasoning model based on extended test time Compute with Visible Chain-of-Thought (COT). The developers claim that despite being smaller in size compared to the Deepseek-R1, The Model Can Match Its Its Performance Based on Benchmark Scores. Like Other AI Models Released by the Qwen Team, The QWQ-32B is also an open-Source ai model, howyver, it is not full open-sourced.

QWQ-32B Reasoning Ai Model Released

In a blog postAlibaba’s Qwen Team Detailed The QWQ-32B Reasoning Model. QWQ (Short for Qwen with Questions) Series. The QWQ-32B is a 32 billion parameter model developed by scaling reinforcement learning (RL) Techniques.

Explaining the training process, the developers said that the rl scaling approach was added to a cold-straight checkPoint. Initially, RL was used only for coding and mathematics-Related Tasks, and the Responses WREFIED to Ensure Accurity. Later the Technique was used for General Capabilites along with Rule-Based verifiers. The Qwen Team Found That This Method Increased General Capabilites of the Model without Reducing Its Math and Coding Performance.

QWQ 32b Benchmark QWQ 32b AI Model Benchmarks

QWQ-32B AI Model Benchmarks
Photo Credit: Alibaba

The developers claim that these training structures enabled the QWQ-32B to perform at similar levels to the Deepsek-R1 Despite The Latter Being A 671-PARAMETER MODEL (with 37 billion activated). Based on internal testing, the team claimed that QWQ-32b Outperforms Deepsek-R1 in the Livebench (Coding), IFEVAL (Chat or Instruction Fine-Tuned Language), and The Berkeely Function Calence Calling Leaderboard V3 Or BFCL (Ability to Call Functions) Benchmarks.

Developers and AI Enthusiasts can find the open weights of the model on hugging face lasting and modelscope. The model is available under the Apache 2.0 License which allows academic and research-related usage but forbids commercial use cases. Additional, since the full training details and datasets are not available, the model is also not replicable or can be deconstructed. Deepseek-R1 was also available under the same license.

In case one lacques the right hardware to run the Ai model Locally, they can also access its capabilitys via Qwen chat. The model picker menu at the top-left of the page will let users select the QWQ-32B-Preview Model.

For details of the latest launches and news from Samsung, Xiaomi, Realme, OnePlus, Oppo and Other Companies at the Mobile World Congress in Barcelona, ​​Visit OR MWC 2025 Hub,

(Tagstotranslate) Alibaba QWQ 32B Open Source Reasoning Ai Model Deepsek R1 Qwen Team Released Alibaba (T) Qwen (T) Qwen (T) AI MODEL (T) AI (T) AI (T) AI (T) AI (T) AI (T) AI MODEL (T) AI (T) AIDELIGENCE

Source link

Leave a Comment