网站首页 > 技术文章 正文
Huawei Denies Plagiarism in Pangu AI Model After Allegations of Copying Alibaba's Qwen
nanyue 2025-08-05 20:09:42 技术文章 2 ℃AsianFin -- Huawei has issued a public rebuttal following mounting accusations that its newly open-sourced Pangu Pro MoE large language model repackages code and architecture from Alibaba Cloud’s Qwen-2.5 model, reigniting a debate over IP boundaries in the rapidly evolving open-source AI ecosystem.
The controversy emerged after a study published on GitHub on July 4 alleged that Huawei’s 72-billion-parameter Pangu Pro MoE model showed a striking resemblance to Alibaba’s 14B Qwen-2.5. Using a model fingerprinting technique, the unnamed author—claiming to be a Korean student at the University of Costa Rica—reported a 0.927 correlation between the two models’ attention parameter distributions, suggesting possible non-independent development. The post also cited leftover metadata in Huawei’s open-source code referencing “Copyright 2024 The Qwen team, Alibaba Group” as evidence.
Huawei responded on July 5 via a statement from its Noah’s Ark Lab, the company’s AI research division, denying any wrongdoing. The lab emphasized that the Pangu Pro MoE is a foundational model developed from scratch on Huawei’s Ascend hardware platform and is not based on incremental training from any other vendor’s models.
“We strictly adhere to open-source license requirements and clearly indicate copyright statements within the code,” Huawei said. “Referencing open-source industry practices is standard, and aligns with the collaborative spirit of the open-source community.”
The company highlighted key innovations behind Pangu Pro MoE, including a proprietary Grouped Mixture of Experts (MoGE) architecture designed to optimize load balancing in distributed training environments. Huawei said it welcomes constructive feedback and aims to promote open innovation, inclusivity, and sustainability in the open-source AI field.
Despite the GitHub repository being taken down, the episode has stirred widespread debate on social media platforms and Chinese developer forums such as Zhihu. Some commenters questioned the scientific validity of the fingerprinting methodology used to make the plagiarism claim, arguing that using parameter standard deviation is insufficient to determine model similarity.
The dispute underscores the legal and ethical gray areas in open-source AI development. While open-sourcing a model doesn’t waive intellectual property rights, confusion remains around licensing obligations, attribution requirements, and commercialization restrictions. Experts caution that developers must clearly acknowledge the original source and comply with license terms, particularly when adapting or commercializing open-source projects.
This isn’t the first time a major open-source AI model has faced plagiarism accusations. Earlier this year, 01.AI’s Yi-34B was criticized for borrowing architecture from Meta’s LLaMA model, and Stanford’s Llama3-V was revealed to have repackaged Moonshot AI’s MiniCPM-Llama3-V 2.5.
The open-source model space has become increasingly crowded, with low technical barriers allowing startups to quickly deploy services by wrapping models like GPT or DeepSeek in user-friendly apps. As generative AI adoption grows, the need for clearer licensing frameworks and enforcement mechanisms is becoming critical.
Huawei launched the Pangu model series in 2021, spanning NLP, computer vision, and scientific computing. On June 30, the company open-sourced its 7B dense model, the 72B Pangu Pro MoE, and inference tech based on Ascend, describing the move as a strategic push to strengthen its AI ecosystem.
Huawei Cloud said the Pangu model has been deployed in over 30 industries and 400 real-world scenarios—including government, finance, healthcare, autonomous driving, and industrial design—generating tangible business value.
While Alibaba has yet to comment on the matter, AI research teams from several major Chinese tech companies are closely watching the situation. For now, the incident highlights a growing tension in the AI community: how to encourage open-source innovation without eroding intellectual property protection or fueling unchecked model replication.
猜你喜欢
- 2025-08-05 KKR Nears Completion of Dayao Soda Buyout in Rare Foreign Takeover of Chinese Beverage Brand
- 2025-08-05 The Rise of China’s Machine Tool Industry Despite the West's Export Restrictions
- 2025-08-05 Tesla Logs Largest Revenue Decline in Over A Decade as Q2 EV Sales Continues to Plunge
- 2025-08-05 Chinese vice premier calls for championing humanity's common values, promoting multipolar world
- 2025-08-05 China Unveils 600 km/h Superconducting Maglev Train, Expected to Slash Beijing–Shanghai Travel Time to 2.5 Hours
- 2025-08-05 Partnership can once again prove its mettle
- 2025-08-05 Amundi sees "US Exceptionalism" eroding, while turns bullish on China's AI
- 2025-08-05 China's listed banks attract record investor visits on dividend appeal
- 2025-08-05 US consumers 'eat' force-fed tariffs
- 2025-08-05 MySQL技术内幕6:InnoDB索引技术
- 1522℃桌面软件开发新体验!用 Blazor Hybrid 打造简洁高效的视频处理工具
- 646℃Dify工具使用全场景:dify-sandbox沙盒的原理(源码篇·第2期)
- 527℃MySQL service启动脚本浅析(r12笔记第59天)
- 492℃服务器异常重启,导致mysql启动失败,问题解决过程记录
- 492℃启用MySQL查询缓存(mysql8.0查询缓存)
- 479℃「赵强老师」MySQL的闪回(赵强iso是哪个大学毕业的)
- 461℃mysql服务怎么启动和关闭?(mysql服务怎么启动和关闭)
- 459℃MySQL server PID file could not be found!失败
- 最近发表
- 标签列表
-
- cmd/c (90)
- c++中::是什么意思 (84)
- 标签用于 (71)
- 主键只能有一个吗 (77)
- c#console.writeline不显示 (95)
- pythoncase语句 (88)
- es6includes (74)
- sqlset (76)
- windowsscripthost (69)
- apt-getinstall-y (100)
- node_modules怎么生成 (87)
- chromepost (71)
- flexdirection (73)
- c++int转char (80)
- mysqlany_value (79)
- static函数和普通函数 (76)
- el-date-picker开始日期早于结束日期 (70)
- asynccallback (71)
- localstorage.removeitem (74)
- vector线程安全吗 (70)
- java (73)
- js数组插入 (83)
- mac安装java (72)
- 查看mysql是否启动 (70)
- 无效的列索引 (74)