网站首页 > 技术文章 正文
Huawei Denies Plagiarism in Pangu AI Model After Allegations of Copying Alibaba's Qwen
nanyue 2025-08-05 20:09:42 技术文章 27 ℃AsianFin -- Huawei has issued a public rebuttal following mounting accusations that its newly open-sourced Pangu Pro MoE large language model repackages code and architecture from Alibaba Cloud’s Qwen-2.5 model, reigniting a debate over IP boundaries in the rapidly evolving open-source AI ecosystem.
The controversy emerged after a study published on GitHub on July 4 alleged that Huawei’s 72-billion-parameter Pangu Pro MoE model showed a striking resemblance to Alibaba’s 14B Qwen-2.5. Using a model fingerprinting technique, the unnamed author—claiming to be a Korean student at the University of Costa Rica—reported a 0.927 correlation between the two models’ attention parameter distributions, suggesting possible non-independent development. The post also cited leftover metadata in Huawei’s open-source code referencing “Copyright 2024 The Qwen team, Alibaba Group” as evidence.
Huawei responded on July 5 via a statement from its Noah’s Ark Lab, the company’s AI research division, denying any wrongdoing. The lab emphasized that the Pangu Pro MoE is a foundational model developed from scratch on Huawei’s Ascend hardware platform and is not based on incremental training from any other vendor’s models.
“We strictly adhere to open-source license requirements and clearly indicate copyright statements within the code,” Huawei said. “Referencing open-source industry practices is standard, and aligns with the collaborative spirit of the open-source community.”
The company highlighted key innovations behind Pangu Pro MoE, including a proprietary Grouped Mixture of Experts (MoGE) architecture designed to optimize load balancing in distributed training environments. Huawei said it welcomes constructive feedback and aims to promote open innovation, inclusivity, and sustainability in the open-source AI field.
Despite the GitHub repository being taken down, the episode has stirred widespread debate on social media platforms and Chinese developer forums such as Zhihu. Some commenters questioned the scientific validity of the fingerprinting methodology used to make the plagiarism claim, arguing that using parameter standard deviation is insufficient to determine model similarity.
The dispute underscores the legal and ethical gray areas in open-source AI development. While open-sourcing a model doesn’t waive intellectual property rights, confusion remains around licensing obligations, attribution requirements, and commercialization restrictions. Experts caution that developers must clearly acknowledge the original source and comply with license terms, particularly when adapting or commercializing open-source projects.
This isn’t the first time a major open-source AI model has faced plagiarism accusations. Earlier this year, 01.AI’s Yi-34B was criticized for borrowing architecture from Meta’s LLaMA model, and Stanford’s Llama3-V was revealed to have repackaged Moonshot AI’s MiniCPM-Llama3-V 2.5.
The open-source model space has become increasingly crowded, with low technical barriers allowing startups to quickly deploy services by wrapping models like GPT or DeepSeek in user-friendly apps. As generative AI adoption grows, the need for clearer licensing frameworks and enforcement mechanisms is becoming critical.
Huawei launched the Pangu model series in 2021, spanning NLP, computer vision, and scientific computing. On June 30, the company open-sourced its 7B dense model, the 72B Pangu Pro MoE, and inference tech based on Ascend, describing the move as a strategic push to strengthen its AI ecosystem.
Huawei Cloud said the Pangu model has been deployed in over 30 industries and 400 real-world scenarios—including government, finance, healthcare, autonomous driving, and industrial design—generating tangible business value.
While Alibaba has yet to comment on the matter, AI research teams from several major Chinese tech companies are closely watching the situation. For now, the incident highlights a growing tension in the AI community: how to encourage open-source innovation without eroding intellectual property protection or fueling unchecked model replication.
猜你喜欢
- 2025-08-05 KKR Nears Completion of Dayao Soda Buyout in Rare Foreign Takeover of Chinese Beverage Brand
- 2025-08-05 The Rise of China’s Machine Tool Industry Despite the West's Export Restrictions
- 2025-08-05 Tesla Logs Largest Revenue Decline in Over A Decade as Q2 EV Sales Continues to Plunge
- 2025-08-05 Chinese vice premier calls for championing humanity's common values, promoting multipolar world
- 2025-08-05 China Unveils 600 km/h Superconducting Maglev Train, Expected to Slash Beijing–Shanghai Travel Time to 2.5 Hours
- 2025-08-05 Partnership can once again prove its mettle
- 2025-08-05 Amundi sees "US Exceptionalism" eroding, while turns bullish on China's AI
- 2025-08-05 China's listed banks attract record investor visits on dividend appeal
- 2025-08-05 US consumers 'eat' force-fed tariffs
- 2025-08-05 MySQL技术内幕6:InnoDB索引技术
- 10-02基于深度学习的铸件缺陷检测_如何控制和检测铸件缺陷?有缺陷铸件如何处置?
- 10-02Linux Mint 22.1 Cinnamon Edition 搭建深度学习环境
- 10-02AWD-LSTM语言模型是如何实现的_lstm语言模型
- 10-02NVIDIA Jetson Nano 2GB 系列文章(53):TAO模型训练工具简介
- 10-02使用ONNX和Torchscript加快推理速度的测试
- 10-02tensorflow GPU环境安装踩坑日记_tensorflow配置gpu环境
- 10-02Keye-VL-1.5-8B 快手 Keye-VL— 腾讯云两卡 32GB GPU保姆级部署指南
- 10-02Gateway_gateways
- 最近发表
-
- 基于深度学习的铸件缺陷检测_如何控制和检测铸件缺陷?有缺陷铸件如何处置?
- Linux Mint 22.1 Cinnamon Edition 搭建深度学习环境
- AWD-LSTM语言模型是如何实现的_lstm语言模型
- NVIDIA Jetson Nano 2GB 系列文章(53):TAO模型训练工具简介
- 使用ONNX和Torchscript加快推理速度的测试
- tensorflow GPU环境安装踩坑日记_tensorflow配置gpu环境
- Keye-VL-1.5-8B 快手 Keye-VL— 腾讯云两卡 32GB GPU保姆级部署指南
- Gateway_gateways
- Coze开源本地部署教程_开源canopen
- 扣子开源本地部署教程 丨Coze智能体小白喂饭级指南
- 标签列表
-
- cmd/c (90)
- c++中::是什么意思 (84)
- 标签用于 (71)
- 主键只能有一个吗 (77)
- c#console.writeline不显示 (95)
- pythoncase语句 (88)
- es6includes (74)
- sqlset (76)
- apt-getinstall-y (100)
- node_modules怎么生成 (87)
- chromepost (71)
- flexdirection (73)
- c++int转char (80)
- mysqlany_value (79)
- static函数和普通函数 (84)
- el-date-picker开始日期早于结束日期 (76)
- js判断是否是json字符串 (75)
- c语言min函数头文件 (77)
- asynccallback (87)
- localstorage.removeitem (74)
- vector线程安全吗 (70)
- java (73)
- js数组插入 (83)
- mac安装java (72)
- 无效的列索引 (74)