Huawei-led team claims it post-trained DeepSeek's 1.6-trillion-parameter model — 1,000 Ascend 910C chips used…
(Image credit: DeepSeek)
A research group that includes Huawei Technologies says it completed full-parameter post-training of DeepSeek's V4-Pro, a 1.6-trillion-parameter model. The group used a cluster of at least 1,000 Huawei Ascend 910C chips, according to the Shenzhen municipal government, as reported by the South China Morning Post.The revelation is evidence that Chinese accelerators can now handle a training-class workload on domestic silicon, the part of the AI pipeline Chinese firms have ...
Read more at tomshardware.com