DeepSeek R2: Chinese startup launches competition against American artificial intelligence giants

show index hide index

Welcome to the artificial intelligence arena, where startup DeepSeek has just launched its new model, R2, ready to challenge American giants like OpenAI and Nvidia. This refined and locally optimized model doesn’t just compete; it promises to revolutionize the sector with its advanced performance and innovative architecture. In a context where privacy and technological autonomy are paramount, R2 could well redefine the rules of the game against its established competitors. The world of artificial intelligence is in full swing, with the emergence of DeepSeek and its latest model, R2, which is poised to shake up giants like OpenAI and Nvidia. The official launch could take place during an online event scheduled for May 8, 2025, marking a turning point in the AI ​​landscape. Thanks to its innovative architecture and cost-cutting strategy, R2 is positioning itself as a serious competitor in the international market.A Revolutionary Local ArchitectureDeepSeek has opted for a local architecture for its R2 model, built on the success of its predecessor, the R1. Leaks suggest an advanced reasoning model, promising improved performance not only in coding, but also in multilingual reasoning and multimodal vision. With its 1.2 trillion parameters, R2 relies on a mixed expert structure, which activates only 78 billion parameters per token, ensuring unprecedented efficiency. A Bold Technological Choice In an effort to reduce its dependence on foreign technologies, DeepSeek has decided to abandon the renowned Nvidia GPUs and adopt Huawei Ascend 910B chips. This transition reduces training costs by 97.3% compared to GPT-4. A decision that could well redraw the contours of competition between AI companies. Autonomy and speed thanks to a local supply chainThe company doesn’t stop there. It is deploying a dedicated local supply chain for AI hardware, ensuring increased autonomy and significantly reduced production times. With only 5.2 petabytes of data needed to train R2, the model demonstrates a perfect balance between efficiency and cost-effectiveness, challenging Nvidia’s historical dominance in the AI ​​chip sector. An open-source model for all In line with its open-source philosophy, DeepSeek is releasing R2 under the MIT license, making the model freely available. Not only can it be run locally, but it also doesn’t require an internet connection, which offers a way to circumvent the risks associated with the cloud. With an inference cost of just $0.07 per million tokens, R2 could create waves on the stock markets, just like its predecessor, R1. Privacy Concerns However, questions remain regarding privacy. R1’s reputation had been tarnished by accusations of data transmission to China. With R2, designed by a Chinese company, potential government access could raise concerns. In response, DeepSeek is promoting a 100% local installation, seeking to defuse these fears.Backdrop Accusations Besides this, OpenAI’s accusations of data distillation continue to weigh on R2’s image. This situation could become a crucial issue for DeepSeek, as the startup strives not only to compete with American giants, but also to build a solid reputation in an industry where trust is paramount. DeepSeek’s Advances with its R2 model, promise to be a major turning point that could redefine the AI ​​landscape. These developments are of interest to a wide audience, eager to explore the implications artificial intelligence could have on a variety of fields. If you’d like to explore this fascinating topic further, also check out articles on the gadget aspect of artificial intelligence, the revolutionary impact on the music industry, and the potential risks of AI.

To read Intelligence artificielle : 28 entreprises françaises unissent leurs forces pour lancer un projet innovant

Rate this article

InterCoaching is an independent media. Support us by adding us to your Google News favorites:

Share your opinion