Chinese startup DeepSeek releases upgraded AI model

Major Upgrade in the Chinese AI Landscape

Chinese artificial intelligence startup DeepSeek has announced the release of its upgraded AI model, aiming to compete directly with global leaders in generative AI. This release marks a significant development in China’s AI sector as the company strives to close the gap with international players.

About DeepSeek and its New Model

DeepSeek, founded in 2023 and headquartered in Hangzhou, has quickly emerged as a leader in China’s AI industry. The company’s latest model, DeepSeek-R1, is designed for advanced logical inference, mathematical reasoning, and real-time problem-solving. It builds upon their previous architecture, using techniques like large-scale reinforcement learning to match the performance of competitors such as ChatGPT[1][2].
  • DeepSeek-R1 is fully open-source under the MIT License, enabling wide adoption and commercial use[2].
  • The model demonstrates strong results on benchmarks—such as the American Invitational Mathematics Examination—outperforming many rivals in mathematical and logical tasks[1][2].
  • It powers the company’s conversational AI tool, DeepThink, accessible at chat.deepseek.com[2].

Technical Highlights

  • DeepSeek-R1 makes use of distillation techniques to create smaller, efficient models suited for various applications, similar to approaches used by LLaMA and Qwen[1].
  • The company offers six open-source distilled models with sizes up to 70B parameters, delivering performance on par with OpenAI o1 mini[2].
  • Data generated by the main model is leveraged for fine-tuning and further advancements.

Focus on Openness and Affordability

DeepSeek has emphasized open access to its technology, providing full documentation, API access, and transparent pricing.
  • APIs are live for developers and research communities.
  • Pricing is competitive, with costs as low as $0.14 per million input tokens on cache hits and $2.19 per million output tokens[2].
  • The MIT license facilitates community-driven innovation and commercial deployment.

China’s AI Ambitions

DeepSeek’s upgraded model underscores China’s focus on catching up with global leaders such as OpenAI and Google Gemini. With a combination of technical sophistication, open-source principles, and rapidly evolving capabilities, DeepSeek is well positioned to shape the future of China’s AI industry and potentially influence international competition[1][3].

Latest AI News

Stay Informed with the Latest news and trends in AI