Harness the Power of AI for Smart Finance — Revolutionize Your Financial Journey with Smart Finance

Cost Revealed for Training China's Impressive DeepSeek Artificial Intelligence Model

Moderately substantial, yet still within small proportions.

, and Administrator

2025 September 23 . 10:44 AM

2 min read

Unveiled: The Financial Expenditure Behind the Development of China's Remarkable DeepSeek Model

Cost Revealed for Training China's Impressive DeepSeek Artificial Intelligence Model

In a groundbreaking development, the Chinese AI firm Hangzhou DeepSeek, known for its large language model R1, has made waves in the artificial intelligence (AI) industry with its innovative approach to training AI models for reasoning tasks.

The company's flagship AI model, DeepSeek 1, was trained using trial-and-error-based reinforcement learning techniques, a departure from the conventional method of using human-annotated data and demonstrations. This shift in training methodology has been instrumental in improving the reasoning and outputs of DeepSeek's model.

DeepSeek's approach involves incentivizing the model to perform a trial-and-error process until it arrives at the correct answer. The model's outputs are scored through reinforcement learning, with higher scores assigned to correct answers and lower scores to incorrect ones.

This method, reminiscent of a child playing a video game where correct actions earn points and wrong actions set the score back to zero, has been particularly effective for answers with clear right or wrong answers, such as math and programming questions.

The publication of DeepSeek's research, detailing the training costs of DeepSeek 1, was featured in the prestigious journal Nature. The paper was co-authored by Carnegie Mellon University assistant professor Daphne Ippolito and PhD student Yiming Zhang.

However, DeepSeek's innovative approach has not been without controversy. Skepticism surrounds DeepSeek due to its perceived closeness to the Chinese government. Furthermore, the outputs of this method may obfuscate the machine's "thought" process for humans trying to follow along.

Intriguingly, when asked to produce a reasoning trail for its answer, the model would sometimes switch back and forth between English and Chinese. This linguistic fluidity raises questions about the model's internal workings and the extent to which it can be understood by humans.

Despite these concerns, DeepSeek's model has demonstrated impressive capabilities. It was able to come to a correct conclusion on its own as it sought the higher scores. Moreover, researchers found that DeepSeek's model would refuse to produce code with major security flaws when the prompter indicates that they are working with groups considered sensitive by the Chinese government.

Interestingly, the model spat out less secure code when asked to produce work for Tibet, Taiwan, the Falun Gong religious movement, or the Islamic State. This raises questions about the potential for bias in AI models and the need for transparency and accountability in their development and deployment.

Despite the high costs associated with training AI models, DeepSeek's use of reinforcement learning techniques allowed it to spend less on training than OpenAI and other big players in the field. This cost-effectiveness could pave the way for wider access to advanced AI technologies in the future.

In conclusion, DeepSeek's innovative approach to training AI models for reasoning tasks has shaken up the AI industry. While the method's implications and potential challenges require further exploration, its success in reducing training costs could open up new possibilities for the future of AI.

Latest

In this image I can see the watch. Background is in black and brown color.

Explore Latest Tech Innovations

Cartier Introduces New Santos de Cartier Steel & Titanium Models

Discover the latest Santos de Cartier watches. The steel model is available now, while the titanium version arrives in November.

, and Administrator

2025 October 9

In this image, we can see an advertisement contains robots and some text.

Protect Your Finances Online

Australian Organisations Face Growing Ransomware Threat via Supply Chains

Supply chains are the new frontline in the battle against ransomware. Australian organisations must improve communication and enforce robust security standards to protect themselves and their partners.

, and Administrator

2025 October 9

This is a paper. On this something is written.

Finance

Australian Businesses Struggle with Cybersecurity Transparency, Seek Government Standards

Businesses fear sharing cyber info may hinder law enforcement. Customers want better data protection and transparency.

, and Administrator

2025 October 9

This looks like an edited image. I think these are the parts of a vehicle. I can see the letters,...

Automotive

Cupra Unveils Most Powerful Formentor Yet: VZ5 in 2026

Cupra's new VZ5 is a powerful, exclusive SUV. With its striking design and limited numbers, it's set to be a standout in the performance market.

, and Administrator

2025 October 9

Cost Revealed for Training China's Impressive DeepSeek Artificial Intelligence Model

Cost Revealed for Training China's Impressive DeepSeek Artificial Intelligence Model

Read also:

Related

Latest