Harness the Power of AI for Smart Finance — Revolutionize Your Financial Journey with Smart Finance

OpenAI distinguishes reasons for the security of LLMs, without relying on specific safeguards.

, and Administrator

2025 September 26 . 12:52 AM

2 min read

OpenAI outlines the safety measures of LLMs despite lacking certain safeguards.

OpenAI distinguishes reasons for the security of LLMs, without relying on specific safeguards.

In a recent publication, OpenAI, a leading research organisation in artificial intelligence, has unveiled a paper titled 'Why Language Models Hallucinate'. This paper aims to shed light on a pervasive issue in AI, seeking to explain the concept of hallucinations in language models.

The paper delves into the statistical inevitability of errors in pre-training and conceptual errors in post-training incentives. Language models, it explains, do not learn absolute truths, but probabilities of which token follows another. This can lead to the production of plausible but false or contradictory statements, known as hallucinations.

OpenAI's strategy to build trust is evident in the publication of this paper. The behaviour of language models has shifted from being cautious and emphasizing uncertainties to appearing more authoritative, with hallucinations being tolerated and even encouraged. This trend, the paper suggests, is not just a statistical problem, but also a regulatory problem.

The quality and origin of training data are significant factors in hallucinations. Training data sources include publicly accessible repositories, Wikipedia dumps, forums, blog posts, and large amounts of GitHub. However, the paper warns of the potential flaws, outdatedness, or manipulation in these data sources, which can influence the behaviour of language models.

One example given in the paper is a points system, where answers above a required confidence threshold earn plus points, an 'I don't know' answer results in no points, and answers below the threshold (assuming 90 percent) incur negative points. This system encourages language models to provide confident, albeit potentially incorrect, answers.

The problem with benchmarks, originally emerging from research, has become a marketing tool. These benchmarks decide which language model is perceived as leading, influencing investors, media, and customers. This, in turn, affects the development strategies of providers, leading to a systematic incentive to guess.

OpenAI proposes a correction called Confidence Targets, where a model should only respond if it exceeds a certain security threshold, and wrong answers are penalized. This approach aims to encourage language models to be more cautious and provide more accurate responses.

The paper also highlights the issue of targeted data poisoning, where prepared content can influence the behaviour of later models. This raises concerns about the potential for manipulation and the need for robust regulations.

Collaborations with universities, peer reviews, and mathematical proofs are intended to convey seriousness to the public, especially in light of OpenAI's growing legal challenges and CEO Sam Altman's admission of a possible AI bubble. The paper was written in collaboration with researchers from Georgia Tech University.

In conclusion, OpenAI's paper provides valuable insights into the issue of hallucinations in AI. It underscores the need for careful consideration of training data, the development of robust regulations, and the importance of accurate benchmarks in the field of AI.

Latest

In this image I can see the watch. Background is in black and brown color.

Explore Latest Tech Innovations

Cartier Introduces New Santos de Cartier Steel & Titanium Models

Discover the latest Santos de Cartier watches. The steel model is available now, while the titanium version arrives in November.

, and Administrator

2025 October 9

In this image, we can see an advertisement contains robots and some text.

Protect Your Finances Online

Australian Organisations Face Growing Ransomware Threat via Supply Chains

Supply chains are the new frontline in the battle against ransomware. Australian organisations must improve communication and enforce robust security standards to protect themselves and their partners.

, and Administrator

2025 October 9

This is a paper. On this something is written.

Finance

Australian Businesses Struggle with Cybersecurity Transparency, Seek Government Standards

Businesses fear sharing cyber info may hinder law enforcement. Customers want better data protection and transparency.

, and Administrator

2025 October 9

This looks like an edited image. I think these are the parts of a vehicle. I can see the letters,...

Automotive

Cupra Unveils Most Powerful Formentor Yet: VZ5 in 2026

Cupra's new VZ5 is a powerful, exclusive SUV. With its striking design and limited numbers, it's set to be a standout in the performance market.

, and Administrator

2025 October 9

OpenAI distinguishes reasons for the security of LLMs, without relying on specific safeguards.

OpenAI distinguishes reasons for the security of LLMs, without relying on specific safeguards.

Read also:

Related

Latest