Close Menu
  • Home
  • Education
  • Health
  • National News
  • Politics
  • Relationship & Wellness
  • World News
What's Hot

CBI charges 16 in Reliance ADAG case, names 5 senior executives

May 30, 2026

Government asks oil companies to build 30-day LPG reserves amid Hormuz supply risks

May 30, 2026

“Silence is part of the strategy”: Blake Lively reportedly taking calculated approach ahead of Taylor Swift and Travis Kelce’s wedding

May 30, 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram YouTube
Global News Bulletin
SUBSCRIBE
  • Home
  • Education
  • Health
  • National News
  • Politics
  • Relationship & Wellness
  • World News
Global News Bulletin
Home»National News»Claude Opus 4.8 prioritises honesty over overconfidence, says Anthropic
National News

Claude Opus 4.8 prioritises honesty over overconfidence, says Anthropic

editorialBy editorialMay 29, 2026No Comments3 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email Telegram Copy Link
Claude Opus 4.8 prioritises honesty over overconfidence, says Anthropic
Share
Facebook Twitter LinkedIn Pinterest Email Copy Link

3 min readNew DelhiUpdated: May 29, 2026 01:31 PM IST

Large language models (LLMs) are often known to make claims they cannot support. Regardless of their size and prowess, LLMs are prone to making statements with complete confidence even when they are incorrect. While this has been a persistent problem, AI companies have been working on reducing these instances.

In this direction, Frontier AI lab, Anthropic, on Thursday, May 28, introduced its latest model – the Claude Opus 4.8 – which it claims to have made Claude more honest. The AI startup said that the model is more honest even with telling the user what they don’t understand.

An upgrade to Claude Opus 4.7, the Opus 4.8 is now Anthropic’s most powerful generally available model. While the improvements seem incremental, the early testers reported that the model is more likely to flag uncertainties about its work and less likely to make unsupported claims.

The company said that the improvement was possible owing to its evaluations that showed Opus 4.8 is around four times less likely than Opus 4.7 to let flaws in code written by it to pass unremarked.

Before release, Anthropic conducted a comprehensive alignment and safety evaluation of Opus 4.8, where it found that the model performed better than the earlier editions. It supported user autonomy and acted in the best interests of the user. The model also showed considerably lower rates of harmful behaviours, such as deception or assisting misuse, when compared to Claude Opus 4.7.

Moreover, its alignment levels were reportedly comparable to the company’s best-aligned model – Claude Mythos Preview, Anthropic’s frontier model that is so powerful that the company has given its access to a motley group of trusted partners.

“The assessment also showed Opus 4.8 to have rates of misaligned behaviour (such as deception or cooperation with misuse) that are substantially lower than Opus 4.7 and similar to our best-aligned model, Claude Mythos Preview. The full alignment assessment, accompanied by a suite of pre-deployment safety tests, is reported in the Claude Opus 4.8 System Card,” the company said in its blog.

Story continues below this ad

When it comes to benchmarking, Anthropic said that Opus 4.8 achieved the highest score on its Harvey’s Legal Agent Benchmark, which evaluates legal reasoning, becoming the first model to cross an overall 10 per cent on the benchmark. On computer use and browser agents, the model reportedly secured 84 per cent on Online-Mind2Web. The model demonstrated improvements in enterprise work and agentic reasoning.

Anthropic emphasised reduced unsupported claims and improved uncertainty reporting. These are the scores shared by the company; however, a thorough review by third-party testers may offer more objective results.

Claude Opus 4.8 is available immediately through Claude.ai, Claude Code, and its API. The model retains the same pricing as Opus 4.7, costing $5 per million input tokens and $25 per million output tokens. The AI lab has also introduced a Fast Mode priced at $10 per million input tokens and $50 per million output tokens. The company noted that prompt caching and batch processing can further reduce costs for developers and enterprise users.

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleDelhi HC issues notice to Centre, X on plea against blocking of Cockroach Janta Party account
Next Article Judge orders removal of Donald Trump's name from Kennedy Center, blocks closure plan; US president blasts ruling
editorial
  • Website

Related Posts

Government asks oil companies to build 30-day LPG reserves amid Hormuz supply risks

May 30, 2026

Relief for RIL in RPL trading case: Supreme Court sets aside SEBI order to pay Rs 447 crore, upholds Rs 25-crore fine

May 29, 2026

Why Kerala BJP’s legislature party leader B B Gopakumar is a surprise pick

May 29, 2026

Could Union Cabinet reshuffle be on the cards? BJP appointment of new state chiefs signals wider changes may be coming

May 29, 2026

Maharashtra FYJC Admission 2026 1st Merit List Live Updates: Round 1 list released at mahafyjcadmissions.in

May 29, 2026

Desk job isn’t enough: Why Delhi High Court ordered Rs 41 lakh for Cop who lost leg at picket duty

May 29, 2026
Add A Comment
Leave A Reply Cancel Reply

Economy News

CBI charges 16 in Reliance ADAG case, names 5 senior executives

By editorialMay 30, 2026

MUMBAI: CBI filed the first chargesheet in one of the Reliance ADA Group cases in…

Government asks oil companies to build 30-day LPG reserves amid Hormuz supply risks

May 30, 2026

“Silence is part of the strategy”: Blake Lively reportedly taking calculated approach ahead of Taylor Swift and Travis Kelce’s wedding

May 30, 2026
Top Trending

CBI charges 16 in Reliance ADAG case, names 5 senior executives

By editorialMay 30, 2026

MUMBAI: CBI filed the first chargesheet in one of the Reliance ADA…

Government asks oil companies to build 30-day LPG reserves amid Hormuz supply risks

By editorialMay 30, 2026

The liquefied petroleum gas (LPG) supply squeeze due to the West Asia…

“Silence is part of the strategy”: Blake Lively reportedly taking calculated approach ahead of Taylor Swift and Travis Kelce’s wedding

By editorialMay 30, 2026

Blake Lively reportedly taking calculated approach ahead of Taylor Swift and Travis…

Subscribe to News

Get the latest sports news from NewsSite about world, sports and politics.

Facebook X (Twitter) Instagram YouTube

News

  • Education
  • Health
  • National News
  • Relationship & Wellness
  • World News
  • Politics

Company

  • Information
  • Advertising
  • Classified Ads
  • Contact Info
  • Do Not Sell Data
  • GDPR Policy
  • Media Kits

Services

  • Subscriptions
  • Customer Support
  • Bulk Packages
  • Newsletters
  • Sponsored News
  • Work With Us

Subscribe to Updates

Get the latest creative news from FooBar about art, design and business.

© Copyright Global News Bulletin.
  • Privacy Policy
  • Terms
  • Accessibility
  • Website Developed by Plenary Media Solution

Type above and press Enter to search. Press Esc to cancel.