Close Menu
  • Home
  • AI
  • Art & Style
  • Economy
  • Entertainment
  • International
  • Market
  • Opinion
  • Politics
  • Sports
  • Trump
  • US
  • World
What's Hot

President Trump strengthens Pulte DNI, Congress focuses on short-term extension of FISA

June 10, 2026

US inflation rate hits 3-year high amid soaring energy prices | Business and Economic News

June 10, 2026

The three hard technology moonshots that powered SpaceX’s incredible IPO

June 10, 2026
Facebook X (Twitter) Instagram
Smart Breaking News on AI, Business, Politics & Global Trends | WhistleBuzz
Facebook X (Twitter) Instagram
  • Home
  • AI
  • Art & Style
  • Economy
  • Entertainment
  • International
  • Market
  • Opinion
  • Politics
  • Sports
  • Trump
  • US
  • World
Smart Breaking News on AI, Business, Politics & Global Trends | WhistleBuzz
Home » Cybersecurity researchers aren’t satisfied with Anthropic fable’s guardrails
AI

Cybersecurity researchers aren’t satisfied with Anthropic fable’s guardrails

Editor-In-ChiefBy Editor-In-ChiefJune 10, 2026No Comments3 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email Copy Link
Follow Us
Google News Flipboard
Share
Facebook Twitter LinkedIn Pinterest Email


Anthropic released its latest model, Fable, on Tuesday, touting it as a public and limited edition of its powerful and highly touted cybersecurity model, Mythos.

However, not everyone is happy with this restriction, with many cybersecurity researchers and experts voicing their complaints online.

“[Fable]denies any request that might have something to do with cyber, even something as innocuous as reading a blog post,” said Valentina “Chompy” Palmiotti, a prominent security researcher who works at IBM X-Force.

If the prompt triggers a guardrail, Fable will pause the chat and say, “Due to safety precautions, this message has been flagged as a cybersecurity or biology topic.”

The guardrails were put in place to limit the risk of Fable being used to develop malware or compromise software, a long-standing concern within Anthropic. Restrictions on biology stem from similar concerns regarding the development of biological weapons.

When the AI ​​giant released Mythos in April, it limited the model to a limited number of businesses and organizations in a project called Project Glasswing, an effort to deploy the model to protect critical software and infrastructure. Last week, Anthropic expanded access to Mythos to hundreds of organizations in 15 countries.

But despite good intentions, many cybersecurity experts remain uncomfortable with the haphazard nature of the restrictions. “If you ask them to write secure code, they’ll think it’s cybersecurity work rather than software engineering best practices, and they’ll demote it,” cybersecurity veteran Matt Swish told TechCrunch. Fable is programmed to fall back to Claude Opus 4.8 if it hits a guardrail. “It seems to be keyword-based, so anything in the vocabulary area of ​​‘cybersecurity’ will trigger guardrails.”

inquiry

Want more information on how hackers are using AI? Or how are cybersecurity companies leveraging AI? We’d love to hear from you. You can contact Lorenzo Franceschi-Bicchierai securely from any non-work device or network on Signal (+1 917 257 1382), Telegram and Keybase @lorenzofb, or email.

“But we’re still in the early stages and they’re still adapting the guardrails, so that’s understandable. I’m sure it will evolve over time as Anthropic and other frontier model companies collaborate more with today’s new generation of cybersecurity companies,” said Suiche, who is part of the technical staff at AI cybersecurity startup Tolmo. “When you do a stocking like this, it’s better to catch more people and loosen the guardrails over time than not catch enough people.”

Another researcher complained to X that “even requiring a code review” would trigger Fable’s guardrails.

Anthropic did not immediately respond to a request for comment.

Aside from the guardrails in our model, Anthropic also requires cybersecurity professionals to apply for a cyber validation program. If approved, applicants will have fewer restrictions on using Claude for cybersecurity work. OpenAI has a similar program called Trusted Access for Cyber.

If you buy through links in our articles, we may earn a small commission. This does not affect editorial independence.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Editor-In-Chief
  • Website

Related Posts

The three hard technology moonshots that powered SpaceX’s incredible IPO

June 10, 2026

Datadog veteran launches AI coding startup Niteshift to combat AI lock-in at scale

June 10, 2026

How memory tools make AI models worse

June 10, 2026
Add A Comment

Comments are closed.

News

US inflation rate hits 3-year high amid soaring energy prices | Business and Economic News

By Editor-In-ChiefJune 10, 2026

U.S. consumer inflation rose at the fastest pace in three years as soaring oil prices…

US military commander Hegseth warns Cuba against acquiring weapons | Donald Trump News

June 10, 2026

After standoff with Democrats, President Trump signs $70 billion immigration enforcement bill | Donald Trump News

June 10, 2026
Top Trending

The three hard technology moonshots that powered SpaceX’s incredible IPO

By Editor-In-ChiefJune 10, 2026

SpaceX is set to hit the market on Friday, and investors can’t…

Datadog veteran launches AI coding startup Niteshift to combat AI lock-in at scale

By Editor-In-ChiefJune 10, 2026

AI coding agent startup Niteshift has raised $7 million in a seed…

Cybersecurity researchers aren’t satisfied with Anthropic fable’s guardrails

By Editor-In-ChiefJune 10, 2026

Anthropic released its latest model, Fable, on Tuesday, touting it as a…

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Welcome to WhistleBuzz.com (“we,” “our,” or “us”). Your privacy is important to us. This Privacy Policy explains how we collect, use, disclose, and safeguard your information when you visit our website https://whistlebuzz.com/ (the “Site”). Please read this policy carefully to understand our views and practices regarding your personal data and how we will treat it.

Facebook X (Twitter) Instagram Pinterest YouTube

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Facebook X (Twitter) Instagram Pinterest
  • Home
  • Advertise With Us
  • Contact US
  • DMCA Policy
  • Privacy Policy
  • Terms & Conditions
  • About US
© 2026 whistlebuzz. Designed by whistlebuzz.

Type above and press Enter to search. Press Esc to cancel.