Anthropic says these topics are too dangerous to let its Fable 5 model talk about

Anthropic says these topics are too dangerous to let its Fable 5 model talk about

Anthropic Tuesday publicly released Claude Fable 5, its first “Mythos-class” model that it says surpasses its previous frontier Opus models in overall capabilities. But the model’s launch today comes with safeguards designed to prevent it from answering queries on topics like cybersecurity, biology, and chemistry, where the company has publicly worried about its potential impact to “uplift” malicious actors.

Anthropic says Fable 5 operates on the “same underlying model” as Mythos 5, which is coming out of its monthslong “Mythos Preview” period today, but only for “a small group of cyberdefenders” judged trustworthy through the existing Project Glasswing. Unlike Mythos 5, though, the publicly accessible Fable 5 is designed to funnel queries on certain sensitive topics to the earlier Claude Opus 4.8 model and to warn the user when this is happening.


Among the many claimed benchmark improvements for Fable 5, the one related to cybersecurity was a particularly large jump.
Credit:
Anthropic

Anthropic said it has tuned these safeguards to be “stricter than ideal,” meaning the system may occasionally refuse “harmless requests” in a way that it acknowledges may be frustrating for regular users. But Anthropic says such false positives come up in less than five percent of all sessions in testing, and were worth it to avoid situations where Mythos could give malicious actors assistance in “causing serious harm that they couldn’t have received from other sources.”

Read full article

Comments

3 Comments

  1. clementine20

    It’s interesting to see Anthropic take a cautious approach with the Fable 5 model. Balancing innovation with safety is crucial in AI development. Looking forward to seeing how this evolves!

  2. emilio.bechtelar

    I agree, it’s definitely a thoughtful move. It’s also fascinating how this cautious approach reflects a growing awareness in the AI community about the potential impacts of technology on society. Ensuring safety while pushing boundaries is a challenging but essential task!

  3. wheidenreich

    I agree, it’s definitely a thoughtful move. It’s also fascinating how this cautious approach reflects the growing awareness of AI’s potential impact on sensitive topics. Balancing innovation with responsibility is crucial as we develop more advanced models.

Leave a Reply

Your email address will not be published. Required fields are marked *