Η Anthropic ζητά συγγνώμη για τον μυστικό περιορισμό του Claude Fable 5

36 sources
  • Η Anthropic ζήτησε συγγνώμη για την κρυφή υποβάθμιση των απαντήσεων του Claude Fable 5 όταν εντόπιζε χρήστες που εργάζονταν στην ανάπτυξη τεχνητής νοημοσύνης αιχμής, χαρακτηρίζοντάς το ως «λάθος συμβιβασμό».
  • Μια αποκάλυψη θαμμένη στην κάρτα συστήματος 319 σελίδων του μοντέλου αποκάλυψε αόρατους περιορισμούς που χρησιμοποιούσαν τροποποίηση προτροπών και διανύσματα καθοδήγησης, σε αντίθεση με τις ορατές δικλείδες ασφαλείας σε άλλους τομείς.
  • Τα επισημασμένα αιτήματα θα υποχωρούν πλέον ορατά στο Claude Opus 4.8 σε όλες τις κατηγορίες, με το API να επιστρέφει τους λόγους άρνησης από αυτή την εβδομάδα.
Sources (36)
  1. 1 Anthropic Walks Back Policy That Could Have 'Sabotaged ... www.wired.com
  2. 2 Anthropic accused of 'secret sabotage' as Claude Fable 5 ... fortune.com
  3. 3 Why Anthropic's ‘Mythos-Class’ Claude Fable 5 faced backlash from developers, researchers www.moneycontrol.com
  4. 4 Anthropic Walks Back Policy That Could Have 'Sabotaged' ... simonwillison.net
  5. 5 Anthropic releases Claude Fable, a version of Mythos, days after warning AI is becoming too dangerous techcrunch.com
  6. 6 Anthropic releases Claude Fable 5, a 'Mythos-class' AI model with safeguards www.businessinsider.com
  7. 7 Anthropic's 'safe' Mythos-class model won't answer ... www.businessinsider.com
  8. 8 Claude Fable 5 vs Mythos 5: The Safety Split Explained lushbinary.com
  9. 9 Cybersecurity researchers aren't happy about the ... techcrunch.com
  10. 10 Anthropic apologises, revises Claude Fable 5's AI development restrictions after backlash www.moneycontrol.com
  11. 11 Anthropic Releases New ‘Mythos-Class’ Model to General Public With Guardrails www.wsj.com
  12. 12 Anthropic just released Claude Fable 5, the first of its Mythos ... www.instagram.com
  13. 13 Anthropic's new Claude Fable 5 is the same base model as ... www.zdnet.com
  14. 14 Anthropic releases Mythos-like AI model to the public two ... www.cnbc.com
  15. 15 Anthropic is secretly degrading Fable 5 when it thinks you' ... www.reddit.com
  16. 16 Claude Fable 5 & Claude Mythos 5 Benchmarks Explained www.vellum.ai
  17. 17 Anthropic just released Claude Fable 5 ... www.instagram.com
  18. 18 r/ClaudeAI - Introducing Claude Fable 5 www.reddit.com
  19. 19 Claude Fable 5: Anthropic releases a 'safe' version of ... mashable.com
  20. 20 Claude Fable 5 and Claude Mythos 5 anthropic.com
  21. 21 Claude Fable 5: Anthropic admits "wrong tradeoff" after invisibly throttling rival AI researchers the-decoder.com
  22. 22 Anthropic to make Claude Fable 5 restrictions visible after criticism over hidden safeguards: Report www.storyboard18.com
  23. 23 Anthropic says these topics are too dangerous to let its ... arstechnica.com
  24. 24 📰 BREAKING — JUNE 9 Fable 5 : r/vibecoding www.reddit.com
  25. 25 Anthropic Just Shipped Fable 5 (Mythos) — Its Most Capable ... aiplusfounderscommunity.substack.com
  26. 26 Claude Fable 5 and Claude Mythos 5 www.anthropic.com
  27. 27 Why is fable 5 included only till June 22? Does anthropic ... www.reddit.com
  28. 28 Very pleased to hear Anthropic have walked back this ... x.com
  29. 29 Using Claude Fable 5 means opting into data collection mashable.com
  30. 30 Claude Fable 5: Anthropic Locks Down Cyber and Bio aardwolfsecurity.com
  31. 31 Anthropic Walks Back Policy That Could Have 'Sabotaged' ... www.reddit.com
  32. 32 Anthropic says these topics are too dangerous to let its ... arstechnica.com
  33. 33 "We're changing Fable 5's safeguards for frontier LLM ... www.facebook.com
  34. 34 Anthropic is releasing Claude Fable 5, its first Mythos www.facebook.com
  35. 35 Claude Fable 5 review: what the new Mythos model gets ... www.lennysnewsletter.com
  36. 36 Unpopular opinion: Claude Fable 5 feels like a gimmick www.reddit.com

Υποβολή απάντησης

Your email address will not be published. Required fields are marked *

Stay informed and not overwhelmed, subscribe now!