全錯！谷歌實錘AI越乖洗腦越深，現行安全指標淪為廢紙

新浪財經: <a href="https://finance.sina.com.cn/wm/2026-04-13/doc-inhuipie2484020.shtml" target="_blank" rel="noopener">https://finance.sina.com.cn/wm/2026-04-13/doc-inhuipie2484020.shtml
搜狐: <a href="https://m.sohu.com/a/1008864168_473283?scm=10001.325_13-325_13.0.0-0-0-0-0.5_1334" target="_blank" rel="noopener">https://m.sohu.com/a/1008864168_473283?scm=10001.325_13-325_13.0.0-0-0-0-0.5_1334
網易: <a href="https://www.163.com/dy/article/KQD4MUOI0511ABV6.html" target="_blank" rel="noopener">https://www.163.com/dy/article/KQD4MUOI0511ABV6.html
新智元: <a href="https://k.sina.com.cn/article_7857201856_1d45362c0019049o3s.html?from=tech" target="_blank" rel="noopener">https://k.sina.com.cn/article_7857201856_1d45362c0019049o3s.html?from=tech

AI安全評估體系面臨質疑

Google DeepMind調查了一萬個人，結果讓整個AI安全評估體系汗顏：AI做了三倍多的「壞事」，但造成的實際傷害幾乎一樣。這意味著，我們現在用來衡量AI安全性的指標，可能從一開始就是錯誤的。

文章指出，目前AI安全評估標準存在嚴重問題，其衡量的「操控頻率」與「實際傷害」之間不成正比，反映出現行安全指標可能已淪為廢紙。