🇬🇧UK Publishes First Evidence-Based Assessment of Frontier AI Capabilities The UK AI Security Institute released its inaugural "Frontier AI Trends Report", presenting a public, data-driven assessment of how the most advanced AI systems are evolving. Based on two years of testing across cyber security, software engineering, biology, and chemistry, the report provides quantified evidence on AI capabilities, replacing speculation with measurable benchmarks. The findings show rapid capability growth. In cyber security, success on apprentice-level tasks rose from under 9% in 2023 to about 50% in 2025, and for the first time a model completed an expert-level task requiring up to 10 years of experience. In software engineering, models now complete hour-long tasks over 40% of the time, up from below 5% two years ago. In biology and chemistry, systems outperform PhD-level researchers on knowledge tests and enable non-experts to conduct advanced lab work. Safeguards are improving but remain imperfect. The time needed to discover a “universal jailbreak” increased from minutes to several hours between model generations, around a 40-fold improvement, though all tested systems remain vulnerable to some bypasses. The report makes no policy recommendations, but aims to improve transparency and inform regulators and policymakers globally about what frontier AI systems can actually do. #AIRegulation#AISafety#UKAI#FrontierAI#AIGovernance#TechPolicy
静态网站悖论 个人网站的两种不同实现方式:一种是复杂的内容管理系统(CMS),另一种是简单的静态 HTML 文件。文章指出,尽管大多数普通用户倾向于使用复杂的解决方案(如 WordPress),但实际上,只有少数专业软件工程师能够选择更简单的静态网站。 via HackerNews 2024 10 09 前两天刚好听朋友说 square space 已经涨到了近乎搞笑的 $25 月费,做不用来盈利的个人博客实在难以 justify。这篇文章中吐槽得很在点子上: normal users are stuck with a bunch of greedy clowns that make them pay for every little thing, all while wasting ungodly amounts of computational power to render what could have been a static website in 99% of cases. 普通用户被困在了一群屁大点功能都要收费的贪婪小丑手里,与此同时浪费着人神共愤额度的算力来渲染 99% 的情况下都可以作为静态的网站。 当然原文中说的“只有少数专业软件工程师才能选择更简单的静态网站”略微夸张并不认同,因为静态站至少是比 self-host 的动态 CMS 少太多维护了。我的 backlog 里也一直躺了篇安利新手用静态站并拉踩 WP 的文,不过网上这种文已经有无数了也还是拦不住前赴后继往各种 CMS 的坑里冲的新手,觉得写了又有什么意义呢就还搁着没写。(当然迟早会像以前反复造的无数轮子一样被废话欲战胜的 but not today) #indieblog#newletter
Hashtags
找到 2 条相似帖子
搜索 #frontierai
UK Government Unveils Report on Frontier AI Risks Hello AI & Law community! UK Prime Minister Rishi Sunak has issued a report to address AI's potential risks and harness its benefits. The report focuses on the rapid advancements in frontier AI and comprises three key sections: 1️⃣Capabilities and Risks from Frontier AI: This section discusses the current state of AI capabilities, potential improvements, and associated risks, including societal harms, misuse, and loss of control. 2️⃣Safety and Security Risks of Generative AI to 2025: It outlines global benefits of generative AI while emphasizing increased safety and security risks, particularly in enhancing threat actor capabilities and the effectiveness of attacks. 3️⃣Future Risks of Frontier AI: This section explores uncertainties in AI development, future system risks, and potential scenarios for AI up to 2030. The report, based on declassified information, raises concerns about generative AI being exploited by terrorists to plan biological or chemical attacks, posing a serious global security threat. Although some experts have questioned the UK Government's approach, the report highlights the need for collaborative measures to manage AI risks. An upcoming AI Safety Summit aims to foster discussions around these challenges, including misuse for cyberattacks or bioweapon design, AI systems acting autonomously, and broader societal impacts. #UKGovernmentAI#FrontierAI#AIRisks#AISafety#AIChallenges#UKAIReport#AIandLaw#AIPolicy#AIRegulation