How can we build human values into AI?
Responsibility & Safety Published 24 April 2023 Authors Iason Gabriel and Kevin McKee Drawing from philosophy to identify fair principles…
Responsibility & Safety Published 24 April 2023 Authors Iason Gabriel and Kevin McKee Drawing from philosophy to identify fair principles…
A major step forward in mathematical reasoning is the use of computer-verifiable formal languages such as Lean to prove mathematical…
During the journey from the suburbs to the city, the tree canopy often dwindles down as skyscrapers rise up. A…
We present Cappy, a small pre-trained scorer model that enhances and surpasses the performance of large multi-task language models. We…
Rethinking the Role of PPO in RLHF TL;DR: In RLHF, there’s tension between the reward learning phase, which uses human…
We’re reader-supported. When you buy through links on our site, we may earn an affiliate commission. The ability to create…
Exploring AI safety, adaptability, and efficiency for the real world Next week marks the start of the 40th International Conference…
Generative AI jailbreaking involves crafting prompts that trick the AI into ignoring its safety guidelines, allowing the user to potentially…
The impact of artificial intelligence will never be equitable if there’s only one company that builds and controls the models…
Today, we introduce Health Equity Assessment of machine Learning performance (HEAL), a novel evaluation framework designed to quantitatively assess whether…