klotz: safety*

0 bookmark(s) - Sort by: Date ↓ / Title / - Bookmarks from other users for this tag

  1. Sam Altman discusses the imminent arrival of digital superintelligence, its potential impacts on society, and the future of technological progress. He highlights the rapid advancements in AI, the economic and scientific benefits, and the challenges of ensuring safety and equitable access.
  2. Symbotic Inc. acquires Veo Robotics Inc. for $8.7M, gaining its FreeMove technology that enhances robot safety and productivity. Key Veo executives join Symbotic.
  3. This post discusses a study that finds that refusal behavior in language models is mediated by a single direction in the residual stream of the model. The study presents an intervention that bypasses refusal by ablating this direction, and shows that adding in this direction induces refusal. The study is part of a scholars program and provides more details in a forthcoming paper.
  4. 2023-10-31 Tags: , , , by klotz
  5. 2019-09-20 Tags: , , by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: Tags: safety

About - Propulsed by SemanticScuttle