Sam Bowman
@sleepinyourhat
AI SAFETYAI alignment + LLMs at Anthropic. On leave from NYU. Views not employers'. No relation to @s8mb. Into @givingwhatwecan.
Dwarkesh Patel
@dwarkesh_sp
CREATORHost of @dwarkeshpodcast https://www.youtube.com/DwarkeshPatel https://open.spotify.com/show/4JH4tybY1zX6e5hjCwU6gF https://apple.co/3ujLQkZ
Joshua Achiam
@jachiam0
AI SAFETYFreedom, flourishing, and abundance. Chief Futurist @openai. Main author of http://spinningup.openai.com
Sholto Douglas
@_sholtodouglas
RESEARCH ENGINEERScaling RL @AnthropicAI, ex @DeepMind - working towards intelligence too cheap to meter
Tim Dettmers
@Tim_Dettmers
OPEN SOURCECreator of bitsandbytes. Professor @CarnegieMellon and Research Scientist @allen_ai . I blog about deep learning and PhD life at http://timdettmers.com.
Nathan Lambert
@natolambert
RESEARCH ENGINEERResearch @allen_ai, reasoning, open models, RL(VR/HF)... Contact via email. Writes @interconnectsai, @readsail Wrote The RLHF Book, 🏔️🏃♂️
Joe Carlsmith
@jkcarlsmith
AI SAFETYPhilosophy, futurism, AI. Working on Claude's values @AnthropicAI. Formerly @coeff_giving. Opinions my own.
kipply
@kipperrii
RESEARCH ENGINEER"uncanny ability to be mentioned in every slack thread about code that's mysteriously breaking" - claude
Victoria Krakovna
@vkrakovna
AI SAFETYResearch scientist in AI alignment at Google DeepMind. Co-founder of Future of Life Institute @FLI_org. Views are my own and do not represent GDM or FLI.
Owain Evans
@OwainEvans_UK
AI SAFETYRuns an AI Safety research group in Berkeley (Truthful AI) + Affiliate at UC Berkeley. Past: Oxford Uni, TruthfulQA, Reversal Curse. Prefer email to DM.
Ajeya Cotra
@ajeya_cotra
AI SAFETYHelping the world prepare for extremely powerful AI. Risk assessment @METR_evals. Writing at Planned Obsolescence (about AI), Good Bones (about whatever).
Helen Toner
@hlntnr
POLICYAI, national security, China. Part of the founding team at @CSETGeorgetown (opinions my own). Author of Rising Tide on substack: http://helentoner.substack.com
Meredith Whittaker
@mer__edith
EXECUTIVEPresident of @signalapp, Chief Advisor to @ainowinstitute (Also on Mastodon @mer__edith@mastodon.world, also on bsky @meredithmeredith.bsky.social)
Dylan HadfieldMenell
@dhadfieldmenell
AI SAFETYAssociate Prof @MITEECS working on value (mis)alignment in AI systems; Safety & Alignment Advisor at http://Character.AI; @dhadfieldmenell@bsky.social; he/him
Buck Shlegeris
@bshlgrs
AI SAFETYCEO@Redwood Research (@redwood_ai), working on technical research to reduce catastrophic risk from AI misalignment. bshlegeris@gmail.com
⿻ Andrew Trask
@iamtrask
AI SAFETYi teach AI on X building AI with attribution-based control @openminedorg, @GoogleDeepMind, @OxfordUni, @UN, @GovAIOrg, and @CFR_org
Stella Biderman @ ICLR
@BlancheMinerva
OPEN SOURCEEnsuring that tech companies don't have a monopoly on being able to do research on cutting edge AI @AiEleuther. She/her
rishi
@RishiBommasani
POLICYSocietal/economic impacts of AI; AI policy & governance @StanfordHAI Stanford CS PhD w/ @percyliang @jurafsky Cornell CS undergrad w/ @clairecardie
Katja Grace 🔍
@KatjaGrace
AI SAFETYThinking about AI destroying the world at http://aiimpacts.org and everything at http://worldspiritsockpuppet.substack.com. DM or email for media requests.
Alexander Berger
@albrgr
FOUNDEREnjoys a good applied micro paper. CEO of @coeff_giving. Views my own, tweets self-destruct every once in a while.
William MacAskill
@willmacaskill
AI SAFETYConsider donating 10% to effective charities: http://www.givingwhatwecan.org/pledge Or a career for impact: http://80000hours.org My research: http://forethought.org
Rob Wiblin
@robertwiblin
CREATORHost of the 80,000 Hours Podcast. Exploring the inviolate sphere of ideas one interview at a time: http://80000hours.org/podcast/
Alex Tamkin
@AlexTamkin
AI SAFETYmachine learning, science & society @AnthropicAI | recently: Clio, Anthropic Economic Index, Claude Artifacts | prev: phd @StanfordAILab, @stanfordnlp
Jason Crawford
@jasoncrawford
CREATORI write and speak about the history & philosophy of progress. Founder, @rootsofprogress. Host of the Progress Conference. Author, The Techno-Humanist Manifesto.
Toby Ord
@tobyordoxford
AI SAFETYSenior Researcher at Oxford University. Author — The Precipice: Existential Risk and the Future of Humanity.
Roberta Raileanu
@robertarail
RESEARCHEROpen-Ended Team Lead and Senior Staff Research Scientist @GoogleDeepMind. Honorary Lecturer @UCL. ex @Meta | @NYU | @Princeton.
Yo Shavit
@yonashav
AI SAFETYpolicy for v smart things @openai. Past: CS PhD @HarvardSEAS/@SchmidtFutures/@MIT_CSAIL. Tweets my own; on my head be it.
Allan Dafoe
@AllanDafoe
AI SAFETYAGI governance: navigating the transition to beneficial AGI (Google DeepMind)
Rosie Campbell
@RosieCampbell
AI SAFETYForever expanding my nerd/bimbo Pareto frontier. AI welfare 🤝 AI safety. Managing Director @eleosai, Ex-OpenAI, 2024 @rootsofprogress fellow
Adam Gleave
@ARGleave
AI SAFETYCEO & co-founder @FARAIResearch non-profit | PhD from @berkeley_ai | Alignment & robustness | on bsky as http://gleave.me
Ian Hogarth
@soundboy
AI SAFETYinvestor at @pluralplatform, chair UK AI Security Institute, co-founder @songkick
Adam.GPT
@TheRealAdamG
ENGINEERForward deployed token slinger. GTM at @OpenAI. A fan of NY sports, memes, tech & nice people. NJ. My opinions are my own.