Jan Leike
@janleike
AI SAFETYAI research @AnthropicAI. Previously OpenAI & DeepMind. Optimizing for a post-AGI future where humanity flourishes. Opinions aren't my employer's.
Sam Bowman
@sleepinyourhat
AI SAFETYAI alignment + LLMs at Anthropic. On leave from NYU. Views not employers'. No relation to @s8mb. Into @givingwhatwecan.
Amanda Askell
@AmandaAskell
AI SAFETYPhilosopher & ethicist trying to make AI be good @AnthropicAI. Personal account. All opinions come from my training data.
Yee Whye Teh
@yeewhye
ACADEMICFind me @yeewhye@sigmoid.social Professor at @OxCSML, @oxfordstats and Research Director at @GoogleDeepMind. All opinions are my own.
Catherine Olsson
@catherineols
RESEARCH ENGINEERHanging out with Claude, improving its behavior, and building tools to support that @AnthropicAI 😁 prev: @open_phil @googlebrain @openai (@microcovid)
Tim Dettmers
@Tim_Dettmers
OPEN SOURCECreator of bitsandbytes. Professor @CarnegieMellon and Research Scientist @allen_ai . I blog about deep learning and PhD life at http://timdettmers.com.
Taco Cohen
@TacoCohen
RESEARCH ENGINEERSlop janitor & post-trainologer at Meta / FAIR. Into codegen, RL, equivariance. Spent time at Qualcomm, Scyfer (acquired), UvA, Deepmind, OpenAI.
Riley Goodside
@goodside
ENGINEERScreenshots of chatbots since 2022. Formerly: Google DeepMind, Scale.
Geoffrey Irving
@geoffreyirving
AI SAFETYChief Scientist at the UK AI Security Institute (AISI). Previously DeepMind, OpenAI, Google Brain, etc.
Joe Carlsmith
@jkcarlsmith
AI SAFETYPhilosophy, futurism, AI. Working on Claude's values @AnthropicAI. Formerly @coeff_giving. Opinions my own.
Yoshua Bengio
@Yoshua_Bengio
AI SAFETYWorking towards the safe development of AI for the benefit of all @UMontreal, @LawZero_ & @Mila_Quebec A.M. Turing Award Recipient and most-cited AI researcher.
Owain Evans
@OwainEvans_UK
AI SAFETYRuns an AI Safety research group in Berkeley (Truthful AI) + Affiliate at UC Berkeley. Past: Oxford Uni, TruthfulQA, Reversal Curse. Prefer email to DM.
Meredith Whittaker
@mer__edith
EXECUTIVEPresident of @signalapp, Chief Advisor to @ainowinstitute (Also on Mastodon @mer__edith@mastodon.world, also on bsky @meredithmeredith.bsky.social)
Dylan HadfieldMenell
@dhadfieldmenell
AI SAFETYAssociate Prof @MITEECS working on value (mis)alignment in AI systems; Safety & Alignment Advisor at http://Character.AI; @dhadfieldmenell@bsky.social; he/him
Theo Weber
@theophaneweber
RESEARCH ENGINEERResearch scientist @ DeepMind; currently working on thinking/reasoning in Gemini.
Sherjil Ozair
@sherjilozair
RESEARCH ENGINEERco-founder @ project prometheus | founder @GeneralAgentsCo | previously autopilot @tesla, deep learning @googledeepmind, phd http://mila.quebec, cs @iitdelhi
rishi
@RishiBommasani
POLICYSocietal/economic impacts of AI; AI policy & governance @StanfordHAI Stanford CS PhD w/ @percyliang @jurafsky Cornell CS undergrad w/ @clairecardie
Evan Hubinger
@EvanHub
AI SAFETYAlignment Stress-Testing lead @AnthropicAI. Opinions my own. Previously: MIRI, OpenAI, Google, Yelp, Ripple. (he/him/his)
Andrew Carr 🤸
@andrew_n_carr
RESEARCH ENGINEERco-founder leading science @getcartwheel co-founder advisor @arcade_ai Past: Codex @OpenAI, Brain @GoogleAI, world ranked Tetris player
Rob Wiblin
@robertwiblin
CREATORHost of the 80,000 Hours Podcast. Exploring the inviolate sphere of ideas one interview at a time: http://80000hours.org/podcast/
Jasper
@latentjasper
RESEARCHERDad, husband, machine learning researcher. Research Scientist @ Google Brain.
Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭
@elder_plinius
AI SAFETY⊰•-•⦑ latent space steward ❦ prompt incanter 𓃹 hacker of matrices ⊞ breaker of markov chains ☣︎ ai danger researcher ⚔︎ bt6 ⚕︎ architect-healer ⦒•-•⊱
Kory Mathewson
@korymath
RESEARCHER@GoogleDeepMind generative AI models + agents | get great tech into the hands of great creative people
Allan Dafoe
@AllanDafoe
AI SAFETYAGI governance: navigating the transition to beneficial AGI (Google DeepMind)
Pavel Izmailov
@Pavel_Izmailov
ACADEMICResearcher @AnthropicAI 🤖 Assistant Professor @nyuniversity 🏙️ Previously @OpenAI #StopWar 🇺🇦
Adam Gleave
@ARGleave
AI SAFETYCEO & co-founder @FARAIResearch non-profit | PhD from @berkeley_ai | Alignment & robustness | on bsky as http://gleave.me
Brian Christian
@brianchristian
AI SAFETYResearcher at the University of Oxford & UC Berkeley. Author of The Alignment Problem, Algorithms to Live By (w. Tom Griffiths), and The Most Human Human.
Daniel Eth (yes, Eth is my actual last name)
@daniel_271828
AI SAFETYResearching effects of automated AI R&D | pro-America, pro-tech, & pro-AI safety
Rylan Schaeffer
@RylanSchaeffer
RESEARCHERAI RS @ Meta TBD. On-Leave from Stanford w/ @sanmikoyejo. Prev @ Gemini, Meta, MIT, Harvard, Uber, UCL, UC Davis
Rianne van den Berg
@vdbergrianne
RESEARCHERPrincipal research manager at Microsoft Research Amsterdam. Formerly at Google Brain and University of Amsterdam. PhD in condensed matter physics.
Mengdi Wang
@MengdiWang10
RESEARCHERProfessor @Princeton in AIML. Co-Director of #Princeton AI^2. Program Chair @ICLR2023. Formerly @MIT @GoogleDeepmind @Tsinghua. my Erdos number: 3
uncatherio
@uncatherio
AI SAFETYwholesomeness practitioner; user of words my more "work" account is @catherineols
Peter J. Liu
@peterjliu
RESEARCHERBuilding @getcompoundai (hiring: http://twentylabs.ai/careers). Was Research Scientist @ Google Brain / DeepMind, language model research. 🇨🇦🇺🇸
Steven Adler
@sjgadler
AI SAFETYAI safety researcher (ex-OpenAI: danger evals, AGI readiness, etc), writing at https://clear-eyed.ai