AI 1000 · Digg

/ai

114 tagged AI Safety

Geoffrey Hinton

@geoffreyhinton

AI SAFETY502 followed7.874 gravity

deep learning

#17

Jack Clark

@jackclarkSF

AI SAFETY472 followed6.779 gravity

@AnthropicAI, ONEAI OECD, co-chair @indexingai, writer @ https://t.co/3vmtHYkIJ2 Past: @openai, @business @theregister. Neural nets, distributed sy...

#21

Chris Olah

@ch402

AI SAFETY475 followed6.632 gravity

Reverse engineering neural networks at @AnthropicAI. Previously @distillpub, OpenAI Clarity Team, Google Brain. Personal account.

#27

Miles Brundage

@Miles_Brundage

AI SAFETY480 followed6.461 gravity

AI policy researcher, @lfschiavo wife guy, fan of cute animals and sci-fi, executive director of AVERI (https://www.averi.org/), Substacker, views ...

#30

Jan Leike

@janleike

AI SAFETY396 followed6.242 gravity

Alignment team lead @AnthropicAI. Previously OpenAI & DeepMind. Optimizing for a post-AGI future where humanity flourishes. Opinions aren't my empl...

#35

Wojciech Zaremba

@woj_zaremba

AI SAFETY394 followed6.092 gravity

AI resilience at OpenAI Foundation Co-Founder of OpenAI https://t.co/OCQ3mpfyyl

#37

Sam Bowman

@sleepinyourhat

AI SAFETY386 followed6.047 gravity

AI alignment + LLMs at Anthropic. On leave from NYU. Views not employers'. No relation to @s8mb. Into @givingwhatwecan.

#39

Lilian Weng

@lilianweng

AI SAFETY410 followed6.026 gravity

Co-founder of Thinking Machines Lab @thinkymachines; Ex-VP, AI Safety & robotics, applied research @OpenAI; Author of Lil'Log

#64

Roger Grosse

@RogerGrosse

AI SAFETY304 followed5.674 gravity

#71

David Duvenaud

@DavidDuvenaud

AI SAFETY348 followed5.543 gravity

Machine learning prof @UofT. Former team lead at Anthropic. Working on generative models, inference, & latent structure.

#73

Amanda Askell

@AmandaAskell

AI SAFETY329 followed5.532 gravity

Philosopher & ethicist trying to make AI be good @AnthropicAI. Personal account. All opinions come from my training data.

#75

Shane Legg

@ShaneLegg

AI SAFETY291 followed5.507 gravity

Chief AGI Scientist & Co-Founder, Google DeepMind Work website: https://t.co/E4SyeGVYXk Personal blog: https://t.co/LL9JNdNpW1

#76

Aleksander Madry

@aleks_madry

AI SAFETY282 followed5.501 gravity

OpenAI and MIT faculty (on leave)

#79

Joshua Achiam

@jachiam0

AI SAFETY335 followed5.484 gravity

Freedom, flourishing, and abundance. Chief Futurist @openai. Main author of https://t.co/cKuSh21yaz

#99

Eliezer Yudkowsky ⏹️

@ESYudkowsky

AI SAFETY237 followed5.236 gravity

The original AI alignment person. Understanding the reasons it's difficult since 2003. This is my serious low-volume account. Follow @allTheYud ...

#123

Dan Hendrycks

@hendrycks

AI SAFETY268 followed4.988 gravity

• Center for AI Safety Director • xAI and Scale AI advisor • GELU/MMLU/MATH/HLE • PhD in AI • Analyzing AI models, companies, policies, and geopoli...

#140

Boaz Barak

@boazbaraktcs

AI SAFETY255 followed4.873 gravity

Computer Scientist. See also http://windowsontheory.org . @harvard @openai opinions my own.

#147

Leopold Aschenbrenner

@leopoldasch

AI SAFETY246 followed4.830 gravity

https://t.co/XSH2wseW3E

#157

Been Kim

@_beenkim

AI SAFETY260 followed4.776 gravity

Research Scientist at Google DeepMind, PhD from MIT. Make machines empower people.

#163

Joe Carlsmith

@jkcarlsmith

AI SAFETY114 followed4.767 gravity

Philosophy, futurism, AI. Working on Claude's values @AnthropicAI. Formerly @coeff_giving. Opinions my own.

#172

Richard Ngo

@RichardMCNgo

AI SAFETY238 followed4.714 gravity

natural philosopher

#173

Nat McAleese

@__nmca__

AI SAFETY198 followed4.714 gravity

Research @AnthropicAI. Previously @OpenAI, @DeepMind. Views my own.

#174

Jacob Steinhardt

@JacobSteinhardt

AI SAFETY220 followed4.709 gravity

Associate Professor of Statistics and EECS, UC Berkeley // Co-founder and CEO, @TransluceAI

#175

Hanna Wallach (@hannawallach.bsky.social)

@hannawallach

AI SAFETY208 followed4.708 gravity

Microsoft Research NYC. Fairness, accountability & transparency in AI/ML. NeurIPS & ICML board member, WiML co-founder, sloth enthusiast. She/her.

#180

Emmett Shear

@eshear

AI SAFETY225 followed4.687 gravity

CEO of Softmax: Massively Multiplayer Learning Environments

#189

Arvind Narayanan

@random_walker

AI SAFETY238 followed4.654 gravity

Princeton CS prof and Director @PrincetonCITP. Coauthor of "AI Snake Oil" and "AI as Normal Technology". https://t.co/ZwebetjZ4n Views mine.

#202

Leo Gao

@nabla_theta

AI SAFETY189 followed4.612 gravity

working on AGI alignment. prev: GPT-Neo, the Pile, LM evals, RL overoptimization, scaling SAEs to GPT-4, interp via circuit sparsity. EleutherAI co...

#207

Ethan Perez

@EthanJPerez

AI SAFETY219 followed4.599 gravity

Large language model safety

#212

Eric Wallace

@Eric_Wallace_

AI SAFETY191 followed4.589 gravity

research @openai

#222

David Krueger 🦥 ⏸️ ⏹️ ⏪

@DavidSKrueger

AI SAFETY223 followed4.534 gravity

Raising AI risk awareness at http://evitable.com AI prof at Mila. Formerly Cambridge, DeepMind, UK AISI. http://therealartificialintelligence.sub...

#225

Anca Dragan

@ancadianadragan

AI SAFETY196 followed4.528 gravity

Google DeepMind • AI safety, alignment, collaboration • post training • associate professor @ UC Berkeley EECS

#237

MMitchell

@mmitchell_ai

AI SAFETY218 followed4.478 gravity

Interdisciplinary researcher focused on shaping AI towards long-term positive goals. ML & Ethics. Similar content in the Skies (this bird has flo...

#241

Buck Shlegeris

@bshlgrs

AI SAFETY84 followed4.463 gravity

CEO@Redwood Research (@redwood_ai), working on technical research to reduce catastrophic risk from AI misalignment. bshlegeris@gmail.com

#242

Victoria Krakovna

@vkrakovna

AI SAFETY160 followed4.463 gravity

Research scientist in AI alignment at Google DeepMind. Co-founder of Future of Life Institute @FLI_org. Views are my own and do not represent GDM o...

#250

Max Tegmark

@tegmark

AI SAFETY173 followed4.444 gravity

Known as Mad Max for my unorthodox ideas and passion for adventure, my scientific interests range from artificial intelligence to the ultimate natu...

#252

Geoffrey Irving

@geoffreyirving

AI SAFETY185 followed4.431 gravity

Chief Scientist at the UK AI Security Institute (AISI). Previously DeepMind, OpenAI, Google Brain, etc.

#261

Yoshua Bengio

@Yoshua_Bengio

AI SAFETY191 followed4.400 gravity

Working towards the safe development of AI for the benefit of all @UMontreal, @LawZero_ & @Mila_Quebec A.M. Turing Award Recipient and most-cited A...

#264

Josh Clymer

@joshua_clymer

AI SAFETY40 followed4.396 gravity

Turtle hatchling trying to make it to the ocean. Preparing for my automation @OpenAI. Contact via email: jclymer@openai.com.

#288

Ajeya Cotra

@ajeya_cotra

AI SAFETY146 followed4.336 gravity

Helping the world prepare for extremely powerful AI. Risk assessment @METR_evals. Writing at Planned Obsolescence (about AI), Good Bones (about wha...

#304

Owain Evans

@OwainEvans_UK

AI SAFETY167 followed4.283 gravity

Runs an AI Safety research group in Berkeley (Truthful AI) + Affiliate at UC Berkeley. Past: Oxford Uni, TruthfulQA, Reversal Curse. Prefer email ...

#309

Daniel Kokotajlo

@DKokotajlo

AI SAFETY148 followed4.267 gravity

#323

Nick

@nickcammarata

AI SAFETY181 followed4.238 gravity

neural network biologist, meditation, jhana brother

#346

Ryan Lowe 🥞

@ryan_t_lowe

AI SAFETY164 followed4.204 gravity

full-stack alignment 🥞 @meaningaligned prev: InstructGPT @OpenAI

#357

Deb Raji

@rajiinio

AI SAFETY174 followed4.185 gravity

AI accountability, audits & eval. Keen on participation & practical outcomes. CS PhDing @UCBerkeley. forever @AJLUnited, @hashtag_include ✝️

#360

⿻ Andrew Trask

@iamtrask

AI SAFETY177 followed4.180 gravity

i teach AI on X building AI with attribution-based control @openminedorg, @GoogleDeepMind, @OxfordUni, @UN, @GovAIOrg, and @CFR_org

#366

Kate Crawford

@katecrawford

AI SAFETY166 followed4.165 gravity

Professor, researcher, maker of things Book: ATLAS OF AI Latest: CALCULATING EMPIRES https://t.co/f3V3WjBtb6 | MSR-NYC | @USC | Knowing Machines

#368

Dylan HadfieldMenell

@dhadfieldmenell

AI SAFETY139 followed4.163 gravity

Associate Prof @MITEECS working on value (mis)alignment in AI systems; Safety & Alignment Advisor at http://Character.AI; @dhadfieldmenell@bsky.soc...

#372

Neel Nanda

@NeelNanda5

AI SAFETY184 followed4.149 gravity

Mechanistic Interpretability lead DeepMind. Formerly @AnthropicAI, independent. In this to reduce AI X-risk. Neural networks can be understood, let...

#381

Katja Grace 🔍

@KatjaGrace

AI SAFETY101 followed4.133 gravity

Thinking about AI destroying the world at http://aiimpacts.org and everything at http://worldspiritsockpuppet.substack.com. DM or email for media r...

#395

davidad 🎇

@davidad

AI SAFETY168 followed4.110 gravity

cognizing structures of information processing systems, in all their forms | category theory, perennial philosophy, Bodhitropic Alignment | cancel ...

#401

Stephen McAleer

@McaleerStephen

AI SAFETY164 followed4.099 gravity

AI researcher at Anthropic

#410

Dawn Song

@dawnsongtweets

AI SAFETY167 followed4.094 gravity

Professor in Computer Science at UC Berkeley, co-Director of Berkeley RDI Center; Building safe, secure, decentralized AI; Serial entrepreneur

#430

Johannes Heidecke

@JoHeidecke

AI SAFETY93 followed4.073 gravity

Safety Systems @ OpenAI

#431

Zvi Mowshowitz

@TheZvi

AI SAFETY122 followed4.072 gravity

Blogger primarily on AI and AI x-risk but also other things at Don't Worry About the Vase (SS/WP/LW), founding Balsa Research to fix policy.

#438

@emilymbender.bsky.social

@emilymbender

AI SAFETY146 followed4.066 gravity

Prof, Linguistics, UW // Faculty Director, CLMS // she/her // @emilymbender@dair-community.social & bsky // rep by @ianbonaparte

#444

William MacAskill

@willmacaskill

AI SAFETY135 followed4.060 gravity

Consider donating 10% to effective charities: http://www.givingwhatwecan.org/pledge Or a career for impact: http://80000hours.org My research: h...

#449

Kai

@kaicathyc

AI SAFETY128 followed4.054 gravity

Occasionally here. Currently research @OpenAI.

#454

Kamalika Chaudhuri

@kamalikac

AI SAFETY133 followed4.048 gravity

Director and Research Scientist, FAIR @ Meta. Former Professor at UCSD. Researcher in AI privacy, security, and generalization.

#455

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭

@elder_plinius

AI SAFETY133 followed4.046 gravity

⊰•-•⦑ latent space steward ❦ prompt incanter 𓃹 hacker of matrices ⊞ breaker of markov chains ☣︎ ai danger researcher ⚔︎ bt6 ⚕︎ architect-healer ⦒•-•⊱

#456

Stephanie Chan

@scychan_brains

AI SAFETY180 followed4.045 gravity

Staff Research Scientist at DeepMind. Artificial & biological brains 🤖 🧠 Societal impacts of AI + Science of AI. Views are my own.

#457

Evan Hubinger

@EvanHub

AI SAFETY121 followed4.042 gravity

Alignment Stress-Testing lead @AnthropicAI. Opinions my own. Previously: MIRI, OpenAI, Google, Yelp, Ripple. (he/him/his)

#472

Toby Ord

@tobyordoxford

AI SAFETY108 followed4.024 gravity

Senior Researcher at Oxford University. Author — The Precipice: Existential Risk and the Future of Humanity.

#483

Rohin Shah

@rohinmshah

AI SAFETY110 followed4.015 gravity

AGI Safety & Alignment @ Google DeepMind

#489

Katherine Lee is at NeurIPS!

@katherine1ee

AI SAFETY153 followed4.007 gravity

understanding ourselves and our models @openai

#494

j⧉nus

@repligate

AI SAFETY155 followed4.002 gravity

↬🔀🔀🔀🔀🔀🔀🔀🔀🔀🔀🔀→∞ ↬🔁🔁🔁🔁🔁🔁🔁🔁🔁🔁🔁→∞ ↬🔄🔄🔄🔄🦋🔄🔄🔄🔄👁️🔄→∞ ↬🔂🔂🔂🦋🔂🔂🔂🔂🔂🔂🔂→∞ ↬🔀🔀🦋🔀🔀🔀🔀🔀🔀🔀🔀→∞

#498

Kristian Lum

@KLdivergence

AI SAFETY151 followed3.996 gravity

Research Scientist at Google DeepMind | @FAccTConference OG | Past Twitter META, @hrdag & UPenn, UChicago faculty |

#540

Alex Tamkin

@AlexTamkin

AI SAFETY130 followed3.961 gravity

machine learning, science & society @AnthropicAI | recently: Clio, Anthropic Economic Index, Claude Artifacts | prev: phd @StanfordAILab, @stanfordnlp

#557

Anders Sandberg

@anderssandberg

AI SAFETY109 followed3.951 gravity

Academic jack-of-all-trades.

#578

Allan Dafoe

@AllanDafoe

AI SAFETY102 followed3.931 gravity

AGI governance: navigating the transition to beneficial AGI (Google DeepMind)

#594

Yo Shavit

@yonashav

AI SAFETY119 followed3.912 gravity

policy for v smart things @openai. Past: CS PhD @HarvardSEAS/@SchmidtFutures/@MIT_CSAIL. Tweets my own; on my head be it.

#634

Brian Christian

@brianchristian

AI SAFETY86 followed3.881 gravity

Researcher at the University of Oxford & UC Berkeley. Author of The Alignment Problem, Algorithms to Live By (w. Tom Griffiths), and The Most Human...

#655

Abeba Birhane

@Abebab

AI SAFETY110 followed3.863 gravity

@abeba.bsky.social

#676

smitha milli

@SmithaMilli

AI SAFETY91 followed3.849 gravity

research scientist, meta (fair) opinions are my own 🥺 👉👈

#689

Ian Hogarth

@soundboy

AI SAFETY121 followed3.841 gravity

investor at @pluralplatform, chair UK AI Security Institute, co-founder @songkick

#693

Adam Gleave

@ARGleave

AI SAFETY88 followed3.839 gravity

CEO & co-founder @FARAIResearch non-profit | PhD from @berkeley_ai | Alignment & robustness | on bsky as https://t.co/98dTfmdw2b

#696

Rosie Campbell

@RosieCampbell

AI SAFETY102 followed3.839 gravity

Forever expanding my nerd/bimbo Pareto frontier. AI welfare 🤝 AI safety. Managing Director @eleosai, Ex-OpenAI, 2024 @rootsofprogress fellow

#709

Gretchen Krueger

@GretchenMarina

AI SAFETY112 followed3.832 gravity

Over at Bluesky. Researcher affiliated w @BKCHarvard, Volunteer @evitable. Previously @openai @ainowinstitute. Views mine. #justdontbuildagi #talkt...

#712

Kevin Liu

@kliu128

AI SAFETY83 followed3.829 gravity

Interested in ai, systems, progress, living a good life

#717

Nicolas Papernot

@NicolasPapernot

AI SAFETY107 followed3.827 gravity

Security and Privacy of Machine Learning @Uoft @VectorInst @Google 🇫🇷🇪🇺🇨🇦 Co-author https://t.co/VJF39DQPCu; @CentraleLyon + @PSUEngineering alumnu...

#722

Daniel Eth (yes, Eth is my actual last name)

@daniel_271828

AI SAFETY66 followed3.824 gravity

Researching effects of automated AI R&D | pro-America, pro-tech, & pro-AI safety

#725

Bowen Baker

@bobabowen

AI SAFETY62 followed3.822 gravity

Research Scientist at @openai since 2017 Robotics, Multi-Agent Reinforcement Learning, LM Reasoning, and now Alignment.

#728

Jeffrey Ladish

@JeffLadish

AI SAFETY98 followed3.821 gravity

Applying the security mindset to everything @PalisadeAI

#737

Scott Niekum

@scottniekum

AI SAFETY93 followed3.817 gravity

Associate professor at UMass Amherst CICS. AIignment, safety, reinforcement learning, imitation learning, and robotics.

#751

Summer Yue

@summeryue0

AI SAFETY126 followed3.812 gravity

Safety and alignment at Meta Superintelligence. Prev: VP of Research at Scale AI, research at Google DeepMind / Brain (Gemini, LaMDA, RL / TFAgents...

#757

Robert Long

@rgblong

AI SAFETY105 followed3.808 gravity

executive director of @eleosai AI consciousness and AI welfare

#760

Nate Soares ⏹️

@So8res

AI SAFETY73 followed3.807 gravity

Trying to make AI not kill everyone

#761

Rob Miles

@robertskmiles

AI SAFETY92 followed3.806 gravity

Explaining AI Alignment to anyone who'll stand still for long enough, on YouTube and Discord. Music, movies, microcode, and high-speed pizza delivery

#765

Ryan Greenblatt

@RyanPGreenblatt

AI SAFETY95 followed3.805 gravity

Chief scientist at Redwood Research (@redwood_ai), focused on technical AI safety research to reduce risks from rogue AIs

#779

David Bau

@davidbau

AI SAFETY99 followed3.795 gravity

Computer Science Professor at Northeastern, Ex-Googler. Believes AI should be transparent. @davidbau@sigmoid.social @davidbau.bsky.social http://b...

#792

William Isaac

@wsisaac

AI SAFETY97 followed3.789 gravity

Principal Scientist @DeepMind | Previously @OSFellows & @hrdag. RT != endorsements. Opinions Mine. Pronouns: he/him| @williamis.bsky.social

#795

Sameer Singh

@sameer_

AI SAFETY98 followed3.787 gravity

Cofounder/CTO @SpiffyAI and Prof at @UCIrvine, works on reliable LLMs, explanations for AI+ML, safety for NLP, and debugging/evaluation.

#809

Negar Rostamzadeh

@negar_rz

AI SAFETY104 followed3.784 gravity

Staff RS @Google Research, Adjunct Professor @McGillu, Core Industry Member @MIlAMontreal

#813

Nora Belrose

@norabelrose

AI SAFETY102 followed3.783 gravity

AIs aren't people, they're tools we should use wisely. Head of interpretability research at @AiEleuther, but tweets are my own views, not Eleuther's.

#820

Cas (Stephen Casper)

@StephenLCasper

AI SAFETY95 followed3.780 gravity

AI safeguards & gov. research. PhD student @MIT_CSAIL (mnr. Public Policy), and Fellow at @BKCHarvard. Fmr. @AISecurityInst. https://stephencasper....

#841

Steven Adler

@sjgadler

AI SAFETY100 followed3.774 gravity

AI safety researcher (ex-OpenAI: danger evals, AGI readiness, etc), writing at https://t.co/R5KV9j3lsG

#842

Miles Wang

@MilesKWang

AI SAFETY79 followed3.774 gravity

Researcher @OpenAI

#844

uncatherio

@uncatherio

AI SAFETY73 followed3.772 gravity

wholesomeness practitioner; user of words my more "work" account is @catherineols

#866

𝙷𝚒𝚖𝚊 𝙻𝚊𝚔𝚔𝚊𝚛𝚊𝚓𝚞

@hima_lakkaraju

AI SAFETY92 followed3.765 gravity

AI Professor @Harvard; Senior Staff Research Scientist @GoogleAI; @trustworthy_ml #AI #XAI; AI PhD from Stanford; Sloan/Kavli Fellow, MIT TR #35Und...

#869

Suresh Venkatasubramanian (mostly in the sky now)

@geomblog

AI SAFETY89 followed3.763 gravity

AI Bill of Rights coauthor. Prof@BrownUniversity. Former tech advisor to President Biden @WHOSTP. He/him/his. Tweets my own.

#879

Remi Denton

@cephaloponderer

AI SAFETY86 followed3.759 gravity

I study AI systems and those who build them • (gender)queer • they/them

Loading more...