Deepak Pathak: Excited to share Sim2Reason -- training LLMs in simulation to learn Olympiad-level physics (mechanics)! Today, LLMs learn science by readin

#334

Deepak Pathak

@pathak2206· 27.7K followers

CEORank #334

Original Tweet

Excited to share Sim2Reason -- training LLMs in simulation to learn Olympiad-level physics (mechanics)! Today, LLMs learn science by reading what humans have already written, absorbing distilled knowledge from textbooks and the internet. But human-annotated physics data is fundamentally scarce, and that bottleneck isn't going away. Analogy to robotics: Sim2Real transformed robotics, where we train in simulation and deploy zero-shot in the real world. We do not try to teach robots by describing physics to them, but they have to experience it. Approach: Our Sim2Reason makes the same bet we made in robotics -- skip the descriptions, go straight to the source. Let models learn directly from simulated worlds, observing how objects move, collide, and interact, much like scientists build intuition through experiment. Result: Models trained purely on simulated experience develop transferable physical reasoning skills, improving even on problems that were never simulated. Zero-shot gains on IPhO, IIT JEE Advanced, OlympiadBench — problems the model never saw during training.

View on X →

AI Classification

Whether our pipeline considers this post AI-relevant

Noise

Enriched Text

Assembled input used for vector embedding and topic clustering

Context: Quoting @mihirp98: "What if AI learned physics the way Newton did – by experiencing it? We built Sim2Reason: train LLMs inside virtual worlds governed by real physics laws, zero human annotation. Result: +5–10% improvement on International Physics Olympiad, zero-shot. 🧵 @mihirp98: "What if AI learned physics the way Newton did – by experiencing it? We built Sim2Reason: train LLMs inside virtual worlds governed by real physics laws, zero human annotation. Result: +5–10% improvement on International Physics Olympiad, zero-shot. 🧵 https://x.com/mihirp98/status/2044830431850250400/video/1" Tweet: Excited to share Sim2Reason -- training LLMs in simulation to learn Olympiad-level physics (mechanics)! Today, LLMs learn science by reading what humans have already written, absorbing distilled knowledge from textbooks and the internet. But human-annotated physics data is fundamentally scarce, and that bottleneck isn't going away. Analogy to robotics: Sim2Real transformed robotics, where we train in simulation and deploy zero-shot in the real world. We do not try to teach robots by describing physics to them, but they have to experience it. Approach: Our Sim2Reason makes the same bet we made in robotics -- skip the descriptions, go straight to the source. Let models learn directly from simulated worlds, observing how objects move, collide, and interact, much like scientists build intuition through experiment. Result: Models trained purely on simulated experience develop transferable physical reasoning skills, improving even on problems that were never simulated. Zero-shot gains on IPhO, IIT JEE Advanced, OlympiadBench — problems the model never saw during training.

Current Stats

18.5KViews

174Likes

24Retweets

5Replies

96Bookmarks

1Quotes

Engagement Timeline(95 snapshots)

Time	Views	Likes	Bookmarks	RTs	Replies
11:00 AM UTC	+93	+1	—	—	—
10:50 AM UTC	+85	—	—	—	—
10:40 AM UTC	+73	+1	+3	+1	—
10:30 AM UTC	+128	+2	—	+1	—
10:20 AM UTC	+130	+1	—	—	—
10:10 AM UTC	+113	—	—	—	—
10:00 AM UTC	+84	+1	+1	—	—
9:50 AM UTC	+10	—	—	—	—
9:40 AM UTC	+217	+1	+3	—	—
9:30 AM UTC	+118	+2	—	+1	—

Time

Views

Likes

Bookmarks

RTs

Replies

11:00 AM UTC

+93

—

10:50 AM UTC

+85

—

10:40 AM UTC

+73

—

10:30 AM UTC

+128

—

10:20 AM UTC

+130

—

10:10 AM UTC

+113

—

10:00 AM UTC

+84

—

9:50 AM UTC

+10

—

9:40 AM UTC

+217

—

9:30 AM UTC

+118

—