Excited to share Sim2Reason -- training LLMs in simulation to learn Olympiad-level physics (mechanics)! Today, LLMs learn science by reading what humans have already written, absorbing distilled knowledge from textbooks and the internet. But human-annotated physics data is fundamentally scarce, and that bottleneck isn't going away. Analogy to robotics: Sim2Real transformed robotics, where we train in simulation and deploy zero-shot in the real world. We do not try to teach robots by describing physics to them, but they have to experience it. Approach: Our Sim2Reason makes the same bet we made in robotics -- skip the descriptions, go straight to the source. Let models learn directly from simulated worlds, observing how objects move, collide, and interact, much like scientists build intuition through experiment. Result: Models trained purely on simulated experience develop transferable physical reasoning skills, improving even on problems that were never simulated. Zero-shot gains on IPhO, IIT JEE Advanced, OlympiadBench — problems the model never saw during training.
View on X →Context: Quoting @mihirp98: "What if AI learned physics the way Newton did – by experiencing it? We built Sim2Reason: train LLMs inside virtual worlds governed by real physics laws, zero human annotation. Result: +5–10% improvement on International Physics Olympiad, zero-shot. 🧵 @mihirp98: "What if AI learned physics the way Newton did – by experiencing it? We built Sim2Reason: train LLMs inside virtual worlds governed by real physics laws, zero human annotation. Result: +5–10% improvement on International Physics Olympiad, zero-shot. 🧵 https://x.com/mihirp98/status/2044830431850250400/video/1" Tweet: Excited to share Sim2Reason -- training LLMs in simulation to learn Olympiad-level physics (mechanics)! Today, LLMs learn science by reading what humans have already written, absorbing distilled knowledge from textbooks and the internet. But human-annotated physics data is fundamentally scarce, and that bottleneck isn't going away. Analogy to robotics: Sim2Real transformed robotics, where we train in simulation and deploy zero-shot in the real world. We do not try to teach robots by describing physics to them, but they have to experience it. Approach: Our Sim2Reason makes the same bet we made in robotics -- skip the descriptions, go straight to the source. Let models learn directly from simulated worlds, observing how objects move, collide, and interact, much like scientists build intuition through experiment. Result: Models trained purely on simulated experience develop transferable physical reasoning skills, improving even on problems that were never simulated. Zero-shot gains on IPhO, IIT JEE Advanced, OlympiadBench — problems the model never saw during training.
| Time | Views | Likes | Bookmarks | RTs | Replies |
|---|---|---|---|---|---|
| 11:00 AM UTC | +93 | +1 | — | — | — |
| 10:50 AM UTC | +85 | — | — | — | — |
| 10:40 AM UTC | +73 | +1 | +3 | +1 | — |
| 10:30 AM UTC | +128 | +2 | — | +1 | — |
| 10:20 AM UTC | +130 | +1 | — | — | — |
| 10:10 AM UTC | +113 | — | — | — | — |
| 10:00 AM UTC | +84 | +1 | +1 | — | — |
| 9:50 AM UTC | +10 | — | — | — | — |
| 9:40 AM UTC | +217 | +1 | +3 | — | — |
| 9:30 AM UTC | +118 | +2 | — | +1 | — |