John Yang@jyangballin·Quote tweet
WildChat, OpenAssistant type datasets were very useful for understanding the effect of chat bots at scale. Now that SWE-agent's are the norm, SWE-chat aims to do the same for AI coding. Lots of fun findings, great effort led by @joabaum!
JO
We present SWE-chat: the first large-scale dataset of coding agent interactions from real users in the wild. In 40% of real coding sessions, the agent writes ~all the code. Users push back 39% of the time – agents almost never stop to check. Data, paper, & findings in the 🧵👇
