2d ago

Thinking Machines introduces Interaction Models for real-time collaboration

0

Thinking Machines introduced Interaction Models, an AI system that engages users through simultaneous real-time talking, listening, watching, thinking, and collaborating. The company posted its technical approach, early experimental results, and a video demonstration on its blog at thinkingmachines.ai/blog/interaction-models. The demonstration highlights live visual generation and multi-stream exchanges. The work addresses a shift in AI bottlenecks from raw compute or intelligence toward human-AI bandwidth, positioning the models as full-duplex multimodal systems that enable continuous adaptation without turn-based constraints.

Original post

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

1:42 PM · May 11, 2026 View on X
Reposted by

Sharing our work on full-duplex multimodal models -- real-time interaction that's natural and intuitive without compromising on intelligence.

We started Thinky in part to differentially advance capabilities for human-AI collaboration, which are underemphasized relative to intelligence/autonomy because they're harder to eval.

In the future, we think every AI system will have something like an interaction model as the outer user-facing layer, continually keeping the user informed and learning what they actually want.

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
8:48 PM · May 11, 2026 · 79.6K Views

Seeing the demos come together over the last week has been awesome -- so many things that previously required a special-purpose model (e.g. real-time translation, event detection in video) turn out to be zero-shot instruction following once you have a general-purpose model with the right type signature -- continuous/simultaneous audio+video+text->audio+text

John Schulman@johnschulman2

Sharing our work on full-duplex multimodal models -- real-time interaction that's natural and intuitive without compromising on intelligence. We started Thinky in part to differentially advance capabilities for human-AI collaboration, which are underemphasized relative to intelligence/autonomy because they're harder to eval. In the future, we think every AI system will have something like an interaction model as the outer user-facing layer, continually keeping the user informed and learning what they actually want.

8:48 PM · May 11, 2026 · 79.6K Views
8:50 PM · May 11, 2026 · 6.4K Views

Thinky's secret plan:

1: Increase Human<->AI bandwidth 2: Raise ceiling of human+AI intelligence 3: Help humans continue as main-characters in the new world

We are at Step 1. Interaction Models are great real-time collaborative tools for humans. Here's a preview:

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
8:48 PM · May 11, 2026 · 112.9K Views

In the past few months, we had a lot of fun (and stress 😅) to produce 12 versions (+ many subversions) and 137 pages in our training run log book.

Turns out human-human collaboration is important to improving human-AI collaboration. 😊

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
8:50 PM · May 11, 2026 · 73.8K Views

@soumithchintala congrats Soumith!!

Soumith Chintala@soumithchintala

Thinky's secret plan: 1: Increase Human<->AI bandwidth 2: Raise ceiling of human+AI intelligence 3: Help humans continue as main-characters in the new world We are at Step 1. Interaction Models are great real-time collaborative tools for humans. Here's a preview:

8:48 PM · May 11, 2026 · 112.9K Views
9:21 PM · May 11, 2026 · 2.4K Views

Very cool announcement from Thinky!

The model looks nice (they go into some reasonable amount of detail), and reading some parts of the blog you can definitely see that the infea guys had a lot of fun there!

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
9:41 PM · May 11, 2026 · 49.5K Views

@rown @liliyu_lili @saurabh_garg67 @AndreaMadotto Yes but 40 though 🤔

Rowan Zellers@rown

Our interaction model is the first general video+speech model that's visually proactive. It was super fun working on this with @liliyu_lili / @saurabh_garg67 / @AndreaMadotto and others - after countless versions it was amazing when visual interruptions suddenly worked!

9:27 PM · May 11, 2026 · 9.7K Views
6:01 AM · May 12, 2026 · 1.1K Views

@cHHillee Wait didn't you draw a picture almost like this for a blogpost sometime over a year ago?

Horace He@cHHillee

In modern ML accelerators, FLOPS have absolutely exploded. Often though, the bottleneck is not FLOPS but memory bandwidth. Similarly, model intelligence has exploded, causing the bottleneck to be human&lt;-&gt;AI bandwidth. At Thinky, we think that it’s important to solve this. 1/4

8:48 PM · May 11, 2026 · 91.5K Views
11:32 PM · May 11, 2026 · 4.8K Views

This is the demo that hits me as being genuinely different -- both model and user talking at once! Great stuff.

Congrats on the release @thinkymachines

Thinking Machines@thinkymachines

With the model's simultaneous speech capability, Horace has gotten a lot easier to work with recently.

8:42 PM · May 11, 2026 · 253K Views
10:54 PM · May 11, 2026 · 38.5K Views

@natolambert @thinkymachines Yeah I think concurrency is actually a pretty fundamental property of human interaction! Model talking/listening + user talking, model talking/watching + user doing something on screen (e.g. sports commentary).

Nathan Lambert@natolambert

This is the demo that hits me as being genuinely different -- both model and user talking at once! Great stuff. Congrats on the release @thinkymachines

10:54 PM · May 11, 2026 · 38.5K Views
11:42 PM · May 11, 2026 · 1.4K Views

To stretch this analogy further, when accelerators (humans) are severely limited by bandwidth you have no choice but to move everything into SRAM (make everything fully autonomous). However, this prohibits say, keeping kv-cache in DRAM (having humans contribute). 2/4

Horace He@cHHillee

In modern ML accelerators, FLOPS have absolutely exploded. Often though, the bottleneck is not FLOPS but memory bandwidth. Similarly, model intelligence has exploded, causing the bottleneck to be human&lt;-&gt;AI bandwidth. At Thinky, we think that it’s important to solve this. 1/4

8:48 PM · May 11, 2026 · 91.5K Views
8:48 PM · May 11, 2026 · 5.8K Views

In modern ML accelerators, FLOPS have absolutely exploded. Often though, the bottleneck is not FLOPS but memory bandwidth. Similarly, model intelligence has exploded, causing the bottleneck to be human&lt;-&gt;AI bandwidth. At Thinky, we think that it’s important to solve this. 1/4

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
8:48 PM · May 11, 2026 · 91.5K Views

@giffmana

Lucas Beyer (bl16)@giffmana

@cHHillee Wait didn't you draw a picture almost like this for a blogpost sometime over a year ago?

11:32 PM · May 11, 2026 · 4.8K Views
11:35 PM · May 11, 2026 · 3.6K Views

@johnschulman2 Congrats!

John Schulman@johnschulman2

Sharing our work on full-duplex multimodal models -- real-time interaction that's natural and intuitive without compromising on intelligence. We started Thinky in part to differentially advance capabilities for human-AI collaboration, which are underemphasized relative to intelligence/autonomy because they're harder to eval. In the future, we think every AI system will have something like an interaction model as the outer user-facing layer, continually keeping the user informed and learning what they actually want.

8:48 PM · May 11, 2026 · 79.6K Views
3:53 AM · May 12, 2026 · 461 Views

Haven’t tried this but it seems very neat…

Yet all of the demos (except maybe one) are the model being fun and/or annoying by correcting or reminding in real time. There are obvious uses for this sort of model in meetings, education, training, etc. Why not demo valuable cases?

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
11:46 PM · May 11, 2026 · 26.6K Views

Great series of demos overall, but why are we spoiling a movie lol?

9:59 PM · May 11, 2026 · 3.4K Views

The Babel fish is here

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
9:22 PM · May 11, 2026 · 2.2K Views

We are so back!

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
8:46 PM · May 11, 2026 · 45.8K Views

Our interaction model is the first general video+speech model that's visually proactive. It was super fun working on this with @liliyu_lili / @saurabh_garg67 / @AndreaMadotto and others - after countless versions it was amazing when visual interruptions suddenly worked!

Lili Yu@liliyu_lili

We’re interested in AI systems that can collaborate in real time, without relying only on artificial turn boundaries. For audio, this feels natural: listen, speak, interrupt, update. For video, we think an important version of this is visual proactivity — models that respond when something happens visually: “Tell me when I start slouching.” “Count my pushups.” “Say stop when the person stops doing X.”

8:56 PM · May 11, 2026 · 14.1K Views
9:27 PM · May 11, 2026 · 9.7K Views

@liliyu_lili @saurabh_garg67 @AndreaMadotto If you're interested in working on realtime video+speech specifically, or human AI collaboration more generally, please reach out!

Rowan Zellers@rown

Our interaction model is the first general video+speech model that's visually proactive. It was super fun working on this with @liliyu_lili / @saurabh_garg67 / @AndreaMadotto and others - after countless versions it was amazing when visual interruptions suddenly worked!

9:27 PM · May 11, 2026 · 9.7K Views
9:28 PM · May 11, 2026 · 933 Views

super cool interleaving!!!

Thinking Machines@thinkymachines

While Lilian is telling a story, the interaction model can track when she is thinking, yielding, self-correcting, or inviting a response; there is no specific built dialogue management system.

8:42 PM · May 11, 2026 · 243.6K Views
9:14 PM · May 11, 2026 · 5.2K Views

@lilianweng so cool

Lilian Weng@lilianweng

In the past few months, we had a lot of fun (and stress 😅) to produce 12 versions (+ many subversions) and 137 pages in our training run log book. Turns out human-human collaboration is important to improving human-AI collaboration. 😊

8:50 PM · May 11, 2026 · 73.8K Views
9:20 PM · May 11, 2026 · 957 Views

@soumithchintala That is so awesome. Can't wait to try it.

Soumith Chintala@soumithchintala

Thinky's secret plan: 1: Increase Human<->AI bandwidth 2: Raise ceiling of human+AI intelligence 3: Help humans continue as main-characters in the new world We are at Step 1. Interaction Models are great real-time collaborative tools for humans. Here's a preview:

8:48 PM · May 11, 2026 · 112.9K Views
9:05 PM · May 11, 2026 · 872 Views

So many people were bearish on Thinky Machines... they were just quietly building!

Really cool work, TML continues to be one of my favorite neolabs!

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
2:15 AM · May 12, 2026 · 4.8K Views

Making a Thinky compilation thread for today's announcement.

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
9:28 PM · May 11, 2026 · 18.2K Views
John Schulman@johnschulman2

Sharing our work on full-duplex multimodal models -- real-time interaction that's natural and intuitive without compromising on intelligence. We started Thinky in part to differentially advance capabilities for human-AI collaboration, which are underemphasized relative to intelligence/autonomy because they're harder to eval. In the future, we think every AI system will have something like an interaction model as the outer user-facing layer, continually keeping the user informed and learning what they actually want.

8:48 PM · May 11, 2026 · 79.6K Views
9:29 PM · May 11, 2026 · 1.8K Views
Lilian Weng@lilianweng

In the past few months, we had a lot of fun (and stress 😅) to produce 12 versions (+ many subversions) and 137 pages in our training run log book. Turns out human-human collaboration is important to improving human-AI collaboration. 😊

8:50 PM · May 11, 2026 · 73.8K Views
9:30 PM · May 11, 2026 · 1.1K Views
Soumith Chintala@soumithchintala

Thinky's secret plan: 1: Increase Human<->AI bandwidth 2: Raise ceiling of human+AI intelligence 3: Help humans continue as main-characters in the new world We are at Step 1. Interaction Models are great real-time collaborative tools for humans. Here's a preview:

8:48 PM · May 11, 2026 · 112.9K Views
9:31 PM · May 11, 2026 · 970 Views
Mira Murati@miramurati

We started Thinking Machines to advance human-AI collaboration, and this is our first bet on what that looks like. Most labs treat autonomy as the goal and interactivity as scaffolding around a turn-based core. We think the way we work with AI matters as much as how smart it is. Interactivity has to be in the model, and it has to scale with intelligence rather than trail behind it. https://thinkingmachines.ai/blog/interaction-models/

8:43 PM · May 11, 2026 · 49.5K Views
9:34 PM · May 11, 2026 · 844 Views
Lili Yu@liliyu_lili

We’re interested in AI systems that can collaborate in real time, without relying only on artificial turn boundaries. For audio, this feels natural: listen, speak, interrupt, update. For video, we think an important version of this is visual proactivity — models that respond when something happens visually: “Tell me when I start slouching.” “Count my pushups.” “Say stop when the person stops doing X.”

8:56 PM · May 11, 2026 · 14.1K Views
9:35 PM · May 11, 2026 · 792 Views
Horace He@cHHillee

In modern ML accelerators, FLOPS have absolutely exploded. Often though, the bottleneck is not FLOPS but memory bandwidth. Similarly, model intelligence has exploded, causing the bottleneck to be human&lt;-&gt;AI bandwidth. At Thinky, we think that it’s important to solve this. 1/4

8:48 PM · May 11, 2026 · 91.5K Views
9:36 PM · May 11, 2026 · 787 Views
Alexander Kirillov@_alex_kirillov_

We strongly believe in the bitter lesson. Making interactivity an integral part of the model will outpace any harness-based approach and will lead to a better experience working with AI models. This is a step in that direction.

8:45 PM · May 11, 2026 · 7.1K Views
9:38 PM · May 11, 2026 · 2.9K Views

@soumithchintala Dude.

You just posted your secret plan!!

Soumith Chintala@soumithchintala

Thinky's secret plan: 1: Increase Human<->AI bandwidth 2: Raise ceiling of human+AI intelligence 3: Help humans continue as main-characters in the new world We are at Step 1. Interaction Models are great real-time collaborative tools for humans. Here's a preview:

8:48 PM · May 11, 2026 · 112.9K Views
8:35 PM · May 12, 2026 · 83 Views

@soumithchintala please. my commute just got longer. fix it

Soumith Chintala@soumithchintala

Thinky's secret plan: 1: Increase Human<->AI bandwidth 2: Raise ceiling of human+AI intelligence 3: Help humans continue as main-characters in the new world We are at Step 1. Interaction Models are great real-time collaborative tools for humans. Here's a preview:

8:48 PM · May 11, 2026 · 112.9K Views
2:15 AM · May 12, 2026 · 809 Views

@soumithchintala Great to see a compelling vision articulated. @humansand is pushing for a similar goal. I'm glad that humans will have a role in the future :)

Soumith Chintala@soumithchintala

Thinky's secret plan: 1: Increase Human<->AI bandwidth 2: Raise ceiling of human+AI intelligence 3: Help humans continue as main-characters in the new world We are at Step 1. Interaction Models are great real-time collaborative tools for humans. Here's a preview:

8:48 PM · May 11, 2026 · 112.9K Views
10:43 PM · May 11, 2026 · 941 Views

@johnschulman2 super cool - videos on the blog are neat.

John Schulman@johnschulman2

Sharing our work on full-duplex multimodal models -- real-time interaction that's natural and intuitive without compromising on intelligence. We started Thinky in part to differentially advance capabilities for human-AI collaboration, which are underemphasized relative to intelligence/autonomy because they're harder to eval. In the future, we think every AI system will have something like an interaction model as the outer user-facing layer, continually keeping the user informed and learning what they actually want.

8:48 PM · May 11, 2026 · 79.6K Views
8:52 PM · May 11, 2026 · 1.8K Views

very cool

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
9:22 PM · May 11, 2026 · 1K Views

we are excited to share our latest work on interactive human-AI collaboration!

as intelligence increases, we think progress will be bottlenecked by the ability of AI to work *with* humans -- thereby enabling AI to positively impact the long tail of human experiences

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
8:47 PM · May 11, 2026 · 1.6K Views

Early days but what’s most impressive is how natural the interactions are becoming with these omnimodels. Real-time, low-latency interactive AI models unlock applications that are very hard to imagine today. Brace yourselves!

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
1:36 AM · May 12, 2026 · 5.3K Views

We need this asap

Thinking Machines@thinkymachines

With the model's simultaneous speech capability, Horace has gotten a lot easier to work with recently.

8:42 PM · May 11, 2026 · 253K Views
10:17 PM · May 11, 2026 · 1.6K Views
Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
8:44 PM · May 11, 2026 · 101.7K Views

Working on the interaction models is a lot of fun at TML! I can't imagine doing that in a turn-based world. Building it from scratch makes a lot of things so much easier. I am very excited about the future of natively multi-modal, multi-stream, multi-task models.

Alexander Kirillov@_alex_kirillov_
8:44 PM · May 11, 2026 · 101.7K Views
8:45 PM · May 11, 2026 · 17.5K Views

We strongly believe in the bitter lesson. Making interactivity an integral part of the model will outpace any harness-based approach and will lead to a better experience working with AI models. This is a step in that direction.

Alexander Kirillov@_alex_kirillov_

Working on the interaction models is a lot of fun at TML! I can't imagine doing that in a turn-based world. Building it from scratch makes a lot of things so much easier. I am very excited about the future of natively multi-modal, multi-stream, multi-task models.

8:45 PM · May 11, 2026 · 17.5K Views
8:45 PM · May 11, 2026 · 7.1K Views

Wait, TML is actually cooking??

Interesting model for higher bandwidth human-machine comms

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
9:26 PM · May 11, 2026 · 14.2K Views

@soumithchintala interesting

Soumith Chintala@soumithchintala

Thinky's secret plan: 1: Increase Human<->AI bandwidth 2: Raise ceiling of human+AI intelligence 3: Help humans continue as main-characters in the new world We are at Step 1. Interaction Models are great real-time collaborative tools for humans. Here's a preview:

8:48 PM · May 11, 2026 · 112.9K Views
6:53 AM · May 13, 2026 · 548 Views

notable

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
10:48 PM · May 11, 2026 · 64.1K Views

Working on pretraining, it often feels like we are building an intricate physical system complete with dynamical laws and dimensional scaling. I did not expect that using our models would also start to feel like interacting with a dynamical system

(1/2)

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
1:53 AM · May 12, 2026 · 23K Views

Interaction models continuously absorb information from the world and think and respond in real time. They are aware of the passage of time, know when to listen and can interrupt based on audio or visual cues. I am so excited by all the work being done here at Thinky

(2/2)

Jeremy Bernstein@jxbz

Working on pretraining, it often feels like we are building an intricate physical system complete with dynamical laws and dimensional scaling. I did not expect that using our models would also start to feel like interacting with a dynamical system (1/2)

1:53 AM · May 12, 2026 · 23K Views
1:53 AM · May 12, 2026 · 1.8K Views

@lilianweng @clarejtbirch Nice. Is this your version of Her?

Lilian Weng@lilianweng

In the past few months, we had a lot of fun (and stress 😅) to produce 12 versions (+ many subversions) and 137 pages in our training run log book. Turns out human-human collaboration is important to improving human-AI collaboration. 😊

8:50 PM · May 11, 2026 · 73.8K Views
9:33 PM · May 11, 2026 · 2.9K Views

I want more proactivity like this

Could I have it track pigeons that walk by my place and then call the police if it’s more than 10 pigeons?

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
9:36 PM · May 11, 2026 · 62.8K Views

holy shit they made Her

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
9:05 PM · May 11, 2026 · 281.6K Views

Subtle point: building excellent AI models and agents will likely require us to reconsider organizational structure and communication patterns (of humans!)

Lilian Weng@lilianweng

In the past few months, we had a lot of fun (and stress 😅) to produce 12 versions (+ many subversions) and 137 pages in our training run log book. Turns out human-human collaboration is important to improving human-AI collaboration. 😊

8:50 PM · May 11, 2026 · 73.8K Views
2:34 PM · May 12, 2026 · 2.8K Views

If you care about building an AGI future where humans are not left behind, come join us.

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
10:23 PM · May 11, 2026 · 3.5K Views

It's refreshing to see an AI lab go in a different direction! But also, as someone who's obsessed over getting my inverse planning algorithms down to milliseconds of latency, I feel like I'm being gaslit when they caption those 1.5s to 3s delays as "instant reactions" 😵‍💫

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
11:31 AM · May 12, 2026 · 8.9K Views

The real world is physical, the real world is continuous, the real world is real-time

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
11:57 PM · May 12, 2026 · 5.3K Views

Thinking Machines launch should be @prozd and @cHHillee using the model to swap places giving talks at GDC and Jane Street

Thinking Machines@thinkymachines

With the model's simultaneous speech capability, Horace has gotten a lot easier to work with recently.

8:42 PM · May 11, 2026 · 253K Views
11:01 PM · May 11, 2026 · 4K Views

are you getting it yet? now youre thinking in microturns

5:59 AM · May 12, 2026 · 2.7K Views

last fall, I read Walter Ong's Orality and Literacy twice and on my 40k step walks from Potrero Hill to the Presidio, I couldn't stop thinking about the "Some psychodynamics of orality" section:

- additive rather than subordinate - aggregating rather than analytic - close to the human lifeworld - agonistically toned - empathetic and participatory rather than objectively distanced - homeostatic - situational rather than abstract

then I had about 4 months of When We Cease to Understand the World level psychosis, where I repeatedly accused anybody I could of not being responsive enough and not being collaborative and being too objective and not touching the world in a high frequency high fidelity way, lost in their modern plato's cave of literary sauce.

in January, I explained my job as the guy that makes the sous vide machine do weird things it wasn't meant to do so that the chefs I work with can make the best dish of their lives and that someone told me actually that role exists at Lazy Bear and it was what created their asparagus dish:

turns out sous vide machines are designed assuming they would only ever be used with water:

- the motor expects certain viscosity - no way to clean insides this tool design constrains the chef; he cannot sous vide asparagus in asparagus juice

historically - immersion blenders, vitamix -> era of purees - cheap nitro -> foams

in the same way training runtimes are designed shapes the path of AI: - chat is turn based, now training is turn based, there's no synchronicity, there is no time, reality freezes - chat is turn based, what can you scale? ok scale the model turn -> cot -> o1 - and now here we are, sitting on our thumbs, waiting for claude

the shape of a tool is what it enables a creator do an intelligence transcends those limitations

so in a desperate attempt to end my psychosis, we went katabatic, wrote a bunch of rust, argued a lot with @_alex_kirillov_

and saw a bunch of the best chefs in the world begin eliciting flavors and textures I have never experienced before

here are some

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
8:59 PM · May 11, 2026 · 7.6K Views

What if working with AI felt less like a chat box and more like talking to another person? Today we're sharing a preview of how we think about humans and AI collaborating together at @thinkymachines, with full-duplex models that handle interactivity natively.

It's still early, and we have a lot more to do ahead of us!

Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
10:06 PM · May 11, 2026 · 6.9K Views

“natively multi-modal, multi-stream, multi-task models”, good direction.

Alexander Kirillov@_alex_kirillov_
8:44 PM · May 11, 2026 · 101.7K Views
10:02 PM · May 11, 2026 · 5.1K Views

We’re interested in AI systems that can collaborate in real time, without relying only on artificial turn boundaries.

For audio, this feels natural: listen, speak, interrupt, update.

For video, we think an important version of this is visual proactivity — models that respond when something happens visually:

“Tell me when I start slouching.” “Count my pushups.” “Say stop when the person stops doing X.”

Thinking Machines@thinkymachines

Tessa's quality of life has improved a lot with some nagging.

8:42 PM · May 11, 2026 · 223K Views
8:56 PM · May 11, 2026 · 14.1K Views

My go-to test for visual proactivity in realtime systems is live finger counting.

It sounds simple, but it requires the model to watch continuously, track visual changes, and respond at the right time.

Other models we’ve tried could not do it.

thinkingmachines.ai
/blog/interaction-models/#benchmarks:~:text=Examples%20from%20our%20internal%20audio%20and%20video%20benchmark.
Lili Yu@liliyu_lili

We’re interested in AI systems that can collaborate in real time, without relying only on artificial turn boundaries. For audio, this feels natural: listen, speak, interrupt, update. For video, we think an important version of this is visual proactivity — models that respond when something happens visually: “Tell me when I start slouching.” “Count my pushups.” “Say stop when the person stops doing X.”

8:56 PM · May 11, 2026 · 14.1K Views
9:20 PM · May 11, 2026 · 552 Views

Find some joys while pushing Human-AI Collaboration.

and "green tea is ok".

Martin Ziqiao Ma@ziqiao_ma

P.S. The demo is basically my life at thinky: I start to cut coffee, @liliyu_lili is visually prompt-injecting my human intelligence with sweet snack every day, and I've gained weight since joining TML.

8:51 PM · May 11, 2026 · 12.7K Views
9:07 PM · May 11, 2026 · 1.7K Views
Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
9:29 PM · May 11, 2026 · 895 Views
Thinking Machines@thinkymachines

People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. https://thinkingmachines.ai/blog/interaction-models

8:42 PM · May 11, 2026 · 6.7M Views
9:10 PM · May 11, 2026 · 66.8K Views
Thinking Machines introduces Interaction Models for real-time collaboration · KRO · Digg