Discussion about this post

User's avatar
Jaci Turner's avatar

One thing that stands out across reliability engineering, safety frameworks, and observability is a shared requirement: systems must recognize uncertainty and respond appropriately at the boundary.

In practice, deployment success often hinges not just on containing failures technically, but on whether humans trust a system’s judgment — especially when it chooses not to act. That trust layer is hard to benchmark, but critical at scale.

Rafayel Ghasabyan's avatar

Fantastic piece, Oliver. It highlights the exact tensions we see in the field. At TACTUN, we’ve focused on building the infrastructure for real-time deterministic control (System 1) so that frontier models like RT-2 and π-0.5 can actually run on real machines in the wild. I wrote a short response on our approach here – thank you for spurring this important conversation!

https://rafayelg.substack.com/p/bridging-the-physical-ai-deployment

7 more comments...

No posts

Ready for more?