Automate 2026 opened this week at Chicago's McCormick Place, and if you walked the floor, one thing was clear: the industry crossed a threshold. Physical AI stopped being a conference theme and became a commercial reality.

NVIDIA's sponsored Humanoid Robot Pavilion was packed. ABB debuted its Physical AI Toolchain, a full software stack for training, validating, and deploying robot AI models at industrial-grade accuracy. FANUC demonstrated cobots responding to natural language commands. This wasn't a demo show. It was a deployment show.

But as we walked the floor, one question kept coming up that almost nobody was answering. Robots are getting smarter. The industry is racing to build the training infrastructure, the simulation frameworks, the sensor stacks. Every major automation vendor has a physical AI story now. But the most important variable on the factory floor is still the one nobody is capturing: human judgment. The judgment calls, the exception handling, the adaptations that no system can fully automate, and that you wouldn't want to.

Physical AI Is Real, and the Race Has Started

At GTC earlier this year, NVIDIA declared 'the big bang of physical AI has started.' Automate was where that declaration met factory reality. And walking the floor, it was hard to argue with. The hardware, the deployments and the investments are real. But it raised a question almost nobody was asking

Walk any factory and ask yourself: what's actually being captured here? Machine data, mostly. Cycle times, sensor readings, equipment states. That data tells you what the machines did, but it doesn't tell you what the people decided. The operator who caught a deviation before it became a defect. The engineer who adjusted a parameter based on something she noticed, not something the system flagged. That human execution layer is what explains why operations actually go right or wrong. Yet, it's still invisible to most of the AI systems that were on that floor.

The robot choice matters. But the more durable dependency is the operational layer underneath it. What system captures how work actually happens, where people, machines, and processes meet? That context is what AI learns from and what makes decisions trustworthy enough to act on. Most manufacturers at Automate were focused on the hardware. Few were asking who owns that layer.

Robots Still Can't Handle the Hard Parts

Robots are genuinely impressive at the work they were designed for. When it comes to repetitive, structured, high-volume tasks, they do that better and faster than any human. But what happens when something falls outside the expected range?

Take a final assembly process where a technician is torquing fasteners on a critical component. A robot can apply torque consistently all day. What it can't do is notice that the parts coming down the line have been sitting in a humid environment overnight and the threading feels slightly off. An experienced operator notices that. They slow down, flag it, and escalate. That call, made in a second, based on years of torquing parts, is the difference between a clean build and a field failure.

That's not a gap the technology will close anytime soon, and honestly, it shouldn't have to. The goal was never full automation. It was always augmentation to free operators from the repetitive load so they can focus on the work that actually requires judgment. The problem is that judgment is almost never captured. Data historians, the systems operations teams have relied on for decades, were designed around machines. They're excellent at recording what equipment did. They were never built to capture what people decided. Factory Playback was built to close that gap. Think of it as a historian for the full operation: machine behavior, human behavior, and critically, the interaction between the two. Because the interaction of an operator catching something a sensor missed or an engineer making a call that prevents a failure is where the real story of your operation lives.

The Layer That Makes Physical AI Actually Work

Think about what's happened in AI over the last few years. ChatGPT changed how people think about what machines can do with language. Claude, Gemini, and the models that followed kept pushing that forward. These systems got extraordinarily good at reasoning over text.

Physical AI is the next chapter. The same shift in capability, but directed at the real world. Not documents and data, but rather machines in motion, operators at work, parts moving through complex processes. AI that can see what's happening on the floor, reason about it, and surface something useful in real time.

But physical AI has a prerequisite that most vendors aren't talking about. You need a data model that originates at the action, not in an extract downstream of it. Most industrial AI today sits on top of lagging records and generates summaries that can't be traced back to what actually happened. That's not a data quality problem, but a structural one.

Tulip was built around a different premise. When apps, agents, and automations all run on one data model, every event on the floor generates governed operational data in real time. AI agents can then surface trends, flag anomalies, and draft reports grounded in that same record, but the human is always involved. Think of it as a continuous loop: composition, execution, analysis, and back again. Engineers design the process. Operators run it. The platform records everything — not just what the machines did, but what the people decided. AI analyzes that record and surfaces what matters. Humans act on it and update the process. Each cycle, the operation gets a little smarter. That's what physical AI actually looks like when it works.

At Automate, we showed Factory Playback at the NVIDIA booth as one concrete expression of what this makes possible. Factory Playback creates a synchronized, time-aligned record of production by combining operational data and video into a unified operational timeline. It reconstructs what actually happened on the floor, step by step — who did what, when, and what the downstream effect was. Not a dashboard, but rather a visual record of how work actually unfolded, traceable back to source.

That's one capability on a platform that's been capturing operational context in production environments for years. The whole industry is now racing to build this layer. We've been building it on real factory floors, with real operators, long before physical AI was a conference theme.

The Question Automate Left Unanswered

The conversation at Automate 2026 was about robots. The smarter robots, the faster robots, the robots that can finally work alongside people without a safety cage. That's a real and important story that will have meaningful impact for many manufacturers.

But the harder problem, and the more valuable one to solve, is what happens in the space between the robot and the outcome. The human decision that catches something before it becomes a defect. The operator adjustment that keeps a line running. The judgment call that never makes it into any system. That's where operations actually live, and it's almost entirely dark to the AI systems the industry is building.

The manufacturers who figure this out first won't just have better robots. They'll have something no competitor can copy: a real-time, traceable record of how their operations actually work, built on years of human and machine data working together. That's the foundation physical AI needs to be more than impressive demos.

Automate made one thing clear: the industry has a convincing answer for the machine side of physical AI. The human side is still waiting for one.

See how Tulip can keep the human judgment in your operations

Take a virtual tour of our experience center and watch on-demand demos of our platform

Day in the life CTA illustration