Hot Takes

The Day that Claude Stood Still

Q: How should enterprises prepare for future AI outages?

Ari's advice is to treat AI like infrastructure, not magic . Teams should think through fallback models, alternate agents, human override paths, and what graceful degradation looks like if the LLM disappears mid-workflow. The question is no longer "How do we prevent every outage?" but "How do we keep business outcomes moving when an outage happens?"

Episode #

March 11, 2026

32 Min

Episode Overview

Claude went down, the memes flew, and suddenly developers everywhere were being forced to remember how to… write code. In this Hot Takes episode, Ari unpacks what the outage unveiled: AI is becoming operational infrastructure. He also discusses how organizations should start thinking about resilience, failover, and "graceful degradation" in an AI-first world.

Key Takeaways

Claude going down was a glimpse into what happens when AI becomes infrastructure. Ari’s core takeaway is that AI is no longer a novelty or side experiment for enterprise users. It is now woven into day-to-day work for developers, support teams, researchers, and operators, which means outages increasingly affect not ‘just’ systems, but the people relying on those systems to think and work.

This outage felt different from older cloud outages because knowledge work itself stopped. When AWS or Azure went down, applications failed but humans still had alternate paths. In this case, the “thinking layer” disappeared. Ari frames that as the truly profound shift: not just technology going offline, but reasoning agents and AI-supported workflows suddenly vanishing midstream.

AI adoption is scaling like a stampede. Ari contrasts steady infrastructure growth with what happens when a new AI capability drops and millions of users shift behavior all at once. That kind of surge creates a fundamentally different reliability challenge, one that traditional infrastructure may not be equipped to absorb.

The answer is not panic or retreat; it’s better architecture. Ari argues that organizations need to stop treating AI like magic and start treating it like infrastructure. That means asking harder questions about whether a workflow can keep delivering outcomes even when a model goes down.

Zero Ticket thinking applies here too: resilience matters more than perfection. Outages will happen. The real goal is not to pretend they won’t, but to design operations so a model failure does not take business outcomes offline. Whether that means alternate agents or different tolerances by use case, the next phase of AI maturity will be defined by how well companies prepare for interruption.

FAQ

Q: Why did the Claude outage feel like such a big deal compared to other recent tech outages?

A: Because this one affected knowledge work directly. Ari explains that past outages often stopped systems while people kept working around them. Here, AI-supported reasoning and task execution stalled out, which made it feel less like losing a server and more like losing a coworker. That’s a sign AI has become deeply embedded in real work.

Timestamp: 4:13–7:22, 20:51–22:50

Q: What does Ari mean when he says AI is scaling through “stampedes” instead of steady growth?

A: He means user behavior shifts all at once. When a model introduces a new capability or users migrate from one platform to another, adoption doesn’t rise gradually like older infrastructure cycles. It surges. That creates a new class of demand problem where providers have to prepare not just for growth, but for sudden waves of usage that hit like Black Friday.

Timestamp: 15:29–16:44, 17:47–18:20

Q: How should enterprises prepare for future AI outages?

A: Ari’s advice is to treat AI like infrastructure, not magic. Teams should think through fallback models, alternate agents, human override paths, and what graceful degradation looks like if the LLM disappears mid-workflow. The question is no longer “How do we prevent every outage?” but “How do we keep business outcomes moving when an outage happens?”

Timestamp: 26:14–29:04, 29:25–30:36

resources

Explore Our Resources

Explore Resources

Agentic Automation

The Resolve Experience

This is a six-chapter walkthrough of what happens inside Resolve; from the moment an employee types a question to the moment the issue is closed, logged, and learned from.

View Resource

ITSM & Service Automation

Introducing AgentLab: The Foundation of the Autonomous Service Desk

AI agents for IT are changing how work gets done in the service desk. Learn how Resolve can help you build domain-specific AI agents that can take action, execute workflows, and drive real outcomes across IT operations.

View Resource

IT Operations & Engineering

The Telecom Playbook for IT Automation

Resolve developed the Automation Capabilities Framework from the collective wisdom of our customers and our decade-long journey in delivering IT automation solutions.

View Resource

By clicking "Ok, got it", you agree to the storing of cookies on your device to enhance site navigation, analyze site usage, and assist in our marketing efforts. View our Privacy Policy for more information.

Ok, got it