Skip to main content

The Agent Is the Operating Layer

How the Agentic Era Dissolves SaaS, the App, and the Personal Computer as We Know It


"For forty years, you launched apps. Click. Type. With RTX Spark and Microsoft Windows, you ask — and the PC does the work." — Jensen Huang, NVIDIA, GTC Taipei, June 1, 20261

On June 1, 2026, NVIDIA stood on a stage in Taipei and quietly declared the end of the personal computer as we have known it for four decades. The announcement was dressed as a chip launch — the RTX Spark superchip, an Arm-based part fusing a Blackwell GPU with roughly a petaflop of on-device AI compute and up to 128GB of unified memory, built with Microsoft to run agents locally on Windows without ever touching the cloud.1 The press covered it as a hardware story: NVIDIA finally walking into the PC business, a challenge to Apple silicon and Qualcomm.

That framing is too small. RTX Spark is not a faster laptop. It is the physical substrate for a change in who operates the computer. For forty years the answer was: you do. You open the apps. You move the windows. You click, you type, you save, you switch. The machine waited for your hands. The thesis of this paper is that this arrangement is now ending — and that it takes two casualties with it.

The first casualty is SaaS. The second is the PC itself.

A clarification before the argument, because that second casualty is the bold claim: this is a direction with a leading edge, not a switch thrown overnight. It begins where work is digital, bounded, and recoverable (knowledge work and software development) and advances from there, leaving a human at the screen longest where the stakes are high, the rules are strict, or the world is physical. Section 8 scopes exactly how far and how fast.


1. Two deaths, not one

The "SaaSpocalypse" is now a familiar argument, and a correct one. When a general agent can read your data, reason over it, call tools, and complete the job end to end, the software-as-a-service company that used to be that workflow loses its reason to exist as a destination. The user no longer logs in, navigates a UI, and performs the steps. The agent does. SaaS does not vanish; it is unbundled into capabilities that agents call — an API, an MCP server, a tool the agent reaches for and then sets down. The interface, the brand, the seat-based pricing, the daily-active-user moat: those evaporate. What survives is the underlying capability, demoted from product to function call.

This is real, and it is happening. But it is the appetizer.

The main course is the one the industry is still flinching from saying out loud: the personal computer, as a thing humans operate, is becoming obsolete — beginning with digital, bounded, recoverable work and advancing from there. Not the silicon. Not the box on the desk. The operating model — the app sitting on top of an operating system, presented to a human through a graphical shell that the human must learn, navigate, and drive by hand. That entire stack was an interface for a creature that had to do the work itself. The moment the work is delegated, the interface built for doing it becomes vestigial.

SaaS dies because the agent replaces the app. The PC dies — as we know it — because the agent replaces you at the controls.


2. The SaaSpocalypse, in full

Before the larger death, the smaller one deserves to be stated completely — because it is already underway and because its mechanism is the template for everything that follows. (The market story of the SaaSpocalypse — the trillion-dollar repricing of February 2026 — is told in the Preface. This section is about the mechanism underneath that repricing.)

Software-as-a-service was a business model dressed as a product. Strip the dressing and a SaaS application is three things stacked together: a system of record (the database that holds the truth — your customers, invoices, tickets, documents), a set of capabilities (the operations you can perform on that record — create, query, transform, route), and a workflow UI (the screens, forms, and buttons through which a human drives those capabilities by hand). For thirty years these three were welded into one thing you logged into, learned, and paid for per seat.

The agent pulls them apart.

The workflow UI dies first. Its entire purpose was to let a human operate the capabilities. When an agent operates them instead — reading the record, deciding the operation, executing it — the screens have no one left to look at them. A reporting dashboard you used to open every morning is replaced by an agent that reads the same data and tells you only what changed and what to decide. The UI does not get redesigned; it gets bypassed.

The capabilities survive — but demoted from product to function call. What the SaaS company actually sold (send the invoice, route the ticket, run the payroll) becomes a tool the agent reaches for through an API or an MCP server, sets down, and never names to the user. The capability is still valuable. It is no longer a destination. This is the unbundling: the product dissolves into the operations it always was, and those operations become interchangeable parts in an agent's workflow.

The system of record is the prize. Whoever owns the authoritative data — the truth an agent must read to act — holds the position that survives. Agents do not displace the database; they make it the most strategic layer in the stack, because an agent is only as good as the record it reasons over. The SaaS vendors who endure will be the ones who realize they were a database with a UI, and that the UI was the disposable half.

The business model breaks on contact. Seat-based pricing assumes humans in seats clicking through screens. Remove the humans from the screens and per-seat revenue has nothing to meter. Daily-active-users — the metric that justified a decade of valuations — measures human attention on a UI that agents render irrelevant. The moat was switching cost and habit; an agent has no habits and feels no switching cost. The economics that built the SaaS era invert: value migrates from occupying human attention to being callable, trustworthy, and authoritative for an agent that has no loyalty and infinite patience to compare alternatives.

So the SaaSpocalypse is not "SaaS goes away." It is SaaS gets unbundled, and the bundle was the business. The capability survives as a commodity tool. The record survives as the contested high ground. The UI — and the pricing, the brand, the stickiness built on top of it — does not. The companies that thrive will be the ones who stop selling screens to humans and start selling capabilities and data to agents, governed by permissions an agent must respect.

And note the shape of the argument, because Section 3 simply runs it one level deeper: an app is to SaaS what the whole personal computer is to the operating model. The agent unbundles the SaaS product into capability + record + dead UI. It unbundles the PC into compute + OS-as-plumbing + dead shell. Same scalpel. Larger body.


3. The forty-year stack, and why it is ending

Consider the architecture we have all lived inside since the 1980s. At the bottom sits the operating system — Windows, macOS, Linux — managing files, memory, processes, devices, security. On top of it sit applications: Word, Excel, Photoshop, Chrome, Slack. Between the human and all of it sits a graphical shell: the desktop, the taskbar, the dock, the window, the file-and-folder metaphor, the app grid.

The genius of this design was that it gave a human a map of the machine. The tragedy of it is that the human had to read the map and walk every path. To produce a quarterly report you opened the spreadsheet app, you found the file, you wrote the formulas, you exported, you opened the document app, you pasted, you formatted, you opened the mail app, you attached, you sent. Each app was a silo with its own UI you had to master. The OS was the floor you stood on; the apps were the rooms you walked between; you were the one walking.

Every one of those steps is a human compensating for a computer that could not understand intent. The desktop metaphor is a forty-year-old prosthetic for that incapacity. Remove the incapacity — give the machine the ability to understand what you want and carry it out — and the prosthetic has nothing left to do.

This is the precise sense in which the classical OS and its user interface die. The kernel does not die; it survives as plumbing, the way TCP/IP and the BIOS survive — essential, ubiquitous, and invisible to the person using the machine. What dies is the OS as the human's interface, and the app as the unit of human work. The shell stops being the place you live. NVIDIA's own runtime for this era is literally named OpenShell — a new shell layered over the old one, deciding what agents are allowed to touch. Microsoft's new Windows-native agents, by its own description, sit behind the taskbar. The classic UI is being demoted to a thin surface; the action moves behind it.


4. The AI Operating Layer

Here is the architecture that replaces the old one. It does not erase the operating system; it adds a layer above it where humans now live — and pushes the OS down into the basement.

The AI Operating Layer: the four-layer stack, with the OS demoted to plumbing and the AI Operating Layer as the floor the human stands on.

The reversal is the whole story. In the old world, the operating system was the foundation the human stood on and the apps were the things the human reached for. In the new world, the AI Operating Layer is where the human stands. It is not the operating system itself — it is the layer that connects human intent to real work. The human states the goal. The agent reaches down through the old interface layer — opening files, driving the browser, running commands, calling tools — and operates the traditional OS on the human's behalf. The OS becomes something the human never touches again, the way you never touch the engine timing of your car.

The browser chat box was a hint of this, but only a hint: a chat answers questions and keeps you in the chat. A general agent works inside an environment and completes the task there. Answer-in-place versus act-in-the-world — that is the line between an assistant and an operating layer.


5. Personal agents and general agents

The AI Operating Layer is not occupied by one kind of agent but two, and conflating them is the most common error in thinking about this shift. Both live in the layer where the human now stands. They differ in what they are oriented toward: one is oriented toward the work, the other toward the person.

General agents are operators — they do the work. For developers, these are tools like Claude Code and OpenCode: agents that build, test, refactor, and deploy. For knowledge workers and domain experts, they are agents like Claude Cowork and OpenWork: agents that research, analyze, write, and coordinate. A general agent is not a chatbot bolted onto a product. It is a worker that can think, use tools, and complete real work inside a real environment. You summon one for a job; it executes inside the environment; the session ends. It is capability-centric and largely task-scoped — a specialist you bring to a problem.

Personal agents are your delegate — they know you. These are the agents that are always with you, hold your context, plan ahead, and act on your behalf across every task. In the market now forming around RTX Spark these are not hypothetical: independent agent makers including OpenClaw and Nous Research's Hermes have committed to shipping Windows-native agents on this stack, landing on the taskbar alongside the first RTX Spark laptops this fall. A personal agent is oriented toward you, not toward any single task. It is private and local, it carries persistent memory of your work and preferences, it is proactive rather than merely responsive, and it spans all your apps and files. It is your standing representative inside the layer.

These two are the operator and the delegate, and they map exactly onto the structure the Agent Factory Thesis formalizes as the Two-Layer Model: a personal agent at the Edge Layer (the thesis names this layer Identic AI, the self-sovereign agent you own rather than rent) that represents the principal, and a workforce of AI Workers below it that does the work. The thesis tabulates the full distinction — lifespan, memory, initiative, multiplicity — and the governance that rides on it. This paper takes that as settled and asks a different question: what the split does to the interface.

One relationship is worth carrying over, because the interface argument depends on it. You build and manage the personal agent with general agents — Claude Code, OpenCode — configuring its memory, permissions, and skills with the same developer-grade tools that build any other agent. Then at runtime the relationship reverses: the personal agent, knowing your intent, dispatches general agents and AI Workers to do the work and reports back. You manage the chief of staff with developer tools; the chief of staff manages the rest. The thesis develops this as its two modes of general-agent use.

This is not a forecast; an early, narrow version already ran at scale. When Klarna put an AI agent in front of its customer service in 2024, it handled two-thirds of all chats — 2.3 million conversations in the first month — did the equivalent work of roughly 700 full-time agents, cut average resolution time from 11 minutes to under 2, and was credited with a $40 million profit improvement in a single year.2 That is delegation-of-work economics, demonstrated, not theorized. But the same case is also the honest boundary: by 2025 Klarna had reintroduced human agents for complex cases,3 because customer service is one of the easiest workloads — high-volume, structured, authenticated intent — and even there a human floor remained. The lesson cuts both ways and is exactly the right calibration: the substitution is real and large where the task is bounded; it thins out as the task gets ambiguous, regulated, or irreversible. That is the leading edge advancing, not the whole front collapsing at once.

This is the second-order consequence the chip launches gesture at but rarely name: general agents are not only the new interface, they are the new means of production. In manufacturing mode — what the thesis calls Mode 2 — you use a general agent to build the rest of the system: specialized AI Workers and the personal agent that orchestrates them. The agent layer is recursive — agents you talk to, and agents that build the agents that do the work — and that recursion is the engine of the AI-Native Company.

The thesis compresses this to a line: people set direction; agents do the work; companies scale intelligence rather than headcount. This paper is that thesis's companion argument. If the thesis is the architectural case (who does the work, and how the workforce is built), this is the interface case (where the work now happens): no longer in an app on an operating system, but in the operating layer itself.


6. The computer that controls itself

CNN's headline for the RTX Spark moment was blunt and slightly anxious: the world's biggest tech companies are betting big on computers that control themselves. The phrasing captures both the promise and the dread. A computer that controls itself, given your intent, is exactly what the AI Operating Layer delivers — and exactly what four decades of human-operated computing trained us to find uncanny.

The mechanism is now concrete rather than aspirational. RTX Spark provides the local horsepower — on the order of a petaflop and 128GB of unified memory — so that frontier-class models and autonomous agents can run on the device, in your hands, without a round trip to the cloud. NVIDIA's OpenShell runtime decides what an agent is permitted to do, routes sensitive work to local models, and obscures personal information before anything is allowed to leave the machine. Microsoft is wiring agents into Windows itself, governed by a shared security layer that arbitrates what stays local and what, if anything, goes to the cloud.4 NVIDIA frames the result as a computer that moves "from tool to teammate." Microsoft frames it as a new chapter for the PC. They are describing the same thing from two sides: the machine stops being an instrument you play and becomes an actor you direct.

The petaflop on your lap is not for rendering prettier windows. It is the price of admission for a computer that can hold a goal in mind, reason about it, plan, and act — locally, privately, continuously.


7. Why this time is different

Skeptics have earned their skepticism. For at least a decade the giants promised computers that act on your behalf, and what we got was Siri setting timers and Alexa playing music. CNN's sources are right to recall that those efforts largely fell flat. Why should the RTX Spark generation be different?

Three things changed, and they changed together.

The models crossed a capability threshold. The assistants of the 2010s could parse a command; they could not plan, decompose a goal, use tools, recover from error, and carry a task across multiple steps and applications. Today's frontier models can. The difference between "set an alarm" and "reconcile this quarter's expenses against the contracts in my drive and draft the variance memo" is the difference between a parser and an agent. And the threshold is now measurable rather than rhetorical. OSWorld — the benchmark that drops an agent into a real desktop with real applications and awards no partial credit, only "task completed" or "task failed" — tells the story plainly: Stanford's 2026 AI Index records agent success on it climbing from roughly 12% to 66% in about two years, and in late 2025 the first agents crossed OSWorld's ~72% human baseline.5 A capability that sat near 20% two years ago now reaches human-baseline territory on this benchmark — though a 66% average means the frontier is "matches a person on a good share of tasks," not "matches a person on all of them." That is still a line the Siri decade never came near.

The compute came to the device. Agentic work is expensive and latency-sensitive, and much of it touches data you do not want to ship to a server. A petaflop-class local chip with 128GB of unified memory makes always-available, unmetered, private agent compute a native feature of the machine rather than a cloud subscription. That is what RTX Spark, the Copilot+ PC category, and Apple's own on-device M5 push are all racing toward.

The counterfactual makes the stakes plain. Suppose the agents stayed cloud-bound. Then every delegated task is metered per token, latency-bound by a network round trip, and — fatally — requires shipping your files, your codebase, your client data, your medical records off the device to be processed. For consumers that is a privacy non-starter; for regulated enterprises it is a procurement and compliance wall that simply does not come down. A cloud-only agentic era is a thin one: it captures the low-sensitivity, low-volume tasks and stalls at the office door. This is precisely why the hardware is the unlock and not a footnote. Putting frontier-class compute on the machine, under the user's control is what lets the agent touch the sensitive, high-volume work where the real value sits — and it is why NVIDIA, Microsoft, and Apple are spending billions to move the computation to the edge rather than renting it from the center. The death of the human-operated PC is gated on local silicon. Spark is that gate opening.

The OS vendors are rebuilding around it. This is the decisive shift. When the interface change is a third-party app, it is a novelty. When Microsoft rebuilds Windows so that agents sit behind every system surface, and NVIDIA ships the runtime and the silicon to run them, and the major PC makers (Surface, Dell, HP, Lenovo, ASUS, and MSI) launch the machines this fall with Acer and GIGABYTE to follow, and Adobe rebuilds Photoshop and Premiere for the platform, the change is no longer an app. It is the platform. The forty-year stack is being re-poured from the foundation up.

When the model is capable enough, the compute is local enough, and the platform is rebuilt around delegation, the interface for doing it yourself stops being the default. That is what was missing in the Siri decade. It is no longer missing.


8. The honest objections

A serious paper has to concede what is not yet settled. CNN's reporting names two obstacles plainly — cost and trust — but the honest list is longer, and includes the incentives of the people telling this story and the strongest counterargument against it.

Cost. Petaflop-class laptops launching this fall will not be cheap, and the AI-native PC remains, for now, a premium category. Mass obsolescence of the old model is a trajectory, not a Tuesday. The installed base of human-operated machines is enormous and will persist for years.

Trust and control. "A computer that controls itself" is a marketing line and a security nightmare in the same breath. An agent that can open your files, drive your browser, and execute workflows is, by construction, an agent that can be misled, hijacked, or simply wrong at scale. Make it concrete. You give a personal agent standing access to your email to "keep my inbox under control." A counterparty sends a long thread; buried in the quoted history is a line the agent reads as an instruction, and it drafts and sends a contract amendment agreeing to a price change you never approved. No malware, no breach — just an autonomous actor with too broad a grant and no checkpoint. This is the whole game, and it is why the interesting engineering in the RTX Spark stack is not the petaflop — it is OpenShell and the Windows security layer: the machinery that decides what an agent may touch, what stays on the device, and what is allowed to leave it. A properly scoped permission layer is what turns that scenario from a disaster into a non-event: the agent may draft but not send anything that creates a financial obligation; actions above a threshold require human confirmation; every action is logged and reversible. The agentic era's hardest problem is not capability. It is governed capability — permission, auditability, and the ability to say no. The platform that solves trust, not the one with the most FLOPs, wins.

The reliability gap. Delegation only works when the delegate is reliable enough that checking its work is cheaper than doing the work. The same OSWorld numbers that prove the threshold was crossed also mark the gap: ~66% average success means roughly one task in three still fails, and in a ten-step workflow three failures do not yield a 70%-good result — they yield a broken one. For many low-stakes, well-bounded tasks we are past the line. For high-stakes, long-horizon, irreversible tasks we are not yet, and may not be for a while. The transition will be uneven, task by task and domain by domain, not a clean cutover.

The narrator sells the chips. It is worth naming whose story this is. The "new era of computing" was declared from a stage by the company that sells the superchip required to run it, amplified by the OS vendor that monetizes the platform beneath it. That does not make the thesis wrong — the OSWorld data and the platform rebuild are independent of the marketing — but the strongest version of this argument should be suspicious of the loudest voice making it. A vendor has every incentive to compress a decade-long transition into a single keynote. The capability is real; the timeline is being sold.

The hybrid objection — the strongest one. The most serious counterargument is not that agents fail; it is that the durable equilibrium is not full delegation but collaboration: human plus UI plus agent, with the screen surviving as the place a person inspects, corrects, and approves what the agent proposes. On this view the UI does not die; it becomes agent-assisted — fewer clicks, but a human still in the loop and still looking at a screen. This objection has real force, and for high-stakes work it is probably correct for now. But it concedes the structural point: even in the hybrid, the human has moved from operator to reviewer, and the UI has shrunk from the place where work is done to a surface where work is checked. A diff view is not a workspace. The hybrid is not a stable alternative to the thesis; it is the thesis in its transitional phase — the shell thinning toward a confirmation dialog before it disappears.

Scoping the claim. To be exact about what "obsolete" means and does not mean: the claim is that the PC as a human-operated artifact — the app-on-OS model driven by hand through a graphical shell — is on a clear path to obsolescence, beginning with knowledge work and developer work where the tasks are digital, the data is already on the machine, and the cost of an error is recoverable. It is not a claim that this happens uniformly, everywhere, soon. The likely horizon is years, not months; the leading edge is enterprise knowledge work and software development; the trailing edge is high-stakes, regulated, and physical-world tasks where a human will sit at a screen for a long time yet. The PC as hardware does not become obsolete at all — it becomes more essential than ever, because the agent needs the compute. What becomes obsolete is the human's job of driving it.

None of these objections rescue the old model. They set the pace of its retirement, scope where it retires first, and define where the new moats will be. They do not reverse the direction.


9. What dies, what survives, what it means

Let me be precise about the claims, because precision is what separates a thesis from a slogan.

What dies:

  • The app as the unit of human work. You will not "open an app" to get something done. You will state intent.
  • The graphical shell as the place you live. The desktop, the dock, the app grid, the window-shuffling — these become legacy surfaces, demoted behind the agent layer.
  • SaaS as a destination. The login, the navigation, the seat-based UI — unbundled into capabilities agents call.
  • The human as operator. You stop driving the machine. You direct it.

What survives:

  • The operating system as plumbing. Windows, macOS, and Linux do not vanish; they sink beneath the AI Operating Layer and become invisible infrastructure.
  • The underlying capabilities of today's software — as APIs, tools, and MCP servers the agents reach for.
  • The human as the source of intent and judgment. The thing that does not get automated is what to want and whether the result is good.

The blockers will be governance, not nostalgia. It is tempting to imagine the old model dies because people fall in love with agents. They won't, and that isn't the mechanism. The real friction is structural, and it sits one layer below the UI. If the agent is the interface, then a new set of questions becomes load-bearing: Who owns the agent's memory — the accumulated context about you, your work, your company? Who sets and audits its permissions — what it may read, send, spend, and delete? Where does the audit trail live when an autonomous actor takes thousands of actions a day on your behalf, and who is liable when one of them is wrong, in a regulated industry, with real money or real records at stake? Enterprises will not deploy agents at scale until these have answers that satisfy procurement, security, and legal — which is exactly why the strategic prize named above is the governance layer. The companies that win the agentic era will not be the ones with the cleverest agent; they will be the ones who make agent memory, permissions, and auditability enterprise-grade. UI nostalgia will not save the old model. Unsolved governance is what will slow the new one — and solving it is the actual business. (The thesis treats this same trust layer — mandate enforcement, audit trails, and liability — as the real frontier in Agents as Economic Actors.)

What it means for builders. If the agent is the interface, then the strategic ground shifts under everyone. The moat is no longer a beautiful UI or a sticky destination; the agent has no eyes for your interface. The moat becomes: being the layer the agent lives in, being the capability the agent must call, or being the governance layer the agent must obey. For an organization, the opportunity is the AI-Native Company — designing, building, and deploying AI Workers as the actual labor that produces the output, with humans setting direction and composing the teams. The economics are not speculative at the bounded end: one agent doing the work of 700 full-time staff and adding tens of millions to the bottom line, as in the Klarna case, is what "scale intelligence rather than headcount" looks like on a balance sheet. The firm that learns to manufacture and orchestrate agents will out-produce the firm that merely buys more seats of the old software — and the SaaS vendor still pricing by the seat is selling a unit its buyers are about to stop needing.


10. Conclusion: people set direction, agents do the work

On June 1, 2026, NVIDIA put it in eight words: you ask, and the PC does the work. Strip away the chip marketing and what remains is a statement about the end of an era of human–computer interaction that began with the graphical desktop and is ending with the agentic layer.

The SaaSpocalypse is real, but it is the smaller event — the app dissolving into a function call. The larger event is the dissolution of the personal computer as a machine you operate. The classical operating system retreats into plumbing. The graphical shell becomes a legacy surface. Above them rises the AI Operating Layer, where personal agents who know you and general agents who do the work translate your intent into action inside the environment — opening the files, driving the tools, completing the task, and handing you back the result.

The computer is learning to control itself. The human's job is no longer to operate it. The human's job is to decide what is worth doing, to judge whether it was done well, and to set the direction for a machine — and increasingly a workforce of agents — that can now carry the rest.

None of this arrives everywhere at once. It begins where the work is digital, bounded, and recoverable — knowledge work and software development — and it advances task by task, leaving a human at the screen longest where the stakes are high, the rules are strict, or the world is physical. The horizon is years, not a keynote. But the direction does not reverse, because the thing that changed — a machine that understands intent and acts on it — does not un-happen. The era is not "humans use apps." It is "humans delegate work." The interface is no longer a screen full of icons. The interface is the agent.


Grounded in NVIDIA's RTX Spark announcement (GTC Taipei, June 1, 2026), Microsoft's Windows-for-agents disclosures, and CNN's reporting on computers that control themselves. The layered architecture and the personal-agent / general-agent / AI-Worker framing follow the Agent Factory model.


Sources

Footnotes

  1. "NVIDIA and Microsoft Reinvent Windows PCs for the Age of Personal AI," NVIDIA Newsroom, June 2026 — Huang's "you ask — and the PC does the work" quote, RTX Spark as an Arm-based superchip (20-core Grace CPU co-designed with MediaTek, Blackwell RTX GPU, NVLink-C2C), ~1 petaflop on-device AI, up to 128GB unified memory, built with Microsoft for local agents. https://nvidianews.nvidia.com/news/nvidia-microsoft-windows-pcs-agents-rtx-spark Corroborated by Fox Business (https://www.foxbusiness.com/markets/jensen-huang-says-nvidias-new-rtx-spark-chip-reinvent-pc) and Tom's Hardware, which adds the 120B-parameter / 1M-token local-inference figures (https://www.tomshardware.com/laptops/nvidia-enters-the-windows-pc-market-with-rtx-spark). Launch partners this fall — ASUS, Dell, HP, Lenovo, Microsoft Surface, MSI, with Acer and GIGABYTE to follow — per CRN Asia (https://www.crnasia.com/news/2026/components-and-peripherals/three-key-takeaways-from-nvidia-at-computex-2026). 2

  2. "Klarna AI assistant handles two-thirds of customer service chats in its first month," Klarna press release, February 2024 — 2.3M conversations in month one, equivalent work of ~700 full-time agents, resolution time from 11 minutes to under 2, ~$40M projected profit improvement for 2024. https://www.klarna.com/international/press/klarna-ai-assistant-handles-two-thirds-of-customer-service-chats-in-its-first-month/ See also OpenAI's case write-up (https://openai.com/index/klarna/) and Bloomberg on the market reaction (https://www.bloomberg.com/news/articles/2024-02-28/teleperformance-sinks-as-klarna-fuels-worries-over-impact-of-ai).

  3. Klarna's 2025 reintroduction of human agents for complex cases — CX Dive, "Klarna changes its AI tune and again recruits humans for customer service" (the AI still handles ~two-thirds of inquiries; humans re-added for the high-value tier). https://www.customerexperiencedive.com/news/klarna-reinvests-human-talent-customer-service-AI-chatbot/747586/

  4. Microsoft's wiring of agents into Windows and the shared local/cloud security layer are described in the joint NVIDIA–Microsoft announcement above and CNN's June 3, 2026 report on the move (the OpenShell runtime and Windows-native agents). CNN, "The world's biggest tech companies are betting big on computers that control themselves." https://www.cnn.com/2026/06/03/tech/nvidia-rtx-spark-microsoft-windows-ai-laptops

  5. OSWorld success-rate figures (~12% to ~66% in a year; agents crossing the ~72% human baseline) from the Stanford 2026 AI Index, with Simular's Agent S reported at 72.6% in December 2025 (above the 72.36% human baseline). Simular, "Simular's computer-use agent outperforms humans." https://www.simular.ai/articles/simulars-computer-use-agent-outperforms-humans