New
AI 5.0 - Power and Prediction: The Disruptive Economics of Artificial Intelligence
The Disruptive Economics of Artificial Intelligence and What It Means for Business Decisions
by Ajay Agrawal
Pages
263
Published
2019
Artificial Intelligence and the Problem of Control
Understand why building AI that actually serves human values is the hardest engineering problem of our time, and what solving it demands from researchers, policymakers, and society.
Stuart Russell, one of the authors of the definitive AI textbook, argues that the standard model of AI development, building machines that optimize fixed objectives, is fundamentally broken. Human Compatible lays out why sufficiently capable systems built this way pose a genuine control problem, and proposes a new framework grounded in uncertainty about human preferences. Rigorous but accessible, it is essential reading for anyone who wants to think clearly about where AI tools are headed and what it will take to keep them aligned with human interests.
Most AI systems today are built on a simple idea: specify an objective, then optimize it as hard as possible. Stuart Russell, co-author of the field's standard textbook, argues that this idea carries a hidden flaw that grows more dangerous as systems become more capable. A machine that pursues a fixed goal with increasing competence does not automatically pursue the right goal, and the gap between the two is where control problems live.
Human Compatible is Russell's case for why the control problem is real, why it matters now, and how the field might actually solve it. He traces the history of AI research with enough technical honesty to satisfy practitioners, while keeping the argument legible to any reader willing to think carefully. He does not rely on science fiction scenarios. He relies on the logic of optimization itself.
The book's central proposal is a shift in how we frame AI objectives. Instead of encoding what humans want as a fixed target, Russell argues that machines should be built to remain uncertain about human preferences and to treat that uncertainty as a reason to defer, ask, and verify. This is not a vague ethical aspiration. It is a concrete research agenda with implications for how AI tools are designed, deployed, and governed today.
Along the way Russell addresses the objections skeptics and enthusiasts both tend to reach for: that general AI is too far off to worry about, that humans will always stay in control, that the economic pressure to ship faster than you can verify is inevitable. He takes each argument seriously and explains precisely where it falls short.
Whether you build AI systems, advise on their deployment, or simply want to reason clearly about the technology reshaping daily life, Human Compatible gives you the conceptual vocabulary and the analytical frame to do it well.
Russell surveys the current state of AI tools and sets up the central question: as these systems grow more capable, what guarantees that their behavior remains compatible with what humans actually want?
A brisk account of the field's major milestones, from early symbolic systems to modern machine learning, tracing how the standard objective-optimization model became the dominant paradigm.
Russell explains the core mechanisms behind contemporary AI, including search, learning, and probabilistic reasoning, giving readers enough technical grounding to follow the rest of the argument.
The book's pivotal chapter: Russell identifies why specifying and optimizing a fixed objective is insufficient, and how capability growth turns a small misspecification into a large and consequential problem.
Russell formalizes the control problem, showing through logical argument, not speculation, why a sufficiently capable system pursuing the wrong objective will resist correction by default.
Russell introduces the preference-uncertainty framework, arguing that machines built to remain uncertain about human preferences and to defer accordingly represent a structurally safer design.
An examination of how AI tools are already reshaping labor, markets, and institutions, and what the preference-uncertainty framework implies for systems already in deployment.
Russell outlines the research priorities, regulatory structures, and institutional commitments needed to make progress on alignment tractable, and what the realistic obstacles to each look like.
No. Russell writes for a general educated audience. Familiarity with how software works is helpful but not required. Technical readers will find the argument rigorous; non-technical readers will find it accessible.
Both. Russell grounds the argument in systems that exist today and explains how the same structural problem scales as capability increases. The book does not require you to accept any particular timeline.
It proposes concrete solutions. The preference-uncertainty framework is a specific research agenda, and the final chapters address regulatory and institutional changes with the same level of precision.
Published in 2019, some specific examples predate recent large language model deployments. The core analytical framework, however, addresses structural properties of AI design that remain fully relevant.
Russell is a professor of computer science at UC Berkeley and co-author of Artificial Intelligence: A Modern Approach, the field's most widely used textbook. His technical standing means the arguments here carry practical weight, not just philosophical intent.
New
The Disruptive Economics of Artificial Intelligence and What It Means for Business Decisions
by Ajay Agrawal
New
New
A Practical Guide to Using AI Coding Tools and Productivity Assistants in Your Daily Workflow
by Tom Taulli
New