The discourse surrounding artificial intelligence in the labor market has, for the better part of the last decade, generated significantly more heat than light. We stand at a juncture characterized by a pervasive dichotomy: on one side, dystopian determinism suggesting widespread substitution of human labor; on the other, techno-optimist visions of frictionless augmentation. This binary perspective, while rhetorically convenient, is empirically deficient.
It is within this theoretical vacuum that a seminal contribution emerges. In their paper "Roles of Artificial Intelligence in Collaboration with Humans" published in Management Science, Fügener, Walzner, and Gupta[Fügener, 2025] offer a rigorous framework to disentangle the specific economic mechanisms governing human-AI integration. Unlike previous studies treating "AI adoption" as a monolithic variable, they descend into the granular architecture of the task itself.
Their distinction between between-task complementarity and within-task complementarity serves as what I call the "Archimedean point" of their analysis—a fixed reference from which the entire edifice of optimal task allocation can be understood and operationalized.
1The Great Decoupling of Task and Labor
To understand the gravity of this contribution, we must first contextualize it within the broader history of economic thought regarding technological change. Since the industrial revolution, economists have grappled with the "Luddite fallacy"—the fear that machines would permanently displace labor. The standard neoclassical answer has been that while technology substitutes for labor in specific tasks, it complements labor in others, raising productivity and aggregate demand[Autor, 2015].
However, the AI revolution presents a unique challenge to this equilibrium because it targets cognitive non-routine tasks—the very domain previously identified as the sanctuary of human comparative advantage. Recent work on the "jagged technological frontier"[Dell'Acqua, 2023] suggests that AI capabilities do not scale linearly with task difficulty; rather, they fluctuate unevenly, creating pockets of high performance amidst surprising incompetence.
The Polanyi Paradox
By leveraging the Polanyi paradox—"we know more than we can tell"[Polanyi, 1966]—Fügener et al. construct a model where tacit human knowledge and codified algorithmic rationality interact dynamically. This explains why AI excels at pattern recognition yet fails at tasks requiring contextual judgment.
The fundamental gap in the literature, which this paper addresses, is the lack of a structural model that can predict when AI will substitute for a human (automation) and when it will collaborate with a human (augmentation). Should the radiologist be replaced by the algorithm? Should the algorithm merely advise the radiologist? Or should the radiologist be reallocated to "edge cases" that confound the algorithm entirely?
The answer, as Fügener et al. demonstrate, depends on the precise calibration of complementarity.
2Data and Methodology
To empirically disentangle the effects of automation and augmentation, one requires a data architecture that is both controlled and representative of complex judgment. The authors employ a controlled experimental setting using image classification—a domain where the "Polanyi condition" holds: human decisions rely on tacit knowledge that is difficult to codify into rules.
Experimental Design: Three Treatment Arms
Human Solo: Humans classify images without assistance. Establishes baseline human performance vector (PH).
Augmentation: Humans receive AI recommendations (GoogLeNet Inception v3). Measures the "augmentation benefit."
AI Solo: Deterministic AI performance (pAI) is known for the dataset. The automation baseline.
A critical innovation is the use of the AI's likelihood score (lAI). For every image, the AI assigns a probability to its classification. This score serves as an ex-ante estimator of task difficulty from the machine's perspective. High lAI indicates a task that is "easy" for the machine (codified knowledge), while low lAI indicates a task requiring tacit knowledge.
Using Monte Carlo simulation with 10,000 iterations, the authors model the performance of human crowds. By sampling n humans and aggregating decisions via majority voting, they simulate a "wisdom of crowds" configuration—crucial for the reallocation argument.
The Optimization Framework
The goal is to maximize average performance over a set of tasks T, subject to a fixed constraint on human resources R. This allows empirical isolation of three distinct value sources: substitution benefit, augmentation benefit, and reallocation benefit.
3Empirical Findings
The application of the task allocation framework to experimental data yields stylized facts that challenge prevailing narratives of both pure substitution and pure augmentation.
| Strategy | Accuracy | Gain vs. Baseline | Key Driver |
|---|---|---|---|
| Human Baseline | 68% | — | Tacit knowledge |
| Full Automation (AI Only) | 77% | +9 pp | Pattern recognition |
| Full Augmentation (Human + AI) | 80% | +12 pp | Human discernment |
| Optimal Task Allocation | 88% | +20 pp | Reallocation efficiency |
Source: Fügener, Walzner, & Gupta (2025), Table 3
The economic magnitude is profound. Even more striking is the comparison with "Full Augmentation"—the strategy often championed by proponents of "human-centered AI." The optimal framework outperforms universal augmentation by 8 percentage points. This gain is purely a result of allocative efficiency: by automating "easy" tasks, the framework liberates human attention for reinvestment into "hard" tasks via crowd-based mechanisms.
Between-Task Complementarity
Measures the correlation between human and AI performance across different tasks. Low correlation (b = 0.31 in the study) implies strong complementarity—the AI's "jagged frontier" doesn't map onto the human frontier.
Within-Task Complementarity
Measures human ability to discern valid AI signals from noise within a specific task. The study found rAI = 0.78 for correct advice vs. rAI̅ = 0.52 for incorrect advice.
4The Tripartite Structure of Labor
Perhaps the most significant contribution is the empirical mapping of the division of labor. When the optimization model distributes work based on AI certainty (lAI), a consistent tripartite structure emerges:
Zone of Automation
For tasks where AI confidence is high (the 'easy' cases), the optimal strategy is pure automation. Human involvement would be a waste of resources. These tasks match the AI's training distribution with clear, unambiguous features.
Zone of Augmentation
For tasks of intermediate difficulty—where AI certainty is moderate and comparable to human ability—the optimal configuration is a single human augmented by AI advice. The synergy of within-task complementarity is maximized in this band.
Zone of Human Crowds
For the most challenging tasks, where AI signals are noisy or low-confidence, the framework allocates groups of humans working without AI. When AI is confused, its advice becomes a 'distractor.' Instead, the model leverages the wisdom of crowds.
Counter-Intuitive Finding
Zone III explicitly rejects AI assistance for the hardest tasks. When the AI is confused, its advice can bias humans toward incorrect answers (negative within-task complementarity). The reallocation benefit enables concentration of human resources on these edge cases.
5Strategic Implications
This research fundamentally reframes the question facing organizations. The choice is not "automate or augment?" but rather "how do we optimally allocate tasks across a tripartite structure?" This has profound implications for:
Process Design
Organizations must map their task portfolios onto the tripartite framework, identifying which tasks belong in each zone. This requires understanding both AI confidence distributions and human error profiles.
Workforce Planning
The "reallocation benefit" suggests that automation doesn't simply reduce headcount—it creates capacity for crowd-based quality assurance on high-stakes decisions. HR strategies should plan for this shift.
Training & Development
Workers in Zone II need "meta-cognitive" skills to audit AI outputs effectively. Training programs should focus on calibrating trust—knowing when to defer and when to override.
This framework also connects directly to recent theoretical work on the knowledge economy[Ide, 2025] and organizational hierarchies[Garicano, 2000]. The tripartite structure mirrors the stratification of cognitive labor: routine tasks to machines, complex judgment to human-AI teams, and irreducible tacit knowledge to aggregated human expertise.
6Conclusion: Beyond the Binary
The Fügener, Walzner, and Gupta framework provides the Archimedean point we have been seeking: a fixed reference from which to understand the complex dynamics of human-AI collaboration. Their work demonstrates that the "future of work" is not a monolithic shift but a stratified ecosystem, where the division of labor is dictated by the limits of codification and the persistence of tacit knowledge.
For practitioners, this means moving beyond the "AI everywhere" heuristic. For researchers, it opens new questions about how complementarity structures evolve as AI capabilities improve. And for policymakers, it suggests that labor market impacts will be highly heterogeneous—varying not just by occupation, but by the granular task composition within each role.
Key Takeaways
- Optimal task allocation outperforms both pure automation and universal augmentation by significant margins (+20pp vs. human baseline).
- Between-task complementarity enables automation gains; within-task complementarity enables augmentation gains.
- The tripartite structure—Automation, Augmentation, Human Crowds—provides a practical framework for process design.
- The 'reallocation benefit' is a distinct value driver: labor saved from automation can be concentrated on high-stakes decisions.
- AI assistance can be counterproductive for the hardest tasks when the machine is uncertain—'wisdom of crowds' dominates.
Related Reading
Norms and the Development of New Knowledge in the Era of AI
How artificial intelligence reshapes the hierarchies of tacit knowledge—and the normative trade-offs between efficiency and equity.
Canaries in the Coal Mine
Generative AI as a Determinant of Labor Market Dynamics. New empirical evidence reveals a 13% employment decline for early-career workers in AI-exposed fields.
The Great Fracture: AI, Labor, and the New Industrial Organization
A comprehensive synthesis of late 2025 research revealing a 'Great Fracture' in labor markets and industrial organization.
The Great Sobering in Climate Economics
Structural rigidities and the macroeconomic consequences of the 'missed decade' in climate policy.
References
Fügener, A., Walzner, D. D., & Gupta, A. (2025). Roles of Artificial Intelligence in Collaboration with Humans: Automation, Augmentation, and the Future of Work. Management Science.Link
Autor, D. H. (2015). Why Are There Still So Many Jobs? The History and Future of Workplace Automation. Journal of Economic Perspectives.
Polanyi, M. (1966). The Tacit Dimension. New York: Doubleday.
Dell'Acqua, F. et al. (2023). Navigating the Jagged Technological Frontier: Field Experimental Evidence of the Effects of AI on Knowledge Worker Productivity and Quality. Harvard Business School Working Paper.
Ide, E. & Talamàs, E. (2025). Artificial Intelligence in the Knowledge Economy. Journal of Political Economy.
Garicano, L. (2000). Hierarchies and the Organization of Knowledge in Production. Journal of Political Economy.