Blog | Schym

The discourse surrounding artificial intelligence in the labor market has, for the better part of the last decade, generated significantly more heat than light. We stand at a juncture characterized by a pervasive dichotomy: on one side, dystopian determinism suggesting widespread substitution of human labor; on the other, techno-optimist visions of frictionless augmentation. This binary perspective, while rhetorically convenient, is empirically deficient.

It is within this theoretical vacuum that a seminal contribution emerges. In their paper "Roles of Artificial Intelligence in Collaboration with Humans" published in Management Science, Fügener, Walzner, and Gupta^{[Fügener, 2025]} offer a rigorous framework to disentangle the specific economic mechanisms governing human-AI integration. Unlike previous studies treating "AI adoption" as a monolithic variable, they descend into the granular architecture of the task itself.

Their distinction between between-task complementarity and within-task complementarity serves as what I call the "Archimedean point" of their analysis—a fixed reference from which the entire edifice of optimal task allocation can be understood and operationalized.

1The Great Decoupling of Task and Labor

To understand the gravity of this contribution, we must first contextualize it within the broader history of economic thought regarding technological change. Since the industrial revolution, economists have grappled with the "Luddite fallacy"—the fear that machines would permanently displace labor. The standard neoclassical answer has been that while technology substitutes for labor in specific tasks, it complements labor in others, raising productivity and aggregate demand^{[Autor, 2015]}.

However, the AI revolution presents a unique challenge to this equilibrium because it targets cognitive non-routine tasks—the very domain previously identified as the sanctuary of human comparative advantage. Recent work on the "jagged technological frontier"^{[Dell'Acqua, 2023]} suggests that AI capabilities do not scale linearly with task difficulty; rather, they fluctuate unevenly, creating pockets of high performance amidst surprising incompetence.

Loading diagram...

The Polanyi Paradox

By leveraging the Polanyi paradox—"we know more than we can tell"^{[Polanyi, 1966]}—Fügener et al. construct a model where tacit human knowledge and codified algorithmic rationality interact dynamically. This explains why AI excels at pattern recognition yet fails at tasks requiring contextual judgment.

The fundamental gap in the literature, which this paper addresses, is the lack of a structural model that can predict when AI will substitute for a human (automation) and when it will collaborate with a human (augmentation). Should the radiologist be replaced by the algorithm? Should the algorithm merely advise the radiologist? Or should the radiologist be reallocated to "edge cases" that confound the algorithm entirely?

The answer, as Fügener et al. demonstrate, depends on the precise calibration of complementarity.

2Data and Methodology

To empirically disentangle the effects of automation and augmentation, one requires a data architecture that is both controlled and representative of complex judgment. The authors employ a controlled experimental setting using image classification—a domain where the "Polanyi condition" holds: human decisions rely on tacit knowledge that is difficult to codify into rules.

Experimental Design: Three Treatment Arms

Treatment 1

Human Solo: Humans classify images without assistance. Establishes baseline human performance vector (P^H).

Treatment 2

Augmentation: Humans receive AI recommendations (GoogLeNet Inception v3). Measures the "augmentation benefit."

Counterfactual

AI Solo: Deterministic AI performance (p^AI) is known for the dataset. The automation baseline.

A critical innovation is the use of the AI's likelihood score (l^AI). For every image, the AI assigns a probability to its classification. This score serves as an ex-ante estimator of task difficulty from the machine's perspective. High l^AI indicates a task that is "easy" for the machine (codified knowledge), while low l^AI indicates a task requiring tacit knowledge.

Using Monte Carlo simulation with 10,000 iterations, the authors model the performance of human crowds. By sampling n humans and aggregating decisions via majority voting, they simulate a "wisdom of crowds" configuration—crucial for the reallocation argument.

Loading visualization...

The Optimization Framework

The goal is to maximize average performance over a set of tasks T, subject to a fixed constraint on human resources R. This allows empirical isolation of three distinct value sources: substitution benefit, augmentation benefit, and reallocation benefit.

3Empirical Findings

The application of the task allocation framework to experimental data yields stylized facts that challenge prevailing narratives of both pure substitution and pure augmentation.

Strategy	Accuracy	Gain vs. Baseline	Key Driver
Human Baseline	68%	—	Tacit knowledge
Full Automation (AI Only)	77%	+9 pp	Pattern recognition
Full Augmentation (Human + AI)	80%	+12 pp	Human discernment
Optimal Task Allocation	88%	+20 pp	Reallocation efficiency

Source: Fügener, Walzner, & Gupta (2025), Table 3

The economic magnitude is profound. Even more striking is the comparison with "Full Augmentation"—the strategy often championed by proponents of "human-centered AI." The optimal framework outperforms universal augmentation by 8 percentage points. This gain is purely a result of allocative efficiency: by automating "easy" tasks, the framework liberates human attention for reinvestment into "hard" tasks via crowd-based mechanisms.

Between-Task Complementarity

Measures the correlation between human and AI performance across different tasks. Low correlation (b = 0.31 in the study) implies strong complementarity—the AI's "jagged frontier" doesn't map onto the human frontier.

Enables substitution arbitrage

Within-Task Complementarity

Measures human ability to discern valid AI signals from noise within a specific task. The study found r^AI = 0.78 for correct advice vs. r^AI̅ = 0.52 for incorrect advice.

Enables meta-cognitive auditing

4The Tripartite Structure of Labor

Perhaps the most significant contribution is the empirical mapping of the division of labor. When the optimization model distributes work based on AI certainty (l^AI), a consistent tripartite structure emerges:

Loading visualization...

Zone ILow Difficulty

Zone of Automation

For tasks where AI confidence is high (the 'easy' cases), the optimal strategy is pure automation. Human involvement would be a waste of resources. These tasks match the AI's training distribution with clear, unambiguous features.

Optimal Strategy:Pure AI Automation

Zone IIMedium Difficulty

Zone of Augmentation

For tasks of intermediate difficulty—where AI certainty is moderate and comparable to human ability—the optimal configuration is a single human augmented by AI advice. The synergy of within-task complementarity is maximized in this band.

Optimal Strategy:Human + AI Collaboration

Zone IIIHigh Difficulty

Zone of Human Crowds

For the most challenging tasks, where AI signals are noisy or low-confidence, the framework allocates groups of humans working without AI. When AI is confused, its advice becomes a 'distractor.' Instead, the model leverages the wisdom of crowds.

Optimal Strategy:Aggregated Human Judgment

Counter-Intuitive Finding

Zone III explicitly rejects AI assistance for the hardest tasks. When the AI is confused, its advice can bias humans toward incorrect answers (negative within-task complementarity). The reallocation benefit enables concentration of human resources on these edge cases.

5Strategic Implications

This research fundamentally reframes the question facing organizations. The choice is not "automate or augment?" but rather "how do we optimally allocate tasks across a tripartite structure?" This has profound implications for:

Process Design

Organizations must map their task portfolios onto the tripartite framework, identifying which tasks belong in each zone. This requires understanding both AI confidence distributions and human error profiles.

Workforce Planning

The "reallocation benefit" suggests that automation doesn't simply reduce headcount—it creates capacity for crowd-based quality assurance on high-stakes decisions. HR strategies should plan for this shift.

Training & Development

Workers in Zone II need "meta-cognitive" skills to audit AI outputs effectively. Training programs should focus on calibrating trust—knowing when to defer and when to override.

This framework also connects directly to recent theoretical work on the knowledge economy^{[Ide, 2025]} and organizational hierarchies^{[Garicano, 2000]}. The tripartite structure mirrors the stratification of cognitive labor: routine tasks to machines, complex judgment to human-AI teams, and irreducible tacit knowledge to aggregated human expertise.

6Conclusion: Beyond the Binary

The Fügener, Walzner, and Gupta framework provides the Archimedean point we have been seeking: a fixed reference from which to understand the complex dynamics of human-AI collaboration. Their work demonstrates that the "future of work" is not a monolithic shift but a stratified ecosystem, where the division of labor is dictated by the limits of codification and the persistence of tacit knowledge.

For practitioners, this means moving beyond the "AI everywhere" heuristic. For researchers, it opens new questions about how complementarity structures evolve as AI capabilities improve. And for policymakers, it suggests that labor market impacts will be highly heterogeneous—varying not just by occupation, but by the granular task composition within each role.

Key Takeaways

Optimal task allocation outperforms both pure automation and universal augmentation by significant margins (+20pp vs. human baseline).
Between-task complementarity enables automation gains; within-task complementarity enables augmentation gains.
The tripartite structure—Automation, Augmentation, Human Crowds—provides a practical framework for process design.
The 'reallocation benefit' is a distinct value driver: labor saved from automation can be concentrated on high-stakes decisions.
AI assistance can be counterproductive for the hardest tasks when the machine is uncertain—'wisdom of crowds' dominates.

References

Fügener, A., Walzner, D. D., & Gupta, A. (2025). Roles of Artificial Intelligence in Collaboration with Humans: Automation, Augmentation, and the Future of Work. Management Science.Link

Autor, D. H. (2015). Why Are There Still So Many Jobs? The History and Future of Workplace Automation. Journal of Economic Perspectives.

Polanyi, M. (1966). The Tacit Dimension. New York: Doubleday.

Dell'Acqua, F. et al. (2023). Navigating the Jagged Technological Frontier: Field Experimental Evidence of the Effects of AI on Knowledge Worker Productivity and Quality. Harvard Business School Working Paper.

Ide, E. & Talamàs, E. (2025). Artificial Intelligence in the Knowledge Economy. Journal of Political Economy.

Garicano, L. (2000). Hierarchies and the Organization of Knowledge in Production. Journal of Political Economy.

Dr. Michael Schymura

The Archimedean Pointof AI CollaborationAutomation, Augmentation, and the Tripartite Division of Labor