[edit]
Reclaiming the Loop: From the Consensus Trap to Pluralistic Data Annotation
Proceedings of the The 39th Canadian Conference on Artificial Intelligence, PMLR 318:1216-1218, 2026.
Abstract
This research challenges the dominant “ground truth” paradigm in machine learning, arguing that current annotation practices suppress meaningful human disagreement in favor of artificial consensus. It identifies two structural failures in annotation pipelines: the allocation gap (mismatch between annotator identity and data context) and the representation gap (erasure of nuance during label aggregation). The proposed solution introduces a pluralistic annotation infrastructure that incorporates identity-aware task assignment and rationale-aware aggregation to preserve lived experience and dissent. By reframing disagreement as a high-fidelity epistemic signal rather than noise, the work advances a model of situated knowledge stewardship aimed at promoting epistemic justice in AI systems.