[edit]
Representation Effects in Child and Youth Mental Health Emergency Readmission Predictions
Proceedings of the The 39th Canadian Conference on Artificial Intelligence, PMLR 318:1028-1035, 2026.
Abstract
Predicting mental health–related emergency department readmission in youth remains challenging, and the role of data representation is underexplored. Using the National Survey on Drug Use and Health (ages 12–18), we compare three representations: (1) structured tabular features, (2) template-generated clinical text, and (3) LLM-derived sentence embeddings. Classical models are trained on tabular data and embeddings, while LLMs are applied to text. Results show that tabular features consistently yield the best and most stable performance. Templated text introduces a representational bottleneck and is less robust under distribution shift, while embeddings preserve some semantics but do not outperform tabular inputs. Representation choice is thus critical for predictive performance.