[edit]
Beyond Recency Bias: Combining Sequential and Global Collaborative Signals in LLM-Based Generative Recommendation for Sparse Data
Proceedings of the The 39th Canadian Conference on Artificial Intelligence, PMLR 318:284-295, 2026.
Abstract
Recommender systems often use multi-stage retrieval and ranking pipelines, where errors made early cannot be fixed later, which hurts overall recommendation quality. LLM-based generative recommendation avoids this by directly generating item identi- fiers, but many methods represent each item as a token sequence, which creates two concrete problems: generation is slow because tokens must be produced step by step, and it can fail due to beam-search local optima, where the correct item is dropped early because its first token has low probability. SETRec addresses these issues by represent- ing each item as an order-agnostic set of tokens and using query-guided simultaneous token generation, so the item’s CF and semantic tokens are generated in parallel without intra-item token dependency. However, SETRec still uses only one collaborative filtering (CF) token, and when that CF signal comes from a sequence-aware model, it is vulnera- ble to recency bias, especially in sparse and cold-start settings where recent interactions dominate. CFs-SETRec reduces this recency bias by adding a second CF token chosen to capture long-term preferences, and combining it with the sequential CF signal, which preserves both short-term behavior and long-term affinity and leads to more balanced recommendations under sparse data.