Enhancing The Recommendation of High-impact Rare-event Business News for Professionals with LLM-based Augmentation
Keywords:
recommender systems, LLMs, augmentationAbstract
Personalized news recommendation has become an essential tool for professionals around the world to keep track of news events matching their interests and alleviate information overload. Beyond personalization, an essential aspect of useful news recommendations for professional use is that they highlight events that are more significant and of higher impact. However, we find that state-of-the-art recommenders struggle to identify and recommend news about significant events. In this paper, we address this gap as follows. To mitigate the relative scarcity of news about significant events, we use an LLM to create a synthetic dataset of significant news seeded from business-relevant news in the MIND dataset. We train four state-of-the-art recommendation models (MINER, UNBERT, UniTRec, Fastformer) with synthetically enhanced versions of a subset of the MIND dataset. We find that this successfully improves the performance of two of the recommendation models on the MIND-large dataset restricted to news about significant events in terms of the MRR, NDCG@5 and Hit@5 metrics and the performance of UNBERT on the AUC metric. The contribution of this paper is three-fold: we highlight news significance as an important aspect of useful news recommendation, we demonstrate the use of generative LLMs to create synthetic datasets for training on rare data and lastly, we demonstrate that augmenting some recommendation models with more significant news improves news recommendation performance on the MIND dataset.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Felipe Bivort Haiek, Anupriya Ankolekar

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Acorde a estos términos, el material se puede compartir (copiar y redistribuir en cualquier medio o formato) y adaptar (remezclar, transformar y crear a partir del material otra obra), siempre que a) se cite la autoría y la fuente original de su publicación (revista y URL de la obra), b) no se use para fines comerciales y c) se mantengan los mismos términos de la licencia.











