TY - JOUR
T1 - Upper bound on the predictability of rating prediction in recommender systems
AU - Xu, En
AU - Zhao, Kai
AU - Yu, Zhiwen
AU - Wang, Hui
AU - Ren, Siyuan
AU - Cui, Helei
AU - Liang, Yunji
AU - Guo, Bin
N1 - Publisher Copyright:
© 2024
PY - 2025/1
Y1 - 2025/1
N2 - The task of rating prediction has undergone extensive scrutiny, employing diverse modeling approaches to enhance accuracy. However, it remains uncertain whether a maximum accuracy, synonymous with predictability, exists for a given dataset, guiding the quest for optimal algorithms. While existing theories quantify predictability in one-dimensional symbol sequences, extending this to multidimensional and heterogeneous data poses challenges, rendering it unsuitable for rating prediction tasks. Our approach initially employs conditional entropy to quantify rating entropy, overcoming its inherent complexity by transforming it into two easily calculable entropies. Unlike conventional entropy measures, we utilize sample entropy to account for the numerical impact of rating sequences. Furthermore, novel metrics for quantifying entropy in numerical sequences are integrated to enhance predictability scaling. Demonstrating the effectiveness of our method across datasets of varying sizes and domains, current leading rating prediction algorithms achieve approximately 80% predictability.
AB - The task of rating prediction has undergone extensive scrutiny, employing diverse modeling approaches to enhance accuracy. However, it remains uncertain whether a maximum accuracy, synonymous with predictability, exists for a given dataset, guiding the quest for optimal algorithms. While existing theories quantify predictability in one-dimensional symbol sequences, extending this to multidimensional and heterogeneous data poses challenges, rendering it unsuitable for rating prediction tasks. Our approach initially employs conditional entropy to quantify rating entropy, overcoming its inherent complexity by transforming it into two easily calculable entropies. Unlike conventional entropy measures, we utilize sample entropy to account for the numerical impact of rating sequences. Furthermore, novel metrics for quantifying entropy in numerical sequences are integrated to enhance predictability scaling. Demonstrating the effectiveness of our method across datasets of varying sizes and domains, current leading rating prediction algorithms achieve approximately 80% predictability.
KW - Information retrieval
KW - Predictability
KW - Rating prediction
KW - Recommender systems
UR - http://www.scopus.com/inward/record.url?scp=85208224302&partnerID=8YFLogxK
U2 - 10.1016/j.ipm.2024.103950
DO - 10.1016/j.ipm.2024.103950
M3 - 文章
AN - SCOPUS:85208224302
SN - 0306-4573
VL - 62
JO - Information Processing and Management
JF - Information Processing and Management
IS - 1
M1 - 103950
ER -