TY - GEN
T1 - Generalizing Query Performance Prediction under Retriever and Concept Shifts via Data-driven Correction
AU - Jung, Jaehwan
AU - Jeon, Jong June
N1 - Publisher Copyright:
© 2025 Copyright held by the owner/author(s).
PY - 2025/11/10
Y1 - 2025/11/10
N2 - Query Performance Prediction (QPP) aims to estimate the effectiveness of an information retrieval (IR) system without access to ground-truth relevance judgments. Existing supervised QPP methods typically follow a regression model framework that maps query-document representations to target metrics such as RR@10 or nDCG@10. However, these approaches often suffer from degraded performance under concept shift, where the distribution of relevance given a query-document pair changes between training and test datasets. This paper proposes a novel classification-based framework, QPP-MLC (QPP Multi-Label Classification), which formulates QPP as a multi-label classification task. QPP-MLC infers the relevance of each document among the top-k retrieved results and aggregates these document-level relevance predictions to predict the overall query performance. As a result, QPP-MLC provides a diagnosis tool for the concept shift and a correction method under the concept shift by modulating a threshold level of classification tasks. Experiments on MS MARCO and TREC DL benchmarks show that QPP-MLC achieves strong prediction accuracy and outperforms traditional regression-based QPP methods.
AB - Query Performance Prediction (QPP) aims to estimate the effectiveness of an information retrieval (IR) system without access to ground-truth relevance judgments. Existing supervised QPP methods typically follow a regression model framework that maps query-document representations to target metrics such as RR@10 or nDCG@10. However, these approaches often suffer from degraded performance under concept shift, where the distribution of relevance given a query-document pair changes between training and test datasets. This paper proposes a novel classification-based framework, QPP-MLC (QPP Multi-Label Classification), which formulates QPP as a multi-label classification task. QPP-MLC infers the relevance of each document among the top-k retrieved results and aggregates these document-level relevance predictions to predict the overall query performance. As a result, QPP-MLC provides a diagnosis tool for the concept shift and a correction method under the concept shift by modulating a threshold level of classification tasks. Experiments on MS MARCO and TREC DL benchmarks show that QPP-MLC achieves strong prediction accuracy and outperforms traditional regression-based QPP methods.
KW - concept shift
KW - query performance prediction
KW - robustness
UR - https://www.scopus.com/pages/publications/105023165526
U2 - 10.1145/3746252.3761404
DO - 10.1145/3746252.3761404
M3 - Conference contribution
AN - SCOPUS:105023165526
T3 - CIKM 2025 - Proceedings of the 34th ACM International Conference on Information and Knowledge Management
SP - 1261
EP - 1271
BT - CIKM 2025 - Proceedings of the 34th ACM International Conference on Information and Knowledge Management
PB - Association for Computing Machinery, Inc
T2 - 34th ACM International Conference on Information and Knowledge Management, CIKM 2025
Y2 - 10 November 2025 through 14 November 2025
ER -