Adaptive Query Performance Prediction for Retrieval-Augmented Generation: Bridging Retrieval Quality and Generation Relevance
Aparajita Sinha, Kunal Chakma
Query Performance Prediction (QPP) estimates retrieval system effectiveness without relevance judgments, a challenge that intensifies in Retrieval-Augmented Generation (RAG) pipelines where retrieval quality directly shapes downstream answer quality. We introduce
RAG-QPP
, a retrieval-centric framework that predicts query difficulty from a twelve-dimensional post-retrieval feature set combining semantic similarity, lexical, and score-distribution signals, extending beyond classical QPP measures (Clarity, WIG, and NQC). Unlike generator-dependent approaches that require model-internal signals such as perplexity or token-level uncertainty, RAG-QPP operates exclusively on post-retrieval features, remaining applicable to any black-box generator without modification. Random Forest, XGBoost, and LightGBM are evaluated across four retrieval paradigms (sparse, dense, hybrid, and late-interaction) on MS MARCO Passage, MS MARCO Document, Natural Questions, and Robust04 datasets. Prediction accuracy is assessed via Pearson