Document Type
Article
Publication Date
2010
Abstract
Rocchio’s similarity-based relevance feedback algorithm, one of the most important query reformation methods in information retrieval, is essentially an adaptive supervised learning algorithm from examples. In practice, Rocchio’s algorithm often uses a fixed query updating factor. When this is the case, we strengthen the linear Ω(n) lower bound obtained by Chen and Zhu (Inf. Retr. 5:61–86, 2002) and prove that Rocchio’s algorithm makes Ω(k(n−k)) mistakes in searching for a collection of documents represented by a monotone disjunction of k relevant features over the n-dimensional binary vector space {0,1}n, when the inner product similarity measure is used. A quadratic lower bound is obtained when k is linearly proportional to n. We also prove an O(k(n−k)3) upper bound for Rocchio’s algorithm with the inner product similarity measure in searching for such a collection of documents with a constant query updating factor and a zero classification threshold.
Recommended Citation
Chen, Z., Fu, B. & Abraham, J. A quadratic lower bound for Rocchio’s similarity-based relevance feedback algorithm with a fixed query updating factor. J Comb Optim 19, 134–157 (2010). https://doi.org/10.1007/s10878-008-9169-6
Publication Title
J Comb Optim
DOI
https://doi.org/10.1007/s10878-008-9169-6
Comments
Reprints and Permissions