Abstract
This thesis research aims to conduct a study on a cost-sensitive listwise approach to learning to rank.
Learning to Rank is an area of application in machine learning, typically supervised, to build ranking models for Information Retrieval systems. The training data consists of lists of items with some partial order specified induced by an ordinal score or a binary judgment (relevant/not relevant). The model purpose is to produce a permutation of the items in this list in a way which is close to the rankings in the training data. This technique has been
successfully applied to ranking, and several approaches have been proposed since then, including the listwise approach.
A cost-sensitive version of that is an adaptation of this framework which treats the documents within a list with different probabilities, i.e. attempt to impose weights for the documents with higher cost. We then take this algorithm to the next level by kernelizing the loss and exploring the optimization in different spaces.
Among the different existing likelihood algorithms, we choose ListMLE as primary focus of experimentation, since it has been shown to be the approach with the best empirical performance. The theoretical framework is given along with its mathematical properties.
Experimentation is done on the benchmark LETOR dataset. They contain queries and some characteristics of the retrieved documents and its human judgments on the relevance of the documents on the queries.
Based on that we will show how the Kernel Cost-Sensitive ListMLE performs compared to the baseline Plain Cost-Sensitive ListMLE, ListNet, and RankSVM and show different aspects of the proposed loss function within different families of kernels.
Library of Congress Subject Headings
Ranking and selection (Statistics); Kernel functions; Machine learning
Publication Date
5-9-2016
Document Type
Thesis
Student Type
Graduate
Degree Name
Applied Statistics (MS)
Department, Program, or Center
School of Mathematical Sciences (COS)
Advisor
Ernest Fokoue
Advisor/Committee Member
Joseph Voelkel
Advisor/Committee Member
Steven LaLonde
Recommended Citation
Guimaraes Olinto, Gabriela, "Kernelized Cost-Sensitive Listwise Ranking" (2016). Thesis. Rochester Institute of Technology. Accessed from
https://repository.rit.edu/theses/8996
Campus
RIT – Main Campus
Comments
Physical copy available from RIT's Wallace Library at QA278.75 .G84 2016