Active Ranking with Subset-wise Preferences

Aadirupa Saha; Aditya Gopalan

Active Ranking with Subset-wise Preferences

Aadirupa Saha, Aditya Gopalan

Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics, PMLR 89:3312-3321, 2019.

Abstract

We consider the problem of probably approximately correct (PAC) ranking

$n$ items by adaptively eliciting subset-wise preference feedback. At each round, the learner chooses a subset of

$k$ items and observes stochastic feedback indicating preference information of the winner (most preferred) item of the chosen subset drawn according to a Plackett-Luce (PL) subset choice model unknown a priori. The objective is to identify an

$\epsilon$ -optimal ranking of the

$n$ items with probability at least

$1 - \delta$ . When the feedback in each subset round is a single Plackett-Luce-sampled item, we show

$(\epsilon, \delta)$ -PAC algorithms with a sample complexity of

$O\left(\frac{n}{\epsilon^2} \ln \frac{n}{\delta} \right)$ rounds, which we establish as being order-optimal by exhibiting a matching sample complexity lower bound of

$\Omega\left(\frac{n}{\epsilon^2} \ln \frac{n}{\delta} \right)$ —this shows that there is essentially no improvement possible from the pairwise comparisons setting (

$k = 2$ ). When, however, it is possible to elicit top-

$m$ (

$\leq k$ ) ranking feedback according to the PL model from each adaptively chosen subset of size

$k$ , we show that an

$(\epsilon, \delta)$ -PAC ranking sample complexity of

$O\left(\frac{n}{m \epsilon^2} \ln \frac{n}{\delta} \right)$ is achievable with explicit algorithms, which represents an

$m$ -wise reduction in sample complexity compared to the pairwise case. This again turns out to be order-wise unimprovable across the class of symmetric ranking algorithms. Our algorithms rely on a novel {pivot trick} to maintain only

$n$ itemwise score estimates, unlike

$O(n^2)$ pairwise score estimates that has been used in prior work. We report results of numerical experiments that corroborate our findings.

Cite this Paper

BibTeX


@InProceedings{pmlr-v89-saha19a,
  title = 	 {Active Ranking with Subset-wise Preferences},
  author =       {Saha, Aadirupa and Gopalan, Aditya},
  booktitle = 	 {Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics},
  pages = 	 {3312--3321},
  year = 	 {2019},
  editor = 	 {Chaudhuri, Kamalika and Sugiyama, Masashi},
  volume = 	 {89},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {16--18 Apr},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v89/saha19a/saha19a.pdf},
  url = 	 {https://proceedings.mlr.press/v89/saha19a.html},
  abstract = 	 {We consider the problem of probably approximately correct (PAC) ranking $n$ items by adaptively eliciting subset-wise preference feedback. At each round, the learner chooses a subset of $k$ items and observes stochastic feedback indicating preference information of the winner (most preferred) item of the chosen subset drawn  according to a Plackett-Luce (PL) subset choice model unknown a priori. The objective is to identify an $\epsilon$-optimal ranking of the $n$ items with probability at least $1 - \delta$. When the feedback in each subset round is a single Plackett-Luce-sampled item, we show $(\epsilon, \delta)$-PAC algorithms with a sample complexity of $O\left(\frac{n}{\epsilon^2} \ln \frac{n}{\delta} \right)$ rounds, which we establish as being order-optimal by exhibiting a matching sample complexity lower bound of $\Omega\left(\frac{n}{\epsilon^2} \ln \frac{n}{\delta} \right)$—this shows that there is essentially no improvement possible from the pairwise comparisons setting ($k = 2$). When, however, it is possible to elicit top-$m$ ($\leq k$) ranking feedback according to the PL model from each adaptively chosen subset of size $k$, we show that an $(\epsilon, \delta)$-PAC ranking sample complexity of $O\left(\frac{n}{m \epsilon^2} \ln \frac{n}{\delta} \right)$ is achievable with explicit algorithms, which represents an $m$-wise reduction in sample complexity compared to the pairwise case. This again turns out to be order-wise unimprovable across the class of symmetric ranking algorithms. Our algorithms rely on a novel {pivot trick} to maintain only $n$ itemwise score estimates, unlike $O(n^2)$ pairwise score estimates that has been used in prior work. We report results of numerical experiments that corroborate our findings.}
}

Endnote

%0 Conference Paper
%T Active Ranking with Subset-wise Preferences
%A Aadirupa Saha
%A Aditya Gopalan
%B Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics
%C Proceedings of Machine Learning Research
%D 2019
%E Kamalika Chaudhuri
%E Masashi Sugiyama	
%F pmlr-v89-saha19a
%I PMLR
%P 3312--3321
%U https://proceedings.mlr.press/v89/saha19a.html
%V 89
%X We consider the problem of probably approximately correct (PAC) ranking $n$ items by adaptively eliciting subset-wise preference feedback. At each round, the learner chooses a subset of $k$ items and observes stochastic feedback indicating preference information of the winner (most preferred) item of the chosen subset drawn  according to a Plackett-Luce (PL) subset choice model unknown a priori. The objective is to identify an $\epsilon$-optimal ranking of the $n$ items with probability at least $1 - \delta$. When the feedback in each subset round is a single Plackett-Luce-sampled item, we show $(\epsilon, \delta)$-PAC algorithms with a sample complexity of $O\left(\frac{n}{\epsilon^2} \ln \frac{n}{\delta} \right)$ rounds, which we establish as being order-optimal by exhibiting a matching sample complexity lower bound of $\Omega\left(\frac{n}{\epsilon^2} \ln \frac{n}{\delta} \right)$—this shows that there is essentially no improvement possible from the pairwise comparisons setting ($k = 2$). When, however, it is possible to elicit top-$m$ ($\leq k$) ranking feedback according to the PL model from each adaptively chosen subset of size $k$, we show that an $(\epsilon, \delta)$-PAC ranking sample complexity of $O\left(\frac{n}{m \epsilon^2} \ln \frac{n}{\delta} \right)$ is achievable with explicit algorithms, which represents an $m$-wise reduction in sample complexity compared to the pairwise case. This again turns out to be order-wise unimprovable across the class of symmetric ranking algorithms. Our algorithms rely on a novel {pivot trick} to maintain only $n$ itemwise score estimates, unlike $O(n^2)$ pairwise score estimates that has been used in prior work. We report results of numerical experiments that corroborate our findings.

APA


Saha, A. & Gopalan, A.. (2019). Active Ranking with Subset-wise Preferences. Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 89:3312-3321 Available from https://proceedings.mlr.press/v89/saha19a.html.

Active Ranking with Subset-wise Preferences

Abstract

Cite this Paper

Related Material