Published 2009
| public
Book Section - Chapter
Interactively Optimizing Information Retrieval Systems as a Dueling Bandits Problem
- Creators
-
Yue, Yisong
- Joachims, Thorsten
Chicago
Abstract
We present an on-line learning framework tailored towards real-time learning from observed user behavior in search engines and other information retrieval systems. In particular, we only require pairwise comparisons which were shown to be reliably inferred from implicit feedback (Joachims et al., 2007; Radlinski et al., 2008b). We will present an algorithm with theoretical guarantees as well as simulation results.
Additional Information
Copyright 2009 by the author(s)/owner(s). The work was funded under NSF Award IIS-0713483, NSF CAREER Award 0237381, and a gift from Yahoo! Research. The first author is also partly funded by a Microsoft Research Graduate Fellowship and a Yahoo! Key Technical Challenges Grant. The authors also thank Robert Kleinberg, Josef Broder and the anonymous reviewers for their helpful comments.Additional details
- Eprint ID
- 49538
- DOI
- 10.1145/1553374.1553527
- Resolver ID
- CaltechAUTHORS:20140910-105428880
- NSF
- IIS-0713483
- NSF CAREER Award
- 0237381
- Yahoo! Research
- Microsoft Research Graduate Fellowship
- Yahoo! Key Technical Challenges Grant
- Created
-
2014-09-11Created from EPrint's datestamp field
- Updated
-
2021-11-10Created from EPrint's last_modified field