Publications [#370056] of Lawrence Carin

	Fitzpatrick Institute for Photonics Pratt School of Engineering Duke University
	HOME > pratt > FIP	Search Help Login

Papers Published

Lobel, S; Li, C; Gao, J; Carin, L, RACT: TOWARDS AMORTIZED RANKING-CRITICAL TRAINING FOR COLLABORATIVE FILTERING, 8th International Conference on Learning Representations, ICLR 2020 (January, 2020)
(last updated on 2024/12/31)
Abstract:
We investigate new methods for training collaborative filtering models based on actor-critic reinforcement learning, to more directly maximize ranking-based objective functions. Specifically, we train a critic network to approximate ranking-based metrics, and then update the actor network to directly optimize against the learned metrics. In contrast to traditional learning-to-rank methods that require re-running the optimization procedure for new lists, our critic-based method amortizes the scoring process with a neural network, and can directly provide the (approximate) ranking scores for new lists. We demonstrate the actor-critic's ability to significantly improve the performance of a variety of prediction models, and achieve better or comparable performance to a variety of strong baselines on three large-scale datasets.