What is it about?

Reinforcement Learning is a powerful technique for decision making. Moreover, while properly optimizing objective, unexpected strategies can be found. We observe cheating strategies in popular recommendation system dataset MovieLens 1M. Resulted recommendations are precise, but unnecessary in the production (the same popular item small subset of possible items is recommended for all users). The new noise injection strategy was proposed for DDPG algorithm. After applying such strategy agent managed to find both precise and diverse recommendations.

Featured Image

Why is it important?

There are many examples of using Deep Deterministic Policy Gradient (DDPG) for recommendations. Nevertheless, for exploration they use different techniques compared to Ornstein-Uhlenbeck noise from non-recsys environments. This particular article successfully incorporate OU noise injection strategy for improving relevance-diversity tradeoff.

Perspectives

Article is related to both multiobjective evaluation and reinforcement learning for recommendations.

Alexey Grishanov
Moscow Institute of Physics and Technology

Read the Original

This page is a summary of: Multiobjective Evaluation of Reinforcement Learning Based Recommender Systems, September 2022, ACM (Association for Computing Machinery),
DOI: 10.1145/3523227.3551485.
You can read the full text:

Read

Contributors

The following have contributed to this page