rw-book-cover

Metadata

Highlights

  • In the final stage of a recommendation system, the system can re-rank the candidates to consider additional criteria or constraints. One re-ranking approach is to use filters that remove some candidates. (View Highlight)
  • Another re-ranking approach is to manually transform the score returned by the ranker. (View Highlight)
  • This section briefly discusses freshness, diversity, and fairness. These factors are among many that can help improve your recommendation system. Some of these factors often require modifying different stages of the process. Each section offers solutions that you might apply individually or collectively. (View Highlight)
  • Freshness Most recommendation systems aim to incorporate the latest usage information, such as current user history and the newest items. Keeping the model fresh helps the model make good recommendations. (View Highlight)
  • Re-run training as often as possible to learn on the latest training data. We recommend warm-starting the training so that the model does not have to re-learn from scratch. Warm-starting can significantly reduce training time. For example, in matrix factorization, warm-start the embeddings for items that were present in the previous instance of the model. (View Highlight)
  • Create an “average” user to represent new users in matrix factorization models. You don’t need the same embedding for each user—you can create clusters of users based on user features. (View Highlight)
  • Use a DNN such as a softmax model or two-tower model. Since the model takes feature vectors as input, it can be run on a query or item that was not seen during training. (View Highlight)
  • Add document age as a feature. For example, YouTube can add a video’s age or the time of its last viewing as a feature. (View Highlight)
  • If the system always recommend items that are “closest” to the query embedding, the candidates tend to be very similar to each other. This lack of diversity can cause a bad or boring user experience. For example, if YouTube just recommends videos very similar to the video the user is currently watching, such as nothing but owl videos (as shown in the illustration), the user will likely lose interest quickly. Solutions • Train multiple candidate generators using different sources. • Train multiple rankers using different objective functions. • Re-rank items based on genre or other metadata to ensure diversity. (View Highlight)