Recommender Systems and Matrix Factorization

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

What motivates the use of a logarithmic term when computing TF and IDF values?

The concept of conditional independence of terms and documents.

The TF monotonicity assumption.

The user background describing general information about the user.

Zipf’s law about the frequency of word occurrences in English texts. (correct)

What is the assumption about terms that appear infrequently in a document according to the TF monotonicity assumption?

They are less important than terms that appear in many documents.

They are more important than terms that appear in many documents. (correct)

They have no effect on the document's relevance.

They are only relevant to the user's background.

What is the relationship between LSA and LDA?

LDA is a probabilistic variant of LSA. (correct)

LSA is a topic modeling approach, while LDA is a metric.

They are unrelated topic modeling approaches.

LSA is a probabilistic variant of LDA.

What is a characteristic of the topic modeling approaches pLSA and LDA?

They assume conditional independence of terms and documents, given a topic z. Signup and view all the answers

What type of information does the user background describe?

General, rather static information about the user, e.g., knowledge or demographics. Signup and view all the answers

What does the average precision (AP) metric account for?

Relevant items not in the recommendation list, therefore, implicitly factoring in the rank of the relevant items. Signup and view all the answers

What is the purpose of context acquisition?

To acquire information about the user, either explicitly, implicitly, or by inferring. Signup and view all the answers

What describes the purpose the content creator had in mind when creating an item?

User intent. Signup and view all the answers

What is one of the reasons why people use recommender systems according to Herlocker et al.?

To find all good items Signup and view all the answers

What is the purpose of the regularization term in the optimization function of an SGD-trained MF model?

To avoid unbound values in the w and h vectors Signup and view all the answers

What is a common choice for regularization when using stochastic gradient descent (SGD) to create a MF model?

Tikhonov regularization Signup and view all the answers

What is the effect of the mutual proximity approach on hubness in recommender systems?

It decreases hubness Signup and view all the answers

Is a user bias factor required when computing memory-based CF with binary ratings?

No, it is not required Signup and view all the answers

How does user-based CF scale with the number of users and items?

It does not scale well with the number of users and items Signup and view all the answers

What is the main idea behind the IDF monotonicity assumption?

Terms that appear in only a few documents are more important Signup and view all the answers

Why is content-based filtering well-suited to recommend “long tail” items?

Because it is based on content features Signup and view all the answers

Study Notes