|
Hi, I am very new to this field. I am working to develop a recommendation model where I push the right kind of content(text) to users based on his stated(optional) interests and how he has interacted with content previously.This particularly important to me as I want to (mainly) engage the user in intensive tasks with the content (e.g. summarising, transcreating etc) hence it is absolutely essential to get the reccomendations right. 1. I understand that I should serve him content similar to one he has interacted with previously, and hence there are text features I should look at particularly. Any good pointers as to what I should be looking for? 2. I am not sure collaborative filtering methods would be able to deliver for this probem. Is this so? Please let me know what you think. Any other comments, suggestions in this direction are welcome. Any pointers to papers or websites that already do something similar are welcome. Edit: I should add, recommendation of content based on what other users similar to him have interacted with might not be of much use, as since they(other users) have already begun working on that content (e.g begun summarisation) its a job ideally they should themselves complete. And it wont serve the primary purpose of making the former work on new relevant content. |
|
Although the question is vague on several points, it appears that the intended users of the application are subject matter experts of sorts who will be summarizing and otherwise converting or improving the text content, in ways which imply understanding and expertise in specific domains. From the question, we can maybe assume that the application has (or should have) a list of domains of interest (the list off which users may optionally enumerate their interest). With this in mind a tentative list of features to focus on is
As to the validity of collaborative filtering... That's a broad subject... I'll be back on that... |
|
This is quite an interesting problem, because you have a lot of information to harvest :
So from C you have a matrix, from which you can extract another representation of Documents and Users, using SVD or RSVD or many other methods. This is true unless you have no document watched by two users, in this case, you won't obtain anything usable. So you have two incompatible measures (A and B), and a matrix from which you can extract no information. That's really bad unless you can inject the A and B knowledge into the matrix C.
|
|
Interesting thing about hunch. Yes we are trying to build such taste profiles. So that users interact with data more and more often. Btw is there a technical name for knowledge transfer? Or any paper/article you can point me to. (googling knowledge transfer natural langauge processing is leading me elsewhere) Check this one http://portal.acm.org/citation.cfm?id=1273592, it is called transfer learning, sorry for the mistake
(Jan 02 '11 at 05:52)
Leon Palafox ♦
|
|
You might want to take a look in knowledge transfer, that is, how other kind of data relates statistically to your current space.i.e. How the variety on the movies you watch correlates with the variety of furnitures you have in your house. Is kind of a hot topic right now, and an engine built with a correct implementation of it might be something of an improvement over existing engines. Try looking into "hunch.com" which is a startup that has a fairly similar idea. Other ways are to look into the Netflix Contests, which had as a target to provide recommendations based on previously seen movies, some papers were written and their solutions are more on the practical side than in the theoretical, so if you want to look fast clean implementations that is the way to go. |
|
Thanks for the answer. It has given me good pointers in the direction I should move. To be a little specific, the content transformers would be everyday users, I want to serve them content according to their interest to ensure sustained involvement. Any comments on using OpenCalais for majority of things that you described? Would you suggest any other? |