Suppose we have news reader like Google reader. each document has its topics. each topics has its followers. each item when seen by user has mark - is it interest for him or not. Is the are approaches to ranking document feed based on those data?

asked Oct 10 '10 at 12:48

yura's gravatar image

yura
1025374854


2 Answers:

This is exactly the problem we are trying to solve at the moment at FUSELabs Project Emporia. We are crawling Twitter (and many other less beefy sources) for links and try to rank them according to your personal taste (you tell us whether you like the story or not in the UI). We are currently using Matchbox to do the recommendation part. It scales very well to large sets of users and we can do online learning using expectation propagation.

answered Oct 10 '10 at 17:28

Jurgen's gravatar image

Jurgen
99531419

1

Very interesting. Is it opensource?

(Oct 11 '10 at 07:56) yura

I assume you mean whether Matchbox is open source (the rest of the code is making it run on the Azure cloud platform)? Matchbox is published (see link above) so it contains all the details of how to build the factor graph and message passing. It is fairly straightforward to implement Matchbox on top of Infer.NET (http://research.microsoft.com/en-us/um/cambridge/projects/infernet/). Definitely ask on the forum if you need any help constructing it (Infer.NET runs fine on Linux and Apple using Mono I believe).

(Oct 11 '10 at 17:11) Jurgen

You can try to use a standard learning to rank method, such as RankSVM (a good implementation is google's sofia-ml, decribed in the Sculley paper Large scale learing to rank in NIPS 2010) with features such as how many times a user has voted on that feed, how many times a user has voted on that topic, given that we're ranking a document of topic y how many times the user has voted another topic (to cover the fact that topic x might be a good predictor of topic y), etc.

answered Oct 10 '10 at 13:23

Alexandre%20Passos's gravatar image

Alexandre Passos ♦
2554154278421

Your answer
toggle preview

powered by OSQA

User submitted content is under Creative Commons: Attribution - Share Alike; Other things copyright (C) 2010, MetaOptimize LLC.