I am trying to get the data from memetracker and mine the best media websites. Any suggestions on the selection of the algorithm to use on this kind of dataset and why?

I am trying to extract the websites with pc of top quotes and which website is the best, some thing similar to http://memetracker.org/lag.html

asked Oct 12 '10 at 00:58

zengr's gravatar image

zengr
110379

edited Oct 12 '10 at 02:08

Can you explain which problem you want to solve exactly?

(Oct 12 '10 at 01:31) Justin Bayer

updated...

(Oct 12 '10 at 02:09) zengr

Do you mean, the problem of downloading web pages with time stamps or the problem of determining which are talking about the same news event?

(Oct 12 '10 at 07:32) Alexandre Passos ♦
Be the first one to answer this question!
toggle preview

Subscription:

Once you sign in you will be able to subscribe for any updates here

Tags:

×2

Asked: Oct 12 '10 at 00:58

Seen: 257 times

Last updated: Oct 12 '10 at 07:32

powered by OSQA

User submitted content is under Creative Commons: Attribution - Share Alike; Other things copyright (C) 2010, MetaOptimize LLC.