|
Hi, I would like to extract relevant information from the HTML pages and free-text as well. I have read several approaches and IE tools. I found that there are some the approaches might be useful, such as: WHISK, RAPIER, RoadRunner, SRV. So, anyone has tried those approaches before or used another ones. I need any comments and review about this problem. Thanks!
This question is marked "community wiki".
|
|
You can use grammar description language to extract information such as http://code.google.com/p/graph-expression/
This answer is marked "community wiki".
|
|
Thanks your reply! So, the open-source which you preferred has any documentation, docs or APIs? And how about powerful of this one for Information Extraction field?
This answer is marked "community wiki".
pretty powerful it used in several NLP commercial startups with some extension as replacer of GATE.
(Jul 21 '11 at 03:33)
yura
|
|
Here are my bookmarks on the topic: http://pinboard.in/u:lrwiman/t:information+extraction I've been collecting all the papers and links I've seen on the topic for the past several months. I hope that's helpful.
This answer is marked "community wiki".
|
|
You can find this blog post helpful. And here is a huge list of approaches and resources which could guide you.
This answer is marked "community wiki".
|