I have to work on a problem of extracting company description from "About us" page that is present in most of the websites. I did some reading, and it seems like this is query based summarization problem. Query can be something like "What this company manufactures" or "What services are provided by this company". How should I approach this problem.

asked Jun 22 '11 at 10:17

Saurabh%20Saxena's gravatar image

Saurabh Saxena
16446

1

This looks more like question-answering than summarization itself.

(Jun 22 '11 at 12:34) Alexandre Passos ♦

If I approach it as a question-answering problem, how can I extract relevant content.

(Jun 23 '11 at 12:49) Saurabh Saxena

One Answer:

Yes I think it is focused summarisation problem. You have to define rules for better/worser sentences like contains company_name, contains definition word etc and then do summarisation to extract most frequent and non duplication sentences.

answered Jun 27 '11 at 03:37

yura's gravatar image

yura
1025374854

Your answer
toggle preview

powered by OSQA

User submitted content is under Creative Commons: Attribution - Share Alike; Other things copyright (C) 2010, MetaOptimize LLC.