Papernotes: Document Summarization for Answering Non-Factoid Queries

Paper url:

  1. Good abandonment – where user’s leave search engine without reading any webpage, because the answer is provided in the SERP.
  2. non-factoid queries are more frequently asked on the web.
  3. Past work – to provide passage level answers to non-factoid queries.
  4. Summarization could be better – because answers might be in different sentences scattered in the underlying document.
  5. Answer biased summary – extracting a summary from each retrieved document that is expected to contain answers to a non-factoid query.
  6. Designed to hint at the whereabout of likely answers.
  7. Using Community Question Answering (CQA) content to guide the extraction of answer-biased summaries.
  8. Why bother if CQA is present? a) better summaries than CQA answers. b) even imperfect CQA answers can help find summaries, c) learning to rank based model to help extract summaries even where CQA answers are not available.
  9. Contributions:
    1. Novel user of CQA content in a summarization algorithm for locating answer-bearing sentences in the document
    2. 3 optimization based methods and a learning to rank based method for answering non-factoid queries.
    3. Analyse the effect of CQA quality on such methods.

The paper then goes on to propose 3 optimization methods using CPLEX. There is some discussion of the learning model, to use when CQA is not available.


Interestingly there is no mention of Knowledge Graphs being used. Though query expansion could be done using KGs.


I have been trying to use FAQs which are not too different from CQAs. So overall a very interesting paper.

Blog at

Up ↑