TextSum: An Automatic Text Summarization Tool using Bayesian Classification

G. Rohini, P. Krishna Subba Rao, Ch. Avinash

Abstract


Automatic text summarization is the process of summarizing text from one or more documents in order to help the reader to determine if [s]he has to read the document(s) in full. Although automatic summarization is around for many years, there is a renewed interest in this area from the government, industry, and academia. The reason is the availability of large quantities of information due to world wide web and the necessity to summarize such information. Automatic summarization involves reducing text document(s) into shorter form which conveys the same meaning as the original documents. Summarization may be done by extractive methods or abstractive methods. This paper consists of an Automatic Text Summarization Tool which takes into account several features, including sentence position, positive and negative key words, sentence centrality etc., and employs the Bayesian classification algorithm to identify the sentences that are to be included in the summary.


Keywords


Automatic text summarization, bayesian classification

Full Text:

PDF

References


Fattah Mohamed Abdel. Fuji Ren: GA, MR, FFNN, PNN and GMM based models for automatic text summarization. Computer Speech & Language. 2009; 23(1): 126–144p.

T. Hirao et al. Supervised automatic evaluation for summarization with voted regression model. Information Processing & Management. 2007; 43(6):1521–1535p.

Sjöbergh, J. Older versions of the ROUGEeval summarization evaluation system were easier to fool. Information Processing & Management. 2007; 43(6): 1500–1505p.

Lin, C.-Y., Hovy, E. Manual and automatic evaluation of summaries. In Proceedings of the ACL-02 Workshop on Automatic Summarization, Morristown, NJ, USA. 2002: 45–51p.

Yeh, J. et al. Text summarization using a trainable summarizer and latent semantic analysis. Information Processing & Management. 2005; 41(1): 75–95p.


Refbacks

  • There are currently no refbacks.


This site has been shifted to https://stmcomputers.stmjournals.com/