Unsupervised Extractive Text Summarization Using Frequency-Based Sentence Clustering

Large texts are not always entirely meaningful: they might include repetitions and useless details, and might not be easy to interpret by humans. Automatic text summarization aims to simplify text by making it shorter and (possibly) more informative. This paper describes a new solution for extractiv...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلف الرئيسي:	Hajjar, Ali (author)
مؤلفون آخرون:	Tekli, Joe (author)
التنسيق:	conferenceObject
منشور في:	2022
الموضوعات:	Database management > Congresses Artificial intelligence > Congresses
الوصول للمادة أونلاين:	http://hdl.handle.net/10725/16287 https://doi.org/10.1007/978-3-031-15743-1_23 http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php https://link.springer.com/chapter/10.1007/978-3-031-15743-1_23
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!

الوصف
الملخص:	Large texts are not always entirely meaningful: they might include repetitions and useless details, and might not be easy to interpret by humans. Automatic text summarization aims to simplify text by making it shorter and (possibly) more informative. This paper describes a new solution for extractive text summarization, designed to efficiently process flat (unstructured) text. It performs unsupervised frequency-based document processing to identify the candidate sentences having the highest potential to represent informative content in the document. It introduces a dedicated feature vector representation for sentences to evaluate the relative impact of different sentence terms. The sentence feature vectors are run through a partitional k-means clustering process, to build the extractive summary based on the cluster representatives. Experimental results highlight the quality and efficiency of our approach.

Unsupervised Extractive Text Summarization Using Frequency-Based Sentence Clustering

مواد مشابهة