Unsupervised Extractive Text Summarization Using Frequency-Based Sentence Clustering

Large texts are not always entirely meaningful: they might include repetitions and useless details, and might not be easy to interpret by humans. Automatic text summarization aims to simplify text by making it shorter and (possibly) more informative. This paper describes a new solution for extractiv...

Full description

Saved in:
Bibliographic Details
Main Author: Hajjar, Ali (author)
Other Authors: Tekli, Joe (author)
Format: conferenceObject
Published: 2022
Subjects:
Online Access:http://hdl.handle.net/10725/16287
https://doi.org/10.1007/978-3-031-15743-1_23
http://libraries.lau.edu.lb/research/laur/terms-of-use/articles.php
https://link.springer.com/chapter/10.1007/978-3-031-15743-1_23
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Large texts are not always entirely meaningful: they might include repetitions and useless details, and might not be easy to interpret by humans. Automatic text summarization aims to simplify text by making it shorter and (possibly) more informative. This paper describes a new solution for extractive text summarization, designed to efficiently process flat (unstructured) text. It performs unsupervised frequency-based document processing to identify the candidate sentences having the highest potential to represent informative content in the document. It introduces a dedicated feature vector representation for sentences to evaluate the relative impact of different sentence terms. The sentence feature vectors are run through a partitional k-means clustering process, to build the extractive summary based on the cluster representatives. Experimental results highlight the quality and efficiency of our approach.