Методи абстрактного реферування текстів: огляд літератури
The paper contains a literature review for automatic abstractive text summarization. The classification of abstractive text summarization methods was considered. Since the emergence of text summarization in the 1950s, techniques for summaries generation were constantly improving, but because the abs...
Gespeichert in:
| Datum: | 2019 |
|---|---|
| Hauptverfasser: | , |
| Format: | Artikel |
| Sprache: | Englisch |
| Veröffentlicht: |
The National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute"
2019
|
| Schlagworte: | |
| Online Zugang: | https://journal.iasa.kpi.ua/article/view/188372 |
| Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
| Назва журналу: | System research and information technologies |
| Завантажити файл: | |
Institution
System research and information technologies| _version_ | 1866391919039348736 |
|---|---|
| author | Shypik, D. V. Bidyuk, Petro I. |
| author_facet | Shypik, D. V. Bidyuk, Petro I. |
| author_sort | Shypik, D. V. |
| baseUrl_str | http://journal.iasa.kpi.ua/oai |
| collection | OJS |
| datestamp_date | 2020-03-02T17:05:10Z |
| description | The paper contains a literature review for automatic abstractive text summarization. The classification of abstractive text summarization methods was considered. Since the emergence of text summarization in the 1950s, techniques for summaries generation were constantly improving, but because the abstractive summarization require extensive language processing, the greatest progress was achieved only recently. Due to the current fast pace of development of both Natural Language Processing in general and Text Summarization in particular, it is essential to analyze the progress in these areas. The paper aims to give a general perspective on both the state-of-the-art and older approaches, while explaining the methods and approaches. Additionally, evaluation results of the research papers are presented. |
| doi_str_mv | 10.20535/SRIT.2308-8893.2019.4.07 |
| first_indexed | 2025-07-17T10:26:35Z |
| format | Article |
| fulltext |
D.V. Shypik, P.I. Bidyuk, 2019
66 ISSN 1681–6048 System Research & Information Technologies, 2019, № 4
UDC 004.89
DOI: 10.20535/SRIT.2308-8893.2019.4.07
A LITERATURE REVIEW OF ABSTRACTIVE
SUMMARIZATION METHODS
D.V. SHYPIK, P.I. BIDYUK
Abstract. The paper contains a literature review for automatic abstractive text sum-
marization. The classification of abstractive text summarization methods was con-
sidered. Since the emergence of text summarization in the 1950s, techniques for
summaries generation were constantly improving, but because the abstractive sum-
marization require extensive language processing, the greatest progress was
achieved only recently. Due to the current fast pace of development of both Natural
Language Processing in general and Text Summarization in particular, it is essential
to analyze the progress in these areas. The paper aims to give a general perspective
on both the state-of-the-art and older approaches, while explaining the methods and
approaches. Additionally, evaluation results of the research papers are presented.
Keywords: natural language processing, text summarization, abstractive text sum-
marization, sequence to sequence models.
INTRODUCTION
Today there is a problem of processing large amounts of text information caused
by a constantly growing volume of textual information. It is possible that this is-
sue can be addressed by using natural language processing approaches, in particu-
lar, text summarization.
The main goal of Text Summarization (TS) is to create a summary — “a re-
ductive transformation of source text to summary text through content reduction
by selection and/or generalization on what is important in the source.”
In the future, TS may be essential for users to efficiently manage informa-
tion, allowing saving time and resources, as well as to quickly find the specific
information they are looking for within documents.
TS has experienced great development in recent years, and a wide range of
techniques and paradigms have been proposed to tackle this research field.
However, to produce a summary automatically is very challenging [2].
Extraction methods reached some serious progress [3], but there is an em-
pirical limit intrinsic to pure extraction, as compared to abstraction [4]. Also,
Laura Hasler claims that the technique humans practice is to copy and paste the
same material present in the source documents [5]. However, some slight changes
are applied in most of the cases, and two types of operations, atomic and complex,
are identified, involving deletion, insertion, replacement, reordering or merging
(the first two are atomic operations while the last three are complex). From the
coherency evaluation standpoint, the results showed that 78% of the abstracts
were more coherent than extracts.
A literature review of abstractive summarization methods
Системні дослідження та інформаційні технології, 2019, № 4 67
GRAPH-BASED APPROACHES
The studies for abstractive sentence summarization used to be largely based on
sentence compression [6, 7] and sentence fusion [8, 9]. Graph-based approaches
also were very popular among older abstractive approaches, in particular, they
were shown to be very successful for producing multi-document summaries [10].
Ganesan, Zhai, and Han in their work [11] proposed Opinosis – graph-
based summarization framework, which generates abstractive summaries of opin-
ions. The system considers a high redundancy of opinions. Opinosis employs
shallow NLP. Firstly, the input text is represented as a textual directed graph. The
work introduces lexical links usage in graph building, which should help with dis-
covering new sentences or reinforcing existing ones. Then, candidate abstractive
summaries are generated by choosing various sub-paths in the graph. They are
analyzed and scored by using three unique properties of graphs (redundancy cap-
ture, gapped subsequence capture, collapsible structures), duplicated or extremely
similar paths are excluded by using similarity measure. Authors created a dataset,
consisting of reviews of various products. The evaluation of the created dataset
shows that the summaries generated by the system have a higher correlation with
human-made summaries than baseline extractive method. Evaluation conducted
on dataset created by authors, resulting in recall for ROUGE-1 – 28,3; ROUGE-2
– 8,53; ROUGE-SU4 – 8,51; F-score for ROUGE-1 – 32,7; ROUGE-2 – 9,98;
ROUGE-SU4 – 10,27. Unfortunately, as with all custom datasets, the results are
not directly comparable to other algorithms. The authors also point out that this
solution is more extractive than abstractive, as only the words from the original
text can occur in the summary. It is abstractive in the sense that generated sen-
tences are in general not from the original sentences set.
Lloret and Palomar [12] proposed to combine Graph-based abstractive ap-
proach with extractive approach (COMPENDIUM) in several ways. The graph-
based approach is used to create new sentences by computing the shortest ‘valid’
path on word graph. The validity of resulting sentences is checked using several
heuristics (sentence should have more than 3 words, one of the words is a verb,
the sentence doesn’t end in an article). The performance is tested on the DUC
2002 test set, resulting in F-Measure ROUGE-1 – 20,85; ROUGE-2 – 6,68;
ROUGE-SU4 – 7,04 for the best of proposed models. Authors state that even
though the results are not very high, the approach is promising for future research.
Banerjee and Sugiyama [13] described new multi-document abstractive
summarizer. Authors state that documents are not equal by information quantity
about the topic, so initially the system evaluates the most important document
from the set by using Lex Rank, Pairwise Cosine Similarity, and Overall Docu-
ment Collection Similarity. Then each sentence from the selected document is
used to generate separate clusters or appended to existing. Each cluster consists of
a word-graph structure. Clusters are further constructed by including sentences
from other documents that have high similarity with them. For each one,
K-shortest paths are selected, which are then used to construct the sentences by
using a proposed integer linear programming problem that maximizes information
content and linguistic quality and reduces redundancy in the final summary. The
system outperforms the best extractive summarizer by ROUGE scores on both
DUC 2004 and DUC 2005 datasets. For DUC-2004 best proposed system reached
D.V. Shypik, P.I. Bidyuk
ISSN 1681–6048 System Research & Information Technologies, 2019, № 4 68
recall for ROUGE-2 – 11,99; ROUGE-SU4 – 14,76. Other metrics were used for
DUC-2005 – recall for ROUGE-L – 35,77; ROUGE-SU4 – 12,41. Also, the work
included manual evaluation of Informativeness and Language Quality by 10 eval-
uators – the proposed system reached 4,1 / 5 in Informativeness, 3,63 / 5 in Lan-
guage Quality on randomized set of summaries.
HEURISTIC BASED METHODS
Some researches s use heuristic methods for sentence generation, for example, the
article of Ganest and Lapalme [14]. The work introduces the concept of Infor-
mation Items (INIT). INIT is “the smallest element of coherent information in the
text or a sentence”. The framework consists of the following steps:
1) INIT retrieval. Proposed: As the definition of INIT is intentionally vague
and leaves out implementation details, authors proposed two candidates for this
step: Semantic Role Labeling (SRL) and predicate-logic analysis. Implemented:
extraction of subject–verb–object triples, with tags for date and location;
2) INIT selection. Proposed and implemented: frequency-based models (as
in extractive summarization);
3) Sentence generation. Proposed: text generation patterns or heuristic rules.
Implemented: heuristic engine (implementing linguistic rules) for sentence
generation (SimpleNLG) is used.
Evaluation results on the TAC 2010 dataset are – Pyramid Score – 0,315;
Linguistic Quality – 2,17 and overall responsiveness – 2,30.
It is also interesting to note, that in fact, part of the implementation of Khan,
Salim, and Kumar [15] were close to some of the proposed ideas of Ganest and
Lapalme [14] (former proposed SRL usage, sentence generation through using
heuristics on the last step).
The proposed framework of Khan, Salim, and Kumar [15] uses heuristics
for sentence generation, genetic algorithm for information selection.
Novelty:
1) The first work to implement semantic role labeling (SRL) in multi-
document abstractive summarization. Rather than simply selecting sentences from
source documents, semantic representation is used to represent source documents;
2) Proposed clustering of the semantically similar PASes (Predicate Argu-
ment Structures) by utilizing semantic similarity measure;
3) Ranking PAS based on the features weighted and optimized by genetic
algorithm; since text features are sensitive to the quality of the generated sum-
mary.
Algorithm:
1) the document is split into sentences;
2) PASes are selected from each sentence in the document collection using
semantic role labeler (SENNA);
3) semantic similarity matrix of PAS is computed;
4) semantically similar PASes are clustered using ‘Agglomerative hierarchi-
cal clustering’ (HAC) algorithm based on the average linkage method;
5) the PASs in each cluster are scored based on features, weighted and opti-
mized by genetic algorithm. Highest ranked PASes are selected from each cluster;
A literature review of abstractive summarization methods
Системні дослідження та інформаційні технології, 2019, № 4 69
6) heuristic engine for generation sentences in English (SimpleNLG) is used
to generate sentences from argument structures.
Content selection is conducted by ranking PASes based on optimized fea-
tures (step 4, 5). In step 6 language generation is used to generate sentences from
PASes.
On DUC 2002 dataset evaluation of Pyramid Score (0,5) and Average Preci-
sion (0,7) showed that authors’ approach outperforms comparison models. Hu-
man-written summaries reached 0,69 for Pyramid Score, 0,85 for Average
Precision.
HYBRID METHODS
TOPIARY, the system described by Zajic and Dorr [16], combines sentence
compression algorithm by means of linguistically-motivated heuristics (modified
Hedge Trimmer) and Unsupervised Topic Discovery (UTD) – statistical method,
which generates a set of topics from document corpus. Hedge Trimmer algorithm
is modified in order to take a list of topics with relevance scores as additional in-
put, dynamically change compression rate in order to include highest scoring top-
ic if it is missed. The algorithm won a prize on Document Understanding Confer-
ence 2004 Workshop as best performing by ROUGE-1, ROUGE-2 and ROUGE-
L measures, scoring 24,9; 6,45; 19,95 respectively.
Compress (Clarke and Lapata, 2008) [17] – uses integer linear programming
(ILP) to infer globally optimal compressions in the presence of linguistically mo-
tivated constraints. The authors introduced the usage of global constraints, de-
signed their system to use less local syntactic knowledge. Three models are pre-
sented and compared – unsupervised, semi-supervised, and fully supervised
approaches. The results have shown that semi-supervised model with the pro-
posed constraints performing the best in terms of human-evaluated Grammatical-
ity and Importance (information content of summary). Evaluation conducted on
data-set created by author. The best scores are 3,76 for Grammaticality, 3,53 for
Importance.
Woodsend and others proposed a novel model [18], which consists of three
components. Content selection was performed by an SVM, which gave a salience
score for each phrase. To generate compressions and paraphrases the model used
Quasi-Synchronous Grammar (QG) rules. The third component, an Integer Linear
Programming ILP model combined the output of previously mentioned compo-
nents into an output summary by optimizing content selection and surface realiza-
tion preferences jointly. Similar to Clarke and Lapata, the ILP model includes
global constraints relating to sentence length, overall summary length, grammati-
cality and topics inclusion. The work is evaluated by humans on DUC-2004 head-
lines dataset, by means of Grammaticality and Importance, reaching scores 5,36
and 4,94 respectively. In comparison conducted by authors, proposed model out-
performed TOPIARY by 2,33 points in Grammaticality, 1,49 points in Impor-
tance.
Bing and others [19] proposed an abstraction-based multi-document sum-
marization system that creates new sentences by employing the proposed integer
linear optimization model. The system operates syntactic units like a noun or verb
phrases instead of whole sentences. In the first step, a noun or verb phrases are
D.V. Shypik, P.I. Bidyuk
ISSN 1681–6048 System Research & Information Technologies, 2019, № 4 70
extracted from the documents via constituency trees. Then, for each phrase, a sa-
lience score is calculated using concept-based weight incorporating position in-
formation. New sentences are created by selecting and merging phrases, insuring
their validity by solving a proposed linear optimization model. Performance eval-
uations are carried out on TAC 2011 dataset. The system reached recall for
ROUGE-2 – 11,7; ROUGE-SU4 – 14,7; F-score for ROUGE-2 – 11,7; ROUGE-
SU4 – 14,8. The average Linguistic Quality assessed for resulting summaries
was 3,43.
METHODS BASED ON SEQUENCE TO SEQUENCE MODELS
The task of abstractive sentence summarization i.e. generating a shorter sentence
version while trying to preserve its original meaning is increasingly intriguing for
researchers, especially with the development of sequence-to-sequence frame-
work [20].
Creation of Encoder-Decoder Recurrent Neural Networks and their utiliza-
tion for encoding variable sentence length with further decoding into variable sen-
tence length [21, 22] lead to significant progress in machine translation. Together
with previously proposed Bidirectional Neural [23], it led to the invention of At-
tentional Encoder-Decoder RNNs [24] and their usage in the field of abstractive
sentence summarization [25]. Also, creation of Gigaword dataset [26] had a huge
role in providing the previously inaccessible amount of training data.
Rush, Chopra, and Weston [25] proposed a fully data-driven approach to
abstractive sentence summarization (which authors call ABS – Attention-Based
Summarization). The method utilizes a local attention-based model that generates
each word of the summary conditioned on the input sentence. The work tries
several encoder architectures:
1. Bag-of-word encoder ignores properties of original order and
relationships between neighboring words. This model can capture the importance
of words, potentially can learn to combine words, but is inherently limited in
representing continuous phrases.
2. The convolutional encoder allows capturing local interactions between
words. Standard TDNN is used. Minuses: it produces a single representation
(vector) of the entire sentence, ignoring length and other differences.
3. Attention-based encoder based on the article of Bahdanau, Cho, and
Bengio [24], removes the need for single representation. The author proposes to
think of this as of “soft alignment” between input and output summary.
Generation model uses a feed-forward neural network – Neural Network
Language Model (NNLM). The encoder and the generation model are trained
jointly on the sentence summarization task. Expectedly, the Attention-based en-
coder significantly outperforms other proposed encoders and baseline.
Several other articles significantly influenced the further route of research:
1. Gated Recurring Unit (GRU) was proposed [27], as an alternative to Long
Short-Term Memory (LSTM) gated unit in RNN. The article of Chung and others
[28] has shown that proposed units are comparable or better than LSTM in the
task of sequence modeling. Also, the additive nature of both models has
advantages over classical tanh units:
A literature review of abstractive summarization methods
Системні дослідження та інформаційні технології, 2019, № 4 71
the unit can remember existence of a specific feature in the input stream
for a long sequence of steps. Forget gate of the LSTM unit or the update gate of
the GRU should specifically decide when the feature is forgotten;
the memory of these units effectively creates shortcut paths that bypass
multiple temporal steps. This property allows to back-propagate the error easily,
as it vanishes longer.
2. Large Vocabulary Trick (LVT) was proposed [29] for machine
translation. The approach is an approximate training algorithm based on (biased)
importance sampling that allows training neural models with a much larger target
vocabulary. The algorithm effectively keeps the computational complexity during
training at the level of using only a small subset of the full vocabulary. Authors
claim that the proposed approach allows us to efficiently use a fast computing
device with limited memory, such as a GPU, to train a neural machine translation
model with a much larger target vocabulary.
In “sequence-to-sequence RNNS for text summarization” Nallapati,
Xiang and Zhou [30] demonstrated that the sequence-to-sequence models are ex-
tremely promising for summarization. Full seq2seq model is used, as previously
proposed [24], with encoder Bidirectional GRU-RNN as and Unidirectional At-
tentional GRU-RNN as the decoder. Their experiments showed that LVT-trick
significantly improves training speed without sacrificing performance. Also, more
classic features, like PoS tags, named-entity tags, TF-, IDF that are encoded to-
gether with words provided an additional performance improvement. Results of
this works have shown that the proposed model outperforms the previous state-of-
art [25] on Gigaword, DUC 2003, DUC 2004 corpora.
During further research Nallapati and others in their work “Abstractive text
summarization using sequence-to-sequence RNNs and beyond” [31] proposed
Switching Pointer-Generator to avoid the generation of “UNK” token (a token
that is generated by most summarization systems, which try to generate a word
that is out of their training dictionary). Authors show, that even though the model
learns to use pointers very accurately not only for named entities but also for mul-
ti-word phrases, the performance improvement of the overall model is not signifi-
cant. It is proposed, that model impact may be clearer in other document sets,
where the tail distribution of rare words is heavier. Hierarchical attention model
is proposed (attention not only on the word level, but also on sentence level), but
it didn’t show a significant difference on all datasets. Also the work introduced a
new large scale dataset “CNN/Daily Mail”, which is very important due to the
lack of the former.
The next year, same authors published new work [32] which proposed new
extractive approach and compared the results with their previous model on the
CNN/Daily Mail dataset using ROUGE metric. The comparison showed that ex-
tractive model significantly outperformed abstractive on the longer texts.
See, Liu, and Manning pointed [33], that even though the system proposed
in “Abstractive text summarization using sequence-to-sequence RNNs and be-
yond” [31] reached a new level of accuracy, it still has typical drawbacks of se-
quence to sequence models – it inaccurately reproduces factual details, has high
level of repeating themselves and not always deals with out-of-vocabulary (OOV)
words. The work tries to address this issue through a proposed novel variant of
coverage vector and modified pointer-generator network. In contrast to the
D.V. Shypik, P.I. Bidyuk
ISSN 1681–6048 System Research & Information Technologies, 2019, № 4 72
abovementioned work where pointer-generator network is activated only for OOV
words or named entities, the model is allowed to learn when to apply the network.
The proposed model is evaluated to have higher both ROUGE and METEOR
scores than the predecessor, was shown to reduce inaccuracies and repetition.
In “Selective encoding for abstractive sentence summarization” [34] it is
pointed, that unlike machine translation, where alignment between all parts of
input and output is required, there is no explicit alignment between input sen-
tences in sentence summarization. The challenge is not to infer the alignment, but
to select the highlights and filter secondary information. To solve this task, it is
proposed to extend seq2seq framework with additional selective encoding model.
The result consists of sentence encoder (bidirectional GRU), and attention
equipped decoder (attentional GRU). The selective gate network constructs a sec-
ond level sentence representation by controlling the information flow from the
encoder to the decoder. Also, the proposed layer was shown to perform as ex-
pected – it has highlighted the representation of important words from the input
sentence. The model is evaluated on the Gigaword, DUC 2004 and MSR abstrac-
tive sentence summarization datasets. The proposed selective encoding model
outperformed the state-of-art baseline models.
As it was shown, seq2seq framework performance quickly deteriorates with
a length of the generated sequence [27]. So, in “Retrieve, rerank and rewrite: Soft
template based neural summarization.” authors point that similar sentences should
hold similar summary patterns, so existing summaries can be used as “soft tem-
plates” to guide the seq2seq model [20]. As the name of the article hints, the
model has three steps:
1. Retrieve step: Popular IR platform (Lucene) is used to retrieve candidate
templates. The system finds out analogies of the given sentence in the corpus and
picks their summaries as candidate templates. Recurrent Neural Network (RNN)
encoder is applied to convert the input sentence and each candidate template into
hidden states, encoder output is shared by “rerank” and “rewrite” steps.
2. Rerank: In Retrieve, the template candidates are ranked according to the
text similarity between the corresponding indexed sentences and the input
sentence. However, for the summarization task, the soft template is expected to
resemble the actual summary as much as possible. So, this step tries to choose the
“closest” template to the processed sentence.
3. Rewrite: summary generation. The summary is generated according to the
hidden states of both the sentence and template. Concatenation function is applied
to combine the hidden states of the sentence and template.
By comparison to other strategies of choosing soft templates, the authors
show that proposed “Rerank” step has a room for improvement. Also, the quality
of the summaries depends on the quality of the imported external summaries,
which shows that soft templates themselves, not other architecture changes have
great importance. In total, the proposed model significantly outperforms the state-
of-art seq2seq models, and even soft templates themselves demonstrate high
competitiveness.
Inspired by results of “Retrieve, rerank and rewrite” [20] with soft templates
Wang and others [35] proposed a new model called BiSET (Bi-directional Se-
lective Encoding with Template for Abstractive Summarization) to enhance soft
template usage in text summarization. The work introduces:
A literature review of abstractive summarization methods
Системні дослідження та інформаційні технології, 2019, № 4 73
1. A novel bi-directional selective mechanism with two gates to mutually
select important information from both article and template to assist with
summary generation. The mechanism is inspired by the research of “bidirectional
attention flow mechanism” in machine reading comprehension [36] and the
selective mechanism described earlier [34].
2. Fast Rerank method to automatically select high-quality templates from
the training corpus. The method is based on Convolutional Encoder, Similarity
Matrix, and Pooling layer.
F-1 scores for ROUGE-1, ROUGE-2, ROUGE-L were used as metrics. Re-
moval of template-to-article attention or article-to-template attention lead to re-
duction in all used metrics, showing that every part of proposed bi-directional se-
lective mechanism is improving performance. Comparison with simple
‘Concatenation’ approach to soft templates showed that proposed selective mech-
anism significantly outperformed in all metrics.
Results of the works, presented in this section are shown in Tables 1–4.
T a b l e 1 . Comparison of methods’ performance on Gigaworld dataset, full-length
ROUGE F1
Model ROUGE-1 ROUGE-2 ROUGE-L
ABS (Rush 2015) [25] 29,55 11,32 26,42
ABS+ (Rush 2015) [25] 29,78 11,89 26,97
CA2s2 (Chopra 2016) 33,78 15,97 31,15
FeatSeq2Seq (Nallapati 2016) [30] 32,67 15,59 30,64
SEASS (Zhou 2017) [34] 36,15 17,54 33,63
R^3 Sum (Cao 2018) [20] 37,04 19,03 34,46
BiSET (Wang 2019) [35] 39,11 19,78 36,87
T a b l e 2 . Comparison of methods’ performance on Gigaworld dataset, ROUGE
Recall (output capped at 75 bytes)
Model ROUGE-1 ROUGE-2 ROUGE-L
Compress (Clarke and Lapata, 2008) * [17] 19,63 5,13 18,28
ABS (Rush 2015) [25] 30,88 12,22 27,77
ABS+ (Rush 2015) [25] 31,00 12,65 28,34
* provided in the work of Rush and others [25], not in original work
T a b l e 3 . Comparison of methods’ performance on DUC-2004 dataset,
ROUGE Recall (output capped at 75 bytes)
Model ROUGE-1 ROUGE-2 ROUGE-L
Compress (Clarke and Lapata) * [17] 19,77 4,02 17,3
W&L (Woodsend) * [18] 22 6 17
TOPIARY (Zajic) * [19] 25,12 6,46 20,12
ABS (Rush) [25] 26,55 7,06 22,05
ABS+ (Rush) [25] 28,18 8,49 23,81
FeatSeq2Seq (Nallapati 2016) [30] 28,35 9,46 24,59
SEASS (Zhou 2017) [34] 29,21 9,56 25,51
* provided in the work of Rush and others [25], not in original work.
D.V. Shypik, P.I. Bidyuk
ISSN 1681–6048 System Research & Information Technologies, 2019, № 4 74
T a b l e 4 . Comparison of methods’ performance on CNN/Daily Mail dataset,
ROUGE Recall (output capped at 75 bytes)
Model ROUGE-1 ROUGE-2 ROUGE-L
Nallapati 2016 [30] 35,46 13,30 32,65
Nallapati 2017 (extractive) [32] 39,6 16,2 35,3
Nallapati 2017 (abstractive) [32] 37,5 14,5 33,4
See 2017 ** [33] 39,53 17,28 36,38
** uses non-anonymized dataset, so not directly comparable.
CONCLUSIONS
This paper introduces some important information concerning both state-of-art
and older approaches of abstractive text summarization. This review could serve
as a starting point for novice researchers to get familiar with the field. Modern
data-driven methods had more focus, as they are mostly left out from the topic
literature reviews, and also as they tend to give better results and have pretty sub-
stantial differences from the older approaches, that were more structure-based or
linguistic-focused. However, in contrary to more classical models, sequence to
sequence-based ones tend to “lose control” on the long text samples, and also they
require much bigger datasets. As we can see, Gigaword dataset is dominating in
training new models, as older ‘DUC’ and ‘TAC’ datasets do not provide the de-
sired sample quantity.
To summarize, further research seems to be aligned with improving the qual-
ity of the sequence to sequence models on the long texts or proposing an alterna-
tive machine learning method.
REFERENCES
1. Jones S.K. Automatic summarizing: factors and directions [Електронний ресурс] /
S.K. Jones // MIT Press. — 1999. — Available at: https://www.cl.cam.ac.uk/
archive/ksj21/ksjdigipapers/summbook99.pdf.
2. Multi-document summarization by sentence extraction [Електронний ресурс] /
J.S. Goldstein, V. Mittal, J.G. Carbonell, M. Kantrowitz. — 2000. — Available at::
http://scholar.google.com.ua/scholar_url?url=https://kilthub.cmu.edu/articles/Mul
ti-Document_Summarization_By_Sentence_Extraction/6624470/files/
12121496.pdf&hl=uk&sa=X&scisig=AAGBfm3dUni3D9yq1qbG7bN3z4ow9Ch
pyA&nossl=1&oi=scholarr.
3. Lloret E. Text summarisation in progress: a literature review / E. Lloret, M. Palomar
// Artificial Intelligence Review. — 2011. — N 37. — P. 1–41.
4. Genest P. HEXTAC: the Creation of a Manual Extractive Run [Електронний
ресурс] / P. Genest, G. Lapalme, M. Yousfi-Monod. — 2009. — Available at:
http://www.mymcorner.net/files/Genest-Lapalme-Yousfi-Monod-09.pdf.
5. Hasler L. From extracts to abstracts: human summary production operations for
computer-aided summarisation [Електронний ресурс] / L. Hasler. — 2007. —
Available at: http://rgcl.wlv.ac.uk/events/CALP07/papers/10.pdf.
6. Erkan G. Lexrank: graph-based lexical centrality as salience in text summarization /
G. Erkan, D. Radev // Journal of Artificial Intelligence Research. — 2004. —
Vol. 22. — P. 457–479.
7. A perspective-based approach for solving textual entailment recognition
[Електронний ресурс] / O.Ferrandez, D. Micol, R. Munoz, M. Palomar. —
A literature review of abstractive summarization methods
Системні дослідження та інформаційні технології, 2019, № 4 75
2007. — Available at: https://www.researchgate.net/profile/Joao_Cordeiro11/
publication/234803426_Biology_Based_Alignments_of_Paraphrases_for_Senten
ce_Compression/links/560fa81b08ae0fc513ef311e/Biology-Based-Alignments-
of-Paraphrases-for-Sentence-Compression.pdf#page=80.
8. Filippova K. Multi-Sentence Compression: Finding Shortest Paths in Word Graphs
[Електронний ресурс] / K. Filippova. — 2010. — Available at:
https://www.aclweb.org/anthology/C10-1037.pdf.
9. Gardner J. An integrated framework for de-identifying unstructured medical data
[Електронний ресурс] / J. Gardner, L. Xiong // Elsevier. — 2009. — Available
at: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.157.7185&rep=
rep1&type=pdf.
10. Hliaoutakis A. The AMTEx approach in the medical document indexing and
retrieval application / A. Hliaoutakis, K. Zervanou, E. Petrakis // Data &
Knowledge Engineering. — 2009. — N 68. — P. 380–392.
11. Ganesan K. Opinosis: A Graph Based Approach to Abstractive Summarization of
Highly Redundant Opinions [Електронний ресурс] / K. Ganesan, C. Zhai,
H. Jiawei. — 2010. — Available at: https://www.aclweb.org/anthology/C10-
1039.pdf.
12. Lloret E. Analyzing the Use of Word Graphs for Abstractive Text Summarization
[Електронний ресурс] / E. Lloret, M. Palomar. — 2011. — Available at:
https://pdfs.semanticscholar.org/7d25/67d7eefc772865992e93996c1cd7f6ba6319
.pdf.
13. Banerjee S. Multi-document abstractive summarization using ILP based
multisentence compression [Електронний ресурс] / S. Banerjee, P. Mitra,
K. Sugiyama. — 2015. — Available at: https://www.ijcai.org/Proceedings/
15/Papers/174.pdf.
14. Ganest P. Framework for Abstractive Summarization — Available at:
https://pdfs.semanticscholar.org/fdf9/e7d06bf21093e29923742d2040b0e495bc1d
.pdf.
15. Khan A. A framework for multi-document abstractive summarization based on
semantic role labelling / A. Khan, N. Salim, Jaya Kumar Y. // Appl Soft Comput
30:737–747. — 2015. — doi:10.1016/j.asoc.2015.01.070
16. Zajic D. BBN/UMD at DUC-2004:Topiary [Електронний ресурс] / D. Zajic,
B. Dorr, R. Schwartz. — 2004. — Available at: http://users.umiacs.umd.edu/
~bonnie/Publications/Attic/DUC2004-HEADLINE.pdf.
17. Clarke J. Global inference for sentence compression an integer linear programming
approach / J. Clarke, M. Lapata // Journal of Artificial Intelligence Research. —
2008. — N 31. — P. 399–429.
18. Woodsend K. Title Generation with Quasi-Synchronous Grammar [Електронний
ресурс] / K. Woodsend, Y. Feng, M. Lapata. — 2010. — Available at:
https://www.research.ed.ac.uk/portal/files/23634327/2010_Woodsend_Feng_ET
_AL_Title_Generation_with_Quasi_Synchronous_Grammar.pdf.
19. Abstractive Multi-Document Summarization via Phrase Selection and Merging
[Електронний ресурс] / L. Bing, P. Li, Y. Liao et al. — 2015. — Available at:
https://www.cs.cmu.edu/~lbing/pub/acl2015-bing.pdf.
20. Retrieve, Rerank and Rewrite: Soft Template Based Neural Summarization
[Електронний ресурс] / Z. Cao, W. Li, F. Wei, S. Li. — 2018. — Available at:
https://pdfs.semanticscholar.org/c93b/8518204ef722f4c749628023c6d5d061a5fa
.pdf.
21. Learning Phrase Representations using RNN Encoder–Decoderfor Statistical
Machine Translation [Електронний ресурс] / K. Cho, B. Merrienboer,
C. Gulcehre et al. — 2014. — Available at: https://www.aclweb.org/anthology/
D14-1179.pdf.
22. Sutskever I. Sequence to Sequence Learning with Neural Networks [Електронний
ресурс] / I. Sutskever, O. Vinyals, Q.V. Le. — 2014. — Available at:
D.V. Shypik, P.I. Bidyuk
ISSN 1681–6048 System Research & Information Technologies, 2019, № 4 76
https://papers.nips.cc/paper/5346-sequence-to-sequence-learning-with-neural-
networks.pdf.
23. Schuster M. Bidirectional recurrent neural networks / M. Schuster, K. Paliwal //
IEEE Transactions on Signal Processing. — 1997. — N 45. — P. 2673–2681.
24. Bahdanau D. Neural machine translation by jointly learning to align and translate
[Електронний ресурс] / D. Bahdanau, K. Cho, Y. Bengio. — 2015. —
Available at: https://arxiv.org/pdf/1409.0473.pdf.
25. Rush A.M. A Neural Attention Model for Sentence Summarization [Електронний
ресурс] / A.M. Rush, S. Chopra, J. Weston. — 2015. — Available at:
https://www.aclweb.org/anthology/D15-1044.pdf.
26. Napoles C. Annotated Gigaword [Електронний ресурс] / C. Napoles, M. Gormley,
B. Van Durme. — 2012. — Available at: https://www.cs.cmu.edu/~mgormley/
papers/napoles+gormley+van-durme.naaclw.2012.pdf.
27. On the Properties of Neural Machine Translation: Encoder–Decoder Approaches
[Електронний ресурс] / K.Cho, B. Van Merrienboer, D. Bahdanau,
Y. Bengio. — 2014. — Available at: https://arxiv.org/pdf/1409.1259.pdf.
28. Empirical evaluation of gated recurrent neural networks on sequence modeling
[Електронний ресурс] / J.Chung, Ç. Gülçehre, K. Hyun Cho, Y. Bengio. —
2014. — Available at: https://arxiv.org/pdf/1412.3555.pdf.
29. On Using Very Large Target Vocabulary forNeural Machine Translation
[Електронний ресурс] / S. Jean, K. Cho, R. Memisevic, Y. Bengio. — 2014. —
Available at: https://www.aclweb.org/anthology/P15-1001.pdf.
30. Nallapati R. Sequence-to-sequence RNNs for text summarization [Електронний
ресурс] / R. Nallapati, B. Xiang, B. Zhou. — 2016. — Available at:
https://pdfs.semanticscholar.org/033b/c4febf590f6e011e9b0f497cadfe6a4c292d.pdf.
31. Abstractive text summarization using sequence-to-sequence RNNs and beyond
[Електронний ресурс] / R. Nallapati, B. Zhou, C. Santos et al. — 2016. —
Available at: https://arxiv.org/pdf/1602.06023.pdf.
32. Nallapati R. SummaRuNNer: A Recurrent Neural Network based Sequence Model
for Extractive Summarization of Documents [Електронний ресурс] /
R. Nallapati, F. Zhai, B. Zhou. — 2017. — Available at: https://arxiv.org/pdf/
1611.04230.
33. See A. Get To The Point: Summarization with Pointer-Generator Networks
[Електронний ресурс] / A. See, P.J. Liu, C.D. Manning. — 2017. — Available
at: https://arxiv.org/pdf/1704.04368.
34. Selective Encoding for Abstractive Sentence Summarization [Електронний ресурс]
/ Q. Zhou, N. Yang, F. Wei, M. Zhou. — 2017. — Available at:
https://www.aclweb.org/anthology/P17-1101.pdf.
35. Wang K. BiSET: Bi-directional Selective Encoding with Template for Abstractive
Summarization [Електронний ресурс] / K. Wang, X. Quan, R. Wang. —
2017. — Available at: https://www.aclweb.org/anthology/P19-1207.pdf.
36. Bidirectional attention flow for machine comprehension [Електронний ресурс] /
M. Seo, A. Kembhavi, A. Farhadi, H. Hajishirzi. — 2017. — Available at:
https://arxiv.org/pdf/1611.01603.pdf.
Received 25.09.2019
From the Editorial Board: the article corresponds completely to submitted manuscript.
|
| id | journaliasakpiua-article-188372 |
| institution | System research and information technologies |
| keywords_txt_mv | keywords |
| language | English |
| last_indexed | 2025-07-17T10:26:35Z |
| publishDate | 2019 |
| publisher | The National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute" |
| record_format | ojs |
| resource_txt_mv | journaliasakpiua/d8/36865fd0f5ea458c2474477a168f50d8.pdf |
| spelling | journaliasakpiua-article-1883722020-03-02T17:05:10Z A literature review of abstractive summarization methods Методы абстрактного реферирования текстов: обзор литературы Методи абстрактного реферування текстів: огляд літератури Shypik, D. V. Bidyuk, Petro I. обробка природної мови реферування тексту абстрактне реферування тексту sequence to sequence моделі natural language processing text summarization abstractive text summarization sequence to sequence models обработка естественного языка реферирование текста абстрактное реферирование текста sequence to sequence модели The paper contains a literature review for automatic abstractive text summarization. The classification of abstractive text summarization methods was considered. Since the emergence of text summarization in the 1950s, techniques for summaries generation were constantly improving, but because the abstractive summarization require extensive language processing, the greatest progress was achieved only recently. Due to the current fast pace of development of both Natural Language Processing in general and Text Summarization in particular, it is essential to analyze the progress in these areas. The paper aims to give a general perspective on both the state-of-the-art and older approaches, while explaining the methods and approaches. Additionally, evaluation results of the research papers are presented. Содержит обзор литературы, посвященной методам абстрактного реферирования текстов. Рассмотрена классификация методов абстрактного реферирования. С появлением методов реферирования текстов в 1950-х гг. техники создания рефератов постоянно улучшались, но поскольку абстрактное реферирование требует мощных техник для обработки/генерации текста, наибольший прогресс был достигнут в последние годы. Текущее быстрое развитие в сфере как обработки естественного языка в целом, так и автоматического реферирования в частности делает особенно необходимым анализ прогресса в этой сфере. Дано общее представление как про более предыдущие подходы, так и про новые, включая обьяснение методов и подходов.Дополнительно представлены оценки методов, предложенных в рассмотреных работах. Містить огляд літератури, присвяченої методам абстрактного реферування текстів. Розглянуто класифікацію методів абстрактного реферування. Із появою методів реферування текстів у 1950-х рр. техніки створення рефератів постійно покращувались, але оскільки абстрактне реферування потребує потужних технік для оброблення/генерації тексту, найбільший прогрес досягнуто в останні роки. Поточний швидкий розвиток у сфері як оброблення природної мови в цілому, так і автоматичного реферування зокрема робить особливо необхідним аналіз прогресу в цій сфері. Надано загальне уявлення як про попередні підходи, так і найновіші, включаючи пояснення методів і підходів. Додатково подано кількісні оцінки методів, запропонованих в оглянутих джерелах. The National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute" 2019-12-23 Article Article application/pdf https://journal.iasa.kpi.ua/article/view/188372 10.20535/SRIT.2308-8893.2019.4.07 System research and information technologies; No. 4 (2019); 66-76 Системные исследования и информационные технологии; № 4 (2019); 66-76 Системні дослідження та інформаційні технології; № 4 (2019); 66-76 2308-8893 1681-6048 en https://journal.iasa.kpi.ua/article/view/188372/190127 Copyright (c) 2021 System research and information technologies |
| spellingShingle | обробка природної мови реферування тексту абстрактне реферування тексту sequence to sequence моделі Shypik, D. V. Bidyuk, Petro I. Методи абстрактного реферування текстів: огляд літератури |
| title | Методи абстрактного реферування текстів: огляд літератури |
| title_alt | A literature review of abstractive summarization methods Методы абстрактного реферирования текстов: обзор литературы |
| title_full | Методи абстрактного реферування текстів: огляд літератури |
| title_fullStr | Методи абстрактного реферування текстів: огляд літератури |
| title_full_unstemmed | Методи абстрактного реферування текстів: огляд літератури |
| title_short | Методи абстрактного реферування текстів: огляд літератури |
| title_sort | методи абстрактного реферування текстів: огляд літератури |
| topic | обробка природної мови реферування тексту абстрактне реферування тексту sequence to sequence моделі |
| topic_facet | обробка природної мови реферування тексту абстрактне реферування тексту sequence to sequence моделі natural language processing text summarization abstractive text summarization sequence to sequence models обработка естественного языка реферирование текста абстрактное реферирование текста sequence to sequence модели |
| url | https://journal.iasa.kpi.ua/article/view/188372 |
| work_keys_str_mv | AT shypikdv aliteraturereviewofabstractivesummarizationmethods AT bidyukpetroi aliteraturereviewofabstractivesummarizationmethods AT shypikdv metodyabstraktnogoreferirovaniâtekstovobzorliteratury AT bidyukpetroi metodyabstraktnogoreferirovaniâtekstovobzorliteratury AT shypikdv metodiabstraktnogoreferuvannâtekstívoglâdlíteraturi AT bidyukpetroi metodiabstraktnogoreferuvannâtekstívoglâdlíteraturi AT shypikdv literaturereviewofabstractivesummarizationmethods AT bidyukpetroi literaturereviewofabstractivesummarizationmethods |