Determining the weights of links in networks of terms

One of the most urgent problem in natural language processing, such as a formalization and creation of ontological models of subject domains based on the thematic text corpora is considered. Using text mining and natural language processing, with applying lingvo-statistical methods and computational...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Datum:2019
Hauptverfasser: Lande, D. V., Dmytrenko, O. O.
Format: Artikel
Sprache:Ukrainian
Veröffentlicht: Інститут проблем реєстрації інформації НАН України 2019
Schlagworte:
Online Zugang:http://drsp.ipri.kiev.ua/article/view/199357
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Назва журналу:Data Recording, Storage & Processing

Institution

Data Recording, Storage & Processing
id drspiprikievua-article-199357
record_format ojs
spelling drspiprikievua-article-1993572020-03-31T09:02:39Z Determining the weights of links in networks of terms Визначення вагових значень зв’язків у мережі термінів Lande, D. V. Dmytrenko, O. O. інформаційний простір глибинний аналіз тексту мережева модель предметна область мережа термінів граф горизонтальної видимості ненаправлена мережа термінів направлена зважена мережа термінів information space Тext Мining terminological ontology subject domain network of terms horizontal visibility graph undirected networks of terms directed weighted networks of terms One of the most urgent problem in natural language processing, such as a formalization and creation of ontological models of subject domains based on the thematic text corpora is considered. Using text mining and natural language processing, with applying lingvo-statistical methods and computational linguistics, networks models of subject domains have been created to provide a better interaction between human communicative acts that presented in sign and verbal form, and computer systems. A new approach for determining the weights of links in the network of terms which correspond to certain concepts of the considered subject domain has been proposed.  In particular, applying the approach for determining the weights of links in the network of terms, the terminological ontology of subject domain that related with a climate emergency has been created as approbation. Further analysis of the created model made it possible to determine the most influential and significant links between the corresponding nodes in networks of terms that in turn correspond to certain concepts of the considered subject domain. The Python programming language and its separate functions of a specialized add-in — the module NLTK (Natural Language Toolkit open source library) is used to create the software realization of the proposed and considered approaches and methods. Using the software for modelling and visualization of graphs - Gephi, the built directed networks of terms have been visualized for better visual perception. The weighted directed networks of terms built according to the proposed approach can be used for automatically creating terminological ontologies of subject domains with the participation of experts. Also, the research result can be used to create personal search interfaces for users of information retrieval systems and also can be used in navigation systems in data-bases. It should help users of such systems simplify the process of searching the relevant information. Tabl.: 2. Fig.: 2. Refs: 18 titles. Розглянуто одну із найбільш актуальних проблем комп’ютерного аналізу природної мови — формалізацію та побудову онтологічних моделей предметних областей на основі текстових корпусів заданої тематики. Завдяки глибинному аналізу текстів та обробці природної мови, вико-ристовуючи лінгвостатистичні методи та методики обчислювальної лінгвістики, побудовано мережеві моделі предметних областей, для забезпечення кращої взаємодії комунікативних актів, поданих у знаково-словесній формі, та комп’ютерних систем. Зокрема, застосовуючи новий підхід до визначення вагових значень зв’язків у мережі понять, як апробацію було побудовано онтологічну модель для предметної області, що пов’язана з кліматичною надзвичайною ситуацією. Подальший аналіз побудованої моделі дав змогу визначити найбільш впливові та значущі зв’язки між відповідними вузлами у мережі термінів, які відповідають певним поняттям розглянутої предметної області. Інститут проблем реєстрації інформації НАН України 2019-12-24 Article Article application/pdf http://drsp.ipri.kiev.ua/article/view/199357 10.35681/1560-9189.2019.21.4.199357 Data Recording, Storage & Processing; Vol. 21 No. 4 (2019); 40-48 Регистрация, хранение и обработка данных; Том 21 № 4 (2019); 40-48 Реєстрація, зберігання і обробка даних; Том 21 № 4 (2019); 40-48 1560-9189 uk http://drsp.ipri.kiev.ua/article/view/199357/199720 Авторське право (c) 2021 Реєстрація, зберігання і обробка даних
institution Data Recording, Storage & Processing
baseUrl_str
datestamp_date 2020-03-31T09:02:39Z
collection OJS
language Ukrainian
topic information space
Тext Мining
terminological ontology
subject domain
network of terms
horizontal visibility graph
undirected networks of terms
directed weighted networks of terms
spellingShingle information space
Тext Мining
terminological ontology
subject domain
network of terms
horizontal visibility graph
undirected networks of terms
directed weighted networks of terms
Lande, D. V.
Dmytrenko, O. O.
Determining the weights of links in networks of terms
topic_facet інформаційний простір
глибинний аналіз тексту
мережева модель
предметна область
мережа термінів
граф горизонтальної видимості
ненаправлена мережа термінів
направлена зважена мережа термінів
information space
Тext Мining
terminological ontology
subject domain
network of terms
horizontal visibility graph
undirected networks of terms
directed weighted networks of terms
format Article
author Lande, D. V.
Dmytrenko, O. O.
author_facet Lande, D. V.
Dmytrenko, O. O.
author_sort Lande, D. V.
title Determining the weights of links in networks of terms
title_short Determining the weights of links in networks of terms
title_full Determining the weights of links in networks of terms
title_fullStr Determining the weights of links in networks of terms
title_full_unstemmed Determining the weights of links in networks of terms
title_sort determining the weights of links in networks of terms
title_alt Визначення вагових значень зв’язків у мережі термінів
description One of the most urgent problem in natural language processing, such as a formalization and creation of ontological models of subject domains based on the thematic text corpora is considered. Using text mining and natural language processing, with applying lingvo-statistical methods and computational linguistics, networks models of subject domains have been created to provide a better interaction between human communicative acts that presented in sign and verbal form, and computer systems. A new approach for determining the weights of links in the network of terms which correspond to certain concepts of the considered subject domain has been proposed.  In particular, applying the approach for determining the weights of links in the network of terms, the terminological ontology of subject domain that related with a climate emergency has been created as approbation. Further analysis of the created model made it possible to determine the most influential and significant links between the corresponding nodes in networks of terms that in turn correspond to certain concepts of the considered subject domain. The Python programming language and its separate functions of a specialized add-in — the module NLTK (Natural Language Toolkit open source library) is used to create the software realization of the proposed and considered approaches and methods. Using the software for modelling and visualization of graphs - Gephi, the built directed networks of terms have been visualized for better visual perception. The weighted directed networks of terms built according to the proposed approach can be used for automatically creating terminological ontologies of subject domains with the participation of experts. Also, the research result can be used to create personal search interfaces for users of information retrieval systems and also can be used in navigation systems in data-bases. It should help users of such systems simplify the process of searching the relevant information. Tabl.: 2. Fig.: 2. Refs: 18 titles.
publisher Інститут проблем реєстрації інформації НАН України
publishDate 2019
url http://drsp.ipri.kiev.ua/article/view/199357
work_keys_str_mv AT landedv determiningtheweightsoflinksinnetworksofterms
AT dmytrenkooo determiningtheweightsoflinksinnetworksofterms
AT landedv viznačennâvagovihznačenʹzvâzkívumerežítermínív
AT dmytrenkooo viznačennâvagovihznačenʹzvâzkívumerežítermínív
first_indexed 2025-07-17T10:57:48Z
last_indexed 2025-07-17T10:57:48Z
_version_ 1850411409536450560