A method for extracting data from semis-tructured documents
Linguistic method to solve the problem of data extraction from weakly structured documents is developed, approved, and described in detail in the paper. Sample data were taken from thesis catalogue of Vernadsky National Library of Ukraine. The sequence of all stages is described: document collection...
Saved in:
| Date: | 2020 |
|---|---|
| Main Authors: | Kudim, K.A., Proskudina, G.Yu. |
| Format: | Article |
| Language: | Russian |
| Published: |
PROBLEMS IN PROGRAMMING
2020
|
| Subjects: | |
| Online Access: | https://pp.isofts.kiev.ua/index.php/ojs1/article/view/388 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Journal Title: | Problems in programming |
| Download file: | |
Institution
Problems in programmingSimilar Items
Methods and tools for extracting personal data from theses abstracts
by: Kudim, K.A., et al.
Published: (2019)
by: Kudim, K.A., et al.
Published: (2019)
Extracting structure from text documents based on machine learning
by: Kudim, K.A., et al.
Published: (2023)
by: Kudim, K.A., et al.
Published: (2023)
About technologies of use of external data on creating and editing of encyclopedic texts
by: Proskudina, G.Yu., et al.
Published: (2018)
by: Proskudina, G.Yu., et al.
Published: (2018)
Mixed topic-entity ontology for enhanced topic vector-spaced model
by: Shabinskiy, A.S.
Published: (2025)
by: Shabinskiy, A.S.
Published: (2025)
Overview of global open access resource aggregation services and their requirements for data providers
by: Proskudina, G.Yu., et al.
Published: (2025)
by: Proskudina, G.Yu., et al.
Published: (2025)
Global open access resource aggregation services and their requirements for data providers
by: Proskudina, G.Yu., et al.
Published: (2024)
by: Proskudina, G.Yu., et al.
Published: (2024)
Decompositional Extraction and Retrieval of Conceptual Knowledge
by: Terletskyi, D.O., et al.
Published: (2023)
by: Terletskyi, D.O., et al.
Published: (2023)
Use of domain ontology for homonymy clarification into the natural language texts
by: Lesko, O.N., et al.
Published: (2018)
by: Lesko, O.N., et al.
Published: (2018)
Review of methods of events extraction «from the stream of news»
by: Pryshchepa, S. V.
Published: (2015)
by: Pryshchepa, S. V.
Published: (2015)
The technology of new events extraction on a defined topic from Twitter social network
by: Pryshchepa, S. V.
Published: (2017)
by: Pryshchepa, S. V.
Published: (2017)
A method of tuning programs on .Net platform with rewriting rules
by: Mamedov, T.A., et al.
Published: (2019)
by: Mamedov, T.A., et al.
Published: (2019)
The main functional blocks of the test bench for the archival electronic documents validation
by: Melaschenko, A.O., et al.
Published: (2018)
by: Melaschenko, A.O., et al.
Published: (2018)
A method for extracting data from semistructured documents
by: K. A. Kudim, et al.
Published: (2020)
by: K. A. Kudim, et al.
Published: (2020)
INTERSTELLAR MEDIUM AND DECAMETER RADIO SPECTROSCOPY
by: Stepkin, S. V., et al.
Published: (2021)
by: Stepkin, S. V., et al.
Published: (2021)
CREATING THE RT-32 RADIO TELESCOPE ON THE BASIC OF MARK-4B ANTENNA SYSTEM. 2. ESTIMATION OF THE POSSIBILITY FOR MAKING SPECTRAL OBSERVATIONS OF RADIO ASTRONOMICAL OBJECTS
by: Antyufeyev, A. V., et al.
Published: (2019)
by: Antyufeyev, A. V., et al.
Published: (2019)
Ontological similar systems for analysis of texts of natural language
by: Kryvyi, S.L., et al.
Published: (2018)
by: Kryvyi, S.L., et al.
Published: (2018)
Automated extraction of structured information from a variety of web pages
by: Pogorilyy, S.D., et al.
Published: (2018)
by: Pogorilyy, S.D., et al.
Published: (2018)
The definition of formal languages in the meta language of normal forms of knowledge
by: Kurgaev, A.F., et al.
Published: (2018)
by: Kurgaev, A.F., et al.
Published: (2018)
Satellite monitoring for the areas of illegal extraction of amber
by: Filipovich, Volodymyr
Published: (2015)
by: Filipovich, Volodymyr
Published: (2015)
UWN: The ontological basе of knowledge of the Ukrainian language
by: Anisіmov, A.V., et al.
Published: (2015)
by: Anisіmov, A.V., et al.
Published: (2015)
Actual problems of long-term preservation of documentation in insurance fund of documentation of Ukraine
by: Podorozhnyi, V. I.
Published: (2016)
by: Podorozhnyi, V. I.
Published: (2016)
PROSPECTS TO THERMAL WATERS EXTRACTION AT ILLICHIVSK OF ODESA REGION
by: DIDKIVSKA, G.G., et al.
Published: (2013)
by: DIDKIVSKA, G.G., et al.
Published: (2013)
Analysis of formal models and standards for structured electronic document in corporate informational system
by: Sharypanov, A.V., et al.
Published: (2018)
by: Sharypanov, A.V., et al.
Published: (2018)
Estimation Method for Compatibility of Normative Documents
by: Mezentsev, O. V.
Published: (2014)
by: Mezentsev, O. V.
Published: (2014)
Anti-proliferative effects of a blueberry extract on a panel of tumor cell lines of different origin
by: Lamdan, H., et al.
Published: (2023)
by: Lamdan, H., et al.
Published: (2023)
DIRECTIVITY OF ANTENNA ARRAYS
by: Bulgakovа, A. A., et al.
Published: (2016)
by: Bulgakovа, A. A., et al.
Published: (2016)
The implementation of legal electronic documents
by: Melaschenko, A.O., et al.
Published: (2015)
by: Melaschenko, A.O., et al.
Published: (2015)
Methods and tools for extracting personal data from theses abstracts
by: K. A. Kudim, et al.
Published: (2019)
by: K. A. Kudim, et al.
Published: (2019)
Performance analysis of a new LP stage located upstream the extraction point in a 225 MW turbine
by: Шиманяк, М., et al.
Published: (2016)
by: Шиманяк, М., et al.
Published: (2016)
Performance analysis of a new LP stage located upstream the extraction point in a 225 MW turbine
by: Шиманяк, М., et al.
Published: (2016)
by: Шиманяк, М., et al.
Published: (2016)
Metastatic cardiac tumors: literature review and own observation of testicular tumor metastasis in the right ventricle of the heart
by: Zakhartseva, L.M., et al.
Published: (2018)
by: Zakhartseva, L.M., et al.
Published: (2018)
On equivalence of some subcategories of modules in Morita contexts
by: Kashu, A. I.
Published: (2018)
by: Kashu, A. I.
Published: (2018)
Some issues of registration and reproduction of information touching upon objects of material and spiritual culture using technologies of the state insurance documentation fund of Ukraine.
by: Babenko, V. V., et al.
Published: (2019)
by: Babenko, V. V., et al.
Published: (2019)
Scientific documents metadata as a component of the system of the “open science” information resources
by: Zakharova, O.V.
Published: (2023)
by: Zakharova, O.V.
Published: (2023)
Estimation of influence of technical re-equipment of the coal-mining enterprises on volumes of extraction and change of the coal prime cost
by: Rublevsky N.T., et al.
Published: (2004)
by: Rublevsky N.T., et al.
Published: (2004)
PECULIARITIES OF FORMING OF COAL SEAMS OF DEEP HORIZONS OF LVIV-VOLUN BASIN.Paper 2. Visean coal seam ʋ03
by: SHULGA, V.F., et al.
Published: (2013)
by: SHULGA, V.F., et al.
Published: (2013)
Methods and software for significant indicators determination of the natural language texts author profile
by: Shynkarenko, V.I., et al.
Published: (2023)
by: Shynkarenko, V.I., et al.
Published: (2023)
Antiproliferative and apoptotic effect of ethanolic extract of Calocybe indica on PANC-1 and MIAPaCa2 cell lines of pancreatic cancer
by: Ghosh, S.K., et al.
Published: (2023)
by: Ghosh, S.K., et al.
Published: (2023)
An approach of intelligent searching of information in texts
by: Chebanuyk, O.V.
Published: (2023)
by: Chebanuyk, O.V.
Published: (2023)
tructure and mechanical properties of vacuum-arc multilayer condensates of nitrides of titanium and its alloys
by: A. V. Demchishin, et al.
Published: (2014)
by: A. V. Demchishin, et al.
Published: (2014)
Similar Items
-
Methods and tools for extracting personal data from theses abstracts
by: Kudim, K.A., et al.
Published: (2019) -
Extracting structure from text documents based on machine learning
by: Kudim, K.A., et al.
Published: (2023) -
About technologies of use of external data on creating and editing of encyclopedic texts
by: Proskudina, G.Yu., et al.
Published: (2018) -
Mixed topic-entity ontology for enhanced topic vector-spaced model
by: Shabinskiy, A.S.
Published: (2025) -
Overview of global open access resource aggregation services and their requirements for data providers
by: Proskudina, G.Yu., et al.
Published: (2025)