2025-02-24T08:07:53-05:00 DEBUG: VuFindSearch\Backend\Solr\Connector: Query fl=%2A&wt=json&json.nl=arrarr&q=id%3A%22journaliasakpiua-article-33341%22&qt=morelikethis&rows=5
2025-02-24T08:07:53-05:00 DEBUG: VuFindSearch\Backend\Solr\Connector: => GET http://localhost:8983/solr/biblio/select?fl=%2A&wt=json&json.nl=arrarr&q=id%3A%22journaliasakpiua-article-33341%22&qt=morelikethis&rows=5
2025-02-24T08:07:53-05:00 DEBUG: VuFindSearch\Backend\Solr\Connector: <= 200 OK
2025-02-24T08:07:53-05:00 DEBUG: Deserialized SOLR response
Модель вторинних некорельованих семантичних полів для анализу текстових даних
The model of derived uncorrelated semantic fields generated by the method of principal components and singular decomposition of the matrix of semantic fields frequencies has been considered. This model describes a new semantic space with orthonormal basis of displaying text documents. The dimension...
Saved in:
Main Author: | |
---|---|
Format: | Article |
Language: | Ukrainian |
Published: |
The National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute"
2014
|
Online Access: | http://journal.iasa.kpi.ua/article/view/33341 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | The model of derived uncorrelated semantic fields generated by the method of principal components and singular decomposition of the matrix of semantic fields frequencies has been considered. This model describes a new semantic space with orthonormal basis of displaying text documents. The dimension of the space of derived semantic fields is significantly less than the dimension of the space of initial semantic fields as a result of replacement of interconnected components by uncorrelated semantic characteristics. The analysis of the test sample of text documents showed the possibility to take into consideration only those components of secondary semantic fields which are described by the first singular numbers. The use of the low-dimension orthonormal basis of derived semantic fields can be effective in the problems of the text data classification and clustering. |
---|