A research of invisible errors in bibliographic data input and its impact on the quality and search accessibility
The research is devoted to a special class of errors in bibliographic data input into automated library information system, which is invisible to users, but affects the functioning of the electronic catalog system. The cause of the problem is the misuse of visually similar Latin characters where Cyr...
Збережено в:
Дата: | 2021 |
---|---|
Автор: | |
Формат: | Стаття |
Мова: | Ukrainian |
Опубліковано: |
Інститут проблем реєстрації інформації НАН України
2021
|
Теми: | |
Онлайн доступ: | http://drsp.ipri.kiev.ua/article/view/239252 |
Теги: |
Додати тег
Немає тегів, Будьте першим, хто поставить тег для цього запису!
|
Назва журналу: | Data Recording, Storage & Processing |
Репозитарії
Data Recording, Storage & ProcessingРезюме: | The research is devoted to a special class of errors in bibliographic data input into automated library information system, which is invisible to users, but affects the functioning of the electronic catalog system. The cause of the problem is the misuse of visually similar Latin characters where Cyrillic characters should have been and vice versa.
The study is based on bibliographic information collected from 141 public libraries in Kyiv for the period from 1993 to 2021 (obtained from two sources). This allows fully explore the features of the problem, its prevalence and impact on the functioning of automated library information system and its OPAC module.
Attention is drawn to the text fields common in search and identification tasks — «Book Title», «Author», «Publisher».
The investigation provides one by information about: 1) the method of automatic error identification is applied; 2) prevalence of errors by type and their percentage in each source; 3) the impact of errors on the search; 4) the impact of errors on the search for duplicates; 5) distribution of errors by symbols; 6) errors and use of reference tables;
The research has shown that all characters with the same appearance are used incorrectly. The frequency of use of symbols differs significantly. There are many mistakes related to Cyrillic using in Roman numerals. Often some part of the number is written in Cyrillic and some part in Latin. But it affects comparison more than search.
The conclusions state that this class of errors affects the search accessibility of hundreds of book records in the libraries of Kyiv and provide suggestions for measures to eliminate and prevent errors in the future. Some records correspond to several real books, so there are thousands of real books in different libraries. The problem can be solved only with software using. Effective prevention is possible with the appropriate improvements of automated library information systems. Tabl.: 4. Fig.: 3. Refs: 8 titles. |
---|