1. Using digitised documents as a source for machine learning
- Author
-
Stančić, Hrvoje, Seljan, Sanja, and Dunđer, ivan
- Subjects
digitisation, OCR, ML - Abstract
Information and communication technologies have changed the way of communication, users’ habits and expectations in all types of settings (business, education, entertainment, service production, tourism, manufacture, …), but also set new requirements for institutions. One of them is online access to digitised authentic materials, which creates added value for users and institutions.The process includes selection, digitisation, format processing, annotation and information retrieval conducted on the collection of materials from the Archives of the Faculty of Humanities and Social Sciences, University of Zagreb, consisting of minutes from the Faculty Council meetings from 1874 until the digital era. Results confirm the presented process as one of possible solutions but requires planned strategy and interdisciplinary approach.
- Published
- 2020