Back to Search Start Over

Using digitised documents as a source for machine learning

Authors :
Stančić, Hrvoje
Seljan, Sanja
Dunđer, ivan
Publication Year :
2020

Abstract

Information and communication technologies have changed the way of communication, users’ habits and expectations in all types of settings (business, education, entertainment, service production, tourism, manufacture, …), but also set new requirements for institutions. One of them is online access to digitised authentic materials, which creates added value for users and institutions.The process includes selection, digitisation, format processing, annotation and information retrieval conducted on the collection of materials from the Archives of the Faculty of Humanities and Social Sciences, University of Zagreb, consisting of minutes from the Faculty Council meetings from 1874 until the digital era. Results confirm the presented process as one of possible solutions but requires planned strategy and interdisciplinary approach.

Subjects

Subjects :
digitisation, OCR, ML

Details

Language :
English
Database :
OpenAIRE
Accession number :
edsair.57a035e5b1ae..16bd0d6d4655cfbe3a8d045825583d53