TY - JOUR AU - Lehtonen, Tommi AU - Piukkula, Juha PY - 2020/03/31 Y2 - 2024/03/19 TI - Automaattinen asiasanoitus Radio- ja televisio-ohjelmatietokanta Ritvassa JF - Informaatiotutkimus JA - INF VL - 39 IS - 1 SE - Katsaukset DO - 10.23978/inf.88107 UR - https://journal.fi/inf/article/view/88107 SP - 27–45 AB - <p>National Audiovisual Institute’s (KAVI) radio and television archive started a joint project with the Finnish broadcasting company (Yle) and the National Library of Finland to develop automated indexing using program subtitles as a source. Project relies on Annif tool originally developed by Osma Suominen. Annif is built upon a combination of existing natural language processing and machine learning tools. It is designed to be multilingual and it can support any subject vocabulary.&nbsp; Annif can use several different backends. During the spring and summer of 2019, 313 Yle programmes were jointly annotated by KAVI and Yle for Annif testing. Analysis was made using a cross-validation technique. It was noted that television programme may be produced so that the central theme is not mentioned at all.&nbsp; When a brief programme description was included, the results improved. Results and quality were promising and the project will continue.</p> ER -