{"id":16617,"date":"2024-09-20T12:35:34","date_gmt":"2024-09-20T10:35:34","guid":{"rendered":"https:\/\/is.ijs.si\/?p=16617"},"modified":"2025-03-26T13:23:00","modified_gmt":"2025-03-26T12:23:00","slug":"borrowing-words-transfer-learning-for-reported-speech-detection-in-slovenian-news-texts","status":"publish","type":"post","link":"https:\/\/is.ijs.si\/?p=16617","title":{"rendered":"Borrowing Words: Transfer Learning for Reported Speech Detection in Slovenian News Texts"},"content":{"rendered":"\n<p>Zoran Fijav\u017e<\/p>\n<p><strong>Abstract<\/strong><br \/>This paper describes the development of a reported speech clas-<br \/>sifier for Slovenian news texts using transfer learning. Due to a<br \/>lack of Slovenian training data, multilingual models were trained<br \/>on English and German reported speech datasets, reaching an<br \/>F-score of 66.8 on a small manually annotated Slovenian news<br \/>dataset and a manual error analysis was performed. While the<br \/>developed model captures many aspects of reported speech, fur-<br \/>ther refinement and annotated data would be needed to reliably<br \/>predict less frequent instances, such as indirect speech and nom-<br \/>inalizations.<\/p>\n<p>\u00a0<\/p>\n\n\n\n<div data-wp-interactive=\"core\/file\" class=\"wp-block-file\"><object data-wp-bind--hidden=\"!state.hasPdfPreview\" hidden class=\"wp-block-file__embed\" data=\"https:\/\/is.ijs.si\/wp-content\/uploads\/2024\/10\/IS2024_-_SIKDD_2024_paper_21-1.pdf\" type=\"application\/pdf\" style=\"width:100%;height:600px\" aria-label=\"Embed of IS2024_-_SIKDD_2024_paper_21-1.\"><\/object><a id=\"wp-block-file--media-d159786c-0478-4e6d-a312-9884a7cadf28\" href=\"https:\/\/is.ijs.si\/wp-content\/uploads\/2024\/10\/IS2024_-_SIKDD_2024_paper_21-1.pdf\">IS2024_-_SIKDD_2024_paper_21-1<\/a><a href=\"https:\/\/is.ijs.si\/wp-content\/uploads\/2024\/10\/IS2024_-_SIKDD_2024_paper_21-1.pdf\" class=\"wp-block-file__button wp-element-button\" download aria-describedby=\"wp-block-file--media-d159786c-0478-4e6d-a312-9884a7cadf28\">Download<\/a><\/div>\n","protected":false},"excerpt":{"rendered":"","protected":false},"author":29,"featured_media":24966,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[109,102],"tags":[],"class_list":["post-16617","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-doi-sikdd-2024","category-papers"],"_links":{"self":[{"href":"https:\/\/is.ijs.si\/index.php?rest_route=\/wp\/v2\/posts\/16617","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/is.ijs.si\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/is.ijs.si\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/is.ijs.si\/index.php?rest_route=\/wp\/v2\/users\/29"}],"replies":[{"embeddable":true,"href":"https:\/\/is.ijs.si\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=16617"}],"version-history":[{"count":3,"href":"https:\/\/is.ijs.si\/index.php?rest_route=\/wp\/v2\/posts\/16617\/revisions"}],"predecessor-version":[{"id":25013,"href":"https:\/\/is.ijs.si\/index.php?rest_route=\/wp\/v2\/posts\/16617\/revisions\/25013"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/is.ijs.si\/index.php?rest_route=\/wp\/v2\/media\/24966"}],"wp:attachment":[{"href":"https:\/\/is.ijs.si\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=16617"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/is.ijs.si\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=16617"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/is.ijs.si\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=16617"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}