{"id":16560,"date":"2024-09-20T12:15:16","date_gmt":"2024-09-20T10:15:16","guid":{"rendered":"https:\/\/is.ijs.si\/?p=16560"},"modified":"2025-03-26T13:14:34","modified_gmt":"2025-03-26T12:14:34","slug":"predicting-pronunciation-types-in-the-sloleks-morphological-lexicon-of-slovene","status":"publish","type":"post","link":"https:\/\/is.ijs.si\/?p=16560","title":{"rendered":"Predicting Pronunciation Types in the Sloleks Morphological Lexicon of Slovene"},"content":{"rendered":"\n<p>Jaka \u010cibej<\/p>\n<p>Abstract<br \/>In the paper, we present an experiment in automatic prediction<br \/>of pronunciation types for lemmas in the Sloleks Morphological<br \/>Lexicon of Slovene. We perform a statistical analysis on a num-<br \/>ber of mostly n-gram-based features and use a set of statistically<br \/>significant features to train and test several machine learning<br \/>models to discriminate between lemmas for which pronuncia-<br \/>tion transcription can be generated automatically using Slovene<br \/>grapheme-to-phoneme (G2P) conversion rules (e.g. Novak), and<br \/>lemmas with pronunciation that follows other G2P rules (e.g.<br \/>Shakespeare).<\/p>\n<p>\u00a0<\/p>\n\n\n\n<div data-wp-interactive=\"core\/file\" class=\"wp-block-file\"><object data-wp-bind--hidden=\"!state.hasPdfPreview\" hidden class=\"wp-block-file__embed\" data=\"https:\/\/is.ijs.si\/wp-content\/uploads\/2024\/10\/IS2024_-_SIKDD_2024_paper_2-1.pdf\" type=\"application\/pdf\" style=\"width:100%;height:600px\" aria-label=\"Embed of IS2024_-_SIKDD_2024_paper_2-1.\"><\/object><a id=\"wp-block-file--media-692e899f-2bc0-4d80-bed9-2a445628aeb0\" href=\"https:\/\/is.ijs.si\/wp-content\/uploads\/2024\/10\/IS2024_-_SIKDD_2024_paper_2-1.pdf\">IS2024_-_SIKDD_2024_paper_2-1<\/a><a href=\"https:\/\/is.ijs.si\/wp-content\/uploads\/2024\/10\/IS2024_-_SIKDD_2024_paper_2-1.pdf\" class=\"wp-block-file__button wp-element-button\" download aria-describedby=\"wp-block-file--media-692e899f-2bc0-4d80-bed9-2a445628aeb0\">Download<\/a><\/div>\n","protected":false},"excerpt":{"rendered":"","protected":false},"author":29,"featured_media":24966,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[109,102],"tags":[],"class_list":["post-16560","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-doi-sikdd-2024","category-papers"],"_links":{"self":[{"href":"https:\/\/is.ijs.si\/index.php?rest_route=\/wp\/v2\/posts\/16560","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/is.ijs.si\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/is.ijs.si\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/is.ijs.si\/index.php?rest_route=\/wp\/v2\/users\/29"}],"replies":[{"embeddable":true,"href":"https:\/\/is.ijs.si\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=16560"}],"version-history":[{"count":2,"href":"https:\/\/is.ijs.si\/index.php?rest_route=\/wp\/v2\/posts\/16560\/revisions"}],"predecessor-version":[{"id":16924,"href":"https:\/\/is.ijs.si\/index.php?rest_route=\/wp\/v2\/posts\/16560\/revisions\/16924"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/is.ijs.si\/index.php?rest_route=\/wp\/v2\/media\/24966"}],"wp:attachment":[{"href":"https:\/\/is.ijs.si\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=16560"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/is.ijs.si\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=16560"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/is.ijs.si\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=16560"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}