Advanced search options

Advanced Search Options 🞨

Browse by author name (“Author name starts with…”).

Find ETDs with:

in
/  
in
/  
in
/  
in

Written in Published in Earliest date Latest date

Sorted by

Results per page:

You searched for id:"handle:1946/37025". One record found.

Search Limiters

Last 2 Years | English Only

No search limiters apply to these results.

▼ Search Limiters


University of Iceland

1. Hinrik Hafsteinsson 1994-. A Faroese part-of-speech tagger built with Icelandic methods. Data preperation, training and evaluation .

Degree: 2020, University of Iceland

This thesis describes the development of a dedicated, high-accuracy part-of-speech (PoS) tagging solution for Faroese. To achieve this, a state-of-the-art neural PoS tagger for Icelandic, ABLTagger, was trained on the 100,000 word Sosialurin PoS-tagged corpus for Faroese, standardised with methods previously applied to Icelandic corpora. This tagger was supplemented with a novel Experimental Database of Faroese Inflection (EDFM), which contains morphological information on 67,488 Faroese words with about one million inflectional forms. This approach produced a PoS-tagging model for Faroese which achieves a 91.40% overall accuracy when evaluated with 10-fold cross validation, which is currently the highest accuracy for a dedicated Faroese PoS-tagging implementation. The tagging model, morphological database, proposed revised PoS tagset for Faroese as well as a revised and standardised Sosialurin corpus are all presented as products of this project and are made available for use in further research in Faroese language technology.; Þessi ritgerð lýsir þróun nákvæms málfræðimarkara fyrir færeysku. Til að ná slíku fram var íslenski tauganetsmarkarinn ABLTagger, sem hefur náð besta birta árangri í íslenskri málfræðimörkun, þjálfaður á færeyskri markaðri málheild sem kennd er við dagblaðið Sosialurin og inniheldur u.þ.b. 100.000 lesmálsorð. Færeyska mörkunarlíkanið notast við nýja Bráðabirgðabeygingarlýsingu færeysks nútímamáls (BBFN) til að betrumbæta mörkunina en beygingarlýsingin inniheldur beygingargögn fyrir um 67,488 færeysk orð, samtals u.þ.b. milljón stakar beygingarmyndir. Þessi aðferð skilaði mörkunarlíkani fyrir færeysku sem nær 91,40% mörkunarnákvæmni, sem er besti birti árangur í sjálfvirkri málfræðimörkun á færeysku. Mörkunarlíkanið, beygingarlýsingin, tillaga að endurbættu færeysku markamengi og yfirfarin Sosialurin málheild eru allt afurðir þessa verkefnis og eru gerðar aðgengilegar, svo þær megi nýtast sem best í frekari rannsóknum í færeyskri máltækni. Efnisorð: Færeyska, Máltækni, Málfræðimörkun, Tauganet

Subjects/Keywords: Máltækni

Record DetailsSimilar RecordsGoogle PlusoneFacebookTwitterCiteULikeMendeleyreddit

APA · Chicago · MLA · Vancouver · CSE | Export to Zotero / EndNote / Reference Manager

APA (6th Edition):

1994-, H. H. (2020). A Faroese part-of-speech tagger built with Icelandic methods. Data preperation, training and evaluation . (Thesis). University of Iceland. Retrieved from http://hdl.handle.net/1946/37025

Note: this citation may be lacking information needed for this citation format:
Not specified: Masters Thesis or Doctoral Dissertation

Chicago Manual of Style (16th Edition):

1994-, Hinrik Hafsteinsson. “A Faroese part-of-speech tagger built with Icelandic methods. Data preperation, training and evaluation .” 2020. Thesis, University of Iceland. Accessed September 20, 2020. http://hdl.handle.net/1946/37025.

Note: this citation may be lacking information needed for this citation format:
Not specified: Masters Thesis or Doctoral Dissertation

MLA Handbook (7th Edition):

1994-, Hinrik Hafsteinsson. “A Faroese part-of-speech tagger built with Icelandic methods. Data preperation, training and evaluation .” 2020. Web. 20 Sep 2020.

Vancouver:

1994- HH. A Faroese part-of-speech tagger built with Icelandic methods. Data preperation, training and evaluation . [Internet] [Thesis]. University of Iceland; 2020. [cited 2020 Sep 20]. Available from: http://hdl.handle.net/1946/37025.

Note: this citation may be lacking information needed for this citation format:
Not specified: Masters Thesis or Doctoral Dissertation

Council of Science Editors:

1994- HH. A Faroese part-of-speech tagger built with Icelandic methods. Data preperation, training and evaluation . [Thesis]. University of Iceland; 2020. Available from: http://hdl.handle.net/1946/37025

Note: this citation may be lacking information needed for this citation format:
Not specified: Masters Thesis or Doctoral Dissertation

.