language-icon Old Web
English
Sign In

MIM-GOLD 20.05 - train/test

2020 
Training and testing sets from MIM-GOLD 20.05, which is a gold standard for PoS-tagging Icelandic texts. This new version uses a revised tagset. The gold standard contains approximately 1 million running words with manually annotated PoS-tags. The texts are from The Tagged Icelandic Corpus (MIM), which was published in 2013. The tagset was revised in 2019-2020. It builds upon a tagging scheme created for the Icelandic Frequency Dictionary in 1991. All changes to the tagging scheme are described in the package. ----------- Þjalfunar- og profunargogn ur MIM-GULL 20.05 sem er gullstaðall fyrir morkun islenskra texta. Þessi nýja utgafa notast við endurskoðað markamengi. Gullstaðallinn inniheldur u.þ.b. 1 milljon orða og morkin eru handyfirfarin. Textarnir eru ur Markaðri islenskri malheild (MIM), sem var gefin ut 2013. Markamengið var endurskoðað 2019-2020. Það byggir a markaskra sem var gerð fyrir Islenska orðtiðnibok arið 1991. Ollum breytingum a markamenginu er lýst i skra sem fylgir gullstaðlinum.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    0
    Citations
    NaN
    KQI
    []