The Faroese Parsed Historical Corpus

2020 
The Faroese Parsed Historical Corpus (FarPaHC) is a manually corrected treebank, parsed according to the annotation guidelines of The Penn Parsed Corpora of Historical English (PPCHE) and The Icelandic Parsed Historical Corpus (IcePaHC), with minor modifications that are specific to Faroese. It consists of 53,000 words in three texts from the 19th and 20th century, all religious biblical texts. The file format is labeled bracketing as in the Penn Treebank with a UTF-8 encoding. The corpus is released under a CC BY 4.0 license. Sogulegi faereyski trjabankinn (FarPaHC) er handleiðrettur trjabanki sem er greindur samkvaemt þattunarskema sogulegu ensku Penn-trjabankanna (Penn Parsed Corpora of Historical English; PPCHE) og Sogulega islenska trjabankans (IcePaHC), þo með nokkrum breytingum til samraemis við faereyska malfraeði. Bankinn inniheldur 53.000 orð i þremur textum fra 19. og 20. old sem allir eru truarlegir bibliutextar. Skraarsniðið er svigasnið (e. labeled bracketing) eins og i Penn-trjabankanum og textinn er i UTF-8-stafasetti. Malheildinni er dreift með CC BY 4.0-leyfi.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    2
    Citations
    NaN
    KQI
    []