The Spoken/Written Language Classification of English Sentences with Bilingual Information *

2013 
To alleviate the problem with Chinese being poor at telling the difference between spoken and written English which is important for learning and using the language, we propose to classify English sentences with bilingual information into the two categories automatically. Based on the text categorization technology, we explore a variety of features, including words, statistics and their combinations, and find that a classification accuracy nearly 95% can be achieved in the open test through Chinese characters + sentence length + average syllable number, or other similar combinations.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    15
    References
    0
    Citations
    NaN
    KQI
    []