Gradually Improving the Computation of Semantic Textual Similarity in Portuguese

Hugo Gonçalo Oliveira,Ana Oliveira Alves,Ricardo J. Rodrigues

Gradually Improving the Computation of Semantic Textual Similarity in Portuguese

2017

Hugo Gonçalo Oliveira
Ana Oliveira Alves
Ricardo J. Rodrigues

There is much research on Semantic Textual Similarity (STS) in English, specially since its inclusion in the SemEval evaluations. For other languages, it is not as common, mostly due to the unavailability of benchmarks. Recently, the ASSIN shared task targeted STS in Portuguese and released training and test collections. This paper describes an incremental approach to ASSIN, where the computed similarity is gradually improved by exploiting different features (e.g., token overlap, semantic relations, chunks, and negation) and approaches. The best reported results, obtained with a supervised approach, would get second place overall in ASSIN.

Keywords:

Computation
Semantic similarity
Unavailability
Machine learning
Negation
Artificial intelligence
SemEval
Security token
Computer science
Portuguese

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations