An Analysis of Capsule Networks for Part of Speech Tagging in High- and Low-resource Scenarios

Andrew Zupon,Faiz Rafique,Mihai Surdeanu

An Analysis of Capsule Networks for Part of Speech Tagging in High- and Low-resource Scenarios

2020

Neural networks are a common tool in NLP, but it is not always clear which architecture to use for a given task. Different tasks, different languages, and different training conditions can all affect how a neural network will perform. Capsule Networks (CapsNets) are a relatively new architecture in NLP. Due to their novelty, CapsNets are being used more and more in NLP tasks. However, their usefulness is still mostly untested.In this paper, we compare three neural network architecturesLSTM, CNN, and CapsNeton a part of speech tagging task. We compare these architectures in both high- and low-resource training conditions and find that no architecture consistently performs the best. Our analysis shows that our CapsNet performs nearly as well as a more complex LSTM under certain training conditions, but not others, and that our CapsNet almost always outperforms our CNN. We also find that our CapsNet implementation shows faster prediction times than the LSTM for Scottish Gaelic but not for Spanish, highlighting the effect that the choice of languages can have on the models.

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations