Decoding Decoders: Finding Optimal Representation Spaces for Unsupervised Similarity Tasks

Vitalii Zhelezniak,Dan Busbridge,April Shen,Samuel L. Smith,Nils Y. Hammerla

Decoding Decoders: Finding Optimal Representation Spaces for Unsupervised Similarity Tasks

2018

Vitalii Zhelezniak
Dan Busbridge
April Shen
Samuel L. Smith
Nils Y. Hammerla

Experimental evidence indicates that simple models outperform complex deep networks on many unsupervised similarity tasks. Introducing the concept of an optimal representation space, we provide a simple theoretical resolution to this apparent paradox. In addition, we present a straightforward procedure that, without any retraining or architectural modifications, allows deep recurrent models to perform equally well (and sometimes better) when compared to shallow models. To validate our analysis, we conduct a set of consistent empirical evaluations and introduce several new sentence embedding models in the process. Even though this work is presented within the context of natural language processing, the insights are readily applicable to other domains that rely on distributed representations for transfer tasks.

Keywords:

Artificial intelligence
Decoding methods
Embedding
Machine learning
Unsupervised learning
Retraining
Computer science
Feature learning
Sentence
encoder decoder

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations