Neural Interaction Transparency (NIT): Disentangling Learned Interactions for Improved Interpretability

Michael Tsang,Hanpeng Liu,Sanjay Purushotham,Pavankumar Murali,Yan Liu

Neural Interaction Transparency (NIT): Disentangling Learned Interactions for Improved Interpretability

2018

Michael Tsang
Hanpeng Liu
Sanjay Purushotham
Pavankumar Murali
Yan Liu

Neural networks are known to model statistical interactions, but they entangle the interactions at intermediate hidden layers for shared representation learning. We propose a framework, DI, that Disentangles Interactions by counteracting the shared learning across different interactions to obtain their intrinsic lower-order and interpretable structure. This is done through a novel regularizer that directly penalizes interaction order. We show that disentangling interactions reduces a feedforward neural network to a generalized additive model with interactions, which can lead to transparent models that perform comparably to the state-of-the-art models. DI is also flexible and efficient; it can learn generalized additive models with maximum K-order interactions by training only O(1) models.

Keywords:

Artificial intelligence
Machine learning
Feedforward neural network
Artificial neural network
Interaction
Transparency (graphic)
Nat
Computer science
Feature learning
Interpretability
Generalized additive model

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations