Adversarial Deep Averaging Networks for Cross-Lingual Sentiment Classification

Xilun Chen; Yu Sun; Ben Athiwaratkun; Claire Cardie; Kilian Weinberger

Vol. 6 (2018)

TACL approved

Adversarial Deep Averaging Networks for Cross-Lingual Sentiment Classification

Published 2018-12-31

Xilun Chen
Yu Sun
Ben Athiwaratkun
Claire Cardie
Kilian Weinberger

Xilun Chen
Cornell University

Yu Sun
Cornell University

Ben Athiwaratkun
Cornell University

Claire Cardie
Cornell University

Kilian Weinberger
Cornell University

Abstract

In recent years great success has been achieved in sentiment classification for English, thanks in part to the availability of copious annotated resources. Unfortunately, most languages do not enjoy such an abundance of labeled data. To tackle the sentiment classification problem in low-resource languages without adequate annotated data, we propose an Adversarial Deep Averaging Network (ADAN1) to transfer the knowledge learned from labeled data on a resource-rich source language to low-resource languages where only unlabeled data exist. ADAN has two discriminative branches: a sentiment classifier and an adversarial language discriminator. Both branches take input from a shared feature extractorto learn hidden representations that are simultaneously indicative for the classification task and invariant across languages. Experiments on Chinese and Arabic sentiment classification demonstrate that ADAN significantly outperforms state-of-the-art systems.

Article at MIT Press (presented at EMNLP 2018)