Speech Sentiment Analysis via Pre-trained Features from End-to-end ASR Models. (arXiv:1911.09762v1 [cs.CL])
In this paper, we propose to use pre-trained features from end-to-end ASR models to solve the speech sentiment analysis problem as a down-stream task. We show that end-to-end ASR features, which integrate both acoustic and text information from speech, achieve promising results. We use RNN with self-attention as the sentiment classifier, which also provides an…