In this paper, we propose an end to end RNN architecture for Multimodal Sentiment Analysis and Emotion Detection. Our proposed model, at the time of writing, out-performs the state of the art on a benchmark dataset on a variety of accuracy and regression metrics