Publications
(2016). Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding.
CoRR. abs/1606.01847,
(2015). A Dataset for Movie Description.
(2014). Coherent Multi-Sentence Video Description with Variable Level of Detail.
8753, 184-195.

]