Learning joint representations of videos and sentences with web image search8月 1, 2016·Mayu Otani,Yuta Nakashima,Esa Rahtu,Janne Heikkilä,Naokazu Yokoya· 0 分で読める 引用 DOIタイプ学会論文収録Proc. Workshop on Web-scale Vision and Social Media最終更新 8月 1, 2016 ← Video summarization using deep semantic features 9月 1, 2016Human action recognition-based video summarization for RGB-D personal sports video 7月 1, 2016 →