TREC Video Retrieval Evaluation 

Partial bibliography of peer-reviewed journal and conference papers
based on TRECVID resources

(comprising mainly work publicly accessible via the ACM Digital
Library and IEEE Explorer)


2022 (59)
------------------------------------------------------------------
Karbalaie, Abdolamir, Farhad Abtahi, and Mårten Sjöström. "Event detection in surveillance videos: a review." Multimedia tools and applications 81.24 (2022): 35463-35501.

Chavate, Shrikant, and Ravi Mishra. "Efficient detection of abrupt transitions using statistical methods." ECS Transactions 107.1 (2022): 6541.

Roomi, Mohamed Mansoor, and Saurav Gupta. "Pyramidal-Relative Entropy Based Temporal Signature for Video Transition Detection using LSTM." (2022).

Dave, Ishan, et al. "Gabriellav2: Towards better generalization in surveillance videos for action detection." Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2022.

Hu, Fan, et al. "Lightweight attentional feature fusion: A new baseline for text-to-video retrieval." European Conference on Computer Vision. Cham: Springer Nature Switzerland, 2022.

Yu, Lijun, et al. "Argus++: Robust real-time activity detection for unconstrained video streams with overlapping cube proposals." Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2022.

Du, Yunhao, et al. "Pami-ad: An activity detector exploiting part-attention and motion information in surveillance videos." 2022 IEEE International Conference on Multimedia and Expo Workshops (ICMEW). IEEE, 2022.

Chakraborty, Saptarshi, et al. "ALO-SBD: a hybrid shot boundary detection technique for video surveillance system." Edge Analytics: Select Proceedings of 26th International Conference—ADCOM 2020. Singapore: Springer Singapore, 2022.

Lin, Qiubin, Wenming Cao, and Zhiquan He. "Level-wise aligned dual networks for text–video retrieval." EURASIP Journal on Advances in Signal Processing 2022.1 (2022): 58.

Nandini, H. M., H. K. Chethan, and B. S. Rashmi. "Shot based keyframe extraction using edge-LBP approach." Journal of King Saud University-Computer and Information Sciences 34.7 (2022): 4537-4545.

Benoughidene, Abdelhalim, and Faiza Titouna. "A novel method for video shot boundary detection using CNN-LSTM approach." International Journal of
Multimedia Information Retrieval 11.4 (2022): 653-667.

Chakraborty, Saptarshi, Alok Singh, and Dalton Meitei Thounaojam. "A novel bifold-stage shot boundary detection algorithm: invariant to motion and illumination." The Visual Computer 38.2 (2022): 445-456.

Lebron, Luis, et al. "Evaluation of Automatically Generated Video Captions Using Vision and Language Models." 2022 IEEE International Conference on Image Processing (ICIP). IEEE, 2022.

Jose, Jasmin T., et al. "Efficient shot boundary detection with multiple visual representations." Mobile Information Systems 2022 (2022).

Mishra, Ravi. "Hybrid feature extraction and optimized deep convolutional neural network based video shot boundary detection." Concurrency and Computation: Practice and Experience 34.25 (2022): e7256.

Naveen Kumar, G. S., and V. S. K. Reddy. "High performance algorithm for content-based video retrieval using multiple features." Intelligent Systems and Sustainable Computing: Proceedings of ICISSC 2021. Singapore: Springer Nature Singapore, 2022. 637-646.

Kalaivani, A., and S. Anusuya. "The Detection of Video Shot Transitions Based on Primary Segments Using the Adaptive Threshold of Colour-Based Histogram Differences and Candidate Segments Using the SURF Feature Descriptor." Symmetry 14.10 (2022): 2041.

Singh, Alok, Thoudam Doren Singh, and Sivaji Bandyopadhyay. "V2t: video to text framework using a novel automatic shot boundary detection algorithm." Multimedia Tools and Applications 81.13 (2022): 17989-18009.

Deotale, Disha, et al. "Optimized hybrid RNN model for human activity recognition in untrimmed video." Journal of Electronic Imaging 31.5 (2022): 051409-051409.

Zhang, Binyu, et al. "Multi-actor activity detection by modeling object relationships in extended videos based on deep learning." Engineering Applications of Artificial Intelligence 114 (2022): 105055.

Hamroun, Mohamed, Karim Tamine, and Benoît Crespin. "Multimodal video indexing (mvi): A new method based on machine learning and semi-automatic annotation on large video collections." International Journal of Image and Graphics 22.02 (2022): 2250022.

Lokoč, Jakub, et al. "A task category space for user-centric comparative multimedia search evaluations." International conference on multimedia modeling. Cham: Springer International Publishing, 2022.

Zhang, Yue, Chao Liang, and Longxiang Jiang. "Confidence-Aware Active Feedback for Interactive Instance Search." IEEE Transactions on Multimedia (2022).

Sandeep, R., and Bora K. Prabin. "Application of Perceptual Video Hashing for Near-duplicate Video Retrieval." Evolutionary Computing and Mobile Sustainable Networks: Proceedings of ICECMSN 2021. Singapore: Springer Singapore, 2022. 253-275.

Zare, Samin, and Mehran Yazdi. "A Survey on Semi-Automated and Automated Approaches for Video Annotation." 2022 12th International Conference on Computer and Knowledge Engineering (ICCKE). IEEE, 2022.

Kaur, Lakhwinder, and Pankaj Kumar Mishra. "Estimation of concise video summaries from long sequence videos using deep learning via LSTM."

Khan, Omar Shahbaz, Jan Zahálka, and Björn Þór Jónsson. "Influence of Late Fusion of High-Level Features on User Relevance Feedback for Videos." Proceedings of the 2nd International Workshop on Interactive Multimedia Retrieval. 2022.

Ma, Zhixin, et al. "Reinforcement learning-based interactive video search." International Conference on Multimedia Modeling. Cham: Springer International Publishing, 2022.

Wu, Weifei. "Multi-source selection transfer learning with privacy-preserving." Neural Processing Letters 54.6 (2022): 4921-4950.

Prabavathy, A. Kethsy, M. Mythily, and J. A. M. Rexie. "Object based Video Retrieval with multiple features matching approach." 2022 8th International Conference on Advanced Computing and Communication Systems (ICACCS). Vol. 1. IEEE, 2022.

Nijhawan, Rahul, et al. "Gun identification from gunshot audios for secure public places using transformer learning." Scientific reports 12.1 (2022): 13300.

Balaji, Avantika, et al. "Shot Boundary Detection and Video Captioning Using Neural Networks." Disruptive Technologies for Big Data and Cloud Applications:
Proceedings of ICBDCC 2021. Singapore: Springer Nature Singapore, 2022. 277-285.

Heller, Silvan, et al. "Interactive video retrieval evaluation at a distance: comparing sixteen interactive video search systems in a remote setting at the 10th Video Browser Showdown." International Journal of Multimedia Information Retrieval 11.1 (2022): 1-18.

Mai, Tien-Dung, Tien Do, and Duy-Dinh Le. "A Framework for Evaluating Video Summary Approaches." 2022 International Conference on Multimedia Analysis and Pattern Recognition (MAPR). IEEE, 2022.

Harrando, Ismail. Representation, information extraction, and summarization for automatic multimedia understanding. Diss. Sorbonne Université, 2022.

Perez-Martin, Jesus, et al. "A comprehensive review of the video-to-text problem." Artificial Intelligence Review (2022): 1-75.

Allouche, Mohamed, and Mihai Mitrea. "Video fingerprinting: Past, present, and future." Frontiers in Signal Processing 2 (2022): 984169.

Liang, Guoqiang, et al. "Video summarization with a dual-path attentive network." Neurocomputing 467 (2022): 1-9.

Khan, Shakir, and Lulwah AlSuwaidan. "Agricultural monitoring system in video surveillance object detection using feature extraction and classification by deep learning techniques." Computers and Electrical Engineering 102 (2022): 108201.

Behera, Nayan Kumar Subhashis, et al. "Person re-identification: A taxonomic survey and the path ahead." Image and Vision Computing 122 (2022): 104432.

Yu, Qinghao, et al. "SUM-GAN-GEA: Video Summarization Using GAN with Gaussian Distribution and External Attention." Electronics 11.21 (2022): 3523.

Harzig, Philipp. "Automatic generation of natural language descriptions of visual data: describing images and videos using recurrent and self-attentive models." (2022).

Apostolidis, Evlampios, et al. "Summarizing videos using concentrated attention and considering the uniqueness and diversity of the video frames." Proceedings of the 2022 International Conference on Multimedia Retrieval. 2022.

Khan, Omar Shahbaz, et al. "Exquisitor at the Video Browser Showdown
2022." International Conference on Multimedia Modeling. Cham: Springer International Publishing, 2022.

Presa-Reyes, Maria, et al. "Multi-Source Weak Supervision Fusion for Disaster Scene Recognition in Videos." 2022 IEEE 5th International Conference on Multimedia Information Processing and Retrieval (MIPR). IEEE, 2022

Wang, Xu, et al. "A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency." Sensors 22.19 (2022): 7689.

Lughofer, Edwin. "Evolving multi-label fuzzy classifier." Information Sciences 597 (2022): 1-23.

Ramesh, Raksha, et al. "Leveraging Text Representation and Face-head Tracking for Long-form Multimodal Semantic Relation Understanding." Proceedings of the 30th ACM International Conference on Multimedia. 2022.

Li, Haopeng, et al. "Video joint modelling based on hierarchical transformer for co-summarization." IEEE Transactions on Pattern Analysis and Machine Intelligence 45.3 (2022): 3904-3917.

Mavroudi, Effrosyni, Prashast Bindal, and René Vidal. "Actor-Centric Tubelets for Real-Time Activity Detection in Extended Videos." Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2022.

Tao, Yudong. Data Analytics and Deep Learning for Multimodal Data. Diss. University of Miami, 2022.

Li, Ding, and Scott Dick. "Semi-supervised multi-label classification using an extended graph-based manifold regularization." Complex & Intelligent Systems 8.2 (2022): 1561-1577.

Li, Lihuan, Maurice Pagnucco, and Yang Song. "Graph-based spatial transformer with memory replay for multi-future pedestrian trajectory prediction." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022.

Li, Changwei. -based video summarization using attention networks. Diss. University of Cincinnati, 2022.

Loc, Erika, et al. "Development of a MultiModal Annotation Framework and Dataset for Deep Video Understanding." Proceedings of the 2nd Workshop on People in Vision, Language, and the Mind. 2022.

Shi, Huizhong, Yana Zhang, and Yanfang Li. "Decision Fusion Based Multi-type Shot Boundary Detection in Real Time." 2022 5th International Conference on Information Communication and Signal Processing (ICICSP). IEEE, 2022.

Chen, Yaosen, et al. "Video summarization with u-shaped transformer." Applied Intelligence 52.15 (2022): 17864-17880.

Yan, Xue, et al. "Multimodal based attention-pyramid for predicting pedestrian trajectory." Journal of Electronic Imaging 31.5 (2022): 053008-053008.

Reboud, A. (2022). Towards automatic understanding of narrative audiovisual content (Doctoral dissertation, Sorbonne université).

------------------------------------------------------------------
2021 (62)
------------------------------------------------------------------
Chen, Aozhu, et al. "What matters for ad-hoc video search? A large-scale evaluation on TRECVID." Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021.

Dong, Jianfeng, et al. "Dual encoding for video retrieval by text." IEEE Transactions on Pattern Analysis and Machine Intelligence (2021).

Rashmi, B. S., and H. S. Nagendraswamy. "Video shot boundary detection using block based cumulative approach." Multimedia Tools and Applications 80.1 (2021): 641-664.

Hao, Yanbin, Chong-Wah Ngo, and Bin Zhu. "Learning to match anchor-target video pairs with dual attentional holographic networks." IEEE Transactions on Image Processing 30 (2021): 8130-8143.

Gkountakos, Konstantinos, et al. "Visual Recognition of Abnormal Activities in Video Streams." Technology Development for Security Practitioners. Springer, Cham, 2021. 151-165.

Reboud, Alison, et al. "Exploring multimodality, perplexity and explainability for memorability prediction." Multimedia Benchmark Workshop. 2021.

Luo, Minnan, Xiaojun Chang, and Chen Gong. "Reliable shot identification for complex event detection via visual-semantic embedding." Computer Vision and Image Understanding 213 (2021): 103300.

Yang, Wenhao, et al. "Instance Search via Fusing Hierarchical Multi-level Retrieval and Human-object Interaction Detection." Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021.

Dzabraev, Maksim, et al. "Mdmmt: Multidomain multimodal transformer for video retrieval." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021.

Chakraborty, Saptarshi, Dalton Meitei Thounaojam, and Nidul Sinha. "A shot boundary detection technique based on visual colour information." Multimedia Tools and Applications 80.3 (2021): 4007-4022.

Nguyen, E-Ro, et al. "HCMUS at MediaEval2021: Attention-based Hierarchical Fusion Network for Predicting Media Memorability." (2021).

Kleinlein, Ricardo, Cristina Luna-Jiménez, and Fernando Fernández-Martínez. "THAU-UPM at MediaEval 2021: From Video Semantics To Memorability Using Pretrained Transformers." (2021).

Savran Kiziltepe, Rukiye, et al. "Overview of The MediaEval 2021 Predicting Media Memorability Task." (2021).

Presa-Reyes, Maria, et al. "Deep Learning With Weak Supervision for Disaster Scene Description in Low-Altitude Imagery." IEEE transactions on geoscience and remote sensing 60 (2021): 1-10.

Li, Changsheng, et al. "Deep Unsupervised Active Learning via Matrix Sketching." IEEE Transactions on Image Processing 30 (2021): 9280-9293.

Chakraborty, Saptarshi, and Dalton Meitei Thounaojam. "SBD-Duo: a dual stage shot boundary detection technique robust to motion and illumination effect." Multimedia Tools and Applications 80.2 (2021): 3071-3087.

Apostolidis, Evlampios, et al. "Video summarization using deep neural networks: A survey." Proceedings of the IEEE 109.11 (2021): 1838-1863.

Lokoč, Jakub, et al. "Is the reign of interactive search eternal? Findings from the video browser showdown 2020." ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 17.3 (2021): 1-26.

Raja, T. Naga, V. V. Ramana, and A. Damodaram. "A Novel Framework for Video Retrieval Algorithm Evaluations and Methods for Effective Context-Aware Video Content Retrial Method on Cloud." 
Proceedings of International Conference on Advances in Computer Engineering and Communication Systems. Springer, Singapore, 2021.

Jin, Yang, et al. "Zero-shot video event detection with high-order semantic concept discovery and matching." IEEE Transactions on Multimedia 24 (2021): 1896-1908.

Chavate, Shrikant, and Ravi Mishra. "A comparison of different procedures for hardware-based video shot boundary detection." 
Advances in Image and Data Processing using VLSI Design, Volume 1: Smart vision systems. IOP Publishing, 2021.

Nishimoto, Koki, and Kimiaki Shirahama. "Acquisition of Human's Memory Mechanism for Video Frames." ITE Technical Report; ITE Tech. Rep. 45.31 (2021): 17-20.

Dilawari, Aniqa, et al. "Natural language description of videos for smart surveillance." Applied Sciences 11.9 (2021): 3730.

Gowri, S., et al. "Human Action Detection Using Deep Learning." Machine Learning for Predictive Analysis. Springer, Singapore, 2021. 229-235.

Constantin, Mihai Gabriel, and Bogdan Ionescu. "Using Vision Transformers and Memorable Moments for the Prediction of Video Memorability." (2021).

Nandini, H. M., H. K. Chethan, and B. S. Rashmi. "An efficient method for video shot transition detection using probability binary weight Approach." International Journal of Computer Vision and Image Processing (IJCVIP) 11.3 (2021): 1-20.

Mishra, Ravi. "Video shot boundary detection using hybrid dual tree complex wavelet transform with Walsh Hadamard transform." Multimedia Tools and Applications 80.18 (2021): 28109-28135.

Xinwei, Li, Xu Lianghao, and Yang Yi. "Compact video fingerprinting via an improved capsule net." Systems Science & Control Engineering 9.sup1 (2021): 122-130.

Chavate, Shrikant, Ravi Mishra, and Pranay Yadav. "A Comparative Analysis of Video Shot Boundary Detection using Different Approaches." 
2021 10th International Conference on System Modeling & Advancement in Research Trends (SMART). IEEE, 2021.

Dong, Jianfeng, et al. "Multi-level alignment network for domain adaptive cross-modal retrieval." Neurocomputing 440 (2021): 207-219.

Nguyen, Phuong-Anh, and Chong-Wah Ngo. "Interactive search vs. automatic search: an extensive study on video retrieval." ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 17.2 (2021): 1-24.

Jose, Jasmin T., and S. Rajkumar. "Multiple Grey-scale Feature Based Shot Boundary Detection." 2021 Asian Conference on Innovation in Technology (ASIANCON). IEEE, 2021.

Han, Tingting, Yuankai Qi, and Suguo Zhu. "A Continuous Semantic Embedding Method for Video Compact Representation." Electronics 10.24 (2021): 3106.

Kiziltepe, Rukiye Savran, et al. "An annotated video dataset for computing video memorability." Data in Brief 39 (2021): 107671.

Wu, Jiaxin, et al. "SQL-like interpretable interactive video search." International Conference on Multimedia Modeling. Springer, Cham, 2021.

Huang, Qing, Hongcai Feng, and Li Liu. "A Video Scene Segmentation Optimization Algorithm Based on Convolutional Neural Network." 2021 2nd International Conference on Intelligent Computing and Human-Computer Interaction (ICHCI). IEEE, 2021.

Iinuma, Yuko, and Shin'ichi Satoh. "Video Action Retrieval Using Action Recognition Model." Proceedings of the 2021 International Conference on Multimedia Retrieval. 2021.

Lu, Youwei, and Xiaoyu Wu. "Cross-modal Interaction for Video Memorability Prediction." (2021).

Galanopoulos, Damianos, and Vasileios Mezaris. "Hard-negatives or Non-negatives? A hard-negative selection strategy for cross-modal retrieval using the improved marginal ranking loss." 
Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021.

Galanopoulos, Damianos, et al. "Automatic and Semi-automatic Augmentation of Migration Related Semantic Concepts for Visual Media Retrieval." 
Proceedings of the 2021 Workshop on Open Challenges in Online Social Networks. 2021.

Gkountakos, Konstantinos, et al. "Spatio-temporal activity detection and recognition in untrimmed surveillance videos." Proceedings of the 2021 International Conference on Multimedia Retrieval. 2021.

Kumar, Neetish. "Shot Boundary Detection Framework For Video Editing Via Adaptive Thresholds And Gradual Curve Point." Turkish Journal of Computer and Mathematics Education (TURCOMAT) 12.11 (2021): 3820-3828.

Godil, Afzal, et al. "2020 Sequestered Data Evaluation for Known Activities in Extended Video: Summary and Results." Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2021.

Manjunath Aradhya, V. N., H. T. Basavaraju, and Devanur S. Guru. "Decade research on text detection in images/videos: a review." Evolutionary Intelligence 14.2 (2021): 405-431.

Hezel, Nico, et al. "Video search with sub-image keyword transfer using existing image archives." International Conference on Multimedia Modeling. Springer, Cham, 2021.

Rizve, Mamshad Nayeem, et al. "Gabriella: An online system for real-time activity detection in untrimmed security videos." 2020 25th International Conference on Pattern Recognition (ICPR). IEEE, 2021.

Pavithra, N., and Y. H. Sharath Kumar. "FSBRS: Framework for Sketch-Based Retrieval System of the Color Images." Soft Computing and Signal Processing. Springer, Singapore, 2021. 1-15.

Bouyahi, Mohamed, and Yassine Ben Ayed. "Multimodal features for shots boundary detection." Thirteenth International Conference on Machine Vision. Vol. 11605. SPIE, 2021.

Jiang, Xuekun, et al. "Jointly Learning the Attributes and Composition of Shots for Boundary Detection in Videos." IEEE Transactions on Multimedia (2021).

Tao, Jianwen, Yufang Dan, and Di Zhou. "Robust multi-source co-adaptation with adaptive loss minimization." Signal Processing: Image Communication 99 (2021): 116455.

Valand, Joakim O., et al. "Automated Clipping of Soccer Events using Machine Learning." 2021 IEEE International Symposium on Multimedia (ISM). IEEE, 2021.

Thallinger, Georg, and Werner Bailer. "Automatic Analysis of Amateur Film and Video Collections." 2021 International Conference on Content-Based Multimedia Indexing (CBMI). IEEE, 2021.

Yao, Wei, et al. "Early and Late Fusion of Multiple Modalities in Sentinel Imagery and Social Media Retrieval." International Conference on Pattern Recognition. Springer, Cham, 2021.

Valand, Joakim Olav, et al. "AI-Based Video Clipping of Soccer Events." Machine Learning and Knowledge Extraction 3.4 (2021): 990-1008.

Lan, Libin, and Chunxiao Ye. "Recurrent generative adversarial networks for unsupervised WCE video summarization." Knowledge-Based Systems 222 (2021): 106971.

Zhu, Yunzhang, et al. "Collaborative multilabel classification." Journal of the American Statistical Association (2021): 1-12.

Li, Yuke, Pin Wang, and Ching-Yao Chan. "RESTEP into the future: relational spatio-temporal learning for multi-person action forecasting." IEEE Transactions on Multimedia (2021).

Li, Yu-Ke, et al. "Imitative Learning for Multi-Person Action Forecasting." Proceedings of the 29th ACM International Conference on Multimedia. 2021.

Apostolidis, Evlampios, et al. "Combining global and local attention with positional encoding for video summarization." 2021 IEEE International Symposium on Multimedia (ISM). IEEE, 2021.

Chen, Bo, Decai Li, and Yuqing He. "Simultaneous Prediction of Pedestrian Trajectory and Actions based on Context Information Iterative Reasoning." 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2021.

Chen, Bo, et al. "SCR-graph: Spatial-causal relationships based graph reasoning network for human action prediction." The 2nd International Conference on Computing and Data Science. 2021.

Kavitha, R., and D. Chitra. "An improved hybridized deep structured model for accurate video event recognition." Journal of Ambient Intelligence and Humanized Computing 12.6 (2021): 6019-6028.

------------------------------------------------------------------
2020 (55)
------------------------------------------------------------------
Mejzlík, F. (2020). Evaluation of Keyword-Based Search Models for Known-Item Search
  
Wang, Ying, Yongchen Wang, Cong Shi, Long Cheng, Huawei Li, and Xiaowei Li. "An Edge 3D CNN Accelerator for Low-Power Activity Recognition."
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 40, no. 5 (2020): 918-930.
  
Zafirova, Deana. "Shot Boundary Detection: A fundamental base for automatic video analysis." PhD diss., Wien, 2020.
  
Atencio Ortiz, Pedro Sandino. "Query-based Video summarization using machine learning and coordinated representations."
  
Bátoryová, Jana. "Searching Image Collections Using Deep Representations of Local Regions." (2020).
  
Wang, Han, Hao Song, Xinxiao Wu, and Yunde Jia. "Incremental transfer learning for video annotation via grouped heterogeneous sources." IET Computer Vision 14, no. 1 (2020): 26-35.
  
Čech, Přemysl, Jakub Lokoč, and Yasin N. Silva. "Pivot-based approximate k-NN similarity joins for big high-dimensional data."
Information Systems 87 (2020): 101410.
  
Fan, S., Shen, Z., Koenig, B.L., Ng, T.T. and Kankanhalli, M.S., 2020. When and Why Static Images Are More Effective Than Videos. IEEE Transactions on Affective Computing, (01), pp.1-1.
  
Lin, Sung-Chiang, Chih-Jou Chen, and Tsung-Ju Lee. "A multi-label classification with hybrid label-based meta-learning method in internet of things." IEEE Access 8 (2020): 42261-42269.
  
Harsha, B. K., and G. Indumathi. "Skin Detection in Images based on Pattern Matching Algorithms-A Review." In 2020 International Conference on Inventive Computation Technologies (ICICT), pp. 359-363. IEEE, 2020.  
  
Qian, Yijun, Lijun Yu, Wenhe Liu, Guoliang Kang, and Alexander G. Hauptmann. "Adaptive feature aggregation for video object detection." In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, pp. 143-147. 2020.
  
Sandeep, R., and Prabin Kumar Bora. "Detection of Malicious Video Modifications using Perceptual Video Hashing." 2020 5th International Conference on Computing, Communication and Security (ICCCS). IEEE, 2020.
  
Apostolidis, E., Adamantidou, E., Metsai, A.I., Mezaris, V. and Patras, I., Unsupervised Video Summarization via Attention-Driven Adversarial Learning.
  
Wang, Liyuan, et al. "Multilevel fusion of multimodal deep features for porn streamer recognition in live video."
Pattern Recognition Letters 140 (2020): 150-157.

Liu, Yanbing, Sanjev Dhakal, and Binyao Hao. "Multimedia image and video retrieval based on an improved HMM."
Multimedia Systems (2020): 1-11.

Bekhet, Saddam, and Amr Ahmed. "Evaluation of similarity measures for video retrieval."
Multimedia Tools and Applications 79.9 (2020): 6265-6278.

Subudhi, Badri Narayan, et al. "Automatic lecture video skimming using shot categorization and contrast based features."
Expert Systems with Applications 149 (2020): 113341.

Gornishka, Iva, Stevan Rudinac, and Marcel Worring. "Interactive Search and Exploration in Discussion Forums Using Multimodal Embeddings."
International Conference on Multimedia Modeling. Springer, Cham, 2020.

Qi, Haifeng, et al. "Hash length: a neglected element." Multimedia Tools and Applications 79.7 (2020): 4763-4782.

Jónsson, Björn Þór, et al. "Exquisitor at the video browser showdown 2020."
International Conference on Multimedia Modeling. Springer, Cham, 2020.

Kim, Byoungjun, et al. "Deep Learning-Based Video Retrieval Using Object Relationships and Associated Audio Classes."
International Conference on Multimedia Modeling. Springer, Cham, 2020.

Mantsis, Damianos Florin, et al. "Multimodal Fusion of Sentinel 1 Images and Social Media Data for Snow Depth Estimation."
IEEE Geoscience and Remote Sensing Letters (2020).

Lee, Yooyoung, et al. "Summary of the 2019 Activity Detection in Extended Videos Prize Challenge."
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops. 2020.

Shen, Ling, Richang Hong, and Yanbin Hao. "Advance on large scale near-duplicate video retrieval." Frontiers of Computer Science 14.5 (2020): 1-24.

Bhaumik, Hrishikesh, Siddhartha Bhattacharyya, and Susanta Chakraborty. "Real-time video segmentation using a vague adaptive threshold."
Hybrid Computational Intelligence. Academic Press, 2020. 191-220.

Jadhav, Dattatraya, Yogesh Kumar Sharma, and Dr Arora. "Profound Learning Approach for Shot Boundary Location." Available at SSRN 3645409 (2020).

Nandini, H. M., H. K. Chethan, and B. S. Rashmi. "Shot based keyframe extraction using edge-LBP approach."
Journal of King Saud University-Computer and Information Sciences (2020).

Xinwei, Li, Xu Lianghao, and Yang Yi. "Compact video fingerprinting via an improved capsule net."
Systems Science & Control Engineering (2020): 1-9.

GogiReddy, Hema Sundara Srinivasula Reddy, and Neelam Sinha. "Video Key Frame Detection Using Block Sparse Coding."
Proceedings of 3rd International Conference on Computer Vision and Image Processing. Springer, Singapore, 2020.

Janwe, Nitin, and Kishor Bhoyar. "Semantic concept based video retrieval using convolutional neural network." SN Applied Sciences 2.1 (2020): 1-8.

Kar, T., and P. Kanungo. "Abrupt Scene Change Detection Using Block Based Local Directional Pattern."
Data Management, Analytics and Innovation. Springer, Singapore, 2020. 191-203.

Soboroff, Ian, et al. "Evaluating Multimedia and Language Tasks." Frontiers in Artificial Intelligence 3 (2020).

Andreadis, Stelios, et al. "Verge in vbs 2020." International Conference on Multimedia Modeling. Springer, Cham, 2020.

Sasithradevi, A., and S. Mohamed Mansoor Roomi. "A new pyramidal opponent color-shape model based video shot boundary detection."
Journal of Visual Communication and Image Representation 67 (2020): 102754.

Sasithradevi, A., and S. Mohamed Mansoor Roomi. "Video classification and retrieval through spatio-temporal Radon features."
Pattern Recognition 99 (2020): 107099.

Jakub Lokoć, Tomáš Soućek, Patrik Veselý, František Mejzlík, Jiaqi Ji, Chaoxi Xu, and Xirong Li. 2020.
A W2VV++ Case Study with Automated and Interactive Text-to-Video Retrieval. In Proceedings of the 28th ACM International Conference on Multimedia (MM '20). Association for Computing Machinery, New York, NY, USA, 2553–2561. DOI:https://doi.org/10.1145/3394171.3414002

Jiaxin Wu and Chong-Wah Ngo. 2020. Interpretable Embedding for Ad-Hoc Video Search.
In Proceedings of the 28th ACM International Conference on Multimedia (MM '20). Association for Computing Machinery, New York, NY, USA, 3357–3366. DOI:https://doi.org/10.1145/3394171.3413916

Zhihui Li, Xiaojun Chang, Lina Yao, Shirui Pan, Ge Zongyuan, and Huaxiang Zhang. 2020. Grounding Visual Concepts for Zero-Shot Event Detection and Event Captioning. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD '20).
Association for Computing Machinery, New York, NY, USA, 297–305. DOI:https://doi.org/10.1145/3394486.3403072

Shuo Chen, Pascal Mettes, Tao Hu, and Cees G.M. Snoek. 2020. Interactivity Proposals for Surveillance Videos.
In Proceedings of the 2020 International Conference on Multimedia Retrieval (ICMR '20). Association for Computing Machinery, New York, NY, USA, 108–116. DOI:https://doi.org/10.1145/3372278.3390680

Damianos Galanopoulos and Vasileios Mezaris. 2020. Attention Mechanisms, Signal Encodings and Fusion Strategies for Improved Ad-hoc Video Search with Dual Encoding Networks. In Proceedings of the 2020 International Conference on Multimedia Retrieval (ICMR '20).
Association for Computing Machinery, New York, NY, USA, 336–340. DOI:https://doi.org/10.1145/3372278.3390737

Pascal Mettes, Dennis C. Koelma, and Cees G. M. Snoek. 2020. Shuffled ImageNet Banks for Video Event Detection and Search.
ACM Trans. Multimedia Comput. Commun. Appl. 16, 2, Article 44 (June 2020), 21 pages. DOI:https://doi.org/10.1145/3377875

Kazuya Ueki and Takayuki Hori. 2020. Comparison and Evaluation of Video Retrieval Approaches Using Query Sentences.
In Proceedings of the 2020 2nd International Conference on Intelligent Medicine and Image Processing (IMIP 2020).
Association for Computing Machinery, New York, NY, USA, 103–107. DOI:https://doi.org/10.1145/3399637.3399657
  
X. Li, H. Li and Y. Dong, "Meta Learning for Task-Driven Video Summarization,"
in IEEE Transactions on Industrial Electronics, vol. 67, no. 7, pp. 5778-5786, July 2020, doi: 10.1109/TIE.2019.2931283.
  
S. H. Abdulhussain, S. A. R. Al-Haddad, M. I. Saripan, B. M. Mahmmod and A. Hussien, "Fast Temporal Video Segmentation Based on Krawtchouk-Tchebichef Moments,"
in IEEE Access, vol. 8, pp. 72347-72359, 2020, doi: 10.1109/ACCESS.2020.2987870.
  
X. Wang, Q. Wang and H. Wang, "Active Video Hashing via Structure Information Learning for Activity Analysis,"
in IEEE Access, vol. 8, pp. 96428-96437, 2020, doi: 10.1109/ACCESS.2020.2994783.
  
J. Gleason, S. Schwarcz, R. Ranjan, C. D. Castillo, J. Chen and R. Chellappa, "Activity Detection in Untrimmed Videos Using Chunk-based Classifiers,"
2020 IEEE Winter Applications of Computer Vision Workshops (WACVW), Snowmass, CO, USA, 2020, pp. 107-116, doi: 10.1109/WACVW50321.2020.9096912.
  
J. Gleason, C. D. Castillo and R. Chellappa, "Real-time Detection of Activities in Untrimmed Videos,"
2020 IEEE Winter Applications of Computer Vision Workshops (WACVW), Snowmass, CO, USA, 2020, pp. 117-125, doi: 10.1109/WACVW50321.2020.9096937.  
  
F. -F. Duan and F. Meng, "Video Shot Boundary Detection Based on Feature Fusion and Clustering Technique,"
in IEEE Access, vol. 8, pp. 214633-214645, 2020, doi: 10.1109/ACCESS.2020.3040861.
  
C. Wang, L. Pang, X. Jiang and L. Jin, "SVD of Shot Boundary Detection Based on Accumulative Difference,"
2020 International Conference on Culture-oriented Science & Technology (ICCST), Beijing, China, 2020, pp. 367-372, doi: 10.1109/ICCST50977.2020.00077.
  
X. Li, F. Zhou, C. Xu, J. Ji and G. Yang, "SEA: Sentence Encoder Assembly for Video Retrieval by Textual Queries,"
in IEEE Transactions on Multimedia, doi: 10.1109/TMM.2020.3042067.
  
F. Hertlein, D. Münch and M. Arens, "Context Sensitivity of Spatio-Temporal Activity Detection using Hierarchical Deep Neural Networks in Extended Videos,"
2020 IEEE Winter Applications of Computer Vision Workshops (WACVW), Snowmass, CO, USA, 2020, pp. 134-142, doi: 10.1109/WACVW50321.2020.9096934.
  
H. M. Nandini, H. K. Chethan and B. S. Rashmi, "Abrupt Shot Change Detection using Midhinge Local Binary Pattern,"
2020 IEEE-HYDCON, Hyderabad, India, 2020, pp. 1-5, doi: 10.1109/HYDCON48903.2020.9242841.
  
H. -Q. Vo et al., "Searching For Desired Person Doing Desired Action based on Visual and Audio Feature in Large Scale Video Database,"
2020 International Conference on Multimedia Analysis and Pattern Recognition (MAPR), Ha Noi, Vietnam, 2020, pp. 1-6, doi: 10.1109/MAPR49794.2020.9237781.~
  
Y. Hao, C. Ngo and B. Huet, "Neighbourhood Structure Preserving Cross-Modal Embedding for Video Hyperlinking,"
in IEEE Transactions on Multimedia, vol. 22, no. 1, pp. 188-200, Jan. 2020, doi: 10.1109/TMM.2019.2923121.
  
W. Liu et al., "Argus: Efficient Activity Detection System for Extended Video Analysis,"
2020 IEEE Winter Applications of Computer Vision Workshops (WACVW), Snowmass, CO, USA, 2020, pp. 126-133, doi: 10.1109/WACVW50321.2020.9096929.
  

------------------------------------------------------------------
2019 (90)
------------------------------------------------------------------
L. Yao and Y. Qian, "Novel Activities Detection Algorithm in Extended Videos," 2019 IEEE Winter Applications of Computer Vision Workshops (WACVW), Waikoloa Village, HI, USA, 2019, pp. 9-15, doi: 10.1109/WACVW.2019.00009.

J. Dong et al., "Dual Encoding for Zero-Example Video Retrieval," 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 2019, pp. 9338-9347, doi: 10.1109/CVPR.2019.00957.

S. Aakur, D. Sawyer and S. Sarkar, "Fine-grained Action Detection in Untrimmed Surveillance Videos," 2019 IEEE Winter Applications of Computer Vision Workshops (WACVW), Waikoloa Village, HI, USA, 2019, pp. 38-40, doi: 10.1109/WACVW.2019.00014.

D. Francis, P. A. Nguyen, B. Huet and C. Ngo, "Fusion of Multimodal Embeddings for Ad-Hoc Video Search," 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW), Seoul, Korea (South), 2019, pp. 1868-1872, doi: 10.1109/ICCVW.2019.00233.

S. H. Abdulhussain et al., "A Fast Feature Extraction Algorithm for Image and Video Processing," 2019 International Joint Conference on Neural Networks (IJCNN), Budapest, Hungary, 2019, pp. 1-8, doi: 10.1109/IJCNN.2019.8851750.

Z. Zhou, J. Chen, C. Yang and X. Sun, "Video Copy Detection Using Spatio-Temporal CNN Features," in IEEE Access, vol. 7, pp. 100658-100665, 2019, doi: 10.1109/ACCESS.2019.2930173.

Y. Gao, Y. Lai and Y. Liu, "Fast Video Shot Boundary Detection Based on Visual Perception," 2019 IEEE International Conference on Consumer Electronics (ICCE), Las Vegas, NV, USA, 2019, pp. 1-4, doi: 10.1109/ICCE.2019.8662083.

Y. Wang, Y. Wang, H. Li, C. Shi and X. Li, "Systolic Cube: A Spatial 3D CNN Accelerator Architecture for Low Power Video Analysis," 2019 56th ACM/IEEE Design Automation Conference (DAC), Las Vegas, NV, USA, 2019, pp. 1-6.

R. Thomanek et al., "A Scalable System Architecture for Activity Detection with Simple Heuristics," 2019 IEEE Winter Applications of Computer Vision Workshops (WACVW), Waikoloa Village, HI, USA, 2019, pp. 27-34, doi: 10.1109/WACVW.2019.00012.

J. Chen et al., "Minding the Gaps in a Video Action Analysis Pipeline," 2019 IEEE Winter Applications of Computer Vision Workshops (WACVW), Waikoloa Village, HI, USA, 2019, pp. 41-46, doi: 10.1109/WACVW.2019.00015.

L. Yu, P. Chen, W. Liu, G. Kang and A. G. Hauptmann, "Training-free Monocular 3D Event Detection System for Traffic Surveillance," 2019 IEEE International Conference on Big Data (Big Data), Los Angeles, CA, USA, 2019, pp. 3838-3843, doi: 10.1109/BigData47090.2019.9006063.

X. Peng, R. Li, J. Wang and H. Shang, "User-Guided Clustering for Video Segmentation on Coarse-Grained Feature Extraction," in IEEE Access, vol. 7, pp. 149820-149832, 2019, doi: 10.1109/ACCESS.2019.2946889.

J. Gleason, R. Ranjan, S. Schwarcz, C. Castillo, J. Chen and R. Chellappa, "A Proposal-Based Solution to Spatio-Temporal Action Detection in Untrimmed Videos," 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa Village, HI, USA, 2019, pp. 141-150, doi: 10.1109/WACV.2019.00021.

A. Yazici, M. Koyuncu, S. A. Sert and T. Yilmaz, "A Fusion-Based Framework for Wireless Multimedia Sensor Networks in Surveillance Applications," in IEEE Access, vol. 7, pp. 88418-88434, 2019, doi: 10.1109/ACCESS.2019.2926206.

S. Lal, S. Duggal and I. Sreedevi, "Online Video Summarization: Predicting Future to Better Summarize Present," 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa Village, HI, USA, 2019, pp. 471-480, doi: 10.1109/WACV.2019.00056.

H. Zhang and C. Ngo, "A Fine Granularity Object-Level Representation for Event Detection and Recounting," in IEEE Transactions on Multimedia, vol. 21, no. 6, pp. 1450-1463, June 2019, doi: 10.1109/TMM.2018.2884478.

F. Markatopoulou, V. Mezaris and I. Patras, "Implicit and Explicit Concept Relations in Deep Neural Networks for Multi-Label Video/Image Annotation," in IEEE Transactions on Circuits and Systems for Video Technology, vol. 29, no. 6, pp. 1631-1644, June 2019, doi: 10.1109/TCSVT.2018.2848458.

Z. Gao, L. Wang, N. Jojic, Z. Niu, N. Zheng and G. Hua, "Video Imprint," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 41, no. 12, pp. 3086-3099, 1 Dec. 2019, doi: 10.1109/TPAMI.2018.2866114.

S. S. Thomas, S. Gupta and V. K. Subramanian, "Context Driven Optimized Perceptual Video Summarization and Retrieval," in IEEE Transactions on Circuits and Systems for Video Technology, vol. 29, no. 10, pp. 3132-3145, Oct. 2019, doi: 10.1109/TCSVT.2018.2873185.

M. Elfeki and A. Borji, "Video Summarization Via Actionness Ranking," 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa Village, HI, USA, 2019, pp. 754-763, doi: 10.1109/WACV.2019.00085.

W. Xie, H. Yao, X. Sun, T. Han, S. Zhao and T. Chua, "Discovering Latent Discriminative Patterns for Multi-Mode Event Representation," in IEEE Transactions on Multimedia, vol. 21, no. 6, pp. 1425-1436, June 2019, doi: 10.1109/TMM.2018.2879749.

A. Dilawari and M. U. G. Khan, "ASoVS: Abstractive Summarization of Video Sequences," in IEEE Access, vol. 7, pp. 29253-29263, 2019, doi: 10.1109/ACCESS.2019.2902507.

Z. Lu, L. Wu, M. Jian, S. Zhang, D. Wang and X. Wang, "Shot Boundary Detection with Key Motion Estimation and Appearance Differentiation," 2019 IEEE International Conference on Signal, Information and Data Processing (ICSIDP), Chongqing, China, 2019, pp. 1-7, doi: 10.1109/ICSIDP47821.2019.9173023.

K. Liao et al., "IR Feature Embedded BOF Indexing Method for Near-Duplicate Video Retrieval," in IEEE Transactions on Circuits and Systems for Video Technology, vol. 29, no. 12, pp. 3743-3753, Dec. 2019, doi: 10.1109/TCSVT.2018.2884941.

M. Ma, S. Mei, S. Wan, Z. Wang and D. Feng, "Video Summarization via Nonlinear Sparse Dictionary Selection," in IEEE Access, vol. 7, pp. 11763-11774, 2019, doi: 10.1109/ACCESS.2019.2891834.

Y. Yuan, T. Mei, P. Cui and W. Zhu, "Video Summarization by Learning Deep Side Semantic Embedding," in IEEE Transactions on Circuits and Systems for Video Technology, vol. 29, no. 1, pp. 226-237, Jan. 2019, doi: 10.1109/TCSVT.2017.2771247.

L. Wu, S. Zhang, M. Jian, Z. Lu and D. Wang, "Two Stage Shot Boundary Detection via Feature Fusion and Spatial-Temporal Convolutional Neural Networks," in IEEE Access, vol. 7, pp. 77268-77276, 2019, doi: 10.1109/ACCESS.2019.2922038.

H. Tao, C. Hou, D. Yi, J. Zhu and D. Hu, "Joint Embedding Learning and Low-Rank Approximation: A Framework for Incomplete Multiview Learning," in IEEE Transactions on Cybernetics, doi: 10.1109/TCYB.2019.2953564.

P. Gunawardena et al., "Interest-Oriented Video Summarization with Keyframe Extraction," 2019 19th International Conference on Advances in ICT for Emerging Regions (ICTer), Colombo, Sri Lanka, 2019, pp. 1-8, doi: 10.1109/ICTer48817.2019.9023769.

M. Gong, H. Li, D. Meng, Q. Miao and J. Liu, "Decomposition-Based Evolutionary Multiobjective Optimization to Self-Paced Learning," in IEEE Transactions on Evolutionary Computation, vol. 23, no. 2, pp. 288-302, April 2019, doi: 10.1109/TEVC.2018.2850769.

H. Li, M. Gong, C. Wang and Q. Miao, "Pareto Self-Paced Learning Based on Differential Evolution," in IEEE Transactions on Cybernetics, doi: 10.1109/TCYB.2019.2935762.

F. Yang and S. Satoh, "Burst-survive Temporal Matching Kernel with Fibonacci Periods," ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, 2019, pp. 2062-2066, doi: 10.1109/ICASSP.2019.8682971.

Schoeffmann, Klaus. "Video browser showdown 2012-2019: A review." 2019 International Conference on Content-Based Multimedia Indexing (CBMI). IEEE, 2019.

Xirong Li, Chaoxi Xu, Gang Yang, Zhineng Chen, and Jianfeng Dong. 2019. W2VV++: Fully Deep Learning for Ad-hoc Video Search. In Proceedings of the 27th ACM International Conference on Multimedia. Association for Computing Machinery, New York, NY, USA, 1786-1794. DOI:https://doi.org/10.1145/3343031.3350906

Zheng Wang, Fan Yang, and Shin'ichi Satoh. 2019. Salient Time Slice Pruning and Boosting for Person-Scene Instance Search in TV Series. In Proceedings of the ACM Multimedia Asia (MMAsia '19). Association for Computing Machinery, New York, NY, USA, Article 27, 1-6. DOI:https://doi.org/10.1145/3338533.3366594

Kashif Ahmad and Nicola Conci. 2019. How Deep Features Have Improved Event Recognition in Multimedia: A Survey. ACM Trans. Multimedia Comput. Commun. Appl. 15, 2, Article 39 (June 2019), 27 pages. DOI:https://doi.org/10.1145/3306240

Fabian Berns, Luca Rossetto, Klaus Schoeffmann, Christian Beecks, and George Awad. 2019. V3C1 Dataset: An Evaluation of Content Characteristics. In Proceedings of the 2019 on International Conference on Multimedia Retrieval (ICMR '19). Association for Computing Machinery, New York, NY, USA, 334-338. DOI:https://doi.org/10.1145/3323873.3325051

Hung-Quoc Vo, Vu-Minh-Hieu Dang, Vinh-Tiep Nguyen, and Duy-Dinh Le. 2019. Noise Removal Based Query Pre-processing to Improve Face Search Performance in Large Scale Video Databases. In Proceedings of the Tenth International Symposium on Information and Communication Technology (SoICT 2019). Association for Computing Machinery, New York, NY, USA, 357-361. DOI:https://doi.org/10.1145/3368926.3369727

Xirong Li. 2019. Deep Learning for Video Retrieval by Natural Language. In Proceedings of the 1st International Workshop on Fairness, Accountability, and Transparency in MultiMedia (FAT/MM '19). Association for Computing Machinery, New York, NY, USA, 2-3. DOI:https://doi.org/10.1145/3347447.3350565

Mohamed Hamroun, Sonia Lajmi, Henri Nicolas, and Ikram Amous. 2019. Large-Scale Semantic Concept Detection Based On Visual Contents. In Proceedings of the 17th International Conference on Advances in Mobile Computing & Multimedia (MoMM2019). Association for Computing Machinery, New York, NY, USA, 165-174. DOI:https://doi.org/10.1145/3365921.3365925

Jakub Lokoc, Gregor Kovalcik, Bernd Münzer, Klaus Schöffmann, Werner Bailer, Ralph Gasser, Stefanos Vrochidis, Phuong Anh Nguyen, Sitapa Rujikietgumjorn, and Kai Uwe Barthel. 2019. Interactive Search or Sequential Browsing? A Detailed Analysis of the Video Browser Showdown 2018. ACM Trans. Multimedia Comput. Commun. Appl. 15, 1, Article 29 (February 2019), 18 pages. DOI:https://doi.org/10.1145/3295663

Madhushree Basavarajaiah and Priyanka Sharma. 2019. Survey of Compressed Domain Video Summarization Techniques. ACM Comput. Surv. 52, 6, Article 116 (January 2020), 29 pages. DOI:https://doi.org/10.1145/3355398

Mohamed Hamroun, Sonia Lajmi, Henri Nicolas, and Ikram Amous. 2019. VISEN: a video interactive retrieval engine based on semantic network in large video collections. In Proceedings of the 23rd International Database Applications & Engineering Symposium (IDEAS '19). Association for Computing Machinery, New York, NY, USA, Article 25, 1-10. DOI:https://doi.org/10.1145/3331076.3331094

Yujia Zhang, Michael Kampffmeyer, Xiaoguang Zhao, and Min Tan. 2019. DTR-GAN: dilated temporal relational adversarial network for video summarization. In Proceedings of the ACM Turing Celebration Conference - China (ACM TURC '19). Association for Computing Machinery, New York, NY, USA, Article 89, 1-6. DOI:https://doi.org/10.1145/3321408.3322622

Yongchen Wang, Ying Wang, Huawei Li, Cong Shi, and Xiaowei Li. 2019. Systolic Cube: A Spatial 3D CNN Accelerator Architecture for Low Power Video Analysis. In Proceedings of the 56th Annual Design Automation Conference 2019 (DAC '19). Association for Computing Machinery, New York, NY, USA, Article 210, 1-6. DOI:https://doi.org/10.1145/3316781.3317919

Junbo Wang, Wei Wang, Zhiyong Wang, Liang Wang, Dagan Feng, and Tieniu Tan. 2019. Stacked Memory Network for Video Summarization. In Proceedings of the 27th ACM International Conference on Multimedia (MM '19). Association for Computing Machinery, New York, NY, USA, 836-844. DOI:https://doi.org/10.1145/3343031.3350992

Xinyu Weng, Yongzhi Li, Lu Chi, and Yadong Mu. 2019. High-Capacity Convolutional Video Steganography with Temporal Residual Modeling. In Proceedings of the 2019 on International Conference on Multimedia Retrieval (ICMR '19). Association for Computing Machinery, New York, NY, USA, 87-95. DOI:https://doi.org/10.1145/3323873.3325011

Evlampios Apostolidis, Alexandros I. Metsai, Eleni Adamantidou, Vasileios Mezaris, and Ioannis Patras. 2019. A Stepwise, Label-based Approach for Improving the Adversarial Training in Unsupervised Video Summarization. In Proceedings of the 1st International Workshop on AI for Smart TV Content Production, Access and Delivery (AI4TV '19). Association for Computing Machinery, New York, NY, USA, 17-25. DOI:https://doi.org/10.1145/3347449.3357482

Jakub Lokoc, Gregor Kovalcik, Tomáš Scek, Jaroslav Moravec, and Premysl cech. 2019. A Framework for Effective Known-item Search in Video. In Proceedings of the 27th ACM International Conference on Multimedia (MM '19). Association for Computing Machinery, New York, NY, USA, 1777-1785. DOI:https://doi.org/10.1145/3343031.3351046


Singh, Alok, Dalton Meitei Thounaojam, and Saptarshi Chakraborty. "A novel automatic shot boundary detection algorithm: robust to illumination and motion effect." Signal, Image and Video Processing (2019): 1-9.

Zhu, Yandong, et al. "A comprehensive solution for detecting events in complex surveillance videos." Multimedia Tools and Applications 78.1 (2019): 817-838.

Rossetto, Luca, et al. "V3c-a research video collection." International Conference on Multimedia Modeling. Springer, Cham, 2019.

Kavoosifar, Mohammad Reza, et al. "Effective video hyperlinking by means of enriched feature sets and monomodal query combinations." International Journal of Multimedia Information Retrieval (2019): 1-13.

Patil, Nita, and Sudhir Sawarkar. "Semantic Concept Detection for Multilabel Unbalanced Dataset Using Global Features." Intelligent Communication Technologies and Virtual Mobile Networks. Springer, Cham, 2019.

Saleem, Summra, et al. "Stateful human-centered visual captioning system to aid video surveillance." Computers & Electrical Engineering 78 (2019): 108-119.

Smeaton, Alan F., et al. "Exploring the Impact of Training Data Bias on Automatic Generation of Video Captions." International Conference on Multimedia Modeling. Springer, Cham, 2019.

Li, Zhihui, et al. "Zero-shot event detection via event-adaptive concept relevance mining." Pattern Recognition 88 (2019): 595-603.

Asha, D., and Y. Madhavee Latha. "Content-Based Video Shot Boundary Detection Using Multiple Haar Transform Features." Soft Computing and Signal Processing. Springer, Singapore, 2019. 703-713.

Chakraborty, Saptarshi, and Dalton Meitei Thounaojam. "A novel shot boundary detection system using hybrid optimization technique." Applied Intelligence 49.9 (2019): 3207-3220.

Nguyen, Vinh-Tiep, et al. "Video instance search via spatial fusion of visual words and object proposals." International Journal of Multimedia Information Retrieval 8.3 (2019): 181-192.

Yarmohammadi, Hadi, Hossein Marvi, and Hamid Hassanpour. "Application of 2-D fractal dimension in content based video summarization." International Journal of Nonlinear Analysis and Applications 10.2 (2019): 131-140.

Roschke, Christian, et al. "Adaptation of Machine Learning Frameworks for Use in a Management Environment." International Conference on Human-Computer Interaction. Springer, Cham, 2019.

Thomanek, Rico, et al. "Use of Multiple Distributed Process Instances for Activity Analysis in Videos." International Conference on Human-Computer Interaction. Springer, Cham, 2019.

Ji, Hyesung, et al. "A semantic-based video scene segmentation using a deep neural network." Journal of Information Science 45.6 (2019): 833-844.

Patil, Nita S., and Sudhir D. Sawarkar. "Semantic Concept Detection in Video Using Hybrid Model of CNN and SVM Classifiers." International Journal of Image Processing (IJIP) 13.2 (2019): 13-28.

Helm, Daniel, and Martin Kampel. "Shot boundary detection for automatic video analysis of historical films." International Conference on Image Analysis and Processing. Springer, Cham, 2019.

Hamroun, Mohamed, et al. "Descriptor Optimization for Semantic Concept Detection Using Visual Content." International Journal of Strategic Information Technology and Applications (IJSITA) 10.1 (2019): 40-59.

Zlitni, Tarek, and Walid Mahdi. "Extraction and Annotation of News Topics From TV Streams for Web Video Sharing: A Contribution to Produce Reliable Online Video News Content." Knowledge-Intensive Economies and Opportunities for Social, Organizational, and Technological Growth. IGI Global, 2019. 272-294.

Abdulhussain, Sadiq H., et al. "Shot boundary detection based on orthogonal polynomial." Multimedia Tools and Applications 78.14 (2019): 20361-20382.

Prabavathy, A. Kethsy, and J. Devi Shree. "Histogram difference with Fuzzy rule base modeling for gradual shot boundary detection in video cloud applications." Cluster Computing 22.1 (2019): 1211-1218.

Daudpota, Sher Muhammad, Atta Muhammad, and Junaid Baber. "Video genre identification using clustering-based shot detection algorithm." Signal, Image and Video Processing 13.7 (2019): 1413-1420.

Aote, Shailendra S., and Archana Potnurwar. "An automatic video annotation framework based on two level keyframe extraction mechanism." Multimedia Tools and Applications 78.11 (2019): 14465-14484.

Zhang, Dacheng, et al. "Shot boundary detection based on block-wise principal component analysis." Journal of Electronic Imaging 28.2 (2019): 023029

Liu, Mengyang, et al. "Video copy detection by conducting fast searching of inverted files." Multimedia Tools and Applications 78.8 (2019): 10601-10624.

Benuwa, Ben-Bright, et al. "Group sparse based locality-sensitive dictionary learning for video semantic analysis." Multimedia Tools and Applications 78.6 (2019): 6721-6744.

Benuwa, Ben-Bright, et al. "Video semantic analysis based kernel locality-sensitive discriminative sparse representation." Expert Systems with Applications 119 (2019): 429-440.

Bhattacharya, Paheli, et al. "Overview of the FIRE 2019 AILA Track: Artificial Intelligence for Legal Assistance." FIRE (Working Notes). 2019.

Nguyen, Phuong Anh, et al. "VIREO@ video browser showdown 2019." International Conference on Multimedia Modeling. Springer, Cham, 2019.

Markatopoulou, Foteini, et al. "Finding Semantically Related Videos in Closed Collections." Video Verification in the Fake News Era. Springer, Cham, 2019. 127-159.

Bhaumik, Hrishikesh, Siddhartha Bhattacharyya, and Susanta Chakraborty. "A vague set approach for identifying shot transition in videos using multiple feature amalgamation." Applied Soft Computing 75 (2019): 633-651.

Kim, Tae Soo, et al. "Safer: Fine-grained activity detection by compositional hypothesis testing."

Fu, Jianjing, and Jianwen Tao. "Robust multi-model adaptation regression with local feature space representation." Knowledge-Based Systems 174 (2019): 160-176.

Tao, Jianwen, and Wei Dai. "Discriminative multi-source adaptation multi-feature co-regression for visual classification." Neural Networks 114 (2019): 96-118.

Tao, Jianwen, et al. "Latent multi-feature co-regression for visual recognition by discriminatively leveraging multi-source models." Pattern Recognition 87 (2019): 296-316.

Bae, Gyujin, et al. "Dual-dissimilarity measure-based statistical video cut detection." Journal of Real-Time Image Processing 16.6 (2019): 1987-1997.

Ma, Mingyang, et al. "Robust video summarization using collaborative representation of adjacent frames." Multimedia Tools and Applications 78.20 (2019): 28985-29005.

Li, Yanping, et al. "Intraframe interpolation based on edge detection." Eleventh International Conference on Digital Image Processing (ICDIP 2019). Vol. 11179. International Society for Optics and Photonics, 2019.

Ji, Zhong, et al. "Query-aware sparse coding for web multi-video summarization." Information Sciences 478 (2019): 152-166.

Zhang, Yujia, et al. "Dilated temporal relational adversarial network for generic video summarization." Multimedia Tools and Applications 78.24 (2019): 35237-35261.

Bekhet, Saddam, and Amr Ahmed. "Video similarity detection using fixed-length statistical dominant colour profile (SDCP) signatures." Journal of Real-Time Image Processing (2019): 1-16.

-------------------------------------------------------------------
2018 (67)
-------------------------------------------------------------------
Anastasia Moumtzidou, Stelios Andreadis, Ilias Gialampoukidis, Anastasios Karakostas, Stefanos Vrochidis, and Ioannis Kompatsiaris. 2018. 
Flood Relevance Estimation from Visual and Textual Content in Social Media Streams. In Companion Proceedings of the The Web Conference 2018 (WWW ’18). 
International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, CHE, 1621–1627. DOI:https://doi.org/10.1145/3184558.3191620

Andrea Ceroni, Chenyang Ma, and Ralph Ewerth. 2018. Mining Exoticism from Visual Content with Fusion-based Deep Neural Networks. 
In Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval (ICMR ’18). Association for Computing Machinery, New York, NY, USA, 37–45. DOI:https://doi.org/10.1145/3206025.3206044

Marcelino, Gon�alo Barreto Ferreira. A computational approach to the art of visual storytelling. Diss. 2018.

Liu, Long, Lechao Yang, and Bin Zhu. "Sparse feature space representation: A unified framework for semi-supervised and domain adaptation learning." 
Knowledge-Based Systems 156 (2018): 43-61.

Gornishka, Iva. "Interactive Search and Exploration in Social Multimedia Networks." (2018).

Singh, Raahat Devender, and Naveen Aggarwal. "Video content authentication techniques: a comprehensive survey." Multimedia Systems 24.2 (2018): 211-240.

Tao, Jianwen, Di Zhou, and Bin Zhu. "Multi-source adaptation embedding with feature selection by exploiting correlation information." Knowledge-Based Systems 143 (2018): 208-224.

Abdulhussain, Sadiq H., et al. "Methods and challenges in shot boundary detection: a review." Entropy 20.4 (2018): 214.

Ye, Guangnan. "Large-Scale Video Event Detection Using Deep Neural Networks." Applied Cloud Deep Semantic Recognition. Auerbach Publications, 2018. 1-23

Gong, Maoguo, et al. "Decomposition-Based Evolutionary Multiobjective Optimization to Self-Paced Learning." 
IEEE Transactions on Evolutionary Computation 23.2 (2018): 288-302

Mahapatra, Debabrata, Ragunathan Mariappan, and Vaibhav Rajan. "Automatic Hierarchical Table of Contents Generation for Educational Videos." 
Companion Proceedings of the The Web Conference 2018. International World Wide Web Conferences Steering Committee, 2018.

Cirne, Marcos Vinicius Mussel, and Helio Pedrini. "VISCOM: A robust video summarization approach using color co-occurrence matrices." 
Multimedia Tools and Applications 77.1 (2018): 857-875.

Zhao, Zhicheng, et al. "A unified framework with a benchmark dataset for surveillance event detection." Neurocomputing 278 (2018): 62-74.

Bekhet, Saddam, and Amr Ahmed. "An integrated signature-based framework for efficient visual similarity detection and measurement in video shots." 
ACM Transactions on Information Systems (TOIS) 36.4 (2018): 37.

Ji, Hyesung, et al. "A semantic-based video scene segmentation using a deep neural network." Journal of Information Science (2018): 0165551518819964.

Yao, Li, and Ying Qian. "Dt-3dresnet-lstm: An architecture for temporal activity recognition in videos." Pacific Rim Conference on Multimedia. Springer, Cham, 2018.

Liu, Junqi, et al. "Discriminative self-adapted locality-sensitive sparse representation for video semantic analysis." 
Multimedia Tools and Applications 77.21 (2018): 29143-29162.

Rouhi, Amir H., and James A. Thom. "Encoder settings impact on intra-prediction-based descriptors for video retrieval." 
Journal of Visual Communication and Image Representation 50 (2018): 263-269.

Loko�, Jakub, Tom�š Sou�ek, and Gregor Koval��k. "Using an interactive video retrieval tool for lifelog data." 
Proceedings of the 2018 ACM Workshop on The Lifelog Search Challenge. ACM, 2018.

Benuwa, Ben-Bright, et al. "Sparsity Based Locality-Sensitive Discriminative Dictionary Learning for Video Semantic Analysis." 
Mathematical Problems in Engineering 2018 (2018).

Wu, Lifang, et al. "Shot Boundary Detection with Spatial-Temporal Convolutional Neural Networks." 
Chinese Conference on Pattern Recognition and Computer Vision (PRCV). Springer, Cham, 2018.

Xie, Wenlong, et al. "Event patches: Mining effective parts for event detection and understanding." Signal Processing 149 (2018): 82-87

Tang, Shitao, et al. "Fast Video Shot Transition Localization with Deep Structured Models." Asian Conference on Computer Vision. Springer, Cham, 2018.

Kletz, Sabrina, Andreas Leibetseder, and Klaus Schoeffmann. "Evaluation of Visual Content Descriptors for Supporting Ad-Hoc Video Search Tasks at the Video Browser Showdown." 
International Conference on Multimedia Modeling. Springer, Cham, 2018.

Rouhi, A. "Near-duplicate video similarity detection in H. 264/AVC compressed domain." (2018)

Hirakawa, Koji, et al. "Ad-hoc Video Search Improved by the Word Sense Filtering of Query Terms." Asia Information Retrieval Symposium. Springer, Cham, 2018.

Sa, Qila, and Zhihui Wang. "Automatic video shot boundary detection using k-means clustering and improved adaptive dual threshold comparison." 
MIPPR 2017: Remote Sensing Image Processing, Geographic Information Systems, and Other Applications. Vol. 10611. International Society for Optics and Photonics, 2018.

Graham, Y., Awad, G., & Smeaton, A. (2018). Evaluation of automatic video captioning using direct assessment. PloS one, 13(9), e0202789.

Dong, Jianfeng, et al. "Dual dense encoding for zero-example video retrieval." arXiv preprint arXiv:1809.06181 (2018).

Jiang, D., & Kim, J. (2018). Video Searching and Fingerprint Detection by Using the Image Query and PlaceNet-Based 
Shot Boundary Detection Method. Applied Sciences, 8(10), 1735.

Leibetseder, Andreas, Sabrina Kletz, and Klaus Schoeffmann. "Sketch-based similarity search for collaborative feature maps." 
International Conference on Multimedia Modeling. Springer, Cham, 2018.

Girbau, A., Hinami, R., & Satoh, S. I. (2018, April). Tracked Instance Search. In 2018 IEEE International Conference on 
Acoustics, Speech and Signal Processing (ICASSP) (pp. 1663-1667). IEEE.
      
Liao, Kaiyang, et al. "IR Feature Embedded BOF Indexing Method for Near-Duplicate Video Retrieval."
IEEE Transactions on Circuits and Systems for Video Technology (2018).
      
Vagliano, Iacopo, et al. "Open Innovation in the Big Data Era With the MOVING Platform." IEEE MultiMedia 25.3 (2018): 8-21.
      
Xie, Wenlong, et al. "Discovering Latent Discriminative Patterns for Multi-Mode Event Representation." IEEE Transactions on Multimedia (2018).
      
Thomas, Sinnu Susan, Sumana Gupta, and Venkatesh K. Subramanian. "Context Driven Optimized Perceptual Video Summarization and Retrieval."
IEEE Transactions on Circuits and Systems for Video Technology (2018).
      
Gao, Zhanning, et al. "Video Imprint." IEEE transactions on pattern analysis and machine intelligence (2018).
      
Huang, Shao, et al. "Egocentric Temporal Action Proposals." IEEE Transactions on Image Processing 27.2 (2018): 764-777.
     
Lei, Jie, et al. "Action Parsing Driven Video Summarization Based on Reinforcement Learning."
IEEE Transactions on Circuits and Systems for Video Technology (2018).
      
Schoeffmann, Klaus, et al. "How Experts Search Different than Novices–An Evaluation of
the Divexplore Video Retrieval System at Video Browser Showdown 2018." 2018 IEEE International
Conference on Multimedia & Expo Workshops (ICMEW). IEEE, 2018.
      
Zhang, Hao, and Chong-Wah Ngo. "A Fine Granularity Object-level Representation for Event
Detection and Recounting." IEEE Transactions on Multimedia (2018).
      
Chen, Zhixiang, et al. "Nonlinear structural hashing for scalable video search."
IEEE Transactions on Circuits and Systems for Video Technology 28.6 (2018): 1421-1433.
      
Lan, Shuyue, et al. "FFNet: Video Fast-Forwarding via Reinforcement Learning."
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2018.

Gialampoukidis, Ilias, et al. "Fusion of Compound Queries with Multiple Modalities for Known Item Video Search."
2018 IEEE 13th Image, Video, and Multidimensional Signal Processing Workshop (IVMSP). IEEE, 2018.

Gao, Wenhui, et al. "MMH: Multi-Modal Hash for Instant Mobile Video Search."
2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR). IEEE, 2018.
      
Dilawari, Aniqa, et al. "Natural language description of video streams using task-specific feature encoding."
IEEE Access 6 (2018): 16639-16645.
      
Zhicheng Zhao, Rui Xiang, and Fei Su. 2018. Complex event detection via attention-based video representation and classification.
Multimedia Tools Appl. 77, 3 (February 2018), 3209-3227. DOI: https://doi.org/10.1007/s11042-017-5058-2

Yassine Himeur and Karima Ait Sadi. 2018. Robust video copy detection based on ring decomposition based
binarized statistical image features and invariant color descriptor (RBSIF-ICD).
Multimedia Tools Appl. 77, 13 (July 2018), 17309-17331. DOI: https://doi.org/10.1007/s11042-017-5307-4
      
Huan Liu, Qinghua Zheng, Zhihui Li, Tao Qin, and Lei Zhu. 2018. An efficient multi-feature SVM solver for complex event detection.
Multimedia Tools Appl. 77, 3 (February 2018), 3509-3532. DOI: https://doi.org/10.1007/s11042-017-5166-z

Rashmi B S and Nagendraswamy H S. 2018. Effective Video Shot Boundary Detection and Keyframe Selection using Soft Computing Techniques.
Int. J. Comput. Vis. Image Process. 8, 2 (April 2018), 27-48. DOI: https://doi.org/10.4018/IJCVIP.2018040102
      
Jaydeb Mondal, Malay Kumar Kundu, Sudeb Das, and Manish Chowdhury. 2018. Video shot boundary detection using multiscale
geometric analysis of nsct and least squares support vector machine. Multimedia Tools Appl. 77, 7 (April 2018), 8139-8161.
DOI: https://doi.org/10.1007/s11042-017-4707-9
      
Nitin J. Janwe and Kishor K. Bhoyar. 2018. Multi-label semantic concept detection in videos using fusion
of asymmetrically trained deep convolutional neural networks and foreground driven concept co-occurrence matrix.
Applied Intelligence 48, 8 (August 2018), 2047-2066. DOI: https://doi.org/10.1007/s10489-017-1033-x

Maaike H.T. de Boer. 2018. Semantic Mapping in Video Retrieval. SIGIR Forum 51, 3 (February 2018),
161-162. DOI: https://doi.org/10.1145/3190580.3190606
      
Tomokazu Murakami. 2018. Industrial Applications of Image Recognition and Retrieval Technologies
for Public Safety and IT Services. In Proceedings of the 2018 ACM on International Conference on
Multimedia Retrieval (ICMR '18). ACM, New York, NY, USA, 4-4. DOI: https://doi.org/10.1145/3206025.3210492
      
Klaus Schoeffmann, Werner Bailer, Cathal Gurrin, George Awad, and Jakub Lokoč. 2018.
Interactive Video Search: Where is the User in the Age of Deep Learning?. In Proceedings
of the 26th ACM international conference on Multimedia (MM '18). ACM, New York, NY, USA,
2101-2103. DOI: https://doi.org/10.1145/3240508.3241473

Nakamasa Inoue and Koichi Shinoda. 2018. Few-Shot Adaptation for Multimedia Semantic Indexing.
In Proceedings of the 26th ACM international conference on Multimedia (MM '18). ACM, New York,
NY, USA, 1110-1118. DOI: https://doi.org/10.1145/3240508.3240592

Ueki, Kazuya. "Latent Concept Extraction for Zero-Shot Video Retrieval."
2018 International Conference on Image and Vision Computing New Zealand (IVCNZ). IEEE, 2018.
      
Xu, Zijun, et al. "S2L: Single-Streamline For Complex Video Event Detection." 
2018 IEEE International Conference on Multimedia & Expo Workshops (ICMEW). IEEE, 2018.

Ueki, Kazuya, et al. "Fine-grained Video Retrieval using Query Phrases—Waseda_Meisei TRECVID 2017 AVS System—." 
2018 24th International Conference on Pattern Recognition (ICPR). IEEE, 2018.

Budnik, Mateusz, Mikail Demirdelen, and Guillaume Gravier. "A study on multimodal video hyperlinking with visual aggregation." 
2018 IEEE International Conference on Multimedia and Expo (ICME). IEEE, 2018.

Kar, Tejaswini, and Priyadarshi Kanungo. "Motion and illumination defiant cut detection based on Weber features." 
IET Image Processing (2018).

V. Vukotić, C. Raymond and G. Gravier, "A Crossmodal Approach to Multimodal Fusion in Video Hyperlinking," 
in IEEE MultiMedia, vol. 25, no. 2, pp. 11-23, Apr.-Jun. 2018. doi: 10.1109/MMUL.2018.023121161
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8424826&isnumber=8424760

H. Li, J. Zhu, C. Ma, J. Zhang and C. Zong, "Read, Watch, Listen and Summarize: Multi-modal Summarization for Asynchronous Text, Image, Audio and Video," 
in IEEE Transactions on Knowledge and Data Engineering. doi: 10.1109/TKDE.2018.2848260
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8387512&isnumber=4358933

F. Markatopoulou, V. Mezaris and I. Patras, "Implicit and Explicit Concept Relations in Deep Neural Networks for Multi-Label Video/Image Annotation," 
in IEEE Transactions on Circuits and Systems for Video Technology. doi: 10.1109/TCSVT.2018.2848458
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8387768&isnumber=4358651

J. Lokoc, W. Bailer, K. Schoeffmann, B. Muenzer and G. Awad, "On influential trends in interactive video retrieval: Video Browser Showdown 2015-2017," 
in IEEE Transactions on Multimedia. doi: 10.1109/TMM.2018.2830110
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8352047&isnumber=4456689

J. Dong, X. Li and C. G. M. Snoek, "Predicting Visual Features from Text for Image and Video Caption Retrieval," 
in IEEE Transactions on Multimedia. doi: 10.1109/TMM.2018.2832602
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8353472&isnumber=4456689

R. Panda, S. K. Kuanar and A. S. Chowdhury, "Nyström Approximated Temporally Constrained Multisimilarity Spectral Clustering Approach for Movie Scene Detection," 
in IEEE Transactions on Cybernetics, vol. 48, no. 3, pp. 836-847, March 2018.
doi: 10.1109/TCYB.2017.2657692
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7845652&isnumber=8283862

--------------------------------------------------------------------
2017 (97)
--------------------------------------------------------------------
X. Luan, Y. Xie, Y. Guo, J. He, L. Zhang and X. Zhang, "A fast near-duplicate keyframe detection method based on local features," 
2017 IEEE 17th International Conference on Communication Technology (ICCT), Chengdu, China, 2017, pp. 1544-1547.
doi: 10.1109/ICCT.2017.8359890
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8359890&isnumber=8359466

M. Hmayda, R. Ejbali and M. Zaied, "Program Classification in a Stream TV Using Deep Learning," 
2017 18th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT), Taipei, Taiwan, 2017, pp. 123-126.
doi: 10.1109/PDCAT.2017.00029
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8327077&isnumber=8326788

T. Kar and P. Kanungo, "Video shot boundary detection based on Hilbert and wavelet transform," 
2017 2nd International Conference on Man and Machine Interfacing (MAMI), Bhubaneswar, India, 2017, pp. 1-6.
doi: 10.1109/MAMI.2017.8307865
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8307865&isnumber=8307855

K. Zhou, Y. Zhu and Y. Zhao, "A spatio-temporal deep architecture for surveillance event detection based on ConvLSTM," 
2017 IEEE Visual Communications and Image Processing (VCIP), St. Petersburg, FL, USA, 2017, pp. 1-4.
doi: 10.1109/VCIP.2017.8305063
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8305063&isnumber=8305018

S. Keshavarz, I. Saleemi and G. Atia, "Exploiting probabilistic relationships between action concepts for complex event classification," 
2017 IEEE International Conference on Image Processing (ICIP), Beijing, China, 2017, pp. 1572-1576.
doi: 10.1109/ICIP.2017.8296546
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8296546&isnumber=8296222

Ke Wang, Jiayong Liu, and Daniel González. 2017. Domain transfer multi-instance dictionary learning. 
Neural Comput. Appl. 28, 1 (January 2017), 983-992. DOI: https://doi.org/10.1007/s00521-016-2406-5

Yilin Yan, Min Chen, Saad Sadiq, and Mei-Ling Shyu. 2017. Efficient Imbalanced Multimedia Concept Retrieval by Deep Learning on Spark Clusters. 
Int. J. Multimed. Data Eng. Manag. 8, 1 (January 2017), 1-20. DOI: https://doi.org/10.4018/IJMDEM.2017010101

Jinlai Lv and Huiru Bai. 2017. Research on Shot Detection Algorithm of Self-adaptive Dual Thresholds Based on Multi-feature Fusion. 
In LNCS on Transactions on Edutainment XIII - Volume 10092, Zhigeng Pan, Adrian David Cheok, Wolfgang Müller, and Mingmin Zhang (Eds.), 
Vol. 10092. Springer-Verlag New York, Inc., New York, NY, USA, 247-261. DOI: https://doi.org/10.1007/978-3-662-54395-5_21

Stefanos Vrochidis, Ioannis Patras, and Ioannis Kompatsiaris. 2017. Gaze movement-driven random forests for query clustering in automatic video annotation. 
Multimedia Tools Appl. 76, 2 (January 2017), 2861-2889. DOI: https://doi.org/10.1007/s11042-015-3221-1

Hao Song, Xinxiao Wu, Wei Liang, and Yunde Jia. 2017. Recognizing key segments of videos for video annotation by learning from web image sets. 
Multimedia Tools Appl. 76, 5 (March 2017), 6111-6126. DOI: https://doi.org/10.1007/s11042-016-3253-1

Wei-Xin Li and Nuno Vasconcelos. 2017. Complex Activity Recognition Via Attribute Dynamics. 
Int. J. Comput. Vision 122, 2 (April 2017), 334-370. DOI: https://doi.org/10.1007/s11263-016-0918-1

Jiyun Fan, Shangbo Zhou, and Muhammad Abubakar Siddique. 2017. Fuzzy color distribution chart -based shot boundary detection. 
Multimedia Tools Appl. 76, 7 (April 2017), 10169-10190. DOI: https://doi.org/10.1007/s11042-016-3604-y

Muhammad Usman Khan and Yoshihiko Gotoh. 2017. Generating natural language tags for video information management. 
Mach. Vision Appl. 28, 3-4 (May 2017), 243-265. DOI: https://doi.org/10.1007/s00138-017-0825-7

Mateusz Budnik, Efrain-Leonardo Gutierrez-Gomez, Bahjat Safadi, Denis Pellerin, and Georges Quénot. 2017. Learned features versus engineered features 
for multimedia indexing. Multimedia Tools Appl. 76, 9 (May 2017), 11941-11958. 
DOI: https://doi.org/10.1007/s11042-016-4240-2 

Peng Wang, Lifeng Sun, Shiqiang Yang, and Alan F. Smeaton. 2017. Training-free indexing refinement for visual media via multi-semantics. 
Neurocomput. 236, C (May 2017), 39-47. DOI: https://doi.org/10.1016/j.neucom.2016.08.107

Zhenxing Zhang, Rami Albatal, Cathal Gurrin, and Alan F. Smeaton. 2017. Enhancing instance search with weak geometric correlation consistency. 
Neurocomput. 236, C (May 2017), 164-172. DOI: https://doi.org/10.1016/j.neucom.2016.09.104

Petra Galuš�áková, Michal Batko, Jan Čech, Jiří Matas, David Novák, and Pavel Pecina. 2017. Visual Descriptors in Methods for Video Hyperlinking. 
In Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval (ICMR '17). ACM, New York, NY, USA, 294-300. 
DOI: https://doi.org/10.1145/3078971.3079026

Damianos Galanopoulos, Foteini Markatopoulou, Vasileios Mezaris, and Ioannis Patras. 2017. Concept Language Models and Event-based Concept Number Selection 
for Zero-example Event Detection. In Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval (ICMR '17). 
ACM, New York, NY, USA, 397-401. DOI: https://doi.org/10.1145/3078971.3079043 

Chrysa Collyda, Evlampios Apostolidis, Alexandros Pournaras, Foteini Markatopoulou, Vasileios Mezaris, and Ioannis Patras. 2017. 
VideoAnalysis4ALL: An On-line Tool for the Automatic Fragmentation and Concept-based Annotation, and the Interactive Exploration 
of Videos. In Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval (ICMR '17). ACM, New York, NY, USA, 470-474. 
DOI: https://doi.org/10.1145/3078971.3079015

Junwei Liang, Lu Jiang, Deyu Meng, and Alexander Hauptmann. 2017. Leveraging Multi-modal Prior Knowledge for Large-scale Concept Learning in Noisy Web Data. 
In Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval (ICMR '17). ACM, New York, NY, USA, 32-40. 
DOI: https://doi.org/10.1145/3078971.3079003 

Foteini Markatopoulou, Damianos Galanopoulos, Vasileios Mezaris, and Ioannis Patras. 2017. Query and Keyframe Representations for Ad-hoc Video Search. 
In Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval (ICMR '17). ACM, New York, NY, USA, 407-411. 
DOI: https://doi.org/10.1145/3078971.3079041 

Omar Seddati, Stéphane Dupont, and Saïd Mahmoudi. 2017. Quadruplet Networks for Sketch-Based Image Retrieval. In Proceedings of the 2017 ACM on 
International Conference on Multimedia Retrieval (ICMR '17). ACM, New York, NY, USA, 184-191. DOI: https://doi.org/10.1145/3078971.3078985

Zhi-Qi Cheng, Hao Zhang, Xiao Wu, and Chong-Wah Ngo. 2017. On the Selection of Anchors and Targets for Video Hyperlinking. 
In Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval (ICMR '17). ACM, New York, NY, USA, 287-293. 
DOI: https://doi.org/10.1145/3078971.3079025

Luca Rossetto, Ivan Giangreco, Claudiu Tănase, and Heiko Schuldt. 2017. Multimodal Video Retrieval with the 2017 IMOTION System. 
In Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval (ICMR '17). ACM, New York, NY, USA, 457-460. 
DOI: https://doi.org/10.1145/3078971.3079012

Werner Bailer. 2017. Efficient Approximate Medoids of Temporal Sequences. In Proceedings of the 15th International Workshop on Content-Based 
Multimedia Indexing (CBMI '17). ACM, New York, NY, USA, Article 3, 6 pages. DOI: https://doi.org/10.1145/3095713.3095717

Xun Xu, Timothy Hospedales, and Shaogang Gong. 2017. Transductive Zero-Shot Action Recognition by Word-Vector Embedding. 
Int. J. Comput. Vision 123, 3 (July 2017), 309-333. DOI: https://doi.org/10.1007/s11263-016-0983-5

Tiziano Portenier, Qiyang Hu, Paolo Favaro, and Matthias Zwicker. 2017. SmartSketcher: sketch-based image retrieval with dynamic semantic re-ranking. 
In Proceedings of the Symposium on Sketch-Based Interfaces and Modeling (SBIM '17), Stephen N. Spencer (Ed.). 
ACM, New York, NY, USA, Article 1, 12 pages. DOI: https://doi.org/10.1145/3092907.3092910

Bendraou Youssef, Essannouni Fedwa, Aboutajdine Driss, and Salam Ahmed. 2017. Shot boundary detection via adaptive low rank and svd-updating. 
Comput. Vis. Image Underst. 161, C (August 2017), 20-28. DOI: https://doi.org/10.1016/j.cviu.2017.06.003

Huan Liu, Qinghua Zheng, Minnan Luo, Dingwen Zhang, Xiaojun Chang, and Cheng Deng. 2017. How unlabeled web videos help complex event detection?. 
In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI'17), Carles Sierra (Ed.). AAAI Press 4040-4046

Jia He, Changying Du, Changde Du, Fuzhen Zhuang, Qing He, and Guoping Long. 2017. Nonlinear maximum margin multi-view learning with adaptive kernel. 
In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI'17), Carles Sierra (Ed.). AAAI Press 1830-1836

Jingya Wang, Xiatian Zhu, and Shaogang Gong. 2017. Discovering visual concept structure with sparse and incomplete tags. 
Artif. Intell. 250, C (September 2017), 16-36. DOI: https://doi.org/10.1016/j.artint.2017.05.002 

Linchao Zhu, Zhongwen Xu, Yi Yang, and Alexander G. Hauptmann. 2017. Uncovering the Temporal Context for Video Question Answering. 
Int. J. Comput. Vision 124, 3 (September 2017), 409-421. DOI: https://doi.org/10.1007/s11263-017-1033-7 

Jeonghwan Gwak. 2017. Multi-object tracking through learning relational appearance features and motion patterns. 
Comput. Vis. Image Underst. 162, C (September 2017), 103-115. DOI: https://doi.org/10.1016/j.cviu.2017.05.010

Maaike H. T. De Boer, Yi-Jie Lu, Hao Zhang, Klamer Schutte, Chong-Wah Ngo, and Wessel Kraaij. 2017. Semantic Reasoning in Zero Example Video Event Retrieval. 
ACM Trans. Multimedia Comput. Commun. Appl. 13, 4, Article 60 (October 2017), 17 pages. 
DOI: https://doi.org/10.1145/3131288

Ke Xia, Yuqing Ma, Xianglong Liu, Yadong Mu, and Li Liu. 2017. Temporal Binary Coding for Large-Scale Video Search. 
In Proceedings of the 2017 ACM on Multimedia Conference (MM '17). ACM, New York, NY, USA, 333-341. 
DOI: https://doi.org/10.1145/3123266.3123273

Jianfeng Dong. 2017. Cross-media Relevance Computation for Multimedia Retrieval. In Proceedings of the 2017 ACM on Multimedia Conference (MM '17). 
ACM, New York, NY, USA, 831-835. DOI: https://doi.org/10.1145/3123266.3123963 

Spencer Cappallo and Cees G.M. Snoek. 2017. Future-Supervised Retrieval of Unseen Queries for Live Video. In Proceedings of the 2017 ACM on 
Multimedia Conference (MM '17). ACM, New York, NY, USA, 28-36. DOI: https://doi.org/10.1145/3123266.3123437

Jiamei Lan, Jun Chen, Zheng Wang, Chao Liang, and Shin'ichi Satoh. 2017. P-S Instance Retrieval via Early Elimination and Late Expansion. 
In Proceedings of the Workshop on Visual Analysis in Smart and Connected Communities (VSCC '17). ACM, New York, NY, USA, 41-49. 
DOI: https://doi.org/10.1145/3132734.3136609

Qin Jin, Shizhe Chen, Jia Chen, and Alexander Hauptmann. 2017. Knowing Yourself: Improving Video Caption via In-depth Recap. 
In Proceedings of the 2017 ACM on Multimedia Conference (MM '17). ACM, New York, NY, USA, 1906-1911. 
DOI: https://doi.org/10.1145/3123266.3127901

Nikolaos Gkalelis and Vasileios Mezaris. 2017. Incremental Accelerated Kernel Discriminant Analysis. In Proceedings of the 2017 ACM on 
Multimedia Conference (MM '17). ACM, New York, NY, USA, 1575-1583. DOI: https://doi.org/10.1145/3123266.3123401

Stevan Rudinac, Iva Gornishka, and Marcel Worring. 2017. Multimodal Classification of Violent Online Political Extremism Content with 
Graph Convolutional Networks. In Proceedings of the on Thematic Workshops of ACM Multimedia 2017 (Thematic Workshops '17). 
ACM, New York, NY, USA, 245-252. DOI: https://doi.org/10.1145/3126686.3126776

Zobeida Jezabel Guzman-Zavaleta, Claudia Feregrino-Uribe, Miguel Morales-Sandoval, and Alejandra Menendez-Ortiz. 2017. 
A robust and low-cost video fingerprint extraction method for copy detection. Multimedia Tools Appl. 76, 22 (November 2017), 24143-24163. 
DOI: https://doi.org/10.1007/s11042-016-4168-6

Ilias Gialampoukidis, Anastasia Moumtzidou, Dimitris Liparas, Theodora Tsikrika, Stefanos Vrochidis, and Ioannis Kompatsiaris. 2017. 
Multimedia retrieval based on non-linear graph-based fusion and partial least squares regression. 
Multimedia Tools Appl. 76, 21 (November 2017), 22383-22403. DOI: https://doi.org/10.1007/s11042-017-4797-4

Maaike Boer, Geert Pingen, Douwe Knook, Klamer Schutte, and Wessel Kraaij. 2017. Improving video event retrieval by user feedback. 
Multimedia Tools Appl. 76, 21 (November 2017), 22361-22381. 
DOI: https://doi.org/10.1007/s11042-017-4798-3

Hao Liu, Qingjie Zhao, Hao Wang, Peng Lv, and Yanming Chen. 2017. An image-based near-duplicate video retrieval and localization 
using improved Edit distance. Multimedia Tools Appl. 76, 22 (November 2017), 24435-24456. 
DOI: https://doi.org/10.1007/s11042-016-4176-6

A. Kar, P. Mavin, Y. Ghaturle and V. M., "What Makes a Video Memorable?," 2017 IEEE International Conference on 
Data Science and Advanced Analytics (DSAA), Tokyo, Japan, 2017, pp. 373-381.
doi: 10.1109/DSAA.2017.37
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8259797&isnumber=8259747

S. Shekhar, D. Singal, H. Singh, M. Kedia and A. Shetty, "Show and Recall: Learning What Makes Videos Memorable," 
2017 IEEE International Conference on Computer Vision Workshops (ICCVW), Venice, Italy, 2017, pp. 2730-2739.
doi: 10.1109/ICCVW.2017.321
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8265533&isnumber=8265191

W. Liu and H. Ma, "Hybrid Semantic Concept Temporal Pooling for Large-Scale Video Event Analysis," 
in Chinese Journal of Electronics, vol. 26, no. 6, pp. 1125-1131, 11 2017.
doi: 10.1049/cje.2017.09.010
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8128889&isnumber=8128699

P. Goyal, Z. Hu, X. Liang, C. Wang, E. P. Xing and C. Mellon, "Nonparametric Variational Auto-Encoders for Hierarchical 
Representation Learning," 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 2017, pp. 5104-5112.
doi: 10.1109/ICCV.2017.545
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8237807&isnumber=8237262

A. Sasithradevi, S. M. M. Roomi and G. Maragatham, "Content based video retrieval via object based approach," 
TENCON 2017 - 2017 IEEE Region 10 Conference, Penang, Malaysia, 2017, pp. 781-787.
doi: 10.1109/TENCON.2017.8227965
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8227965&isnumber=8227816

H. T. Shen, C. Li, J. Cao, Z. Huang and L. Zhu, "Leveraging Weak Semantic Relevance for Complex Video Event Classification," 
2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 2017, pp. 3667-3676.
doi: 10.1109/ICCV.2017.394
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8237656&isnumber=8237262

R. Panda, A. Das, Z. Wu, J. Ernst and A. K. Roy-Chowdhury, "Weakly Supervised Summarization of Web Videos," 2017 IEEE International 
Conference on Computer Vision (ICCV), Venice, Italy, 2017, pp. 3677-3686.
doi: 10.1109/ICCV.2017.395
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8237657&isnumber=8237262

J. C. SanMiguel and A. Cavallaro, "Energy Consumption Models for Smart Camera Networks," in IEEE Transactions on 
Circuits and Systems for Video Technology, vol. 27, no. 12, pp. 2661-2674, Dec. 2017.
doi: 10.1109/TCSVT.2016.2593598
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7517353&isnumber=8186326

M. Liu, C. Xu, Y. Luo, C. Xu, Y. Wen and D. Tao, "Cost-Sensitive Feature Selection by Optimizing F-measures," 
in IEEE Transactions on Image Processing, vol. PP, no. 99, pp. 1-1.
doi: 10.1109/TIP.2017.2781298
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8170306&isnumber=4358840

A. C. S. e Santos and H. Pedrini, "Shot boundary detection for video temporal segmentation based on the weber local descriptor," 
2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC), Banff, AB, Canada, 2017, pp. 1310-1315.
doi: 10.1109/SMC.2017.8122794
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8122794&isnumber=8122565

Z. Gao et al., "ER3: A Unified Framework for Event Retrieval, Recognition and Recounting," 2017 IEEE Conference on Computer 
Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 2017, pp. 2107-2116.
doi: 10.1109/CVPR.2017.227
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8099710&isnumber=8099483

N. Hussein, E. Gavves and A. W. M. Smeulders, "Unified Embedding and Metric Learning for Zero-Exemplar Event Detection," 
2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 2017, pp. 2087-2096.
doi: 10.1109/CVPR.2017.225
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8099708&isnumber=8099483

S. Huang, W. Wang, S. He and R. W. H. Lau, "Egocentric Temporal Action Proposals," 
in IEEE Transactions on Image Processing, vol. 27, no. 2, pp. 764-777, Feb. 2018.
doi: 10.1109/TIP.2017.2772904
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8105826&isnumber=8103362

L. Zhu, Z. Xu and Y. Yang, "Bidirectional Multirate Reconstruction for Temporal Modeling in Videos," 
2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 2017, pp. 1339-1348.
doi: 10.1109/CVPR.2017.147
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8099630&isnumber=8099483

J. Hou, X. Wu, Y. Sun and Y. Jia, "Content-Attention Representation by Factorized Action-Scene Network for Action Recognition," 
in IEEE Transactions on Multimedia, vol. PP, no. 99, pp. 1-1.
doi: 10.1109/TMM.2017.2771462
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8101020&isnumber=4456689

C. Tzelepis, V. Mezaris and I. Patras, "Linear Maximum Margin Classifier for Learning from Uncertain Data," 
in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. PP, no. 99, pp. 1-1.
doi: 10.1109/TPAMI.2017.2772235
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8103808&isnumber=4359286

N. Chesneau, K. Alahari and C. Schmid, "Learning from Web Videos for Event Classification," 
in IEEE Transactions on Circuits and Systems for Video Technology, vol. PP, no. 99, pp. 1-1.
doi: 10.1109/TCSVT.2017.2764624
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8076905&isnumber=4358651

B. Selbes and M. Sert, "Multimodal vehicle type classification using convolutional neural network and statistical 
representations of MFCC," 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Lecce, 2017, pp. 1-6.
doi: 10.1109/AVSS.2017.8078514
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8078514&isnumber=8078458

S. S. Thomas, S. Gupta and V. K. Subramanian, "Smart surveillance based on video summarization," 
2017 IEEE Region 10 Symposium (TENSYMP), Cochin, 2017, pp. 1-5. doi: 10.1109/TENCONSpring.2017.8070003
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8070003&isnumber=8069969

H. Song, X. Wu, W. Yu and Y. Jia, "Extracting Key Segments of Videos for Event Detection by Learning from Web Sources," 
in IEEE Transactions on Multimedia, vol. PP, no. 99, pp. 1-1.
doi: 10.1109/TMM.2017.2763322
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8068288&isnumber=4456689

X. Nie, Weizhen Jing, Lin Yuan Ma, Chaoran Cui and Y. Yin, "Two-layer video fingerprinting strategy for near-duplicate video detection," 
2017 IEEE International Conference on Multimedia & Expo Workshops (ICMEW), Hong Kong, 2017, pp. 555-560.
doi: 10.1109/ICMEW.2017.8026322
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8026322&isnumber=8026209

A. Habibian, T. Mensink and C. G. M. Snoek, "Video2vec Embeddings Recognize Events When Examples Are Scarce," 
in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39, no. 10, pp. 2089-2103, Oct. 1 2017.
doi: 10.1109/TPAMI.2016.2627563
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7740886&isnumber=8024097

N. Putpuek, N. Cooharojananone and S. Satoh, "A modification of retake detection using simple signature and LCS algorithm," 
2017 18th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed 
Computing (SNPD), Kanazawa, 2017, pp. 257-261. doi: 10.1109/SNPD.2017.8022730
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8022730&isnumber=8022642

E. Boyaci and M. Sert, "Video classification based on ConvNet collaboration and feature selection," 2017 
25th Signal Processing and Communications Applications Conference (SIU), Antalya, 2017, pp. 1-4.
doi: 10.1109/SIU.2017.7960515
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7960515&isnumber=7960135

L. Yu; Z. Huang; F. Shen; J. Song; H. T. Shen; X. Zhou, "Bilinear Optimized Product Quantization for Scalable Visual Content Analysis," 
in IEEE Transactions on Image Processing , vol.PP, no.99, pp.1-1
doi: 10.1109/TIP.2017.2722224
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7964737&isnumber=4358840

B. C. Chen et al., "Scalable Face Track Retrieval in Video Archives Using Bag-of-Faces Sparse Representation," 
in IEEE Transactions on Circuits and Systems for Video Technology, vol. 27, no. 7, pp. 1595-1603, July 2017.
doi: 10.1109/TCSVT.2016.2538520
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7426412&isnumber=7963904

B. Selbes and M. Sert, "Multimodal video concept classification based on convolutional neural network and audio feature combination," 2017 
25th Signal Processing and Communications Applications Conference (SIU), Antalya, 2017, pp. 1-4.
doi: 10.1109/SIU.2017.7960723
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7960723&isnumber=7960135

O. Khalid, J. C. SanMiguel and A. Cavallaro, "Multi-Tracker Partition Fusion," 
in IEEE Transactions on Circuits and Systems for Video Technology, vol. 27, no. 7, pp. 1527-1539, July 2017.
doi: 10.1109/TCSVT.2016.2542699
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7434028&isnumber=7963904

D. Francis, P. Pidou, B. Merialdo and B. Huet, "Natural Language Access to Video Databases," 
2017 IEEE Third International Conference on Multimedia Big Data (BigMM), Laguna Hills, CA, USA, 2017, pp. 78-81.
doi: 10.1109/BigMM.2017.34
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7966721&isnumber=7966694

W. Lu et al., "Unsupervised Sequential Outlier Detection With Deep Architectures," 
in IEEE Transactions on Image Processing, vol. 26, no. 9, pp. 4321-4330, Sept. 2017.
doi: 10.1109/TIP.2017.2713048
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7942034&isnumber=7956620

S. Tippaya, S. Sitjongsataporn, T. Tan, M. M. Khan and K. Chamnongthai, "Multi-Modal Visual Features-Based Video Shot Boundary Detection," 
in IEEE Access, vol. 5, no. , pp. 12563-12575, 2017. doi: 10.1109/ACCESS.2017.2717998
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7954599&isnumber=7859429

H. Li; Y. Huang; Z. Zhang, "An Improved Faster R-CNN for Same Object Retrieval," in IEEE Access , vol.PP, no.99, pp.1-1
doi: 10.1109/ACCESS.2017.2729943
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7986979&isnumber=6514899

Z. Ma; X. Chang; Z. Xu; N. Sebe; A. G. Hauptmann, "Joint Attributes and Event Analysis for Multimedia Event Detection," 
in IEEE Transactions on Neural Networks and Learning Systems , vol.PP, no.99, pp.1-10
doi: 10.1109/TNNLS.2017.2709308
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7949100&isnumber=6104215

J. Liang, L. Jiang and A. Hauptmann, "Temporal localization of audio events for conflict monitoring in social media," 
2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, USA, 2017, pp. 1597-1601.
doi: 10.1109/ICASSP.2017.7952426
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7952426&isnumber=7951776

S. Tippaya; S. Sitjongsataporn; T. Tan; M. M. Khan; K. Chamnongthai, "Multi-modal Visual Features Based Video Shot 
Boundary Detection," in IEEE Access , vol.PP, no.99, pp.1-1. doi: 10.1109/ACCESS.2017.2717998
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7954599&isnumber=6514899

C. Ouali, P. Dumouchel and V. Gupta, "Robust video fingerprints using positions of salient regions," 
2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, USA, 2017, pp. 3041-3045.
doi: 10.1109/ICASSP.2017.7952715
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7952715&isnumber=7951776

X. Han, B. Singh, V. I. Morariu and L. S. Davis, "VRFP: On-the-Fly Video Retrieval Using Web Images and Fast Fisher Vector Products," 
in IEEE Transactions on Multimedia, vol. 19, no. 7, pp. 1583-1595, July 2017.
doi: 10.1109/TMM.2017.2671414
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7858779&isnumber=7949123

Z. Ma, X. Chang, Y. Yang, N. Sebe and A. G. Hauptmann, "The Many Shades of Negativity," 
in IEEE Transactions on Multimedia, vol. 19, no. 7, pp. 1558-1568, July 2017.
doi: 10.1109/TMM.2017.2659221
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7835107&isnumber=7949123

Y. N. Li and X. P. Chen, "Robust and compact video descriptor learned by deep neural network," 
2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, USA, 2017, pp. 2162-2166.
doi: 10.1109/ICASSP.2017.7952539
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7952539&isnumber=7951776

W. Lu; Y. Cheng; C. Xiao; S. Chang; S. Huang; B. Liang; T. Huang, "Unsupervised Sequential Outlier Detection with Deep Architectures," 
in IEEE Transactions on Image Processing , vol.PP, no.99, pp.1-1
doi: 10.1109/TIP.2017.2713048
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7942034&isnumber=4358840

Y. Wang, W. Zhang, L. Wu, X. Lin and X. Zhao, "Unsupervised Metric Fusion Over Multiview Data by Graph Random Walk-Based 
Cross-View Diffusion," in IEEE Transactions on Neural Networks and Learning Systems, vol. 28, no. 1, pp. 57-70, Jan. 2017.
doi: 10.1109/TNNLS.2015.2498149
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7348699&isnumber=7797565

X. Chang, Z. Ma, Y. Yang, Z. Zeng and A. G. Hauptmann, "Bi-Level Semantic Representation Analysis for Multimedia Event Detection," 
in IEEE Transactions on Cybernetics, vol. 47, no. 5, pp. 1180-1197, May 2017.
doi: 10.1109/TCYB.2016.2539546
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7442559&isnumber=7898877

X. S. Wei, J. Wu and Z. H. Zhou, "Scalable Algorithms for Multi-Instance Learning," 
in IEEE Transactions on Neural Networks and Learning Systems, vol. 28, no. 4, pp. 975-987, April 2017.
doi: 10.1109/TNNLS.2016.2519102
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7398097&isnumber=7879455

X. Nie, Y. Yin, J. Sun, J. Liu and C. Cui, "Comprehensive Feature-Based Robust Video Fingerprinting Using Tensor Model," 
in IEEE Transactions on Multimedia, vol. 19, no. 4, pp. 785-796, April 2017.
doi: 10.1109/TMM.2016.2629758
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7745950&isnumber=7879458

K. Li; S. Li; S. Oh; Y. Fu, "Videography based Unconstrained Video Analysis," 
in IEEE Transactions on Image Processing , vol.PP, no.99, pp.1-1
doi: 10.1109/TIP.2017.2678800
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7872416&isnumber=4358840

Y. Xian, X. Rong, X. Yang and Y. Tian, "Evaluation of Low-Level Features for Real-World Surveillance Event Detection," 
in IEEE Transactions on Circuits and Systems for Video Technology, vol. 27, no. 3, pp. 624-634, March 2017.
doi: 10.1109/TCSVT.2016.2589838
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7514916&isnumber=7870721

X. Han; B. Singh; V. Morariu; L. S. Davis, "VRFP: On-the-fly Video Retrieval using Web Images and Fast Fisher Vector Products," 
in IEEE Transactions on Multimedia , vol.PP, no.99, pp.1-1
doi: 10.1109/TMM.2017.2671414
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7858779&isnumber=4456689

C. Li; Z. Huang; Y. Yang; J. Cao; X. Sun; H. T. Shen, "Hierarchical Latent Concept Discovery for Video Event Detection," 
in IEEE Transactions on Image Processing , vol.PP, no.99, pp.1-1
doi: 10.1109/TIP.2017.2670782
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7858791&isnumber=4358840

Z. Ma; X. Chang; Y. Yang; N. Sebe; A. Hauptmann, "The Many Shades of Negativity," 
in IEEE Transactions on Multimedia , vol.PP, no.99, pp.1-1
doi: 10.1109/TMM.2017.2659221
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7835107&isnumber=4456689

D. Zhang; J. Han; L. Jiang; S. Ye; X. Chang, "Revealing Event Saliency in Unconstrained Video Collection," 
in IEEE Transactions on Image Processing , vol.PP, no.99, pp.1-1
doi: 10.1109/TIP.2017.2658957
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7835130&isnumber=4358840

C. L. Chou, H. T. Chen and S. Y. Lee, "Multimodal Video-to-Near-Scene Annotation," 
in IEEE Transactions on Multimedia, vol. 19, no. 2, pp. 354-366, Feb. 2017.
doi: 10.1109/TMM.2016.2614426
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7579212&isnumber=7820230

Y. Wang, W. Zhang, L. Wu, X. Lin and X. Zhao, "Unsupervised Metric Fusion Over Multiview Data by Graph Random Walk-Based Cross-View Diffusion," 
in IEEE Transactions on Neural Networks and Learning Systems, vol. 28, no. 1, pp. 57-70, Jan. 2017.
doi: 10.1109/TNNLS.2015.2498149
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7348699&isnumber=7797565


--------------------------------------------------------------------
2016 (119)
--------------------------------------------------------------------
Youxian Zheng and Yuan Zhang, "Abrupt shot boundary detection with combined features and SVM," 2016 2nd IEEE International Conference on 
Computer and Communications (ICCC), Chengdu, China, 2016, pp. 409-413. doi: 10.1109/CompComm.2016.7924733
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7924733&isnumber=7924647

C. Pingping, Y. Guan, X. Ding and Z. Yu, "Shot boundary detection with sparse presentation," 
2016 IEEE 13th International Conference on Signal Processing (ICSP), Chengdu, China, 2016, pp. 900-904.
doi: 10.1109/ICSP.2016.7877960
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7877960&isnumber=7877780

A. Dandashi, J. Aljaam and S. Foufou, "Audio-Visual Video Classification System Design: For Arabic News Domain," 
2016 International Conference on Computational Science and Computational Intelligence (CSCI), Las Vegas, NV, USA, 2016, pp. 745-751.
doi: 10.1109/CSCI.2016.0145
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7881438&isnumber=7881293

A. Mazaheri, B. Gong and M. Shah, "Learning a Multi-concept Video Retrieval Model with Multiple Latent Variables," 
2016 IEEE International Symposium on Multimedia (ISM), San Jose, CA, USA, 2016, pp. 615-620.
doi: 10.1109/ISM.2016.0132
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7823699&isnumber=7823367

N. Katayama, H. Mo and S. Satoh, "Unsupervised Estimation of Video Continuity Model from Large-Scale Video Archives 
and Its Application to Shot Boundary Detection," 2016 IEEE International Symposium on Multimedia (ISM), San Jose, CA, USA, 2016, pp. 52-59.
doi: 10.1109/ISM.2016.0019
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7823586&isnumber=7823367

J. Cao, L. Yu, M. Chen and X. Cui, "A Key Frame Selection Algorithm Based on Sliding Window and Image Features," 
2016 IEEE 22nd International Conference on Parallel and Distributed Systems (ICPADS), Wuhan, China, 2016, pp. 956-962.
doi: 10.1109/ICPADS.2016.0128
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7823843&isnumber=7823715

Y. Yan and M. L. Shyu, "Enhancing Rare Class Mining in Multimedia Big Data by Concept Correlation," 
2016 IEEE International Symposium on Multimedia (ISM), San Jose, CA, USA, 2016, pp. 281-286.
doi: 10.1109/ISM.2016.0062
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7823629&isnumber=7823367

J. Xu, L. Song and R. Xie, "Shot boundary detection using convolutional neural networks," 2016 Visual Communications and Image Processing (VCIP), 
Chengdu, China, 2016, pp. 1-4. doi: 10.1109/VCIP.2016.7805554
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7805554&isnumber=7805413

D. O. Gorodnichy, D. Bissessar, E. Granger and R. Laganiére, "Recognizing People and Their Activities in Surveillance Video: 
Technology State of Readiness and Roadmap," 2016 13th Conference on Computer and Robot Vision (CRV), Victoria, BC, Canada, 2016, pp. 250-259.
doi: 10.1109/CRV.2016.43. http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7801529&isnumber=7801481

B. Miller and S. McCloskey, "Metric Feature Indexing for Interactive Multimedia Search," 2016 
13th Conference on Computer and Robot Vision (CRV), Victoria, BC, Canada, 2016, pp. 109-115.
doi: 10.1109/CRV.2016.22. http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7801510&isnumber=7801481

Dalton Meitei Thounaojam, Thongam Khelchandra, Kh. Manglem Singh, and Sudipta Roy. 2016. A Genetic Algorithm and Fuzzy Logic Approach 
for Video Shot Boundary Detection. Intell. Neuroscience 2016 (March 2016), 14-. DOI: http://dx.doi.org/10.1155/2016/8469428

JianWen Tao, Wenjun Hu, and Shiting Wen. 2016. Multi-source adaptation joint kernel sparse representation for visual classification. 
Neural Netw. 76, C (April 2016), 135-151. DOI: http://dx.doi.org/10.1016/j.neunet.2016.01.008

Yanan Liu, Xiaoqing Feng, and Zhiguang Zhou. 2016. Multimodal video classification with stacked contractive autoencoders. 
Signal Process. 120, C (March 2016), 761-766. DOI=http://dx.doi.org/10.1016/j.sigpro.2015.01.001

Mohammad A. Al-Jarrah and Faruq A. Al-Omari. 2016. Fast Video Shot Boundary Detection Technique based on Stochastic 
Model. Int. J. Comput. Vis. Image Process. 6, 2 (July 2016), 1-17. DOI: https://doi.org/10.4018/IJCVIP.2016070101

Christos Tzelepis, Damianos Galanopoulos, Vasileios Mezaris, and Ioannis Patras. 2016. 
Learning to detect video events from zero or very few video examples. Image Vision Comput. 53, C (September 2016), 35-44. 
DOI: https://doi.org/10.1016/j.imavis.2015.09.005

Sinnu Susan Thomas, Sumana Gupta, and Venkatesh K. Subramanian. 2016. Perceptual synoptic view of pixel, 
object and semantic based attributes of video. J. Vis. Comun. Image Represent. 38, C (July 2016), 367-377. 
DOI: http://dx.doi.org/10.1016/j.jvcir.2016.03.015 

Jiyin He, Pernilla Qvarfordt, Martin Halvey, and Gene Golovchinsky. 2016. Beyond actions. Inf. Process. 
Manage. 52, 6 (November 2016), 1200-1226. DOI: https://doi.org/10.1016/j.ipm.2016.05.007 

Lixia Hong, Qingyue Jin, Xusheng Li, and Yizhen Huang. 2016. Image and medical annotations using non-homogeneous 
2D ruler learning models. Comput. Electr. Eng. 50, C (February 2016), 102-110. 
DOI=http://dx.doi.org/10.1016/j.compeleceng.2016.01.011

Mohamed Elhoseiny, Jingen Liu, Hui Cheng, Harpreet Sawhney, and Ahmed Elgammal. 2016. Zero-shot Event Detection 
by multimodal distributional semantic embedding of videos. In Proceedings of the Thirtieth AAAI Conference on 
Artificial Intelligence (AAAI'16). AAAI Press 3478-3486. 

Chuang Gan, Ming Lin, Yi Yang, Gerard de Melo, and Alexander G. Hauptmann. 2016. Concepts not alone: 
exploring pairwise relationships for zero-shot video activity recognition. In Proceedings of the Thirtieth 
AAAI Conference on Artificial Intelligence (AAAI'16). AAAI Press 3487-3493.

Jingya Wang, Xiatian Zhu, and Shaogang Gong. 2016. Video semantic clustering with sparse and incomplete tags. 
In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI'16). AAAI Press 3618-3624

Xiaojun Chang, Yi Yang, Guodong Long, Chengqi Zhang, and Alexander G. Hauptmann. 2016. Dynamic concept composition 
for zero-example event detection. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI'16). 
AAAI Press 3464-3470.

Diego Ortego, Juan C. SanMiguel, and José M. Martínez. 2016. Rejection based multipath reconstruction for background 
estimation in video sequences with stationary objects. Comput. Vis. Image Underst. 147, C (June 2016), 23-37. 
DOI=http://dx.doi.org/10.1016/j.cviu.2016.03.012

Ruben Fernandez-Beltran and Filiberto Pla. 2016. Latent topics-based relevance feedback for video retrieval. 
Pattern Recogn. 51, C (March 2016), 72-84. DOI=http://dx.doi.org/10.1016/j.patcog.2015.09.007 

JianWen Tao, Shiting Wen, and Wenjun Hu. 2016. Multi-source adaptation learning with global and local regularization 
by exploiting joint kernel sparse representation. Know.-Based Syst. 98, C (April 2016), 76-94. 
DOI: http://dx.doi.org/10.1016/j.knosys.2016.01.021

Yingying Zhu, Xiaoyan Huang, Qiang Huang, and Qi Tian. 2016. Large-scale video copy retrieval with 
temporal-concentration SIFT. Neurocomput. 187, C (April 2016), 83-91. DOI: http://dx.doi.org/10.1016/j.neucom.2015.09.114

Irfan Mehmood, Muhammad Sajjad, Seungmin Rho, and Sung Wook Baik. 2016. Divide-and-conquer based summarization 
framework for extracting affective video content. Neurocomput. 174, PA (January 2016), 393-403. 
DOI=http://dx.doi.org/10.1016/j.neucom.2015.05.126

Haojie Li, Lijuan Liu, Fuming Sun, Yu Bao, and Chenxin Liu. 2016. Multi-level feature representations 
for video semantic concept detection. Neurocomput. 172, C (January 2016), 64-70. 
DOI=http://dx.doi.org/10.1016/j.neucom.2014.09.096 

Lei Bao, Cao Juan, Jintao Li, and Yongdong Zhang. 2016. Boosted Near-miss Under-sampling on SVM 
ensembles for concept detection in large-scale imbalanced datasets. Neurocomput. 172, C (January 2016), 
198-206. DOI=http://dx.doi.org/10.1016/j.neucom.2014.05.096 

Haojie Li, Bin Liu, Lei Yi, Yue Guan, and Zhong-Xuan Luo. 2016. On the tag localization of web video. 
Multimedia Syst. 22, 4 (July 2016), 405-412. DOI: http://dx.doi.org/10.1007/s00530-014-0404-y

Foteini Markatopoulou, Vasileios Mezaris, and Ioannis Patras. 2016. Ordering of Visual Descriptors in a Classifier Cascade 
Towards Improved Video Concept Detection. In Proceedings, Part I, of the 22nd International Conference on MultiMedia Modeling - 
Volume 9516 (MMM 2016), Springer-Verlag New York, Inc., New York, NY, USA, 874-885. DOI: http://dx.doi.org/10.1007/978-3-319-27671-7_73 

Peng Wang, Lifeng Sun, Shiqang Yang, and Alan F. Smeaton. 2016. Towards Training-Free Refinement for Semantic Indexing 
of Visual Media. In Proceedings, Part I, of the 22nd International Conference on MultiMedia Modeling - Volume 9516 (MMM 2016), 
Springer-Verlag New York, Inc., New York, NY, USA, 251-263. DOI: http://dx.doi.org/10.1007/978-3-319-27671-7_21

Christos Tzelepis, Vasileios Mezaris, and Ioannis Patras. 2016. Video Event Detection Using Kernel Support Vector 
Machine with Isotropic Gaussian Sample Uncertainty KSVM-iGSU. In Proceedings, Part I, of the 22nd International 
Conference on MultiMedia Modeling - Volume 9516 (MMM 2016), Qi Tian, Nicu Sebe, Guo-Jun Qi, Benoit Huet, Richang Hong, 
and Xueliang Liu (Eds.), Vol. 9516. Springer-Verlag New York, Inc., New York, NY, USA, 3-15. 
DOI: http://dx.doi.org/10.1007/978-3-319-27671-7_1

Xiao-Jun Chen, Yong-Zhao Zhan, Jia Ke, and Xiao-Bo Chen. 2016. Complex video event detection via pairwise fusion 
of trajectory and multi-label hypergraphs. Multimedia Tools Appl. 75, 22 (November 2016), 15079-15100.
DOI: http://dx.doi.org/10.1007/s11042-015-2514-8

Saddam Bekhet, Amr Ahmed, Amjad Altadmri, and Andrew Hunter. 2016. Compressed video matching: Frame-to-frame revisited. 
Multimedia Tools Appl. 75, 23 (December 2016), 15763-15778. DOI: https://doi.org/10.1007/s11042-015-2887-8 

Chuang Gan, Yi Yang, Linchao Zhu, Deli Zhao, and Yueting Zhuang. 2016. Recognizing an Action Using Its Name: 
A Knowledge-Based Approach. Int. J. Comput. Vision 120, 1 (October 2016), 61-77. 
DOI: http://dx.doi.org/10.1007/s11263-016-0893-6

Shyi-Chyi Cheng, Jui-Yuan Su, Kuei-Fang Hsiao, and Habib F. Rashvand. 2016. Latent semantic learning with time-series 
cross correlation analysis for video scene detection and classification. Multimedia Tools Appl. 75, 20 (October 2016), 
12919-12940. DOI: http://dx.doi.org/10.1007/s11042-015-2548-y 

Luis Herranz and Shuqiang Jiang. 2016. Scalable storyboards in handheld devices: applications and evaluation metrics. 
Multimedia Tools Appl. 75, 20 (October 2016), 12597-12625. DOI: http://dx.doi.org/10.1007/s11042-014-2421-4

Debabrata Dutta, Sanjoy Kumar Saha, and Bhabatosh Chanda. 2016. A shot detection technique using linear 
regression of shot transition pattern. Multimedia Tools Appl. 75, 1 (January 2016), 93-113.
DOI=http://dx.doi.org/10.1007/s11042-014-2273-y

Mohamed Zarka, Anis Ben Ammar, and Adel M. Alimi. 2016. Fuzzy reasoning framework to improve semantic 
video interpretation. Multimedia Tools Appl. 75, 10 (May 2016), 5719-5750. DOI=http://dx.doi.org/10.1007/s11042-015-2537-1

Gabriel Sargent, Karina R. Perez-Daniel, Andrei Stoian, Jenny Benois-Pineau, Sofian Maabout, Henri Nicolas, 
Mariko Nakano Miyatake, and Jean Carrive. 2016. A scalable summary generation method based on cross-modal 
consensus clustering and OLAP cube modeling. Multimedia Tools Appl. 75, 15 (August 2016), 9073-9094. 
DOI: http://dx.doi.org/10.1007/s11042-015-2863-3

Matthijs Douze, Jérôme Revaud, Jakob Verbeek, Hervé Jégou, and Cordelia Schmid. 2016. 
Circulant Temporal Encoding for Video Retrieval and Temporal Alignment. 
Int. J. Comput. Vision 119, 3 (September 2016), 291-306. DOI: http://dx.doi.org/10.1007/s11263-015-0875-0 

Heng Wang, Dan Oneata, Jakob Verbeek, and Cordelia Schmid. 2016. A Robust and Efficient Video Representation 
for Action Recognition. Int. J. Comput. Vision 119, 3 (September 2016), 219-238. 
DOI: http://dx.doi.org/10.1007/s11263-015-0846-5 

Tarek Zlitni, Bassem Bouaziz, and Walid Mahdi. 2016. Automatic topics segmentation for TV news video 
using prior knowledge. Multimedia Tools Appl. 75, 10 (May 2016), 5645-5672. 
DOI=http://dx.doi.org/10.1007/s11042-015-2531-7 

Abdelkader Hamadi, Philippe Mulhem, and Georges Quénot. 2016. A comparative study for multiple visual 
concepts detection in images and videos. Multimedia Tools Appl. 75, 15 (August 2016), 8973-8997. 
DOI: http://dx.doi.org/10.1007/s11042-015-2730-2 

Maaike Boer, Klamer Schutte, and Wessel Kraaij. 2016. Knowledge based query expansion in complex 
multimedia event detection. Multimedia Tools Appl. 75, 15 (August 2016), 9025-9043. 
DOI: http://dx.doi.org/10.1007/s11042-015-2757-4 

Chahid Ouali, Pierre Dumouchel, and Vishwa Gupta. 2016. A spectrogram-based audio fingerprinting system 
for content-based copy detection. Multimedia Tools Appl. 75, 15 (August 2016), 9145-9165. 
DOI: http://dx.doi.org/10.1007/s11042-015-3081-8 

Vinh-Tiep Nguyen, Minh-Triet Tran, Thanh Duc Ngo, Duy-Dinh Le, and Duc Anh Duong. 2016. 
Searching a specific person in a specific location using deep features. In Proceedings of 
the Seventh Symposium on Information and Communication Technology (SoICT '16). 
ACM, New York, NY, USA, 79-86. DOI: https://doi.org/10.1145/3011077.3011138 

S. Sadiq, Y. Yan, M. L. Shyu, S. C. Chen and H. Ishwaran, "Enhancing Multimedia Imbalanced Concept 
Detection Using VIMP in Random Forests," 2016 IEEE 17th International Conference on Information Reuse 
and Integration (IRI), Pittsburgh, PA, USA, 2016, pp. 601-608. doi: 10.1109/IRI.2016.87
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7785796&isnumber=7785148

A. Salvador, X. Giró-i-Nieto, F. Marqués and S. Satoh, "Faster R-CNN Features for Instance Search," 
2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Las Vegas, NV, USA, 2016, pp. 394-401.
doi: 10.1109/CVPRW.2016.56
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7789546&isnumber=7789490

X. Chang, Y. L. Yu, Y. Yang and E. P. Xing, "They are Not Equally Reliable: Semantic Event Search Using 
Differentiated Concept Classifiers," 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 
Las Vegas, NV, USA, 2016, pp. 1884-1893. doi: 10.1109/CVPR.2016.208
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7780577&isnumber=7780329

C. Gan, T. Yao, K. Yang, Y. Yang and T. Mei, "You Lead, We Exceed: Labor-Free Video Concept Learning by 
Jointly Exploiting Web Videos and Images," 2016 IEEE Conference on Computer Vision and 
Pattern Recognition (CVPR), Las Vegas, NV, USA, 2016, pp. 923-932.
doi: 10.1109/CVPR.2016.106
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7780475&isnumber=7780329

K. Kikuchi, K. Ueki, T. Ogawa and T. Kobayashi, "Video semantic indexing using object detection-derived 
features," 2016 24th European Signal Processing Conference (EUSIPCO), Budapest, Hungary, 2016, pp. 1288-1292.
doi: 10.1109/EUSIPCO.2016.7760456
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7760456&isnumber=7760191

Y. Zheng and Y. Zhang, "GPU-accelerated abrupt shot boundary detection," 2016 16th International 
Symposium on Communications and Information Technologies (ISCIT), Qingdao, China, 2016, pp. 141-145.
doi: 10.1109/ISCIT.2016.7751609
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7751609&isnumber=7751579

Reinhard Sonnleitner and Gerhard Widmer. 2016. Robust quad-based audio fingerprinting. 
IEEE/ACM Trans. Audio, Speech and Lang. Proc. 24, 3 (March 2016), 409-421. 
DOI=http://dx.doi.org/10.1109/TASLP.2015.2509248

Anurag Kumar and Bhiksha Raj. 2016. Audio Event Detection using Weakly Labeled Data.  
In Proceedings of the 2016 ACM on Multimedia Conference (MM '16). 
ACM, New York, NY, USA,  1038-1047. DOI: https://doi.org/10.1145/2964284.2964310

Vedran Vukotić, Christian Raymond, and Guillaume Gravier. 2016. 
Multimodal and Crossmodal Representation Learning from Textual and Visual 
Features with Bidirectional Deep Neural Networks for Video Hyperlinking.  
In Proceedings of the 2016 ACM workshop on Vision and Language Integration 
Meets Multimedia Fusion (iV&L-MM '16). 
ACM, New York, NY, USA,  37-44. DOI: https://doi.org/10.1145/2983563.2983567

Ilias Gialampoukidis, Anastasia Moumtzidou, Theodora Tsikrika, Stefanos Vrochidis, 
and Ioannis Kompatsiaris. 2016. Retrieval of Multimedia Objects by Fusing Multiple Modalities.  
In Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval (ICMR '16). 
ACM, New York, NY, USA,  359-362. DOI: http://dx.doi.org/10.1145/2911996.2912068

Xiaoshan Yang, Tianzhu Zhang, and Changsheng Xu. 2016. Semantic Feature Mining for 
Video Event Understanding. ACM Trans. Multimedia Comput. Commun. Appl. 12, 4, Article 55 
(August 2016), 22 pages. DOI: http://dx.doi.org/10.1145/2962719

Stavros Arestis-Chartampilas, Nikolaos Gkalelis, and Vasileios Mezaris. 2016. 
AKSDA-MSVM: A GPU-accelerated Multiclass Learning Framework for Multimedia.  
In Proceedings of the 2016 ACM on Multimedia Conference (MM '16). 
ACM, New York, NY, USA,  461-465. DOI: https://doi.org/10.1145/2964284.2967263

Pascal Mettes. 2016. Weakly-Supervised Recognition, Localization, and Explanation 
of Visual Entities.  In Proceedings of the 2016 ACM on Multimedia Conference (MM '16). 
ACM, New York, NY, USA,  1459-1463. DOI: https://doi.org/10.1145/2964284.2971479

K. Raghurama Holla and B. H. Shekar. 2016. Video Retrieval based on Patterns of 
Oriented Edge Magnitude.  In Proceedings of the Third International Symposium on 
Computer Vision and the Internet (VisionNet'16). ACM, New York, NY, USA,  115-120. 
DOI: http://dx.doi.org/10.1145/2983402.2983433

Lu Jiang. 2016. Web-scale Multimedia Search for Internet Video Content.  
In Proceedings of the Ninth ACM International Conference on Web Search and Data Mining (WSDM '16). 
ACM, New York, NY, USA,  701-701. DOI: http://dx.doi.org/10.1145/2835776.2855081

Jiang, L., 2016, April. Web-scale multimedia search for internet video content. 
In Proceedings of the 25th International Conference Companion on World Wide Web (pp. 311-316). 
International World Wide Web Conferences Steering Committee.

Yi-Jie Lu. 2016. Zero-Example Multimedia Event Detection and Recounting with 
Unsupervised Evidence Localization. In Proceedings of the 2016 ACM on Multimedia 
Conference (MM '16). ACM, New York, NY, USA,  1464-1468. 
DOI: https://doi.org/10.1145/2964284.2971480

Chahid Ouali, Pierre Dumouchel, and Vishwa Gupta. 2016. Fast audio fingerprinting system 
using GPU and a clustering-based technique. IEEE/ACM Trans. Audio, Speech and Lang. 
Proc. 24, 6 (June 2016), 1106-1118.

Yi-Jie Lu, Hao Zhang, Maaike de Boer, and Chong-Wah Ngo. 2016. Event Detection with Zero Example: 
Select the Right and Suppress the Wrong Concepts.  In Proceedings of the 2016 ACM on 
International Conference on Multimedia Retrieval (ICMR '16). ACM, New York, NY, USA,  127-134. 
DOI: http://dx.doi.org/10.1145/2911996.2912015

Zhiyong Cheng, Xuanchong Li, Jialie Shen, and Alexander G. Hauptmann. 2016. 
Which Information Sources are More Effective and Reliable in Video Search.  
In Proceedings of the 39th International ACM SIGIR conference on Research and 
Development in Information Retrieval (SIGIR '16). ACM, New York, NY, USA, 1069-1072. 
DOI: http://dx.doi.org/10.1145/2911451.2914765

Yuancheng Ye, Xuejian Rong, Xiaodong Yang, and YIngli Tian. 2016. Region Trajectories 
for Video Semantic Concept Detection.  In Proceedings of the 2016 ACM on International 
Conference on Multimedia Retrieval (ICMR '16). ACM, New York, NY, USA,  255-259. 
DOI: http://dx.doi.org/10.1145/2911996.2912046

Foteini Markatopoulou, Vasileios Mezaris, and Ioannis Patras. 2016. Deep Multi-task 
Learning with Label Correlation Constraint for Video Concept Detection.  In Proceedings 
of the 2016 ACM on Multimedia Conference (MM '16). 
ACM, New York, NY, USA,  501-505. DOI: https://doi.org/10.1145/2964284.2967271

Eva Mohedano, Kevin McGuinness, Noel E. O'Connor, Amaia Salvador, Ferran Marques, 
and Xavier Giro-i-Nieto. 2016. Bags of Local Convolutional Features for Scalable 
Instance Search.  In Proceedings of the 2016 ACM on International Conference on Multimedia 
Retrieval (ICMR '16). ACM, New York, NY, USA,  327-331. DOI: http://dx.doi.org/10.1145/2911996.2912061

Pascal Mettes, Dennis C. Koelma, and Cees G.M. Snoek. 2016. The ImageNet Shuffle: 
Reorganized Pre-training for Video Event Detection.  In Proceedings of the 2016 ACM on 
International Conference on Multimedia Retrieval (ICMR '16). 
ACM, New York, NY, USA,  175-182. DOI: http://dx.doi.org/10.1145/2911996.2912036

B. S. Rashmi and H. S. Nagendraswamy. 2016. Abrupt Shot Detection in Video using 
Weighted Edge Information.  In Proceedings of the International Conference on 
Informatics and Analytics (ICIA-16). ACM, New York, NY, USA, , Article 69 , 5 pages. 
DOI: http://dx.doi.org/10.1145/2980258.2980406

Nakamasa Inoue and Koichi Shinoda. 2016. Adaptation of Word Vectors using Tree 
Structure for Visual Semantics.  In Proceedings of the 2016 ACM on Multimedia Conference (MM '16). 
ACM, New York, NY, USA,  277-281. DOI: https://doi.org/10.1145/2964284.2967226

A. Habibian; T. Mensink; C. G. M. Snoek, "VideoStory Embeddings Recognize Events when Examples are Scarce," 
in IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.PP, no.99, pp.1-1
doi: 10.1109/TPAMI.2016.2627563
URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7740886&isnumber=4359286

X. Nie; Y. Yin; J. Sun; J. Liu; C. Cui, "Comprehensive Feature-based Robust Video Fingerprinting Using 
Tensor Model," in IEEE Transactions on Multimedia , vol.PP, no.99, pp.1-1
doi: 10.1109/TMM.2016.2629758
URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7745950&isnumber=4456689

B. H. Shekar, K. P. Uma and K. R. Holla, "Shot boundary detection using correlation based spectral 
residual saliency map," 2016 International Conference on Advances in Computing, Communications and 
Informatics (ICACCI), Jaipur, India, 2016, pp. 2242-2247. doi: 10.1109/ICACCI.2016.7732385
URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7732385&isnumber=7732013

B. S. Rashmi and H. S. Nagendraswamy, "Video shot boundary detection using midrange local binary pattern," 
2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI), Jaipur, 
India, 2016, pp. 201-206. doi: 10.1109/ICACCI.2016.7732047
URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7732047&isnumber=7732013

K. P. Uma, B. H. Shekar and K. R. Holla, "Video clip retrieval using local phase quantization," 
2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI), 
Jaipur, India, 2016, pp. 1522-1527. doi: 10.1109/ICACCI.2016.7732264
URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7732264&isnumber=7732013

C. L. Chou; H. T. Chen; S. Y. Lee, "Multi-Modal Video-to-Near-Scene Annotation," in
IEEE Transactions on Multimedia , vol.PP, no.99, pp.1-1
doi: 10.1109/TMM.2016.2614426
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7579212&isnumber=4456689

M. Yazdi and M. Fani, "Shot boundary detection with effective prediction of transitions' positions and 
spans by use of classifiers and adaptive thresholds," 2016 24th Iranian Conference on Electrical 
Engineering (ICEE), Shiraz, Iran, 2016, pp. 167-172. doi: 10.1109/IranianCEE.2016.7585511
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7585511&isnumber=7585374

Z. Li, X. Liu and S. Zhang, "Shot Boundary Detection based on Multilevel Difference of Colour Histograms," 
2016 First International Conference on Multimedia and Image Processing (ICMIP), Bandar Seri Begawan, 
Brunei Darussalam, 2016, pp. 15-22. doi: 10.1109/ICMIP.2016.24
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7573060&isnumber=7573029

X. Chang; Y. L. Yu; Y. Yang; E. P. Xing, "Semantic Pooling for Complex Event Analysis in Untrimmed Videos," 
in IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.PP, no.99, pp.1-1
doi: 10.1109/TPAMI.2016.2608901
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7565615&isnumber=4359286

M. Sang, Z. Sun and K. Jia, "Semantic Similarity Based Video Reranking," 2015 International Conference on 
Computational Intelligence and Communication Networks (CICN), Jabalpur, India, 2015, pp. 1420-1423.
doi: 10.1109/CICN.2015.274
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7546332&isnumber=7546033

F. Markatopoulou, V. Mezaris and I. Patras, "Online multi-task learning for semantic concept detection 
in video," 2016 IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA, 2016, pp. 186-190.
doi: 10.1109/ICIP.2016.7532344
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7532344&isnumber=7532277

T. Sato, M. Iwamura, K. Kaneda and K. Kise, "Fast and Memory Saving Instance Search with Approximate 
Reverse Nearest Neighbor Search Using Reverse Lookup," 2016 IEEE Second International Conference on 
Multimedia Big Data (BigMM), Taipei, 2016, pp. 326-333. doi: 10.1109/BigMM.2016.76
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7545045&isnumber=7544979

J. Hou, X. Wu, F. Yu and Y. Jia, "Multimedia event detection via deep spatial-temporal neural networks," 
2016 IEEE International Conference on Multimedia and Expo (ICME), Seattle, WA, USA, 2016, pp. 1-6.
doi: 10.1109/ICME.2016.7552981
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7552981&isnumber=7552854

D. Ren, L. Zhuo, H. Long, P. Qu and J. Zhang, "MPEG-2 Video Copy Detection Method Based on Sparse 
Representation of Spatial and Temporal Features," 2016 IEEE Second International Conference on
Multimedia Big Data (BigMM), Taipei, 2016, pp. 233-236. doi: 10.1109/BigMM.2016.21
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7545028&isnumber=7544979

Y. Huo, Y. Wang and H. Hu, "Effective algorithms for video shot and scene boundaries detection," 
2016 IEEE/ACIS 15th International Conference on Computer and Information Science (ICIS), Okayama, 
Japan, 2016, pp. 1-6. doi: 10.1109/ICIS.2016.7550913
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7550913&isnumber=7550716

J. Pang et al., "Accelerate convolutional neural networks for binary classification 
via cascading cost-sensitive feature," 2016 IEEE International Conference on Image 
Processing (ICIP), Phoenix, AZ, USA, 2016, pp. 1037-1041. doi: 10.1109/ICIP.2016.7532515
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7532515&isnumber=7532277

R. B. Wang, H. Chen, J. L. Yao and Y. T. Guo, "Video Copy Detection Based On Temporal 
Contextual Hashing," 2016 IEEE Second International Conference on Multimedia Big Data 
(BigMM), Taipei, 2016, pp. 223-228. doi: 10.1109/BigMM.2016.12
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7545026&isnumber=7544979

A. Kumar and B. Raj, "Weakly supervised scalable audio content analysis," 2016 IEEE 
International Conference on Multimedia and Expo (ICME), Seattle, WA, USA, 2016, pp. 1-6.
doi: 10.1109/ICME.2016.7552989
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7552989&isnumber=7552854

T. Y. Chang, S. C. Tai and G. S. Lin, "Manipulation classification for near-duplicate videos," 
2016 IEEE International Conference on Consumer Electronics-Taiwan (ICCE-TW), Nantou County, 
Taiwan, 2016, pp. 1-2. doi: 10.1109/ICCE-TW.2016.7520976
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7520976&isnumber=7520694

W. Zhang, C. W. Ngo and X. Cao, "Hyperlink-Aware Object Retrieval," in IEEE Transactions 
on Image Processing, vol. 25, no. 9, pp. 4186-4198, Sept. 2016.
doi: 10.1109/TIP.2016.2590321
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7508952&isnumber=7502214

J. C. SanMiguel; A. Cavallaro, "Energy Consumption Models for Smart-Camera Networks," 
in IEEE Transactions on Circuits and Systems for Video Technology , vol.PP, no.99, pp.1-1
doi: 10.1109/TCSVT.2016.2593598
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7517353&isnumber=4358651

X. Liu; L. Huang; C. Deng; B. Lang; D. Tao, "Query-Adaptive Hash Code Ranking for Large-Scale 
Multi-View Visual Search," in IEEE Transactions on Image Processing , vol.PP, no.99, pp.1-1
doi: 10.1109/TIP.2016.2593344
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7516672&isnumber=4358840

Y. Xian; X. Rong; X. Yang; Y. Tian, "Evaluation of Low-Level Features for Real-World Surveillance Event Detection," 
in IEEE Transactions on Circuits and Systems for Video Technology , vol.PP, no.99, pp.1-1
doi: 10.1109/TCSVT.2016.2589838
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7514916&isnumber=4358651

B. Safadi, P. Mulhem, G. Quénot and J. P. Chevallet, "Lifelog Semantic Annotation using deep visual 
features and metadata-derived descriptors," 2016 14th International Workshop on Content-Based 
Multimedia Indexing (CBMI), Bucharest, Romania, 2016, pp. 1-6. doi: 10.1109/CBMI.2016.7500247
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7500247&isnumber=7500235

A. Moumtzidou, I. Gialampoukidis, T. Mironidis, D. Liparas, S. Vrochidis and I. Kompatsiaris, 
"A multimedia interactive search engine based on graph-based and non-linear multimodal fusion," 
2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI), Bucharest, Romania, 2016, pp. 1-4.
doi: 10.1109/CBMI.2016.7500276
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7500276&isnumber=7500235

C. Lyu et al., "Identifying group-wise consistent sub-networks via spatial sparse representation 
of natural stimulus FMRI data," 2016 IEEE 13th International Symposium on Biomedical Imaging (ISBI), 
Prague, Czech Republic, 2016, pp. 62-65. doi: 10.1109/ISBI.2016.7493211
 http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7493211&isnumber=7493185

E. Mezghani, M. Charfeddine, C. Ben Amar and H. Nicolas, "Audiovisual video characterization 
using audio watermarking scheme," 2015 15th International Conference on Intelligent Systems 
Design and Applications (ISDA), Marrakech, 2015, pp. 213-218.
doi: 10.1109/ISDA.2015.7489227
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7489227&isnumber=7489153

M. Chakroun, A. Wali, Y. Aribi and A. M. Alimi, "Video event detection using auto-associative 
neural network and incremental SVM models," 2015 15th International Conference on Intelligent 
Systems Design and Applications (ISDA), Marrakech, 2015, pp. 563-568.
doi: 10.1109/ISDA.2015.7489178
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7489178&isnumber=7489153

O. Ben Said, A. Wali and A. M. Alimi, "Interlinking video programs with Linked Open Data," 
2015 15th International Conference on Intelligent Systems Design and Applications (ISDA), 
Marrakech, 2015, pp. 462-467. doi: 10.1109/ISDA.2015.7489159
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7489159&isnumber=7489153

J. Geng, Z. Miao, Q. Liang and S. Wang, "Linear multimodal fusion in video concept 
analysis based on node equilibrium model," 2015 3rd IAPR Asian Conference on Pattern 
Recognition (ACPR), Kuala Lumpur, 2015, pp. 316-320. doi: 10.1109/ACPR.2015.7486517
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7486517&isnumber=7486438

A. Agharwal, R. Kovvuri, R. Nevatia and C. G. M. Snoek, "Tag-based video retrieval by 
embedding semantic content in a continuous word space," 2016 IEEE Winter Conference on 
Applications of Computer Vision (WACV), Lake Placid, NY, USA, 2016, pp. 1-8.
doi: 10.1109/WACV.2016.7477706
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7477706&isnumber=7477446

K. Ueki and T. Kobayashi, "Improving semantic video indexing: Efforts in Waseda 
TRECVID 2015 SIN system," 2016 IEEE International Conference on Acoustics, Speech 
and Signal Processing (ICASSP), Shanghai, China, 2016, pp. 1184-1188.
doi: 10.1109/ICASSP.2016.7471863
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7471863&isnumber=7471614

Y. Wang, L. Neves and F. Metze, "Audio-based multimedia event detection using deep 
recurrent neural networks," 2016 IEEE International Conference on Acoustics, Speech 
and Signal Processing (ICASSP), Shanghai, China, 2016, pp. 2742-2746.
doi: 10.1109/ICASSP.2016.7472176
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7472176&isnumber=7471614

Q. Chen, W. Jiang, Y. Zhao and Z. Zhao, "Part-based deep network for pedestrian detection 
in surveillance videos," 2015 Visual Communications and Image Processing (VCIP), Singapore, 
Singapore, 2015, pp. 1-4. doi: 10.1109/VCIP.2015.7457855
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7457855&isnumber=7457773

C. Ouali, P. Dumouchel and V. Gupta, "Fast Audio Fingerprinting System Using GPU and a Clustering-Based 
Technique," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 24, no. 6, pp. 
1106-1118, June 2016. doi: 10.1109/TASLP.2016.2541303
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7431948&isnumber=7463555

M. Mazloom; X. Li; C. Snoek, "TagBook: A Semantic Video Representation without Supervision for Event Detection," 
in IEEE Transactions on Multimedia , vol.PP, no.99, pp.1-1
doi: 10.1109/TMM.2016.2559947
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7462268&isnumber=4456689

Q. Chen, W. Jiang, Y. Zhao and Z. Zhao, "Part-based deep network for pedestrian detection 
in surveillance videos," 2015 Visual Communications and Image Processing (VCIP), Singapore, 
Singapore, 2015, pp. 1-4. doi: 10.1109/VCIP.2015.7457855
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7457855&isnumber=7457773

P. Kanungo and T. Kar, "Cut detection using block based center symmetric local binary pattern," 
2015 International Conference on Man and Machine Interfacing (MAMI), Bhubaneswar, India, 2015, pp. 1-5.
doi: 10.1109/MAMI.2015.7456583
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7456583&isnumber=7456527

L. Yu; Z. Huang; J. Cao; H. T. Shen, "Scalable Video Event Retrieval by Visual State Binary Embedding," 
in IEEE Transactions on Multimedia , vol.PP, no.99, pp.1-1. doi: 10.1109/TMM.2016.2557059
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7457257&isnumber=4456689

S. Angadi and V. Naik, "Static video summarization - A minimum edge weight bipartite graph 
matching approach," 2015 IEEE International Conference on Computer Graphics, Vision and Information 
Security (CGVIS), Bhubaneshwar, Odisha, India, 2015, pp. 100-105.
doi: 10.1109/CGVIS.2015.7449901
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7449901&isnumber=7449869

Himeur, Yassine; Sadi, Karima Ait, "A Rotation Invariant BSIF Descriptor for Video 
Copy Detection Using a Ring Decomposition," in Signal-Image Technology & Internet-Based 
Systems (SITIS), 2015 11th International Conference on , vol., no., pp.300-305, 23-27 Nov. 2015
doi: 10.1109/SITIS.2015.71
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7400580&isnumber=7400513

Himeur, Yassine; Sadi, Karima Ait, "Joint color and texture descriptor using ring 
decomposition for robust video copy detection in large databases," in Signal Processing 
and Information Technology (ISSPIT), 2015 IEEE International Symposium on , 
vol., no., pp.495-500, 7-10 Dec. 2015. doi: 10.1109/ISSPIT.2015.7394386
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7394386&isnumber=7394243

Wei, X.-S.; Wu, J.; Zhou, Z.-H., "Scalable Algorithms for Multi-Instance Learning," 
in Neural Networks and Learning Systems, IEEE Transactions on , vol.PP, no.99, pp.1-13
doi: 10.1109/TNNLS.2016.2519102
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7398097&isnumber=6104215

Himeur, Yassine; Sadi, Karima Ait, "Joint color and texture descriptor using ring 
decomposition for robust video copy detection in large databases," in Signal 
Processing and Information Technology (ISSPIT), 2015 IEEE International Symposium on,
vol., no., pp.495-500, 7-10 Dec. 2015. doi: 10.1109/ISSPIT.2015.7394386
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7394386&isnumber=7394243

Zhang, X.; Zhang, H.; Zhang, Y.; Yang, Y.; Wang, M.; Luan, H.; Li, J.; Chua, T., 
"Deep Fusion of Multiple Semantic Cues for Complex Event Recognition," in Image Processing, 
IEEE Transactions on , vol.25, no.3, pp.1033-1046, March 2016
doi: 10.1109/TIP.2015.2511585
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7364255&isnumber=7383373

---------------------------------------------------------------------
2015(97)
---------------------------------------------------------------------
F. Markatopoulou, V. Mezaris, N. Pittaras and I. Patras, "Local Features and a Two-Layer Stacking Architecture for 
Semantic Concept Detection in Video," in IEEE Transactions on Emerging Topics in Computing, vol. 3, no. 2, pp. 193-204, June 2015.
doi: 10.1109/TETC.2015.2418714
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7073626&isnumber=7118282

X. Liu, Y. Mu, D. Zhang, B. Lang and X. Li, "Large-Scale Unsupervised Hashing with Shared Structure Learning," 
in IEEE Transactions on Cybernetics, vol. 45, no. 9, pp. 1811-1822, Sept. 2015.
doi: 10.1109/TCYB.2014.2360856
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6960876&isnumber=7203181

Y. Han, Y. Yang, Y. Yan, Z. Ma, N. Sebe and X. Zhou, "Semisupervised Feature Selection via Spline Regression for Video Semantic Recognition," 
in IEEE Transactions on Neural Networks and Learning Systems, vol. 26, no. 2, pp. 252-264, Feb. 2015.
doi: 10.1109/TNNLS.2014.2314123
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6786497&isnumber=7010866

Rouhi, Amir H., "Evaluating Spatio-Temporal Parameters in Video Similarity Detection by 
Global Descriptors," in Digital Image Computing: Techniques and Applications (DICTA), 
2015 International Conference on , vol., no., pp.1-8, 23-25 Nov. 2015
doi: 10.1109/DICTA.2015.7371255
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7371255&isnumber=7371204

Rouhi, Amir H., "Enhanced-IPMH as a Robust Visual Descriptor from H.264/AVC and Evaluation 
of Parameters Effects," in Digital Image Computing: Techniques and Applications (DICTA), 
2015 International Conference on , vol., no., pp.1-8, 23-25 Nov. 2015
doi: 10.1109/DICTA.2015.7371254
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7371254&isnumber=7371204

Jiang, L., Yu, S. I., Meng, D., Mitamura, T., & Hauptmann, A. G. (2015). Text-to-video: 
a semantic search engine for internet videos. International Journal of Multimedia 
Information Retrieval, 1-16.

Zhang, X.; Zhang, H.; Zhang, Y.D.; Yang, Y.; Wang, M.; Luan, H.; Li, J.T.; Chua, Tat-Seng, 
"Deep Fusion of Multiple Semantic Cues for Complex Event Recognition," in Image Processing, 
IEEE Transactions on , vol.PP, no.99, pp.1-1. doi: 10.1109/TIP.2015.2511585
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7364255&isnumber=4358840

Li, X.; Zhao, X.; Zhang, Z.; Wu, F.; Zhuang, Y.; Wang, J.; Li, X., "Joint Multilabel Classification 
With Community-Aware Label Graph Learning," in Image Processing, IEEETransactions on , vol.25, no.1, 
pp.484-493, Jan. 2016. doi: 10.1109/TIP.2015.2503700
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7337423&isnumber=7331739

Changyu Liu, Dapeng Li, Bin Lu, Juntao Xiong, Event Bank based Multimedia Representation via Latent 
Group Logistic Regression Minimization, Neurocomputing, Available online 15 December 2015, 
ISSN 0925-2312, http://dx.doi.org/10.1016/j.neucom.2015.12.002.
http://www.sciencedirect.com/science/article/pii/S0925231215018883

Juan Manuel Barrios and Jose Manuel Saavedra. 2015. Score Propagation Based on Similarity Shot Graph 
for Improving Visual Object Retrieval. In Proceedings of the Third Edition Workshop on Speech, 
Language & Audio in Multimedia (SLAM '15). ACM, New York, NY, USA, 19-22. 
DOI=http://dx.doi.org/10.1145/2802558.2814644

Andrea Ceroni, Vassilios Solachidis, Claudia Niederée, Olga Papadopoulou, Nattiya Kanhabua, and 
Vasileios Mezaris. 2015. To Keep or not to Keep: An Expectation-oriented Photo Selection Method 
for Personal Photo Collections. In Proceedings of the 5th ACM on International Conference on 
Multimedia Retrieval (ICMR '15). ACM, New York, NY, USA, 187-194. 
DOI=http://dx.doi.org/10.1145/2671188.2749372

Sébastien Poullot, Shunsuke Tsukatani, Anh Phuong Nguyen, Hervé Jégou, and Shin'Ichi Satoh. 2015. 
Temporal Matching Kernel with Explicit Feature Maps. In Proceedings of the 23rd ACM international 
conference on Multimedia (MM '15). ACM, New York, NY, USA, 381-390. 
DOI=http://dx.doi.org/10.1145/2733373.2806228 

Julia Bernd, Damian Borth, Carmen Carrano, Jaeyoung Choi, Benjamin Elizalde, Gerald Friedland, 
Luke Gottlieb, Karl Ni, Roger Pearce, Doug Poland, Khalid Ashraf, David A. Shamma, and Bart Thomee. 2015. 
Kickstarting the Commons: The YFCC100M and the YLI Corpora. In Proceedings of the 2015 Workshop on 
Community-Organized Multimodal Mining: Opportunities for Novel Solutions (MMCommons '15). ACM, New York, NY, USA, 1-6. 
DOI=http://dx.doi.org/10.1145/2814815.2816986 

Shicheng Xu, Huan Li, Xiaojun Chang, Shoou-I Yu, Xingzhong Du, Xuanchong Li, Lu Jiang, Zexi Mao, 
Zhenzhong Lan, Susanne Burger, and Alexander Hauptmann. 2015. Incremental Multimodal Query Construction 
for Video Search. In Proceedings of the 5th ACM on International Conference on Multimedia Retrieval (ICMR '15). 
ACM, New York, NY, USA, 675-678. 
DOI=http://dx.doi.org/10.1145/2671188.2749413

Shoou-I Yu, Lu Jiang, Zhongwen Xu, Yi Yang, and Alexander G. Hauptmann. 2015. Content-Based Video 
Search over 1 Million Videos with 1 Core in 1 Second. In Proceedings of the 5th ACM on International 
Conference on Multimedia Retrieval (ICMR '15). ACM, New York, NY, USA, 419-426. 
DOI=http://dx.doi.org/10.1145/2671188.2749398

Klaus Schoeffmann, Marco A. Hudelist, and Jochen Huber. 2015. Video Interaction Tools: A Survey of Recent Work. 
ACM Comput. Surv. 48, 1, Article 14 (September 2015), 34 pages. 
DOI=http://dx.doi.org/10.1145/2808796 

Guangnan Ye, Yitong Li, Hongliang Xu, Dong Liu, and Shih-Fu Chang. 2015. EventNet: A Large Scale 
Structured Concept Library for Complex Event Detection in Video. In Proceedings of the 23rd ACM 
international conference on Multimedia (MM '15). ACM, New York, NY, USA, 471-480. 
DOI=http://dx.doi.org/10.1145/2733373.2806221

Yonghong Tian, Mengren Qian, and Tiejun Huang. 2015. TASC: A Transformation-Aware Soft Cascading Approach
for Multimodal Video Copy Detection. ACM Trans. Inf. Syst. 33, 2, Article 7 (February 2015), 34 pages.
DOI=http://dx.doi.org/10.1145/2699662

Khalid Ashraf, Benjamin Elizalde, Forrest Iandola, Matthew Moskewicz, Julia Bernd, Gerald Friedland, 
and Kurt Keutzer. 2015. Audio-Based Multimedia Event Detection with DNNs and Sparse Sampling. 
In Proceedings of the 5th ACM on International Conference on Multimedia Retrieval (ICMR '15). 
ACM, New York, NY, USA, 611-614. DOI=http://dx.doi.org/10.1145/2671188.2749396 

Lu Jiang, Shoou-I Yu, Deyu Meng, Yi Yang, Teruko Mitamura, and Alexander G. Hauptmann. 2015. Fast and Accurate 
Content-based Semantic Search in 100M Internet Videos. In Proceedings of the 23rd ACM international 
conference on Multimedia (MM '15). ACM, New York, NY, USA, 49-58. 
DOI=http://dx.doi.org/10.1145/2733373.2806237 

Xiaojun Chang, Yao-Liang Yu, Yi Yang, and Alexander G. Hauptmann. 2015. Searching Persuasively: 
Joint Event Detection and Evidence Recounting with Limited Supervision. In Proceedings of the 23rd
ACM international conference on Multimedia (MM '15). ACM, New York, NY, USA, 581-590. 
DOI=http://dx.doi.org/10.1145/2733373.2806218 

Sang Phan, Duy-Dinh Le, and Shin'ichi Satoh. 2015. Multimedia Event Detection Using Event-Driven Multiple 
Instance Learning. In Proceedings of the 23rd ACM international conference on Multimedia (MM '15). 
ACM, New York, NY, USA, 1255-1258. DOI=http://dx.doi.org/10.1145/2733373.2806330

Lu Jiang, Shoou-I Yu, Deyu Meng, Teruko Mitamura, and Alexander G. Hauptmann. 2015. Bridging the Ultimate 
Semantic Gap: A Semantic Search Engine for Internet Videos. In Proceedings of the 5th ACM on International
Conference on Multimedia Retrieval (ICMR '15). ACM, New York, NY, USA, 27-34. 
DOI=http://dx.doi.org/10.1145/2671188.2749399 

B. H. Shekar and K. P. Uma. 2015. Gabor Moments Based Shot Boundary Detection. In Proceedings of the Third 
International Symposium on Women in Computing and Informatics (WCI '15), Indu Nair (Ed.). ACM, New York, 
NY, USA, 359-364. DOI=http://dx.doi.org/10.1145/2791405.2791499

Stavros Arestis-Chartampilas, Nikolaos Gkalelis, and Vasileios Mezaris. 2015. GPU Accelerated Generalised
Subclass Discriminant Analysis for Event and Concept Detection in Video. In Proceedings of the 23rd 
ACM international conference on Multimedia (MM '15). ACM, New York, NY, USA, 1219-1222.
DOI=http://dx.doi.org/10.1145/2733373.2806321 

Moitreya Chatterjee and Anton Leuski. 2015. A Novel Statistical Approach for Image and Video Retrieval
and Its Adaption for Active Learning. In Proceedings of the 23rd ACM international conference on 
Multimedia (MM '15). ACM, New York, NY, USA, 935-938. DOI=http://dx.doi.org/10.1145/2733373.2806368

M. L. Smitha and B. H. Shekar. 2015. Illumination Invariant Text Recognition System Based 
On Contrast Limit Adaptive Histogram Equalization in Videos/Images. In Proceedings of the 
Third International Symposium on Women in Computing and Informatics (WCI '15), Indu Nair (Ed.).
ACM, New York, NY, USA, 174-179. DOI=http://dx.doi.org/10.1145/2791405.2791498 

Moitreya Chatterjee and Anton Leuski. 2015. CRMActive: An Active Learning Based Approach for 
Effective Video Annotation and Retrieval. In Proceedings of the 5th ACM on International 
Conference on Multimedia Retrieval (ICMR '15). ACM, New York, NY, USA, 535-538. 
DOI=http://dx.doi.org/10.1145/2671188.2749342

Eva Mohedano, Kevin McGuinness, Graham Healy, Noel E. O'Connor, Alan F. Smeaton, Amaia Salvador, 
Sergi Porta, and Xavier Giró-i-Nieto. 2015. Exploring EEG for Object Detection and Retrieval. 
In Proceedings of the 5th ACM on International Conference on Multimedia Retrieval (ICMR '15). 
ACM, New York, NY, USA, 591-594. DOI=http://dx.doi.org/10.1145/2671188.2749368 

Nakamasa Inoue and Koichi Shinoda. 2015. Vocabulary Expansion Using Word Vectors for Video Semantic Indexing. 
In Proceedings of the 23rd ACM international conference on Multimedia (MM '15). ACM, New York, NY, USA,
851-854. DOI=http://dx.doi.org/10.1145/2733373.2806347 

Jie Geng; Zhenjiang Miao; Xiao-Ping Zhang, "Efficient Heuristic Methods for Multimodal 
Fusion and Concept Fusion in Video Concept Detection," in Multimedia, IEEE Transactions on,
 vol.17, no.4, pp.498-511, April 2015
doi: 10.1109/TMM.2015.2398195
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7027217&isnumber=7060570

Pourian, N.; Manjunath, B.S., "PixNet: A Localized Feature Representation for Classification 
and Visual Search," in Multimedia, IEEE Transactions on , vol.17, no.5, pp.616-625, May 2015
doi: 10.1109/TMM.2015.2410734
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7055325&isnumber=7086387

Zhenzhong Lan; Ming Lin; Xuanchong Li; Hauptmann, A.G.; Raj, B., "Beyond Gaussian Pyramid: 
Multi-skip Feature Stacking for action recognition," in Computer Vision and Pattern Recognition 
(CVPR), 2015 IEEE Conference on , vol., no., pp.204-212, 7-12 June 2015
doi: 10.1109/CVPR.2015.7298616
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7298616&isnumber=7298593

Markatopoulou, F.; Mezaris, V.; Pittaras, N.; Patras, I., "Local Features and a Two-Layer
Stacking Architecture for Semantic Concept Detection in Video," in Emerging Topics in 
Computing, IEEE Transactions on , vol.3, no.2, pp.193-204, June 2015
doi: 10.1109/TETC.2015.2418714
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7073626&isnumber=7118282

Yahong Han; Yi Yang; Yan Yan; Zhigang Ma; Sebe, N.; Xiaofang Zhou, "Semisupervised
Feature Selection via Spline Regression for Video Semantic Recognition," in Neural 
Networks and Learning Systems, IEEE Transactions on , vol.26, no.2, pp.252-264, Feb. 2015
doi: 10.1109/TNNLS.2014.2314123
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6786497&isnumber=7010866

Kalpakis, G.; Tsikrika, T.; Markatopoulou, F.; Pittaras, N.; Vrochidis, S.; Mezaris, V.;
Patras, I.; Kompatsiaris, I., "Concept Detection in Multimedia Web Resources About Home 
Made Explosives," in Availability, Reliability and Security (ARES), 2015 10th International 
Conference on , vol., no., pp.632-641, 24-27 Aug. 2015
doi: 10.1109/ARES.2015.85
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7299974&isnumber=7299862

Ling Shao; Fan Zhu; Xuelong Li, "Transfer Learning for Visual Categorization: A Survey," 
in Neural Networks and Learning Systems, IEEE Transactions on , vol.26, no.5, pp.1019-1034, May 2015
doi: 10.1109/TNNLS.2014.2330900
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6847217&isnumber=7086401

Nguyen, Vinh-Tiep; Nguyen, Dinh-Luan; Tran, Minh-Triet; Le, Duy-Dinh; Duong, Duc Anh; Satoh, Shin'ichi,
"Query-adaptive late fusion with neural network for instance search," in Multimedia Signal Processing
(MMSP), 2015 IEEE 17th International Workshop on , vol., no., pp.1-6, 19-21 Oct. 2015
doi: 10.1109/MMSP.2015.7340795
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7340795&isnumber=7340786

Yaowei Wang; Yonghong Tian; Limin Su; Xiaoyu Fang; Ziwei Xia; Tiejun Huang, "Detecting Rare 
Actions and Events from Surveillance Big Data with Bag of Dynamic Trajectories," in Multimedia 
Big Data (BigMM), 2015 IEEE International Conference on , vol., no., pp.128-135, 20-22 April 2015
doi: 10.1109/BigMM.2015.74
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7153866&isnumber=7153824

Ting Yao; Yingwei Pan; Chong-Wah Ngo; Houqiang Li; Tao Mei, "Semi-supervised Domain Adaptation 
with Subspace Learning for visual recognition," in Computer Vision and Pattern Recognition 
(CVPR), 2015 IEEE Conference on , vol., no., pp.2142-2150, 7-12 June 2015
doi: 10.1109/CVPR.2015.7298826
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7298826&isnumber=7298593

Markatopoulou, Foteini; Mezaris, Vasileios; Patras, Ioannis, "Cascade of classifiers based 
on binary, non-binary and deep convolutional network descriptors for video concept detection," 
in Image Processing (ICIP), 2015 IEEE International Conference on , vol., no., pp.1786-1790, 27-30 Sept. 2015
doi: 10.1109/ICIP.2015.7351108
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7351108&isnumber=7350743

Xishan Zhang; Yang Yang; Yongdong Zhang; Huanbo Luan; Jintao Li; Hanwang Zhang; 
Tat-Seng Chua, "Enhancing Video Event Recognition Using Automatically Constructed 
Semantic-Visual Knowledge Base," in Multimedia, IEEE Transactions on , vol.17, no.9,
pp.1562-1575, Sept. 2015. doi: 10.1109/TMM.2015.2449660
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7132742&isnumber=7182813

Jie Xu; Tekin, C.; Zhang, S.; van der Schaar, M., "Distributed Multi-Agent Online Learning Based 
on Global Feedback," in Signal Processing, IEEE Transactions on , vol.63, no.9, pp.2225-2238, May1, 2015
doi: 10.1109/TSP.2015.2403288
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7041172&isnumber=7067505

Inoue, N.; Shinoda, K., "Fast Coding of Feature Vectors using Neighbor-To-Neighbor Search," 
in Pattern Analysis and Machine Intelligence, IEEE Transactions on , vol.PP, no.99, pp.1-1
doi: 10.1109/TPAMI.2015.2481390
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7274762&isnumber=4359286

Arabaci, M.A.; Esen, E., "Video copy detection using motion co-occurrence feature," 
in Signal Processing and Communications Applications Conference (SIU), 2015 23th , 
vol., no., pp.1946-1949, 16-19 May 2015. doi: 10.1109/SIU.2015.7130243
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7130243&isnumber=7129794

Zhongwen Xu; Yi Yang; Hauptmann, A.G., "A discriminative CNN video representation for event detection,"
in Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on , vol., no., pp.1798-1807, 7-12 June 2015
doi: 10.1109/CVPR.2015.7298789
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7298789&isnumber=7298593

Garci�a-Marti�n, A.; Marti�nez, J.M., "People detection in surveillance: classification and evaluation," 
in Computer Vision, IET , vol.9, no.5, pp.779-788, 10 2015
doi: 10.1049/iet-cvi.2014.0148
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7270482&isnumber=7270452

Han, J.; Ji, X.; Hu, X.; Guo, L.; Liu, T., "Arousal Recognition Using Audio-Visual Features 
and FMRI-Based Brain Response," in Affective Computing, IEEE Transactions on , 
vol.6, no.4, pp.337-347, Oct.-Dec. 1 2015. doi: 10.1109/TAFFC.2015.2411280
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7056522&isnumber=7335704

Chien-Li Chou; Hua-Tsung Chen; Suh-Yin Lee, "Pattern-Based Near-Duplicate Video Retrieval 
and Localization on Web-Scale Videos," in Multimedia, IEEE Transactions on , 
vol.17, no.3, pp.382-395, March 2015. doi: 10.1109/TMM.2015.2391674
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7008558&isnumber=7041253

Hyungtae Lee; Morariu, V.I.; Davis, L.S., "Clauselets: Leveraging Temporally Related Actions 
for Video Event Analysis," in Applications of Computer Vision (WACV), 2015 IEEE Winter 
Conference on , vol., no., pp.1161-1168, 5-9 Jan. 2015
doi: 10.1109/WACV.2015.159
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7046013&isnumber=7045853

Guozhu Liang; Shivakumara, P.; Tong Lu; Chew Lim Tan, "Multi-Spectral Fusion Based Approach 
for Arbitrarily Oriented Scene Text Detection in Video Images," in Image Processing, IEEE 
Transactions on , vol.24, no.11, pp.4488-4501, Nov. 2015
doi: 10.1109/TIP.2015.2465169
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7180356&isnumber=7131605

Hamadi, A.; Mulhem, P.; Quenot, G., "Temporal re-scoring vs. temporal descriptors for semantic 
indexing of videos," in Content-Based Multimedia Indexing (CBMI), 2015 13th International 
Workshop on , vol., no., pp.1-4, 10-12 June 2015
doi: 10.1109/CBMI.2015.7153626
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7153626&isnumber=7153597

Ting-Chu Lin; Min-Chun Yang; Chia-Yin Tsai; Wang, Y.-C.F., "Query-Adaptive Multiple Instance
Learning for Video Instance Retrieval," in Image Processing, IEEE Transactions on , vol.24, 
no.4, pp.1330-1340, April 2015. doi: 10.1109/TIP.2015.2403236
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7041233&isnumber=7038243

Ouali, C.; Dumouchel, P.; Gupta, V., "Efficient spectrogram-based binary image feature for audio 
copy detection," in Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International 
Conference on , vol., no., pp.1792-1796, 19-24 April 2015
doi: 10.1109/ICASSP.2015.7178279
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7178279&isnumber=7177909

Jingxin Xu; Denman, S.; Sridharan, S.; Fookes, C., "An Efficient and Robust System for Multiperson
Event Detection in Real-World Indoor Surveillance Scenes," in Circuits and Systems for Video 
Technology, IEEE Transactions on , vol.25, no.6, pp.1063-1076, June 2015
doi: 10.1109/TCSVT.2014.2367352
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6948205&isnumber=7116636

Li, Xianfeng; Zhan, Yongzhao; Xu, Sen, "Video Shot Annotation Based on Hypergraph
Random Walk Algorithm," in Intelligent Human-Machine Systems and Cybernetics (IHMSC),
2015 7th International Conference on , vol.2, no., pp.167-170, 26-27 Aug. 2015
doi: 10.1109/IHMSC.2015.92
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7334942&isnumber=7334774

Yale Song; Vallmitjana, J.; Stent, A.; Jaimes, A., "TVSum: Summarizing web videos using titles,"
in Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on , vol., no., 
pp.5179-5187, 7-12 June 2015. doi: 10.1109/CVPR.2015.7299154
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7299154&isnumber=7298593

Ondel, L.; Anguera, X.; Luque, J., "MASK+: Data-driven regions selection for acoustic fingerprinting,"
in Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on , 
vol., no., pp.335-339, 19-24 April 2015. doi: 10.1109/ICASSP.2015.7177986
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7177986&isnumber=7177909

Etter, D.; Domeniconi, C., "Multi2Rank: Multimedia Multiview Ranking," in Multimedia Big Data 
(BigMM), 2015 IEEE International Conference on , vol., no., pp.80-87, 20-22 April 2015
doi: 10.1109/BigMM.2015.47
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7153859&isnumber=7153824

Chuang Gan; Naiyan Wang; Yi Yang; Dit-Yan Yeung; Hauptmann, A.G., "DevNet: A Deep Event
Network for multimedia event detection and evidence recounting," in Computer Vision and
Pattern Recognition (CVPR), 2015 IEEE Conference on , vol., no., pp.2568-2577, 7-12 June 2015
doi: 10.1109/CVPR.2015.7298872
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7298872&isnumber=7298593

Nan Nan; Guizhong Liu, "Video Copy Detection Based on Path Merging and Query 
Content Prediction," in Circuits and Systems for Video Technology, IEEE Transactions on,
vol.25, no.10, pp.1682-1695, Oct. 2015. doi: 10.1109/TCSVT.2015.2395771
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7024903&isnumber=7284726

Wang, Jinzhuo; Wang, Wenmin; Wang, Ronggang; Gao, Wen, "A compact shot representation 
for video semantic indexing," in Image Processing (ICIP), 2015 IEEE International 
Conference on , vol., no., pp.3265-3269, 27-30 Sept. 2015
doi: 10.1109/ICIP.2015.7351407
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7351407&isnumber=7350743

Ceroni, A.; Solachidis, V.; Mingxin Fu; Kanhabua, N.; Papadopoulou, O.; Niederee, 
C.; Mezaris, V., "Investigating human behaviors in selecting personal photos to preserve
memories," in Multimedia & Expo Workshops (ICMEW), 2015 IEEE International Conference on,
vol., no., pp.1-6, June 29 2015-July 3 2015
doi: 10.1109/ICMEW.2015.7169750
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7169750&isnumber=7169738

Kyoungmin Lee; Kolsch, M., "Shot Boundary Detection with Graph Theory Using Keypoint
Features and Color Histograms," in Applications of Computer Vision (WACV), 2015 IEEE
Winter Conference on , vol., no., pp.1177-1184, 5-9 Jan. 2015
doi: 10.1109/WACV.2015.161
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7046015&isnumber=7045853

Budnik, M.; Gutierrez-Gomez, E.-L.; Safadi, B.; Quenot, G., "Learned features versus
engineered features for semantic video indexing," in Content-Based Multimedia Indexing
(CBMI), 2015 13th International Workshop on , vol., no., pp.1-6, 10-12 June 2015
doi: 10.1109/CBMI.2015.7153637
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7153637&isnumber=7153597

Huang, D.; Cabral, R.; de la Torre, F., "Robust Regression," in Pattern Analysis 
and Machine Intelligence, IEEE Transactions on , vol.PP, no.99, pp.1-1
doi: 10.1109/TPAMI.2015.2448091
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7130636&isnumber=4359286

Kuan-Ting Lai; Dong Liu; Shih-Fu Chang; Ming-Syan Chen, "Learning Sample Specific 
Weights for Late Fusion," in Image Processing, IEEE Transactions on , vol.24, no.9,
pp.2772-2783, Sept. 2015
doi: 10.1109/TIP.2015.2423560
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7086303&isnumber=7110434

Kumar, A.; Raj, B., "A novel ranking method for multiple classifier systems,"
in Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International 
Conference on , vol., no., pp.1931-1935, 19-24 April 2015
doi: 10.1109/ICASSP.2015.7178307
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7178307&isnumber=7177909

Pourian, N.; Manjunath, B.S., "Retrieval of Images with Objects of Specific Size, 
Location, and Spatial Configuration," in Applications of Computer Vision (WACV), 
2015 IEEE Winter Conference on , vol., no., pp.960-967, 5-9 Jan. 2015
doi: 10.1109/WACV.2015.133
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7045987&isnumber=7045853

Etter, D.; Domeniconi, C., "SemRank: Semantic rank learning for multimedia retrieval," 
in Semantic Computing (ICSC), 2015 IEEE International Conference on , vol., no., pp.57-64, 7-9 Feb. 2015
doi: 10.1109/ICOSC.2015.7050778
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7050778&isnumber=7050753

Amer, M.; Todorovic, S., "Sum Product Networks for Activity Recognition,"
in Pattern Analysis and Machine Intelligence, IEEE Transactions on , vol.PP, no.99, pp.1-1
doi: 10.1109/TPAMI.2015.2465955
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7182341&isnumber=4359286

Wenjing Tong; Li Song; Xiaokang Yang; Hui Qu; Rong Xie, "CNN-based shot boundary detection 
and video annotation," in Broadband Multimedia Systems and Broadcasting (BMSB), 2015 IEEE 
International Symposium on , vol., no., pp.1-5, 17-19 June 2015
doi: 10.1109/BMSB.2015.7177222
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7177222&isnumber=7177182

Junwei Han; Changyuan Chen; Ling Shao; Xintao Hu; Jungong Han; Tianming Liu,
"Learning Computational Models of Video Memorability from fMRI Brain Imaging," 
in Cybernetics, IEEE Transactions on , vol.45, no.8, pp.1692-1703, Aug. 2015
doi: 10.1109/TCYB.2014.2358647
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6919270&isnumber=7156182

Hsin-Yu Ha; Shu-Ching Chen; Mei-Ling Shyu, "Utilizing Indirect Associations in Multimedia 
Semantic Retrieval," in Multimedia Big Data (BigMM), 2015 IEEE International Conference on,
vol., no., pp.72-79, 20-22 April 2015. doi: 10.1109/BigMM.2015.89
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7153858&isnumber=7153824

Hsin-Yu Ha; Shu-Ching Chen; Mei-Ling Shyu, "Negative-Based Sampling for Multimedia Retrieval,"
in Information Reuse and Integration (IRI), 2015 IEEE International Conference on,
vol., no., pp.64-71, 13-15 Aug. 2015. doi: 10.1109/IRI.2015.20
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7300956&isnumber=7300933

Wei Zhang; Chong-Wah Ngo, "Topological Spatial Verification for Instance Search," 
in Multimedia, IEEE Transactions on , vol.17, no.8, pp.1236-1247, Aug. 2015
doi: 10.1109/TMM.2015.2440997
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7117400&isnumber=7159118

Bastan, M; Cam, H; Gudukbay, U; Ulusoy, O, "An MPEG-7 Compatible Video Retrieval System 
with Integrated Support for Complex Multimodal Queries," in MultiMedia, IEEE , 
vol.PP, no.99, pp.1-1. doi: 10.1109/MMUL.2009.74
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5306056&isnumber=5255202

Fang Liu; Yi Wan, "Improving the video shot boundary detection using the HSV 
color space and image subsampling," in Advanced Computational Intelligence (ICACI), 
2015 Seventh International Conference on , vol., no., pp.351-354, 27-29 March 2015
doi: 10.1109/ICACI.2015.7184728
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7184728&isnumber=7184712

Ville Viitaniemi, Mats Sjöberg, Markus Koskela, Satoru Ishikawa and Jorma Laaksonen, 
Chapter 12 - Advances in visual concept detection: Ten years of TRECVID, In Advances in 
Independent Component Analysis and Learning Machines, edited by Ella Bingham, Samuel Kaski,
Jorma Laaksonen and Jouko Lampinen, Academic Press, 2015, Pages 249-278,
ISBN 9780128028063, http://dx.doi.org/10.1016/B978-0-12-802806-3.00012-9.
http://www.sciencedirect.com/science/article/pii/B9780128028063000129

Ouali, C.; Dumouchel, P.; Gupta, V., "GPU implementation of an audio fingerprints 
similarity search algorithm," in Content-Based Multimedia Indexing (CBMI), 2015 13th 
International Workshop on , vol., no., pp.1-6, 10-12 June 2015
doi: 10.1109/CBMI.2015.7153625
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7153625&isnumber=7153597

Safadi, B.; Quenot, G., "A factorized model for multiple SVM and multi-label 
classification for large scale multimedia indexing," in Content-Based Multimedia 
Indexing (CBMI), 2015 13th International Workshop on , vol., no., pp.1-6, 10-12 June 2015
doi: 10.1109/CBMI.2015.7153610
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7153610&isnumber=7153597

Xintao Hu; Lei Guo; Junwei Han; Tianming Liu, "Decoding Semantics Categorization 
during Natural Viewing of Video Streams," in Autonomous Mental Development, IEEE 
Transactions on , vol.7, no.3, pp.201-210, Sept. 2015
doi: 10.1109/TAMD.2015.2415413
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7070746&isnumber=7317835

Kaavya, S.; LakshmiPriya, G.G., "Multimedia Indexing and Retrieval: 
Recent research work and their challenges," in Signal Processing, Communication and 
Networking (ICSCN), 2015 3rd International Conference on , vol., no., pp.1-5, 26-28 March 2015
doi: 10.1109/ICSCN.2015.7219851
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7219851&isnumber=7219823

Yan Yan; Yi Yang; Deyu Meng; Gaowen Liu; Wei Tong; Hauptmann, A.G.; Sebe, N.,
"Event Oriented Dictionary Learning for Complex Event Detection," in Image Processing,
IEEE Transactions on , vol.24, no.6, pp.1867-1878, June 2015
doi: 10.1109/TIP.2015.2413294
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7061499&isnumber=7065385

Shinde, S.R.; Chiddarwar, G.G., "Recent advances in content based video copy detection,"
in Pervasive Computing (ICPC), 2015 International Conference on , vol., no., pp.1-6,
8-10 Jan. 2015. doi: 10.1109/PERVASIVE.2015.7087093
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7087093&isnumber=7086957

Vrochidis, S.; Kompatsiaris, I.; Casamayor, G.; Arapakis, I.; Busch, 
R.; Alexiev, V.; Jamin, E.; Jugov, M.; Heise, N.; Forrellat, T.; Liparas,
D.; Wanner, L.; Miliaraki, I.; Aleksic, V.; Simov, K.; Mas Soro, A.; Eckhoff,
M.; Wagner, T.; Puigbo, M., "MULTISENSOR: Development of multimedia content
integration technologies for journalism, media monitoring and international
exporting decision support," in Multimedia & Expo Workshops (ICMEW), 2015 IEEE
International Conference on , vol., no., pp.1-4, June 29 2015-July 3 2015
doi: 10.1109/ICMEW.2015.7169818
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7169818&isnumber=7169738

Tsai, T.J.; Friedland, G.; Anguera, X., "An information-theoretic metric of 
fingerprint effectiveness," in Acoustics, Speech and Signal Processing (ICASSP),
2015 IEEE International Conference on , vol., no., pp.340-344, 19-24 April 2015
doi: 10.1109/ICASSP.2015.7177987
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7177987&isnumber=7177909

Liu, X.; Lin, L.; Jin, H., "Contextualized Trajectory Parsing with Spatio-Temporal 
Graph," in Pattern Analysis and Machine Intelligence, IEEE Transactions on , vol.PP, 
no.99, pp.1-1. doi: 10.1109/TPAMI.2013.84
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6517196&isnumber=4359286

Xianglong Liu; Yadong Mu; Danchen Zhang; Bo Lang; Xuelong Li, "Large-Scale
Unsupervised Hashing with Shared Structure Learning," in Cybernetics, IEEE 
Transactions on , vol.45, no.9, pp.1811-1822, Sept. 2015
doi: 10.1109/TCYB.2014.2360856
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6960876&isnumber=7203181

Xintao Hu; Cheng Lv; Gong Cheng; Jinglei Lv; Lei Guo; Junwei Han; Tianming Liu,
"Sparsity-Constrained fMRI Decoding of Visual Saliency in Naturalistic Video Streams,"
in Autonomous Mental Development, IEEE Transactions on , vol.7, no.2, pp.65-75, June 2015
doi: 10.1109/TAMD.2015.2409835
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7056490&isnumber=7121039

Hajimirsadeghi, H.; Wang Yan; Vahdat, A.; Mori, G., "Visual recognition by counting instances:
A multi-instance cardinality potential kernel," in Computer Vision and Pattern Recognition (CVPR),
 2015 IEEE Conference on , vol., no., pp.2596-2605, 7-12 June 2015
doi: 10.1109/CVPR.2015.7298875
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7298875&isnumber=7298593

Mihir Jain, Jan C. van Gemert, Thomas Mensink, and Cees G. M. Snoek, "Objects2action: 
Classifying and localizing actions without any video example," in Proceedings of the 
IEEE International Conference on Computer Vision, Santiago, Chile, 2015.

Markus Nagel, Thomas Mensink, and Cees G. M. Snoek, "Event Fisher Vectors: Robust 
Encoding Visual Diversity of Visual Streams," in Proceedings of the British Machine 
Vision Conference, Swansea, UK, 2015.

Amirhossein Habibian, Thomas Mensink, and Cees G. M. Snoek, "Discovering Semantic 
Vocabularies for Cross-Media Retrieval," in Proceedings of the ACM International 
Conference on Multimedia Retrieval, Shanghai, China, 2015.

Masoud Mazloom, Amirhossein Habibian, Dong Liu, Cees G. M. Snoek, and Shih-Fu Chang, 
"Encoding Concept Prototypes for Video Event Detection and Summarization," in Proceedings 
of the ACM International Conference on Multimedia Retrieval, Shanghai, China, 2015.

Pascal Mettes, Jan C. van Gemert, Spencer Cappallo, Thomas Mensink, and Cees G. M. Snoek,
"Bag-of-Fragments: Selecting and encoding video fragments for event detection and recounting," 
in Proceedings of the ACM International Conference on Multimedia Retrieval, Shanghai, China, 2015.

Svetlana Kordumova, Xirong Li, and Cees G. M. Snoek, "Best Practices for Learning Video Concept 
Detectors from Social Media Examples," Multimedia Tools and Applications, 
vol. 74, iss. 4, pp. 1291-1315, 2015. 

---------------------------------------------------------------------
2014 (87)
---------------------------------------------------------------------

Amjad Altadmri and Amr Ahmed. 2014. A framework for automatic semantic
video annotation. Multimedia Tools Appl. 72, 2 (September 2014),
1167-1191. DOI=10.1007/s11042-013-1363-6
http://dx.doi.org/10.1007/s11042-013-1363-6

Amid, E.; Mesaros, A.; Palomaki, K.J.; Laaksonen, J.; Kurimo, M.,
"Unsupervised feature extraction for multimedia event detection and
ranking using audio content," Acoustics, Speech and Signal Processing
(ICASSP), 2014 IEEE International Conference on , vol., no.,
pp.5939,5943, 4-9 May 2014 doi: 10.1109/ICASSP.2014.6854743 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6854743&isnumber=6853544

Bhattacharya, Subhabrata; Kalayeh, Mahdi M.; Sukthankar, Rahul; Shah,
Mubarak, "Recognition of Complex Events: Exploiting Temporal Dynamics
between Underlying Concepts," Computer Vision and Pattern Recognition
(CVPR), 2014 IEEE Conference on , vol., no., pp.2243,2250, 23-28 June
2014 doi: 10.1109/CVPR.2014.287, URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6909684&isnumber=6909393

Bhattacharya, S.; Mehran, R.; Sukthankar, R.; Shah, M.,
"Classification of Cinematographic Shots Using Lie Algebra and its
Application to Complex Event Recognition," Multimedia, IEEE
Transactions on , vol.16, no.3, pp.686,696, April 2014 doi:
10.1109/TMM.2014.2300833, URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6714427&isnumber=6766693

Subhabrata Bhattacharya, Felix X. Yu, and Shih-Fu
Chang. 2014. Minimally Needed Evidence for Complex Event Recognition
in Unconstrained Videos. In Proceedings of International Conference on
Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 105 ,
8 pages. DOI=10.1145/2578726.2578740
http://doi.acm.org/10.1145/2578726.2578740

Ethem F. Can and R. Manmatha. 2014. Modeling Concept Dependencies for
Event Detection. In Proceedings of International Conference on
Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 289 ,
8 pages. DOI=10.1145/2578726.2578763
http://doi.acm.org/10.1145/2578726.2578763

Ning Chen; Jun Zhu; Fuchun Sun; Bo Zhang, "Learning Harmonium Models
With Infinite Latent Features," Neural Networks and Learning Systems,
IEEE Transactions on , vol.25, no.3, pp.520,532, March 2014 doi:
10.1109/TNNLS.2013.2276398, URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6741394&isnumber=6740874

Jiawei Chen, Yin Cui, Guangnan Ye, Dong Liu, and Shih-Fu
Chang. 2014. Event-Driven Semantic Concept Discovery by Exploiting
Weakly Tagged Internet Images. In Proceedings of International
Conference on Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA,
, Pages 1 , 8 pages. DOI=10.1145/2578726.2578729
http://doi.acm.org/10.1145/2578726.2578729

Yu Cheng; Brown, L.; Fan, Q.; Feris, R.; Pankanti, S.; Tao Zhang,
"RiskWheel: Interactive visual analytics for surveillance event
detection," Multimedia and Expo (ICME), 2014 IEEE International
Conference on , vol., no., pp.1,6, 14-18 July 2014 doi:
10.1109/ICME.2014.6890286, URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6890286&isnumber=6890121

Ozgun Cirakman, Bilge Gunsel, Neslihan Serap Sengor, and Sezer
Kutluk. 2014. Content-based copy detection by a subspace learning
based video fingerprinting scheme. Multimedia Tools Appl. 71, 3
(August 2014), 1381-1409. DOI=10.1007/s11042-012-1269-8
http://dx.doi.org/10.1007/s11042-012-1269-8

Dang, C.T.; Radha, H., "Heterogeneity Image Patch Index and Its
Application to Consumer Video Summarization," Image Processing, IEEE
Transactions on , vol.23, no.6, pp.2704,2718, June 2014 doi:
10.1109/TIP.2014.2320814 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6807803&isnumber=6807541

Dehghan, Afshin; Idrees, Haroon; Shah, Mubarak, "Improving Semantic
Concept Detection through the Dictionary of Visually-Distinct
Elements," Computer Vision and Pattern Recognition (CVPR), 2014 IEEE
Conference on , vol., no., pp.2585,2592, 23-28 June 2014 doi:
10.1109/CVPR.2014.331 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6909727&isnumber=6909393

David Etter and Carlotta Domeniconi. 2014. Semi-Supervised Rank
Learning for Multimedia Known-Item Search. In Proceedings of
International Conference on Multimedia Retrieval (ICMR '14). ACM, New
York, NY, USA, , Pages 257 , 8 pages. DOI=10.1145/2578726.2578759
http://doi.acm.org/10.1145/2578726.2578759

Guangyu Gao and Huadong Ma. 2014. To accelerate shot boundary
detection by reducing detection region and scope. Multimedia Tools
Appl. 71, 3 (August 2014), 1749-1770. DOI=10.1007/s11042-012-1301-z
http://dx.doi.org/10.1007/s11042-012-1301-z

Zan Gao, Long-Fei Zhang, Ming-Yu Chen, Alexander Hauptmann, Hua Zhang,
and An-Ni Cai. 2014. Enhanced and hierarchical structure algorithm for
data imbalance problem in semantic extraction under massive video
dataset. Multimedia Tools Appl. 68, 3 (February 2014),
641-657. DOI=10.1007/s11042-012-1071-7
http://dx.doi.org/10.1007/s11042-012-1071-7

Nikolaos Gkalelis and Vasileios Mezaris. 2014. Video event detection
using generalized subclass discriminant analysis and linear support
vector machines. In Proceedings of International Conference on
Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 25 ,
8 pages. DOI=10.1145/2578726.2578745
http://doi.acm.org/10.1145/2578726.2578745

Amirhossein Habibian, Masoud Mazloom, and Cees
G. M. Snoek. 2014. On-the-Fly Video Event Search by Semantic
Signatures. In Proceedings of International Conference on Multimedia
Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 518 , 3
pages. DOI=10.1145/2578726.2582615
http://doi.acm.org/10.1145/2578726.2582615

Amirhossein Habibian, Thomas Mensink, and Cees
G. M. Snoek. 2014. Composite Concept Discovery for Zero-Shot Video
Event Detection. In Proceedings of International Conference on
Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 17 ,
8 pages. DOI=10.1145/2578726.2578746
http://doi.acm.org/10.1145/2578726.2578746

Amirhossein Habibian, Thomas Mensink, and Cees G. M. Snoek,
"VideoStory: A New Multimedia Embedding for Few-Example Recognition
and Translation of Events," in Proceedings of the ACM International
Conference on Multimedia, Orlando, Florida, USA, 2014, pp. 17-26.

Amirhossein Habibian and Cees G. M. Snoek, "Recommendations for
Recognizing Video Events by Concept Vocabularies," Computer Vision and
Image Understanding, vol. 124, pp. 110-122, 2014.

Amirhossein Habibian and Cees G. M. Snoek. 2014. Stop-Frame Removal
Improves Web Video Classification. In Proceedings of International
Conference on Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA,
, Pages 499 , 4 pages. DOI=10.1145/2578726.2578803
http://doi.acm.org/10.1145/2578726.2578803

Abdelkader Hamadi, Philippe Mulhem, and Georges
Quénot. 2014. Infrequent concept pairs detection in multimedia
documents. In Proceedings of International Conference on Multimedia
Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 435 , 4
pages. DOI=10.1145/2578726.2578787
http://doi.acm.org/10.1145/2578726.2578787

Junwei Han, Xiang Ji, Xintao Hu, Jungong Han, and Tianming
Liu. 2014. Clustering and retrieval of video shots based on natural
stimulus fMRI. Neurocomput. 144 (November 2014),
128-137. DOI=10.1016/j.neucom.2013.11.052
http://dx.doi.org/10.1016/j.neucom.2013.11.052

Junwei Han, Kaiming Li, Ling Shao, Xintao Hu, Sheng He, Lei Guo,
Jungong Han, and Tianming Liu. 2014. Video abstraction based on
fMRI-driven visual attention model. Inf. Sci. 281 (October 2014),
781-796. DOI=10.1016/j.ins.2013.12.039
http://dx.doi.org/10.1016/j.ins.2013.12.039

Nakamasa Inoue and Koichi Shinoda, "n-Gram Models for Video Semantic
Indexing," Proc. ACM Multimedia, pp. 777-780, 2014.

Jain, A.; Xujun Peng; Xiaodan Zhuang; Natarajan, P.; Huaigu Cao, "Text
detection and recognition in natural scenes and consumer videos,"
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE
International Conference on , vol., no., pp.1245,1249, 4-9 May 2014
doi: 10.1109/ICASSP.2014.6853796, URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6853796&isnumber=6853544

Lu Jiang, Teruko Mitamura, Shoou-I Yu, and Alexander
G. Hauptmann. 2014. Zero-Example Event Search using MultiModal Pseudo
Relevance Feedback. In Proceedings of International Conference on
Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 297 ,
8 pages. DOI=10.1145/2578726.2578764
http://doi.acm.org/10.1145/2578726.2578764

Lu Jiang, Wei Tong, Deyu Meng, and Alexander
G. Hauptmann. 2014. Towards Efficient Learning of Optimal Spatial
Bag-of-Words Representations. In Proceedings of International
Conference on Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA,
, Pages 121 , 8 pages. DOI=10.1145/2578726.2578739
http://doi.acm.org/10.1145/2578726.2578739

I-Hong Jhuo, Guangnan Ye, Shenghua Gao, Dong Liu, Yu-Gang Jiang,
D. T. Lee, and Shih-Fu Chang. 2014. Discovering joint audio---visual
codewords for video event detection. Mach. Vision Appl. 25, 1 (January
2014), 33-47. DOI=10.1007/s00138-013-0567-0
http://dx.doi.org/10.1007/s00138-013-0567-0

Ilseo Kim and Chin-Hui Lee. 2014. An Efficient Gradient-based Approach
to Optimizing Average Precision Through Maximal Figure-of-Merit
Learning. J. Signal Process. Syst. 74, 3 (March 2014),
285-295. DOI=10.1007/s11265-013-0748-0
http://dx.doi.org/10.1007/s11265-013-0748-0

Semin Kim, Jae Young Choi, Seungwan Han, and Yong Man
Ro. 2014. Adaptive weighted fusion with new spatial and temporal
fingerprints for improved video copy detection. Image Commun. 29, 7
(August 2014), 788-806. DOI=10.1016/j.image.2014.05.002
http://dx.doi.org/10.1016/j.image.2014.05.002

Svetlana Kordumova, Christoph Kofler, Dennis C. Koelma, Bouke
Huurnink, Bauke Freiburg, Joris Kleinveld, Manuel van Rijn, Marco van
Deursen, Martha Larson, and Cees G. M. Snoek. 2014. SocialZap:
Catch-up on Interesting Television Fragments Discovered from Social
Media. In Proceedings of International Conference on Multimedia
Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 538 , 3
pages. DOI=10.1145/2578726.2582622
http://doi.acm.org/10.1145/2578726.2582622

Zhen-Zhong Lan, Lei Bao, Shoou-I Yu, Wei Liu, and Alexander
G. Hauptmann. 2014. Multimedia classification and event detection
using double fusion. Multimedia Tools Appl. 71, 1 (July 2014),
333-347. DOI=10.1007/s11042-013-1391-2
http://dx.doi.org/10.1007/s11042-013-1391-2

Gaowen Liu, Yan Yan, Chenqiang Gao, Wei Tong, Alexander Hauptmann, and
Nicu Sebe. 2014. The Mystery of Faces: Investigating Face Contribution
for Multimedia Event Detection. In Proceedings of International
Conference on Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA,
, Pages 467 , 4 pages. DOI=10.1145/2578726.2578795
http://doi.acm.org/10.1145/2578726.2578795

Tianming Liu; Xintao Hu; Xiaojin Li; Mo Chen; Junwei Han; Lei Guo,
"Merging Neuroimaging and Multimedia: Methods, Opportunities, and
Challenges," Human-Machine Systems, IEEE Transactions on , vol.44,
no.2, pp.270,280, April 2014 doi: 10.1109/THMS.2013.2296871, URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6742574&isnumber=6766254

Xianglong Liu, Junfeng He, and Bo Lang. 2014. Multiple feature kernel
hashing for large-scale visual search. Pattern Recogn. 47, 2 (February
2014), 748-757. DOI=10.1016/j.patcog.2013.08.022
http://dx.doi.org/10.1016/j.patcog.2013.08.022

Zhigang Ma; Yi Yang; Sebe, N.; Hauptmann, A.G., "Knowledge Adaptation
with PartiallyShared Features for Event DetectionUsing Few Exemplars,"
Pattern Analysis and Machine Intelligence, IEEE Transactions on ,
vol.36, no.9, pp.1789,1802, Sept. 2014 doi: 10.1109/TPAMI.2014.2306419
URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6740842&isnumber=6868318

Masoud Mazloom, Efstrastios Gavves, and Cees G. M. Snoek,
"Conceptlets: Selective Semantics for Classifying Video Events," IEEE
Transactions on Multimedia, vol. 16, iss. 8, pp. 2214-2228, 2014.

Masoud Mazloom, Xirong Li, and Cees G. M. Snoek. 2014. Few-Example
Video Event Retrieval using Tag Propagation. In Proceedings of
International Conference on Multimedia Retrieval (ICMR '14). ACM, New
York, NY, USA, , Pages 459 , 4 pages. DOI=10.1145/2578726.2578793
http://doi.acm.org/10.1145/2578726.2578793

Scott McCloskey and Jingchen Liu. 2014. Metadata-Weighted Score Fusion
for Multimedia Event Detection. In Proceedings of the 2014 Canadian
Conference on Computer and Robot Vision (CRV '14). IEEE Computer
Society, Washington, DC, USA, 299-305. DOI=10.1109/CRV.2014.47
http://dx.doi.org/10.1109/CRV.2014.47

Tao Mei, Yong Rui, Shipeng Li, and Qi Tian. 2014. Multimedia search
reranking: A literature survey. ACM Comput. Surv. 46, 3, Article 38
(January 2014), 38 pages. DOI=10.1145/2536798
http://doi.acm.org/10.1145/2536798

Tao Meng, Yang Liu, Mei-Ling Shyu, Yilin Yan, and Chi-Min
Shu. 2014. Enhancing Multimedia Semantic Concept Mining and Retrieval
by Incorporating Negative Correlations. In Proceedings of the 2014
IEEE International Conference on Semantic Computing (ICSC '14). IEEE
Computer Society, Washington, DC, USA, 28-35. DOI=10.1109/ICSC.2014.30
http://dx.doi.org/10.1109/ICSC.2014.30

Merialdo, B.; Niaz, U., "Uploader models for video concept detection,"
Content-Based Multimedia Indexing (CBMI), 2014 12th International
Workshop on , vol., no., pp.1,4, 18-20 June 2014 doi:
10.1109/CBMI.2014.6849847, URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6849847&isnumber=6849811

Metze, F.; Rawat, S.; Yipei Wang, "Improved audio features for
large-scale multimedia event detection," Multimedia and Expo (ICME),
2014 IEEE International Conference on , vol., no., pp.1,6, 14-18 July
2014 doi: 10.1109/ICME.2014.6890234 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6890234&isnumber=6890121

Murata, M.; Nagano, H.; Mukai, R.; Kashino, K.; Satoh, S., "BM25 With
Exponential IDF for Instance Search," Multimedia, IEEE Transactions on
, vol.16, no.6, pp.1690,1699, Oct. 2014 doi: 10.1109/TMM.2014.2323945
URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6820744&isnumber=6898894

Gregory K. Myers, Ramesh Nallapati, Julien Hout, Stephanie Pancoast,
Ramakant Nevatia, Chen Sun, Amirhossein Habibian, Dennis C. Koelma,
Koen E. Sande, Arnold W. Smeulders, and Cees
G. Snoek. 2014. Evaluating multimedia features and fusion for
example-based event detection. Mach. Vision Appl. 25, 1 (January
2014), 17-32. DOI=10.1007/s00138-013-0527-8
http://dx.doi.org/10.1007/s00138-013-0527-8

Niaz, U.; Merialdo, B., "Improving video concept detection through
label space partitioning," Multimedia and Expo (ICME), 2014 IEEE
International Conference on , vol., no., pp.1,6, 14-18 July 2014 doi:
10.1109/ICME.2014.6890258, URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6890258&isnumber=6890121

Usman Niaz and Bernard Merialdo. 2014. Selective Multi-cotraining for
Video Concept Detection. In Proceedings of International Conference on
Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 443 ,
4 pages. DOI=10.1145/2578726.2578789
http://doi.acm.org/10.1145/2578726.2578789

Sangmin Oh, Scott Mccloskey, Ilseo Kim, Arash Vahdat, Kevin
J. Cannons, Hossein Hajimirsadeghi, Greg Mori, A. G. Perera, Megha
Pandey, and Jason J. Corso. 2014. Multimedia event detection with
multimodal feature fusion and temporal concept
localization. Mach. Vision Appl. 25, 1 (January 2014),
49-69. DOI=10.1007/s00138-013-0525-x
http://dx.doi.org/10.1007/s00138-013-0525-x

Ouali, C.; Dumouchel, P.; Gupta, V., "A robust audio fingerprinting
method for content-based copy detection," Content-Based Multimedia
Indexing (CBMI), 2014 12th International Workshop on , vol., no.,
pp.1,6, 18-20 June 2014 doi: 10.1109/CBMI.2014.6849814, URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6849814&isnumber=6849811

Sang Phan, Thanh Duc Ngo, Vu Lam, Son Tran, Duy-Dinh Le, Duc Anh
Duong, and Shin'ichi Satoh. 2014. Multimedia Event Detection Using
Segment-Based Approach for Motion Feature. J. Signal
Process. Syst. 74, 1 (January 2014),
19-31. DOI=10.1007/s11265-013-0825-4
http://dx.doi.org/10.1007/s11265-013-0825-4

Trung Quy Phan; Shivakumara, P.; Bhowmick, S.; Shimiao Li; Chew Lim
Tan; Pal, U., "Semiautomatic Ground Truth Generation for Text
Detection and Recognition in Video Images," Circuits and Systems for
Video Technology, IEEE Transactions on , vol.24, no.8, pp.1277,1287,
Aug. 2014 doi: 10.1109/TCSVT.2014.2305515, URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6739120&isnumber=6869080

G.G. Lakshmi Priya and S. Domnic. 2014. Shot boundary-based keyframe
extraction for video summarisation. Int. J. Comput. Intell. Stud. 3,
2/3 (June 2014), 157-175. DOI=10.1504/IJCISTUDIES.2014.062728
http://dx.doi.org/10.1504/IJCISTUDIES.2014.062728

Mengren Qian; Luntian Mou; Jia Li; Yonghong Tian, "Video
picture-in-picture detection using spatio-temporal slicing,"
Multimedia and Expo Workshops (ICMEW), 2014 IEEE International
Conference on , vol., no., pp.1,6, 14-18 July 2014 doi:
10.1109/ICMEW.2014.6890580 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6890580&isnumber=6890528

Xueming Qian, Danping Guo, Xingsong Hou, Zhi Li, Huan Wang, Guizhong
Liu, and Zhe Wang. 2014. HWVP: hierarchical wavelet packet descriptors
and their applications in scene categorization and semantic concept
retrieval. Multimedia Tools Appl. 69, 3 (April 2014),
897-920. DOI=10.1007/s11042-012-1151-8
http://dx.doi.org/10.1007/s11042-012-1151-8

J. Rest, F. A. Grootjen, M. Grootjen, R. Wijn, O. Aarts,
M. L. Roelofs, G. J. Burghouts, H. Bouma, L. Alic, and
W. Kraaij. 2014. Requirements for multimedia metadata schemes in
surveillance applications for security. Multimedia Tools Appl. 70, 1
(May 2014), 573-598. DOI=10.1007/s11042-013-1575-9
http://dx.doi.org/10.1007/s11042-013-1575-9

C. Okan Sakar, Olcay Kursun, and Fikret Gurgen. 2014. Ensemble
canonical correlation analysis. Applied Intelligence 40, 2 (March
2014), 291-304. DOI=10.1007/s10489-013-0464-2
http://dx.doi.org/10.1007/s10489-013-0464-2

Yuan Shen; Zhenjiang Miao, "Multihuman Tracking Based on a
Spatial–Temporal Appearance Match," Circuits and Systems for Video
Technology, IEEE Transactions on , vol.24, no.3, pp.361,373, March
2014 doi: 10.1109/TCSVT.2013.2280073, URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6587854&isnumber=6754192

Kimiaki Shirahama, Marcin Grzegorzek, and Kuniaki
Uehara. 2014. Multimedia Event Detection Using Hidden Conditional
Random Fields. In Proceedings of International Conference on
Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages 9 , 8
pages. DOI=10.1145/2578726.2578742
http://doi.acm.org/10.1145/2578726.2578742

Kimiaki Shirahama, Yuta Matsuoka, and Kuniaki Uehara. 2014. Hybrid
negative example selection using visual and conceptual
features. Multimedia Tools Appl. 71, 3 (August 2014),
967-989. DOI=10.1007/s11042-011-0886-y
http://dx.doi.org/10.1007/s11042-011-0886-y

Sidiropoulos, P.; Mezaris, V.; Kompatsiaris, I., "Video Tomographs and
a Base Detector Selection Strategy for Improving Large-Scale Video
Concept Detection," Circuits and Systems for Video Technology, IEEE
Transactions on , vol.24, no.7, pp.1251,1264, July 2014 doi:
10.1109/TCSVT.2014.2302554 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6727470&isnumber=6846390

Sjoberg, M.; Laaksonen, J., "Using semantic features to improve
large-scale visual concept detection," Content-Based Multimedia
Indexing (CBMI), 2014 12th International Workshop on , vol., no.,
pp.1,6, 18-20 June 2014 doi: 10.1109/CBMI.2014.6849817, URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6849817&isnumber=6849811

Sabin Tiberius Strat, Alexandre Benoit, Patrick Lambert, and Alice
Caplier. 2014. Retina enhanced SURF descriptors for spatio-temporal
concept detection. Multimedia Tools Appl. 69, 2 (March 2014),
443-469. DOI=10.1007/s11042-012-1280-0
http://dx.doi.org/10.1007/s11042-012-1280-0

Sun, Chen; Nevatia, Ram, "DISCOVER: Discovering Important Segments for
Classification of Video Events and Recounting," Computer Vision and
Pattern Recognition (CVPR), 2014 IEEE Conference on , vol., no.,
pp.2569,2576, 23-28 June 2014 doi: 10.1109/CVPR.2014.329 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6909725&isnumber=6909393

Chen Sun, Brian Burns, Ram Nevatia, Cees Snoek, Bob Bolles, Greg
Myers, Wen Wang, and Eric Yeh. 2014. ISOMER: Informative Segment
Observations for Multimedia Event Recounting. In Proceedings of
International Conference on Multimedia Retrieval (ICMR '14). ACM, New
York, NY, USA, , Pages 241 , 8 pages. DOI=10.1145/2578726.2578757
http://doi.acm.org/10.1145/2578726.2578757

Fuming Sun; Jinhui Tang; Haojie Li; Guo-Jun Qi; Huang, T.S.,
"Multi-Label Image Categorization With Sparse Factor Representation,"
Image Processing, IEEE Transactions on , vol.23, no.3, pp.1028,1037,
March 2014 doi: 10.1109/TIP.2014.2298978, URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6705666&isnumber=6717077

Jinhui Tang and Xian-Sheng Hua. 2014. Typicality ranking: beyond
accuracy for video semantic annotation. Multimedia Tools Appl. 70, 2
(May 2014), 647-660. DOI=10.1007/s11042-011-0892-0
http://dx.doi.org/10.1007/s11042-011-0892-0

Ran Tao, Efstratios Gavves, Cees G. M. Snoek, and Arnold
W. M. Smeulders, "Locality in Generic Instance Search from One
Example," in Proceedings of the IEEE Conference on Computer Vision and
Pattern Recognition, Columbus, Ohio, USA, 2014.

Wei Tong, Yi Yang, Lu Jiang, Shoou-I Yu, Zhenzhong Lan, Zhigang Ma,
Waito Sze, Ehsan Younessian, and Alexander G. Hauptmann. 2014. E-LAMP:
integration of innovative ideas for multimedia event
detection. Mach. Vision Appl. 25, 1 (January 2014),
5-15. DOI=10.1007/s00138-013-0529-6
http://dx.doi.org/10.1007/s00138-013-0529-6

Trichet, R.; Nevatia, R., "Video segmentation and feature
co-occurrences for activity classification," Applications of Computer
Vision (WACV), 2014 IEEE Winter Conference on , vol., no., pp.385,392,
24-26 March 2014 doi: 10.1109/WACV.2014.6836074 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6836074&isnumber=6835728

Chun-Yu Tsai, Michelle L. Alexander, Nnenna Okwara, and John
R. Kender. 2014. Highly Efficient Multimedia Event Recounting from
User Semantic Preferences. In Proceedings of International Conference
on Multimedia Retrieval (ICMR '14). ACM, New York, NY, USA, , Pages
419 , 4 pages. DOI=10.1145/2578726.2578783
http://doi.acm.org/10.1145/2578726.2578783

van Hout, J.; Yeh, E.; Koelma, D.C.; Snoek, C.G.M.; Chen Sun; Nevatia,
R.; Wong, J.; Myers, G.K., "Late fusion and calibration for multimedia
event detection using few examples," Acoustics, Speech and Signal
Processing (ICASSP), 2014 IEEE International Conference on , vol.,
no., pp.4598,4602, 4-9 May 2014 doi: 10.1109/ICASSP.2014.6854473 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6854473&isnumber=6853544

Feng Wang; Zhanhu Sun; Yu-Gang Jiang; Chong-Wah Ngo, "Video Event
Detection Using Motion Relativity and Feature Selection," Multimedia,
IEEE Transactions on , vol.16, no.5, pp.1303,1315, Aug. 2014 doi:
10.1109/TMM.2014.2315780, URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6783709&isnumber=6856249

Haidong Wang and Guizhong Liu. 2014. Priority and delay aware packet
management framework for real-time video transport over 802.11e
WLANs. Multimedia Tools Appl. 69, 3 (April 2014),
621-641. DOI=10.1007/s11042-012-1131-z
http://dx.doi.org/10.1007/s11042-012-1131-z

Mei Wang, Xiaoling Xia, Jiajin Le, and Xiangdong Zhou. 2014. Effective
automatic image annotation via integrated discriminative and
generative models. Inf. Sci. 262 (March 2014),
159-171. DOI=10.1016/j.ins.2013.11.005
http://dx.doi.org/10.1016/j.ins.2013.11.005

Wu, Shuang; Bondugula, Sravanthi; Luisier, Florian; Zhuang, Xiaodan;
Natarajan, Pradeep, "Zero-Shot Event Detection Using Multi-modal
Fusion of Weakly Supervised Concepts," Computer Vision and Pattern
Recognition (CVPR), 2014 IEEE Conference on , vol., no., pp.2665,2672,
23-28 June 2014 doi: 10.1109/CVPR.2014.341 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6909737&isnumber=6909393

Shuang Wu; Xiaodan Zhuang; Natarajan, P., "Effective representations
for leveraging language content in multimedia event detection,"
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE
International Conference on , vol., no., pp.7123,7127, 4-9 May 2014
doi: 10.1109/ICASSP.2014.6854982, URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6854982&isnumber=6853544

Hongtao Xie; Yongdong Zhang; Jianlong Tan; Li Guo; Jintao Li,
"Contextual Query Expansion for Image Retrieval," Multimedia, IEEE
Transactions on , vol.16, no.4, pp.1104,1114, June 2014 doi:
10.1109/TMM.2014.2305909, URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6739088&isnumber=6814813

Xu, Zhongwen; Tsang, Ivor W.; Yang, Yi; Ma, Zhigang; Hauptmann,
Alexander G., "Event Detection Using Multi-level Relevance Labels and
Multiple Features," Computer Vision and Pattern Recognition (CVPR),
2014 IEEE Conference on , vol., no., pp.97,104, 23-28 June 2014 doi:
10.1109/CVPR.2014.20, URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6909414&isnumber=6909393

Bo Yang and Ramakant Nevatia. 2014. Multi-Target Tracking by Online
Learning a CRF Model of Appearance and Motion
Patterns. Int. J. Comput. Vision 107, 2 (April 2014),
203-217. DOI=10.1007/s11263-013-0666-4
http://dx.doi.org/10.1007/s11263-013-0666-4

Haojin Yang, Bernhard Quehl, and Harald Sack. 2014. A framework for
improved video text detection and recognition. Multimedia Tools
Appl. 69, 1 (March 2014), 217-245. DOI=10.1007/s11042-012-1250-6
http://dx.doi.org/10.1007/s11042-012-1250-6

Turgay Yilmaz, Adnan Yazici, and Masaru Kitsuregawa. 2014. RELIEF-MM:
effective modality weighting for multimedia information
retrieval. Multimedia Syst. 20, 4 (July 2014),
389-413. DOI=10.1007/s00530-014-0360-6
http://dx.doi.org/10.1007/s00530-014-0360-6

Ruijie Zhang, Fushan Wei, and Bicheng Li. 2014. E2LSH based multiple
kernel approach for object detection. Neurocomput. 124 (January 2014),
105-110. DOI=10.1016/j.neucom.2013.07.027
http://dx.doi.org/10.1016/j.neucom.2013.07.027

Xianguo Zhang; Tiejun Huang; Yonghong Tian; Wen Gao,
"Background-Modeling-Based Adaptive Prediction for Surveillance Video
Coding," Image Processing, IEEE Transactions on , vol.23, no.2,
pp.769,784, Feb. 2014 doi: 10.1109/TIP.2013.2294549 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6680670&isnumber=6685907

Cencen Zhong and Zhenjiang Miao. 2014. Graph regularized GM-pLSA and
its applications to video content analysis. Multimedia Syst. 20, 4
(July 2014), 429-445. DOI=10.1007/s00530-014-0378-9
http://dx.doi.org/10.1007/s00530-014-0378-9

Jun Zhu, Ning Chen, and Eric P. Xing. 2014. Bayesian inference with
posterior regularization and applications to infinite latent
SVMs. J. Mach. Learn. Res. 15, 1 (January 2014), 1799-1847.

Liang Zhuolin, Nakamasa Inoue, and Koichi Shinoda, "Velocity Pyramid
for Multimedia Event Detection," In Proc. MMM, 2014.

---------------------------------------------------------------------
2013 (94) 
---------------------------------------------------------------------

Robin Aly, Aiden Doherty, Djoerd Hiemstra, Franciska Jong, and Alan
F. Smeaton. 2013. The uncertain representation ranking framework for
concept-based video retrieval. Inf. Retr. 16, 5 (October 2013),
557-583. DOI=10.1007/s10791-012-9207-y
http://dx.doi.org/10.1007/s10791-012-9207-y

Arpit, D.; Shuang Wu; Natarajan, P.; Prasad, R.; Natarajan, P., "Ridge
Regression based classifiers for large scale class imbalanced
datasets," Applications of Computer Vision (WACV), 2013 IEEE Workshop
on , vol., no., pp.267,274, 15-17 Jan. 2013 doi:
10.1109/WACV.2013.6475028

Ilaria Bartolini, Marco Patella, and Corrado Romani. 2013. SHIATSU:
tagging and retrieving videos without worries. Multimedia Tools
Appl. 63, 2 (March 2013), 357-385. DOI=10.1007/s11042-011-0948-1
http://dx.doi.org/10.1007/s11042-011-0948-1

Subhabrata Bhattacharya. 2013. Recognition of complex events in
open-source web-scale videos: a bottom up approach. In Proceedings of
the 21st ACM international conference on Multimedia (MM '13). ACM, New
York, NY, USA, 1035-1038. DOI=10.1145/2502081.2502210
http://doi.acm.org/10.1145/2502081.2502210

Qiang Chen, Yang Cai, Lisa Brown, Ankur Datta, Quanfu Fan, Rogerio
Feris, Shuicheng Yan, Alex Hauptmann, and Sharath
Pankanti. 2013. Spatio-temporal fisher vector coding for surveillance
event detection. In Proceedings of the 21st ACM international
conference on Multimedia (MM '13). ACM, New York, NY, USA,
589-592. DOI=10.1145/2502081.2502155
http://doi.acm.org/10.1145/2502081.2502155

Michael G. Christel. 2013. Multimedia: from information source to
components of transformational games. In Proceedings of the 19th
Brazilian symposium on Multimedia and the web (WebMedia '13). ACM, New
York, NY, USA, 1-2. DOI=10.1145/2526188.2528279
http://doi.acm.org/10.1145/2526188.2528279

Roghayeh Dadashi and Hamidreza Rashidy Kanan. 2013. AVCD-FRA: A novel
solution to automatic video cut detection using fuzzy-rule-based
approach. Comput. Vis. Image Underst. 117, 7 (July 2013),
807-817. DOI=10.1016/j.cviu.2013.03.002
http://dx.doi.org/10.1016/j.cviu.2013.03.002

Jeffrey Dalton, James Allan, and Pranav Mirajkar. 2013. Zero-shot
video retrieval using content and concepts. In Proceedings of the 22nd
ACM international conference on Conference on information & knowledge
management (CIKM '13). ACM, New York, NY, USA,
1857-1860. DOI=10.1145/2505515.2507880
http://doi.acm.org/10.1145/2505515.2507880

Pradipto Das, Rohini K. Srihari, and Jason J. Corso. 2013. Translating
related words to videos and back through latent topics. In Proceedings
of the sixth ACM international conference on Web search and data
mining (WSDM '13). ACM, New York, NY, USA,
485-494. DOI=10.1145/2433396.2433456
http://doi.acm.org/10.1145/2433396.2433456

Del Fabro, M.; Schoeffmann, K.; Guggenberger, M.; Taschwer, M., "A
filtering tool to support interactive search in Internet video
archives," Content-Based Multimedia Indexing (CBMI), 2013 11th
International Workshop on , vol., no., pp.7,10, 17-19 June 2013 doi:
10.1109/CBMI.2013.6576544

Ajay Divakaran, Omar Javed, Saad Ali, Harpreet Sawhney, Qian Yu,
Jingen Liu, Hui Cheng, and Amir Tamrakar. 2013. Video event
recognition using concept attributes. In Proceedings of the 2013 IEEE
Workshop on Applications of Computer Vision (WACV) (WACV '13). IEEE
Computer Society, Washington, DC, USA,
339-346. DOI=10.1109/WACV.2013.6475038
http://dx.doi.org/10.1109/WACV.2013.6475038

de Rooij, O.; Worring, M., "Active Bucket Categorization for High
Recall Video Retrieval," Multimedia, IEEE Transactions on , vol.15,
no.4, pp.898,907, June 2013 doi: 10.1109/TMM.2013.2237894

Xiaohua Duan; Liang Lin; Hongyang Chao, "Discovering Video Shot
Categories by Unsupervised Stochastic Graph Partition," Multimedia,
IEEE Transactions on , vol.15, no.1, pp.167,180, Jan. 2013 doi:
10.1109/TMM.2012.2225029

Mennan Guder and Nihan Kesim Cicekli. 2013. Interactive Event
Recognition in Video. In Proceedings of the 2013 IEEE International
Symposium on Multimedia (ISM '13). IEEE Computer Society, Washington,
DC, USA, 100-101. DOI=10.1109/ISM.2013.24
http://dx.doi.org/10.1109/ISM.2013.24

Mennan Guder and Nihan Kesim Cicekli. 2013. Dichotomic Decision
Cascading for Video Shot Boundary Detection. In Proceedings of the
2013 IEEE International Symposium on Multimedia (ISM '13). IEEE
Computer Society, Washington, DC, USA,
227-230. DOI=10.1109/ISM.2013.43 http://dx.doi.org/10.1109/ISM.2013.43

Jinlin Guo; Zhengwei Qiu; Gurrin, C., "Exploring the optimal visual
vocabulary sizes for semantic concept detection," Content-Based
Multimedia Indexing (CBMI), 2013 11th International Workshop on ,
vol., no., pp.109,114, 17-19 June 2013 doi: 10.1109/CBMI.2013.6576565

Amirhossein Habibian, Koen E. A. van de Sande, Cees G. M. Snoek.
Recommendations for Video Event Recognition Using Concept Vocabularies.
ACM International Conference on Multimedia Retrieval, 2013.

Amirhossein Habibian, Cees G. M. Snoek. Video2Sentence and Vice Versa.
ACM International Conference on Multimedia, 2013

Hamadi, A.; Mulhem, P.; Quenot, G., "Conceptual feedback for semantic
multimedia indexing," Content-Based Multimedia Indexing (CBMI), 2013
11th International Workshop on , vol., no., pp.53,58, 17-19 June 2013
doi: 10.1109/CBMI.2013.6576552

Hamadi, A.; Quenot, G.; Mulhem, P., "Clustering based rescoring for
semantic indexing of multimedia documents," Content-Based Multimedia
Indexing (CBMI), 2013 11th International Workshop on , vol., no.,
pp.41,46, 17-19 June 2013 doi: 10.1109/CBMI.2013.6576550

Junwei Han; Xiang Ji; Xintao Hu; Dajiang Zhu; Kaiming Li; Xi Jiang;
Guangbin Cui; Lei Guo; Tianming Liu, "Representing and Retrieving
Video Shots in Human-Centric Brain Imaging Space," Image Processing,
IEEE Transactions on , vol.22, no.7, pp.2723,2736, July 2013 doi:
10.1109/TIP.2013.2256919

Xintao Hu; Tuo Zhang; Junwei Han; Lei Guo; Tianming Liu, "Functional
brain interactions during free viewing of video stream," Biomedical
Imaging (ISBI), 2013 IEEE 10th International Symposium on , vol., no.,
pp.1082,1085, 7-11 April 2013 doi: 10.1109/ISBI.2013.6556666

Chang Huang; Yuan Li; Nevatia, R., "Multiple Target Tracking by
Learning-Based Hierarchical Association of Detection Responses,"
Pattern Analysis and Machine Intelligence, IEEE Transactions on ,
vol.35, no.4, pp.898,910, April 2013 doi: 10.1109/TPAMI.2012.159

Nakamasa Inoue and Koichi Shinoda, "q-Gaussian Mixture Models for
Image And Video Semantic Indexing," Elsevier Journal of Visual
Communication and Image Representation, vol.24, no.8, pp.1450-1457,
2013.

Shuiwang Ji; Wei Xu; Ming Yang; Kai Yu, "3D Convolutional Neural
Networks for Human Action Recognition," Pattern Analysis and Machine
Intelligence, IEEE Transactions on , vol.35, no.1, pp.221,231,
Jan. 2013 doi: 10.1109/TPAMI.2012.59

Su Jiang, Yao Zhao, Shikui Wei, Rongrong Ni, and Zhenfeng
Zhu. 2013. Frame filtering and path verification for improving video
copy detection. In Proceedings of the Fifth International Conference
on Internet Multimedia Computing and Service (ICIMCS '13). ACM, New
York, NY, USA, 34-37. DOI=10.1145/2499788.2499829
http://doi.acm.org/10.1145/2499788.2499829

Ilseo Kim, Sangmin Oh, Arash Vahdat, Kevin Cannons, A.G. Amitha
Perera, and Greg Mori. 2013. Segmental multi-way local pooling for
video recognition. In Proceedings of the 21st ACM international
conference on Multimedia (MM '13). ACM, New York, NY, USA,
637-640. DOI=10.1145/2502081.2502167
http://doi.acm.org/10.1145/2502081.2502167

Yusuke Kamishima, Nakamasa Inoue, and Koichi Shinoda, "Event detection
in consumer videos using GMM supervectors and SVMs," EURASIP Journal
on Image and Video Processing, vol.2013, no.1 , pp.1-13, 2013.

Svetlana Kordumova, Xirong Li, Cees G. M. Snoek. Evaluating Sources and
Strategies for Learning Video Concepts from Social Media. International
Workshop on Content-Based Multimedia Indexing, 2013.

Guorong Li; Qingming Huang; Lei Qin; Shuqiang Jiang, "SSOCBT: A Robust
Semisupervised Online CovBoost Tracker That Uses Samples Differently,"
Circuits and Systems for Video Technology, IEEE Transactions on ,
vol.23, no.4, pp.695,709, April 2013 doi: 10.1109/TCSVT.2012.2221257

Jingen Liu; Qian Yu; Javed, O.; Ali, S.; Tamrakar, A.; Divakaran, A.;
Hui Cheng; Sawhney, H., "Video event recognition using concept
attributes," Applications of Computer Vision (WACV), 2013 IEEE
Workshop on , vol., no., pp.339,346, 15-17 Jan. 2013 doi:
10.1109/WACV.2013.6475038

Bo Lu, Guoren Wang, Ye Yuan, and Dong Han. 2013. Semantic concept
detection for video based on extreme learning
machine. Neurocomput. 102 (February 2013),
176-183. DOI=10.1016/j.neucom.2012.02.043
http://dx.doi.org/10.1016/j.neucom.2012.02.043

Masoud Mazloom, Amirhossein Habibian, and Cees
G.M. Snoek. 2013. Querying for video events by semantic signatures
from few examples. In Proceedings of the 21st ACM international
conference on Multimedia (MM '13). ACM, New York, NY, USA,
609-612. DOI=10.1145/2502081.2502160
http://doi.acm.org/10.1145/2502081.2502160

Kevin McGuinness, Noel E. O'Connor, Robin Aly, Franciska De Jong, Ken
Chatfield, Omkar M. Parkhi, Relja Arandjelovic, Andrew Zisserman,
Matthijs Douze, and Cordelia Schmid. 2013. The AXES PRO video search
system. In Proceedings of the 3rd ACM conference on International
conference on multimedia retrieval (ICMR '13). ACM, New York, NY, USA,
307-308. DOI=10.1145/2461466.2461519
http://doi.acm.org/10.1145/2461466.2461519

Sara Memar, Lilly Suriani Affendey, Norwati Mustapha, Shyamala
C. Doraisamy, and Mohammadreza Ektefa. 2013. An integrated
semantic-based approach in concept based video retrieval. Multimedia
Tools Appl. 64, 1 (May 2013), 77-95. DOI=10.1007/s11042-011-0848-4
http://dx.doi.org/10.1007/s11042-011-0848-4

Li, M.; Monga, V., "Compact Video Fingerprinting Via Structural
Graphical Models," Information Forensics and Security, IEEE
Transactions on , vol.PP, no.99, pp.1,1, 0 doi:
10.1109/TIFS.2013.2278100

Xirong Li, Cees G. M. Snoek, Marcel Worring, Dennis C. Koelma, Arnold W.
M. Smeulders. Bootstrapping Visual Categorization with Relevant
Negatives. IEEE Transactions on Multimedia, Volume 15 (4), page 933-945,
2013.

Zhenyang Li, Efstratios Gavves, Koen E. A. van de Sande, Cees G. M.
Snoek, Arnold W. M. Smeulders. Codemaps Segment, Classify and Search
Objects Locally. IEEE International Conference on Computer Vision, 2013.

Suzanne Little, Iveel Jargalsaikhan, Kathy Clawson, Marcos Nieto, Hao
Li, Cem Direkoglu, Noel E. O'Connor, Alan F. Smeaton, Jun Liu, Bryan
Scotney, Hui Wang, Seán Gaines, Aitor Rodriguez, Pedro Sanchez, Ana
Martínez Llorens, Karina Villarroel Peniza, Roberto Gimenez, Raúl
Santos de La Cámara, Anna Mereu, Celso Prados, and Emmanouil
Kafetzakis. 2013. Interactive surveillance event detection at
TRECVid2012. In Proceedings of the 3rd ACM conference on International
conference on multimedia retrieval (ICMR '13). ACM, New York, NY, USA,
301-302. DOI=10.1145/2461466.2461516
http://doi.acm.org/10.1145/2461466.2461516

Suzanne Little, Iveel Jargalsaikhan, Kathy Clawson, Marcos Nieto, Hao
Li, Cem Direkoglu, Noel E. O'Connor, Alan F. Smeaton, Bryan Scotney,
Hui Wang, and Jun Liu. 2013. An information retrieval approach to
identifying infrequent events in surveillance video. In Proceedings of
the 3rd ACM conference on International conference on multimedia
retrieval (ICMR '13). ACM, New York, NY, USA,
223-230. DOI=10.1145/2461466.2461503
http://doi.acm.org/10.1145/2461466.2461503

Hong Liu; Hong Lu; Xiangyang Xue, "A Segmentation and Graph-Based
Video Sequence Matching Method for Video Copy Detection," Knowledge
and Data Engineering, IEEE Transactions on , vol.25, no.8,
pp.1706,1718, Aug. 2013 doi: 10.1109/TKDE.2012.92

Jingen Liu; Qian Yu; Javed, O.; Ali, S.; Tamrakar, A.; Divakaran, A.;
Hui Cheng; Sawhney, H., "Video event recognition using concept
attributes," Applications of Computer Vision (WACV), 2013 IEEE
Workshop on , vol., no., pp.339,346, 15-17 Jan. 2013 doi:
10.1109/WACV.2013.6475038

Wu Liu, Feibin Yang, Yongdong Zhang, Qinghua Huang, and Tao
Mei. 2013. LAVES: an instant mobile video search system based on
layered audio-video indexing. In Proceedings of the 21st ACM
international conference on Multimedia (MM '13). ACM, New York, NY,
USA, 409-410. DOI=10.1145/2502081.2502244
http://doi.acm.org/10.1145/2502081.2502244

Liu, X.; Lin, L.; Jin, H., "Contextualized Trajectory Parsing with
Spatio-Temporal Graph," Pattern Analysis and Machine Intelligence,
IEEE Transactions on , vol.PP, no.99, pp.1,1, 0 doi:
10.1109/TPAMI.2013.84

Lu, Z.; Shi, Y., "Fast Video Shot Boundary Detection Based on SVD and
Pattern Matching," Image Processing, IEEE Transactions on , vol.PP,
no.99, pp.1,1, 0 doi: 10.1109/TIP.2013.2282081

Ma, Z.; Yang, Y.; Sebe, N.; Zheng, K.; Hauptmann, A.G., "Multimedia
Event Detection Using A Classifier-Specific Intermediate
Representation," Multimedia, IEEE Transactions on , vol.PP, no.99,
pp.1,1, 0 doi: 10.1109/TMM.2013.2264928

Zhigang Ma, Yi Yang, Zhongwen Xu, Nicu Sebe, and Alexander
G. Hauptmann. 2013. We are not equally negative: fine-grained labeling
for multimedia event detection. In Proceedings of the 21st ACM
international conference on Multimedia (MM '13). ACM, New York, NY,
USA, 293-302. DOI=10.1145/2502081.2502119
http://doi.acm.org/10.1145/2502081.2502119

Masoud Mazloom, Efstratios Gavves, Koen E. A. van de Sande, Cees G. M.
Snoek. Searching Informative Concept Banks for Video Event Detection.
ACM International Conference on Multimedia Retrieval, 2013.

Masoud Mazloom, Amirhossein Habibian, Cees G. M. Snoek. Querying for
Video Events by Semantic Signatures from Few Examples. ACM International
Conference on Multimedia, 2013.

Tao Mei, Lin-Xie Tang, Jinhui Tang, and Xian-Sheng
Hua. 2013. Near-lossless semantic video summarization and its
applications to video analysis. ACM Trans. Multimedia
Comput. Commun. Appl. 9, 3, Article 16 (July 2013), 23
pages. DOI=10.1145/2487268.2487269
http://doi.acm.org/10.1145/2487268.2487269

Gregory K. Meyers, Ramesh Nallapati, Julien van Hout, Stephanie
Pancoast, Ram Nevatia, Chen Sun, Amirhossein Habibian, Dennis C. Koelma,
Koen E. A. van de Sande, Arnold W. M. Smeulders, Cees G. M. Snoek.
Evaluating Multimedia Features and Fusion for Example-Based Event
Detection. Machine Vision and Applications, In press, 2013.

Davide Modolo, Cees G. M. Snoek. Can Object Detectors Aid Internet Video
Event Retrieval? IS&T/SPIE Symposium on Electronic Imaging, 2013.

Niaz, U.; Merialdo, B., "Leveraging from group classification for
video concept detection," Content-Based Multimedia Indexing (CBMI),
2013 11th International Workshop on , vol., no., pp.173,178, 17-19
June 2013 doi: 10.1109/CBMI.2013.6576577

Oneata, Dan and Verbeek, Jakob and Schmid, Cordelia, "Action and Event
Recognition with Fisher Vectors on a Compact Feature Set", in IEEE
International Conference on Computer Vision (ICCV), Dec 2013, Sydney,
Australia, http://hal.inria.fr/hal-00873662

Fabio Poiesi, Riccardo Mazzon, and Andrea
Cavallaro. 2013. Multi-target tracking on confidence maps: An
application to people tracking. Comput. Vis. Image Underst. 117, 10
(October 2013), 1257-1272. DOI=10.1016/j.cviu.2012.08.008
http://dx.doi.org/10.1016/j.cviu.2012.08.008

Priyadharssini, B.A.; Sivagami, S.V.; Muneeswaran, K., "Maximum a
posteriori adaptation method for video semantic indexing," Emerging
Trends in Computing, Communication and Nanotechnology (ICE-CCN), 2013
International Conference on , vol., no., pp.58,61, 25-26 March 2013
doi: 10.1109/ICE-CCN.2013.6528613

Vignesh Ramanathan, Percy Liang, and Li Fei-Fei. 2013. Video Event
Understanding Using Natural Language Descriptions. In Proceedings of
the 2013 IEEE International Conference on Computer Vision (ICCV
'13). IEEE Computer Society, Washington, DC, USA,
905-912. DOI=10.1109/ICCV.2013.117
http://dx.doi.org/10.1109/ICCV.2013.117

Vignesh Ramanathan, Bangpeng Yao, and Li Fei-Fei. 2013. Social Role
Discovery in Human Events. In Proceedings of the 2013 IEEE Conference
on Computer Vision and Pattern Recognition (CVPR '13). IEEE Computer
Society, Washington, DC, USA, 2475-2482. DOI=10.1109/CVPR.2013.320
http://dx.doi.org/10.1109/CVPR.2013.320

Miriam Redi and Bernard Merialdo. 2013. Direct modeling of image
keypoints distribution through copula-based image signatures. In
Proceedings of the 3rd ACM conference on International conference on
multimedia retrieval (ICMR '13). ACM, New York, NY, USA,
183-190. DOI=10.1145/2461466.2461498
http://doi.acm.org/10.1145/2461466.2461498

Ren, Y.J.; O'Gorman, L.; Wu, L.J.; Chang, F.; Wood, T.L.; Zhang, J.R.,
"Authenticating Lossy Surveillance Video," Information Forensics and
Security, IEEE Transactions on , vol.8, no.10, pp.1678,1687, Oct. 2013
doi: 10.1109/TIFS.2013.2279542

Safadi, B.; Quenot, G., "Descriptor optimization for multimedia
indexing and retrieval," Content-Based Multimedia Indexing (CBMI),
2013 11th International Workshop on , vol., no., pp.65,71, 17-19 June
2013 doi: 10.1109/CBMI.2013.6576554

Shen, Y.; Miao, Z., "Multi-Human Tracking Based on Spatial-Temporal
Appearance Match," Circuits and Systems for Video Technology, IEEE
Transactions on , vol.PP, no.99, pp.1,1, 0 doi:
10.1109/TCSVT.2013.2280073

Shinoda, K.; Inoue, N., "Reusing Speech Techniques for Video Semantic
Indexing [Applications Corner]," Signal Processing Magazine, IEEE ,
vol.30, no.2, pp.118,122, March 2013 doi: 10.1109/MSP.2012.2230520

Mats Sjoberg, Markus Koskela, Satoru Ishikawa, and Jorma
Laaksonen. Large-scale Visual Concept Detection with Explicit Kernel
Maps and Power Mean SVM.  In Proceedings of ACM International
Conference on Multimedia Retrieval (ICMR2013), pages 239--246, Dallas,
Texas, USA, April 2013. ACM.

Slimi, J.; Ben Ammar, A.; Alimi, A.M., "Interactive video data
visualization system based on semantic organization," Content-Based
Multimedia Indexing (CBMI), 2013 11th International Workshop on ,
vol., no., pp.161,166, 17-19 June 2013 doi: 10.1109/CBMI.2013.6576575

Strat, S.T.; Benoit, A.; Lambert, P., "Bags of Trajectory Words for
video indexing," Content-Based Multimedia Indexing (CBMI), 2014 12th
International Workshop on , vol., no., pp.1,6, 18-20 June 2014 doi:
10.1109/CBMI.2014.6849820 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6849820&isnumber=6849811

Strat, S.T.; Benoit, A.; Lambert, P., "Retina enhanced SIFT
descriptors for video indexing," Content-Based Multimedia Indexing
(CBMI), 2013 11th International Workshop on , vol., no., pp.201,206,
17-19 June 2013 doi: 10.1109/CBMI.2013.6576582

Chen Sun and Ram Nevatia. 2013. ACTIVE: Activity Concept Transitions
in Video Event Classification. In Proceedings of the 2013 IEEE
International Conference on Computer Vision (ICCV '13). IEEE Computer
Society, Washington, DC, USA, 913-920. DOI=10.1109/ICCV.2013.453
http://dx.doi.org/10.1109/ICCV.2013.453

Chen Sun; Nevatia, R., "Large-scale web video event classification by
use of Fisher Vectors," Applications of Computer Vision (WACV), 2013
IEEE Workshop on , vol., no., pp.15,22, 15-17 Jan. 2013 doi:
10.1109/WACV.2013.6474994

Kevin Tang, Bangpeng Yao, Li Fei-Fei, and Daphne
Koller. 2013. Combining the Right Features for Complex Event
Recognition. In Proceedings of the 2013 IEEE International Conference
on Computer Vision (ICCV '13). IEEE Computer Society, Washington, DC,
USA, 2696-2703. DOI=10.1109/ICCV.2013.335
http://dx.doi.org/10.1109/ICCV.2013.335

Yonghong Tian; Tiejun Huang; Menglin Jiang; Wen Gao, "Video
Copy-Detection and Localization with a Scalable Cascading Framework,"
MultiMedia, IEEE , vol.20, no.3, pp.72,86, July-Sept.

Christos Tzelepis, Nikolaos Gkalelis, Vasileios Mezaris, and Ioannis
Kompatsiaris. 2013. Improving event detection using related videos and
relevance degree support vector machines. In Proceedings of the 21st
ACM international conference on Multimedia (MM '13). ACM, New York,
NY, USA, 673-676. DOI=10.1145/2502081.2502176
http://doi.acm.org/10.1145/2502081.2502176

Tian, Y.; Wang, Y.; Hu, Z.; Huang, T., "Selective Eigenbackground for
Background Modeling and Subtraction in Crowded Scenes," Circuits and
Systems for Video Technology, IEEE Transactions on , vol.PP, no.99,
pp.1,1, 0 doi: 10.1109/TCSVT.2013.2248239

Arash Vahdat, Kevin Cannons, Greg Mori, Sangmin Oh, and Ilseo
Kim. 2013. Compositional Models for Video Event Detection: A Multiple
Kernel Learning Latent Variable Approach. In Proceedings of the 2013
IEEE International Conference on Computer Vision (ICCV '13). IEEE
Computer Society, Washington, DC, USA,
1185-1192. DOI=10.1109/ICCV.2013.463
http://dx.doi.org/10.1109/ICCV.2013.463

Carles Ventura. 2013. Visual object analysis using regions and
interest points. In Proceedings of the 21st ACM international
conference on Multimedia (MM '13). ACM, New York, NY, USA,
1075-1078. DOI=10.1145/2502081.2502220
http://doi.acm.org/10.1145/2502081.2502220

Zhiyong Wang, Genliang Guan, Yu Qiu, Li Zhuo, and Dagan
Feng. 2013. Semantic context based refinement for news video
annotation. Multimedia Tools Appl. 67, 3 (December 2013),
607-627. DOI=10.1007/s11042-012-1060-x
http://dx.doi.org/10.1007/s11042-012-1060-x

Xiao-Yong Wei; Zhen-Qun Yang, "Coaching the Exploration and
Exploitation in Active Learning for Interactive Video Retrieval,"
Image Processing, IEEE Transactions on , vol.22, no.3, pp.955,968,
March 2013 doi: 10.1109/TIP.2012.2222902

Song Wu and Michael Lew. 2013. Evaluation of salient point methods. In
Proceedings of the 21st ACM international conference on Multimedia (MM
'13). ACM, New York, NY, USA, 685-688. DOI=10.1145/2502081.2502179
http://doi.acm.org/10.1145/2502081.2502179

Zhongwen Xu, Yi Yang, Ivor Tsang, Nicu Sebe, and Alexander
G. Hauptmann. 2013. Feature Weighting via Optimal Thresholding for
Video Analysis. In Proceedings of the 2013 IEEE International
Conference on Computer Vision (ICCV '13). IEEE Computer Society,
Washington, DC, USA, 3440-3447. DOI=10.1109/ICCV.2013.427
http://dx.doi.org/10.1109/ICCV.2013.427

Yi Yang, Zhigang Ma, Zhongwen Xu, Shuicheng Yan, and Alexander
G. Hauptmann. 2013. How Related Exemplars Help Complex Event Detection
in Web Videos?. In Proceedings of the 2013 IEEE International
Conference on Computer Vision (ICCV '13). IEEE Computer Society,
Washington, DC, USA, 2104-2111. DOI=10.1109/ICCV.2013.456
http://dx.doi.org/10.1109/ICCV.2013.456

Yi Yang; Jingkuan Song; Zi Huang; Zhigang Ma; Sebe, N.; Hauptmann,
A.G., "Multi-Feature Fusion via Hierarchical Regression for Multimedia
Analysis," Multimedia, IEEE Transactions on , vol.15, no.3,
pp.572,581, April 2013 doi: 10.1109/TMM.2012.2234731

Yang Yang, Yi Yang, and Heng Tao Shen. 2013. Effective transfer
tagging from image to video. ACM Trans. Multimedia
Comput. Commun. Appl. 9, 2, Article 14 (May 2013), 20
pages. DOI=http://dx.doi.org/10.1145/2457450.2457456
http://doi.acm.org/http://dx.doi.org/10.1145/2457450.2457456

Ting Yao; Chong-Wah Ngo; Tao Mei, "Circular Reranking for Visual
Search," Image Processing, IEEE Transactions on , vol.22, no.4,
pp.1644,1655, April 2013 doi: 10.1109/TIP.2012.2236341

Yi, J.; Peng, Y.; Xiao, J., "Exploiting Semantic and Visual Context
for Effective Video Annotation," Multimedia, IEEE Transactions on ,
vol.15, no.6, pp.1400,1414, Oct. 2013 doi: 10.1109/TMM.2013.2250266

Qian Yu, Jingen Liu, Hui Cheng, Ajay Divakaran, and Harpreet
Sawhney. 2013. Semantic pooling for complex event detection. In
Proceedings of the 21st ACM international conference on Multimedia (MM
'13). ACM, New York, NY, USA, 733-736. DOI=10.1145/2502081.2502191
http://doi.acm.org/10.1145/2502081.2502191

Wei Zhang and Chong-Wah Ngo. 2013. Searching visual instances with
topology checking and context modeling. In Proceedings of the 3rd ACM
conference on International conference on multimedia retrieval (ICMR
'13). ACM, New York, NY, USA, 57-64. DOI=10.1145/2461466.2461477
http://doi.acm.org/10.1145/2461466.2461477

Zhang, X.; Huang, T.; Tian, Y.; Geng, M.; Ma, S.; Gao, W., "Fast and
Efficient Transcoding Based on Low-complexity Background Modeling and
Adaptive Block Classification," Multimedia, IEEE Transactions on ,
vol.PP, no.99, pp.1,1, 0 doi: 10.1109/TMM.2013.2280117

Wan-Lei Zhao; Chong-Wah Ngo, "Flip-Invariant SIFT for Copy and Object
Detection," Image Processing, IEEE Transactions on , vol.22, no.3,
pp.980,991, March 2013 doi: 10.1109/TIP.2012.2226043

Zheng-Jun Zha, Tao Mei, Richang Hong, and Zhiwei
Gu. 2013. Marginalized multi-layer multi-instance kernel for video
concept detection. Signal Process. 93, 8 (August 2013),
2119-2125. DOI=10.1016/j.sigpro.2012.08.026
http://dx.doi.org/10.1016/j.sigpro.2012.08.026

Xiangmin Zhou; Lei Chen, "ASVTDECTOR: A practical near duplicate video
retrieval system," Data Engineering (ICDE), 2013 IEEE 29th
International Conference on , vol., no., pp.1348,1351, 8-12 April 2013
doi: 10.1109/ICDE.2013.6544941

Cai-Zhi Zhu, Xiao Zhou, and Shin'Ichi Satoh. 2013. Bag-of-Words
Against Nearest-Neighbor Search for Visual Object Retrieval. In
Proceedings of the 2013 2nd IAPR Asian Conference on Pattern
Recognition (ACPR '13). IEEE Computer Society, Washington, DC, USA,
626-630. DOI=10.1109/ACPR.2013.56
http://dx.doi.org/10.1109/ACPR.2013.56

Xiaofeng Zhu; Zi Huang; Jiangtao Cui; Heng Tao Shen, "Video-to-Shot
Tag Propagation by Graph Sparse Group Lasso," Multimedia, IEEE
Transactions on , vol.15, no.3, pp.633,646, April 2013 doi:

Xiaodan Zhuang, Shuang Wu, and Pradeep Natarajan. 2013. Compact
bag-of-words visual representation for effective linear
classification. In Proceedings of the 21st ACM international
conference on Multimedia (MM '13). ACM, New York, NY, USA,
521-524. DOI=10.1145/2502081.2502138
http://doi.acm.org/10.1145/2502081.2502138


---------------------------------------------------------------------
2012 (81) 
---------------------------------------------------------------------

Aly, Robin and Doherty, Aiden and Hiemstra, Djoerd and de Jong,
Franciska and Smeaton, Alan. The uncertain representation ranking
framework for concept-based video retrieval. Information Retrieval
2012, pp.1-27, doi = {10.1007/s10791-012-9207-y}, Springer Netherlands

Aly, Robin; Hiemstra, Djoerd; de Jong, Franciska; Apers, Peter.
Simulating the future of concept-based video retrieval under improved
detector performance.  Multimedia Tools and Applications, 2012-09-01,
Springer Netherlands, pp. 203-231. Vol. 60, Issue. 1,
dx.doi.org/10.1007/s11042-011-0818-x, Doi: 10.1007/s11042-011-0818-x

Anguera, Xavier; Garzon, Antonio; Adamek, Tomasz; , "MASK: Robust
Local Features for Audio Fingerprinting," Multimedia and Expo (ICME),
2012 IEEE International Conference on , vol., no., pp.455-460, 9-13
July 2012 doi: 10.1109/ICME.2012.137 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6298443&isnumber=6298237

Werner Bailer. 2012. Sequence kernels for clustering and visualizing
near duplicate video segments. In Proceedings of the 18th
international conference on Advances in Multimedia Modeling (MMM'12),
Klaus Schoeffmann, Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah
Ngo, and Yiannis Andreopoulos (Eds.). Springer-Verlag, Berlin,
Heidelberg, 383-394. DOI=10.1007/978-3-642-27355-1_36
http://dx.doi.org/10.1007/978-3-642-27355-1_36

Werner Bailer, "Learning Multiple Sequence-based Kernels for Video
Concept Detection," in IEEE International Symposium on Multimedia,
Irvine, CA, USA, Dec. 2012, pp. 73-77.

Mohammed Belkhatir and Bashar Tahayna. 2012. Near-duplicate video
detection featuring coupled temporal and perceptual visual structures
and logical inference based matching. Inf. Process. Manage. 48, 3 (May
2012), 489-501. DOI=10.1016/j.ipm.2011.03.003
http://dx.doi.org/10.1016/j.ipm.2011.03.003

Bredin, Herve; , "Community-driven hierarchical fusion of numerous
classifiers: Application to video semantic indexing," Acoustics,
Speech and Signal Processing (ICASSP), 2012 IEEE International
Conference on , vol., no., pp.2329-2332, 25-30 March 2012 doi:
10.1109/ICASSP.2012.6288381 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6288381&isnumber=6287775

Andrei Bursuc, Titus Zaharia, and Francoise Prêteux. 2012. Retrieval
of multiple instances of objects in videos. In Proceedings of the 18th
international conference on Advances in Multimedia Modeling (MMM'12),
Klaus Schoeffmann, Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah
Ngo, and Yiannis Andreopoulos (Eds.). Springer-Verlag, Berlin,
Heidelberg, 358-369. DOI=10.1007/978-3-642-27355-1_34
http://dx.doi.org/10.1007/978-3-642-27355-1_34

Shu Chen; McGuinness, K.; Aly, R.; O'Connor, N.E.; de Jong, F.; , "The
AXES-lite video search engine," Image Analysis for Multimedia
Interactive Services (WIAMIS), 2012 13th International Workshop on ,
vol., no., pp.1-4, 23-25 May 2012 doi: 10.1109/WIAMIS.2012.6226778
URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6226778&isnumber=6226742

Choudhury, A.; Medioni, G.; , "A Framework for Robust Online Video
Contrast Enhancement Using Modularity Optimization," Circuits and
Systems for Video Technology, IEEE Transactions on , vol.22, no.9,
pp.1266-1279, Sept. 2012 doi: 10.1109/TCSVT.2012.2198136 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6196206&isnumber=6291822

Codella, Noel C.F.; Natsev, Apostol; Hua, Gang; Hill, Matthew; Cao,
Liangliang; Gong, Leiguang; Smith, John R.; , "Video Event Detection
Using Temporal Pyramids of Visual Semantics with Kernel Optimization
and Model Subspace Boosting," Multimedia and Expo (ICME), 2012 IEEE
International Conference on , vol., no., pp.747-752, 9-13 July 2012
doi: 10.1109/ICME.2012.190 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6298492&isnumber=6298237

Tiago O. Cunha, Flávio G. H. de Souza, Arnaldo de A. Araújo, and
Gisele L. Pappa. 2012. Rushes video summarization based on
spatio-temporal features. In Proceedings of the 27th Annual ACM
Symposium on Applied Computing (SAC '12). ACM, New York, NY, USA,
45-50. DOI=10.1145/2245276.2245287
http://doi.acm.org/10.1145/2245276.2245287

Lixin Duan; Tsang, I.W.; Dong Xu; , "Domain Transfer Multiple Kernel
Learning," Pattern Analysis and Machine Intelligence, IEEE
Transactions on , vol.34, no.3, pp.465-479, March 2012 doi:
10.1109/TPAMI.2011.114 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6136518&isnumber=6136512

Dumont, Emilie; Quenot, Georges; , "A Local Temporal Context-Based
Approach for TV News Story Segmentation," Multimedia and Expo (ICME),
2012 IEEE International Conference on , vol., no., pp.973-978, 9-13
July 2012 doi: 10.1109/ICME.2012.3 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6298529&isnumber=6298237

Ralph Ewerth, Markus Muehling, and Bernd Freisleben. 2012. Robust Video
Content Analysis via Transductive Learning. ACM
Trans. Intell. Syst. Technol. 3, 3, Article 41 (May 2012), 26
pages. DOI=10.1145/2168752.2168755
http://doi.acm.org/10.1145/2168752.2168755

�Lvaro GarcíA-MartíN and José M. MartíNez. 2012. On collaborative
people detection and tracking in complex scenarios. Image Vision
Comput. 30, 4-5 (May 2012), 345-354. DOI=10.1016/j.imavis.2012.03.005
http://dx.doi.org/10.1016/j.imavis.2012.03.005

�lvaro García-Martín, José M. Martínez, and Jesús Bescós. 2012. A
corpus for benchmarking of people detection algorithms. Pattern
Recogn. Lett. 33, 2 (January 2012),
152-156. DOI=10.1016/j.patrec.2011.09.038
http://dx.doi.org/10.1016/j.patrec.2011.09.038

Efstratios Gavves, Cees G. M. Snoek, and Arnold W. M. Smeulders,
"Convex Reduction of High-Dimensional Kernels for Visual
Classification," in Proceedings of the IEEE Computer Society
Conference on Computer Vision and Pattern Recognition, Providence,
Rhode Island, USA, 2012.

Efstratios Gavves, Cees G. M. Snoek, and Arnold W. M. Smeulders,
"Visual Synonyms for Landmark Image Retrieval," Computer Vision and
Image Understanding, vol. 116, iss. 2, pp. 238-249, 2012.

Bo Geng; Yangxi Li; Dacheng Tao; Meng Wang; Zheng-Jun Zha; Chao Xu; ,
"Parallel Lasso for Large-Scale Video Concept Detection," Multimedia,
IEEE Transactions on , vol.14, no.1, pp.55-65, Feb. 2012 doi:
10.1109/TMM.2011.2174781 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6069863&isnumber=6130620

Jinlin Guo, Colum Foley, Cathal Gurrin, and Songyang
Lao. 2011. Semantic concept detection in imbalanced datasets based on
different under-sampling strategies. In Proceedings of the 2011 IEEE
International Conference on Multimedia and Expo (ICME '11). IEEE
Computer Society, Washington, DC, USA,
1-6. DOI=10.1109/ICME.2011.6011923
http://dx.doi.org/10.1109/ICME.2011.6011923

Vishwa Nath Gupta, Gilles Boulianne, and Patrick
Cardinal. 2012. CRIM's content-based audio copy detection system for
TRECVID 2009. Multimedia Tools Appl. 60, 2 (September 2012),
371-387. DOI=10.1007/s11042-010-0608-x
http://dx.doi.org/10.1007/s11042-010-0608-x

Hamadi, A.; Quenot, G.; Mulhem, P.; , "Two-layers re-ranking approach
based on contextual information for visual concepts detection in
videos," Content-Based Multimedia Indexing (CBMI), 2012 10th
International Workshop on , vol., no., pp.1-6, 27-29 June 2012 doi:
10.1109/CBMI.2012.6269837 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6269837&isnumber=6269791

R. Cameron Harvey and Mohamed Hefeeda. 2012. Spatio-temporal video
copy detection. In Proceedings of the 3rd Multimedia Systems
Conference (MMSys '12). ACM, New York, NY, USA,
35-46. DOI=10.1145/2155555.2155562
http://doi.acm.org/10.1145/2155555.2155562

Xintao Hu; Kaiming Li; Junwei Han; Xiansheng Hua; Lei Guo; Tianming
Liu; , "Bridging the Semantic Gap via Functional Brain Imaging,"
Multimedia, IEEE Transactions on , vol.14, no.2, pp.314-325, April
2012 doi: 10.1109/TMM.2011.2172201 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6046230&isnumber=6170997

Huang, Po-Sen; Mertens, Robert; Divakaran, Ajay; Friedland, Gerald;
Hasegawa-Johnson, Mark; , "How to put it into words - using random
forests to extract symbol level descriptions from audio content for
concept detection," Acoustics, Speech and Signal Processing (ICASSP),
2012 IEEE International Conference on , vol., no., pp.505-508, 25-30
March 2012 doi: 10.1109/ICASSP.2012.6287927 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6287927&isnumber=6287775

Bouke Huurnink, Cees G. M. Snoek, Maarten de Rijke, and Arnold W. M.
Smeulders, "Content-Based Analysis Improves Audiovisual Archive
Retrieval," IEEE Transactions on Multimedia, vol. 14, iss. 4, pp.
1166-1178, 2012.

Inoue, N.; Shinoda, K.; , "A Fast and Accurate Video Semantic-Indexing
System Using Fast MAP Adaptation and GMM Supervectors," Multimedia,
IEEE Transactions on , vol.14, no.4, pp.1196-1205, Aug. 2012 doi:
10.1109/TMM.2012.2191395 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6172243&isnumber=6239700

Nakamasa Inoue and Koichi Shinoda, "q-Gaussian Mixture Models Based on
Non-Extensive Statistics for Image And Video Semantic Indexing," In
Proc. ACCV, pp.499-510, 2012.

Jegou, Herve; Delhumeau, Jonathan; Yuan, Jiangbo; Gravier, Guillaume;
Gros, Patrick; , "BABAZ: A large scale audio search system for video
copy detection," Acoustics, Speech and Signal Processing (ICASSP),
2012 IEEE International Conference on , vol., no., pp.2369-2372, 25-30
March 2012 doi: 10.1109/ICASSP.2012.6288391 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6288391&isnumber=6287775

Jiang, Menglin; Tian, Yonghong; Huang, Tiejun; , "Video Copy Detection
Using a Soft Cascade of Multimodal Features," Multimedia and Expo
(ICME), 2012 IEEE International Conference on , vol., no., pp.374-379,
9-13 July 2012 doi: 10.1109/ICME.2012.189 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6298426&isnumber=6298237

Yusuke Kamishima, Nakamasa Inoue, Koichi Shinoda, and Shunsuke Sato,
"Multimedia Event Detection Using GMM Supervectors and SVMs," In
Proc. ICIP, pp.3089–3092, 2012.

Zhen-zhong Lan, Lei Bao, Shoou-I Yu, Wei Liu, and Alexander
G. Hauptmann. 2012. Double fusion for multimedia event detection. In
Proceedings of the 18th international conference on Advances in
Multimedia Modeling (MMM'12), Klaus Schoeffmann, Bernard Merialdo,
Alexander G. Hauptmann, Chong-Wah Ngo, and Yiannis Andreopoulos
(Eds.). Springer-Verlag, Berlin, Heidelberg,
173-185. DOI=10.1007/978-3-642-27355-1_18
http://dx.doi.org/10.1007/978-3-642-27355-1_18

Huan Li, Yuan Shi, Yang Liu, Alexander G. Hauptmann, and Zhang
Xiong. 2012. Cross-domain video concept detection: A joint
discriminative and generative active learning approach. Expert
Syst. Appl. 39, 15 (November 2012),
12220-12228. DOI=10.1016/j.eswa.2012.04.054
http://dx.doi.org/10.1016/j.eswa.2012.04.054

Xirong Li, Cees G. M. Snoek, Marcel Worring, and Arnold W. M. Smeulders, 
"Fusing Concept Detection and Geo Context for Visual Search," in 
Proceedings of the ACM International Conference on Multimedia Retrieval, 
Hong Kong, China, 2012.

Xirong Li, Cees G. M. Snoek, Marcel Worring, and Arnold
W. M. Smeulders, "Harvesting Social Images for Bi-Concept Search,"
IEEE Transactions on Multimedia, vol. 14, iss. 4, pp. 1091-1104, 2012.

Li, Zhi and Liu, Guizhong; "Video scene analysis in 3D wavelet
transform domain", Multimedia Tools and Applications, Springer
Netherlands,dx.doi.org/10.1007/s11042-010-0594-z, 2012.

Jingen Liu. 2012. Evaluation of low-level features and their
combinations for complex event detection in open source videos. In
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern
Recognition (CVPR) (CVPR '12). IEEE Computer Society, Washington, DC,
USA, 3681-3688.

Gjorgji Madjarov, Dragi Kocev, Dejan Gjorgjevikj, and SašO
Deroski. 2012. An extensive experimental comparison of methods for
multi-label learning. Pattern Recogn. 45, 9 (September 2012),
3084-3104. DOI=10.1016/j.patcog.2012.03.004
http://dx.doi.org/10.1016/j.patcog.2012.03.004

Mansencal, B.; Benois-Pineau, J.; Vieux, R.; Domenger, J.-P.; ,
"Search of objects of interest in videos," Content-Based Multimedia
Indexing (CBMI), 2012 10th International Workshop on , vol., no.,
pp.1-6, 27-29 June 2012 doi: 10.1109/CBMI.2012.6269809 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6269809&isnumber=6269791

Meng, Tao; Shyu, Mei-Ling; , "Leveraging Concept Association Network
for Multimedia Rare Concept Mining and Retrieval," Multimedia and Expo
(ICME), 2012 IEEE International Conference on , vol., no., pp.860-865,
9-13 July 2012 doi: 10.1109/ICME.2012.134 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6298511&isnumber=6298237

Merler, M.; Huang, B.; Lexing Xie; Gang Hua; Natsev, A.; , "Semantic
Model Vectors for Complex Video Event Recognition," Multimedia, IEEE
Transactions on , vol.14, no.1, pp.88-101, Feb. 2012 doi:
10.1109/TMM.2011.2168948 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6024471&isnumber=6130620

Hyun-seok Min; Jae Young Choi; De Neve, W.; Yong Man Ro; ,
"Near-Duplicate Video Clip Detection Using Model-Free Semantic Concept
Detection and Adaptive Semantic Distance Measurement," Circuits and
Systems for Video Technology, IEEE Transactions on , vol.22, no.8,
pp.1174-1187, Aug. 2012 doi: 10.1109/TCSVT.2012.2197080 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6193167&isnumber=6255841

Muehling, Markus and Ewerth, Ralph and Zhou, Jun and Freisleben, Bernd;
Multimodal Video Concept Detection via Bag of Auditory Words and
Multiple Kernel Learning; in Advances in Multimedia Modeling, Lecture
Notes in Computer Science}, Eds.: Schoeffmann, Klaus and Merialdo,
Bernard and Hauptmann, Alexander and Ngo, Chong-Wah and Andreopoulos,
Yiannis and Breiteneder, Christian; Springer Berlin / Heidelberg, isbn
978-3-642-27354-4, pp 40-50, vol. 7131,
http://dx.doi.org/10.1007/978-3-642-27355-1_7,2012.

Natarajan, P.; Shuang Wu; Vitaladevuni, S.; Xiaodan Zhuang;
Tsakalidis, S.; Unsang Park; Prasad, R.; Natarajan, P.; , "Multimodal
feature fusion for robust event detection in web videos," Computer
Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on , vol.,
no., pp.1298-1305, 16-21 June 2012 doi: 10.1109/CVPR.2012.6247814 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6247814&isnumber=6247647

Parkhi, O.M.; Vedaldi, A.; Zisserman, A.; , "On-the-fly specific
person retrieval," Image Analysis for Multimedia Interactive Services
(WIAMIS), 2012 13th International Workshop on , vol., no., pp.1-4,
23-25 May 2012 doi: 10.1109/WIAMIS.2012.6226775 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6226775&isnumber=6226742

Jesse Read, Albert Bifet, Geoff Holmes, and Bernhard
Pfahringer. 2012. Scalable and efficient multi-label classification
for evolving data streams. Mach. Learn. 88, 1-2 (July 2012),
243-272. DOI=10.1007/s10994-012-5279-6
http://dx.doi.org/10.1007/s10994-012-5279-6

Redi, Miriam; Merialdo, Bernard; , "Fitting Gaussian copulae for
efficient visual codebooks generation," Content-Based Multimedia
Indexing (CBMI), 2012 10th International Workshop on , vol., no.,
pp.1-6, 27-29 June 2012 doi: 10.1109/CBMI.2012.6269794 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6269794&isnumber=6269791

Redi, Miriam; Merialdo, Bernard;, "A Multimedia Retrieval Framework
Based on Automatic Graded Relevance Judgments" in Advances in
Multimedia Modeling, Lecture Notes in Computer Science, 2012, Springer
Berlin / Heidelberg,
pp. 300-311,dx.doi.org/10.1007/978-3-642-27355-1_29 Doi:
10.1007/978-3-642-27355-1_29

Miriam Redi and Bernard Merialdo. 2012. Exploring two spaces with one
feature: kernelized multidimensional modeling of visual alphabets. In
Proceedings of the 2nd ACM International Conference on Multimedia
Retrieval (ICMR '12). ACM, New York, NY, USA, , Article 20 , 8
pages. DOI=10.1145/2324796.2324821
http://doi.acm.org/10.1145/2324796.2324821

Jennifer Ren, Fangzhe Chang, Thomas Wood, and John
R. Zhang. 2012. Efficient video copy detection via aligning video
signature time series. In Proceedings of the 2nd ACM International
Conference on Multimedia Retrieval (ICMR '12). ACM, New York, NY, USA,
, Article 14 , 8 pages. DOI=10.1145/2324796.2324814
http://doi.acm.org/10.1145/2324796.2324814

Bahjat Safadi, Stephane Ayache, and Georges Quenot. 2012. Active
cleaning for video corpus annotation. In Proceedings of the 18th
international conference on Advances in Multimedia Modeling (MMM'12),
Klaus Schoeffmann, Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah
Ngo, and Yiannis Andreopoulos (Eds.). Springer-Verlag, Berlin,
Heidelberg, 518-528. DOI=10.1007/978-3-642-27355-1_48
http://dx.doi.org/10.1007/978-3-642-27355-1_48

Kimiaki Shirahama, Yuta Matsuoka, and Kuniaki Uehara. 2012. Event
retrieval in video archives using rough set theory and partially
supervised learning. Multimedia Tools Appl. 57, 1 (March 2012),
145-173. DOI=10.1007/s11042-011-0727-z
http://dx.doi.org/10.1007/s11042-011-0727-z

Mats Sjöberg, Markus Koskela, Satoru Ishikawa, and Jorma Laaksonen.
Real-time Large-scale Visual Concept Detection with Linear
Classifiers.  In Proceedings of 21st International Conference on
Pattern Recognition, Tsukuba, Japan, November 2012.

Tamrakar, A.; Ali, S.; Qian Yu; Jingen Liu; Javed, O.; Divakaran, A.;
Hui Cheng; Sawhney, H.; , "Evaluation of low-level features and their
combinations for complex event detection in open source videos,"
Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference
on , vol., no., pp.3681-3688, 16-21 June 2012 doi:
10.1109/CVPR.2012.6248114 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6248114&isnumber=6247647

Claudiu Tănase and Bernard Merialdo. 2012. Efficient spatio-temporal
edge descriptor. In Proceedings of the 18th international conference
on Advances in Multimedia Modeling (MMM'12), Klaus Schoeffmann,
Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah Ngo, and Yiannis
Andreopoulos (Eds.). Springer-Verlag, Berlin, Heidelberg,
210-221. DOI=10.1007/978-3-642-27355-1_21
http://dx.doi.org/10.1007/978-3-642-27355-1_21

Tang, K.; Li Fei-Fei; Koller, D.; , "Learning latent temporal
structure for complex event detection," Computer Vision and Pattern
Recognition (CVPR), 2012 IEEE Conference on , vol., no., pp.1250-1257,
16-21 June 2012 doi: 10.1109/CVPR.2012.6247808 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6247808&isnumber=6247647

Sheng Tang; Yan-Tao Zheng; Yu Wang; Tat-Seng Chua; , "Sparse Ensemble
Learning for Concept Detection," Multimedia, IEEE Transactions on ,
vol.14, no.1, pp.43-54, Feb. 2012 doi: 10.1109/TMM.2011.2168198 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6020805&isnumber=6130620

Tuan Hue Thi, Li Cheng, Jian Zhang, Li Wang, and Shinichi
Satoh. 2012. Editors Choice Article: Structured learning of local
features for human action classification and localization. Image
Vision Comput. 30, 1 (January 2012),
1-14. DOI=10.1016/j.imavis.2011.12.006
http://dx.doi.org/10.1016/j.imavis.2011.12.006

Xinmei Tian, Dacheng Tao, and Yong Rui. 2012. Sparse transfer learning
for interactive video search reranking. ACM Trans. Multimedia
Comput. Commun. Appl. 8, 3, Article 26 (August 2012), 19
pages. DOI=10.1145/2240136.2240139
http://doi.acm.org/10.1145/2240136.2240139

Mercan Topkara, Shimei Pan, Jennifer Lai, Ahmet Dirik, Steven Wood,
and Jeff Boston. 2012. "You've got video": increasing clickthrough
when sharing enterprise video with email. In Proceedings of the 2012
ACM annual conference on Human Factors in Computing Systems (CHI
'12). ACM, New York, NY, USA, 565-568. DOI=10.1145/2207676.2207755
http://doi.acm.org/10.1145/2207676.2207755

Valdés, Víctor; Martínez, José; "On-line video abstract generation of
multimedia news" in Multimedia Tools and Applications, 2012-08-01,
Springer Netherlands,pp. 795-832, vol. 59, No. 3
dx.doi.org/10.1007/s11042-011-0774-5 Doi: 10.1007/s11042-011-0774-5

Victor Valdes and Jose M. Martinez. 2012. Automatic evaluation of
video summaries. ACM Trans. Multimedia Comput. Commun. Appl. 8, 3,
Article 25 (August 2012), 21 pages. DOI=10.1145/2240136.2240138
http://doi.acm.org/10.1145/2240136.2240138

Robert Villa and Joemon M. Jose. 2012. A study of awareness in
multimedia search. Inf. Process. Manage. 48, 1 (January 2012),
32-46. DOI=10.1016/j.ipm.2011.03.005
http://dx.doi.org/10.1016/j.ipm.2011.03.005

Feng Wang; Chong-Wah Ngo; , "Summarizing Rushes Videos by Motion,
Object, and Event Understanding," Multimedia, IEEE Transactions on ,
vol.14, no.1, pp.76-87, Feb. 2012 doi: 10.1109/TMM.2011.2165531 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5993544&isnumber=6130620

Wang, Lezi; Dong, Yuan; Bai, Hongliang; Zhang, Jiwei; Huang, Chong;
Liu, Wei; , "Contented-Based Large Scale Web Audio Copy Detection,"
Multimedia and Expo (ICME), 2012 IEEE International Conference on ,
vol., no., pp.961-966, 9-13 July 2012 doi: 10.1109/ICME.2012.17 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6298527&isnumber=6298237

Weng, Ming-Fang; Chuang, Yung-Yu; , "Cross-Domain Multicue Fusion for
Concept-Based Video Indexing," Pattern Analysis and Machine
Intelligence, IEEE Transactions on , vol.34, no.10, pp.1927-1941,
Oct. 2012 doi: 10.1109/TPAMI.2011.273 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6112775&isnumber=6269017

Shaoxi Xu, Sheng Tang, Yongdong Zhang, Jintao Li, and Yan-Tao
Zheng. 2012. Exploring multi-modality structure for cross domain
adaptation in video concept annotation. Neurocomput. 95 (October
2012), 11-21. DOI=10.1016/j.neucom.2011.05.041
http://dx.doi.org/10.1016/j.neucom.2011.05.041

Yang, J.; Tong, W.; Hauptmann, A. G.; , "A Framework for Classifier
Adaptation for Large-Scale Multimedia Data," Proceedings of the IEEE ,
vol.100, no.9, pp.2639-2657, Sept. 2012 doi:
10.1109/JPROC.2012.2204009 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6257407&isnumber=6269941

Turgay Yilmaz, Elvan Gulen, Adnan Yazici, and Masaru
Kitsuregawa. 2012. A RELIEF-based modality weighting approach for
multimodal information retrieval. In Proceedings of the 2nd ACM
International Conference on Multimedia Retrieval (ICMR '12). ACM, New
York, NY, USA, , Article 54 , 8 pages. DOI=10.1145/2324796.2324858
http://doi.acm.org/10.1145/2324796.2324858

Ehsan Younessian and Deepu Rajan. 2012. Scene signatures for
unconstrained news video stories. In Proceedings of the 18th
international conference on Advances in Multimedia Modeling (MMM'12),
Klaus Schoeffmann, Bernard Merialdo, Alexander G. Hauptmann, Chong-Wah
Ngo, and Yiannis Andreopoulos (Eds.). Springer-Verlag, Berlin,
Heidelberg, 77-88. DOI=10.1007/978-3-642-27355-1_10
http://dx.doi.org/10.1007/978-3-642-27355-1_10

Jin Yuan, Huanbo Luan, Dejun Hou, Han Zhang, Yan-Tao Zheng, Zheng-Jun
Zha, and Tat-Seng Chua. 2012. Video browser showdown by NUS. In
Proceedings of the 18th international conference on Advances in
Multimedia Modeling (MMM'12), Klaus Schoeffmann, Bernard Merialdo,
Alexander G. Hauptmann, Chong-Wah Ngo, and Yiannis Andreopoulos
(Eds.). Springer-Verlag, Berlin, Heidelberg,
642-645. DOI=10.1007/978-3-642-27355-1_64
http://dx.doi.org/10.1007/978-3-642-27355-1_64


Zhang, John R.; Ren, Jennifer Y.; Chang, Fangzhe; Wood, Thomas L.;
Kender, John R.; , "Fast Near-Duplicate Video Retrieval via Motion
Time Series Matching," Multimedia and Expo (ICME), 2012 IEEE
International Conference on , vol., no., pp.842-847, 9-13 July 2012
doi: 10.1109/ICME.2012.111 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6298508&isnumber=6298237

Zheng-Jun Zha; Meng Wang; Yan-Tao Zheng; Yi Yang; Richang Hong;
Tat-Seng Chua; , "Interactive Video Indexing With Statistical Active
Learning," Multimedia, IEEE Transactions on , vol.14, no.1, pp.17-27,
Feb. 2012 doi: 10.1109/TMM.2011.2174782 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6069865&isnumber=6130620

Zheng-Jun Zha, Tao Mei, Yan-Tao Zheng, Zengfu Wang, and Xian-Sheng
Hua. 2012. A comprehensive representation scheme for video semantic
ontology and its applications in semantic concept
detection. Neurocomput. 95 (October 2012),
29-39. DOI=10.1016/j.neucom.2011.05.044
http://dx.doi.org/10.1016/j.neucom.2011.05.044

Yongchao Zhang; Mingxing Xu; Pratt, E.; , "Energy
classification-assisted fingerprint system for content-based audio
copy detection," Communications (COMM), 2012 9th International
Conference on , vol., no., pp.35-38, 21-23 June 2012 doi:
10.1109/ICComm.2012.6262598 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6262598&isnumber=6262524

Zhong, Cencen; Miao, Zhenjiang; , "Data-specific concept correlation
estimation for video annotation refinement," Acoustics, Speech and
Signal Processing (ICASSP), 2012 IEEE International Conference on ,
vol., no., pp.961-964, 25-30 March 2012 doi:
10.1109/ICASSP.2012.6288044 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6288044&isnumber=6287775

Cencen Zhong; Zhenjiang Miao; , "A Two-View Concept Correlation Based
Video Annotation Refinement," Signal Processing Letters, IEEE ,
vol.19, no.5, pp.259-262, May 2012 doi: 10.1109/LSP.2012.2189386 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6159059&isnumber=6167335

Cai-Zhi Zhu and Shin'ichi Satoh. 2012. Large vocabulary quantization
for searching instances from videos. In Proceedings of the 2nd ACM
International Conference on Multimedia Retrieval (ICMR '12). ACM, New
York, NY, USA, , Article 52 , 8 pages. DOI=10.1145/2324796.2324856
http://doi.acm.org/10.1145/2324796.2324856



---------------------------------------------------------------------
2011 (75) 
---------------------------------------------------------------------

Almeida, J.; Leite, N.J.; da S Torres, R.; , "Comparison of video
sequences with histograms of motion patterns," Image Processing
(ICIP), 2011 18th IEEE International Conference on , vol., no.,
pp.3673-3676, 11-14 Sept. 2011 doi: 10.1109/ICIP.2011.6116516 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6116516&isnumber=6115588

Xavier Anguera, Juan Manuel Barrios, Tomasz Adamek, and Nuria
Oliver. 2011. Multimodal fusion for video copy detection. In
Proceedings of the 19th ACM international conference on Multimedia (MM
'11). ACM, New York, NY, USA, 1221-1224. DOI=10.1145/2072298.2071979
http://doi.acm.org/10.1145/2072298.2071979

Baber, J.; Afzulpurkar, N.; Dailey, M.N.; Bakhtyar, M.; , "Shot
boundary detection from videos using entropy and local descriptor,"
Digital Signal Processing (DSP), 2011 17th International Conference on
, vol., no., pp.1-6, 6-8 July 2011 doi: 10.1109/ICDSP.2011.6004918
URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6004918&isnumber=6004864

Chidansh Bhatt and Mohan Kankanhalli. 2011. Probabilistic temporal
multimedia data mining. ACM Trans. Intell. Syst. Technol. 2, 2,
Article 17 (February 2011), 19 pages. DOI=10.1145/1899412.1899421
http://doi.acm.org/10.1145/1899412.1899421

Werner Bailer. A Feature Sequence Kernel for Video Concept
Classification. Proceedings of 17th Multimedia Modeling Conference,
Taipei, TW, Jan. 2011, pp. 359-369.

Werner Bailer. Sequence-based Kernels for Online Concept Detection in
Video. AIEMPro '11: Proceedings of the 4th international workshop on
Automated information extraction in media production, Scottsdale, AZ,
USA, Dec. 2011.

Bali, O.; Karray, H.; Ben Ammar, A.; Alimi, A.M.; , "Toward
Interactive TV," Computational Intelligence and Intelligent
Informatics (ISCIII), 2011 5th International Symposium on , vol., no.,
pp.31-36, 15-17 Sept. 2011 doi: 10.1109/ISCIII.2011.6069737 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6069737&isnumber=6069732

Barrios, J.M.; Bustos, B.; , "P-VCD: A pivot-based approach for
Content-Based Video Copy Detection," Multimedia and Expo (ICME), 2011
IEEE International Conference on , vol., no., pp.1-6, 11-15 July 2011
doi: 10.1109/ICME.2011.6012212 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6012212&isnumber=6011826

Chaisorn, L.; Yan-Tao Zheng; Sim, K.; , "Known-item Search (KIS) in
video: Survey, experience and trend," Information, Communications and
Signal Processing (ICICS) 2011 8th International Conference on , vol.,
no., pp.1-4, 13-16 Dec. 2011 doi: 10.1109/ICICS.2011.6173547 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6173547&isnumber=6173124

Chao Chen; Lin Lin; Mei-Ling Shyu; , "Utilization of Co-occurrence
Relationships between Semantic Concepts in Re-ranking for Information
Retrieval," Multimedia (ISM), 2011 IEEE International Symposium on ,
vol., no., pp.53-60, 5-7 Dec. 2011 doi: 10.1109/ISM.2011.18 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6123325&isnumber=6123309

Tianlong Chen, Shuqiang Jiang, Lingyang Chu, and Qingming
Huang. 2011. Detection and location of near-duplicate video sub-clips
by finding dense subgraphs. In Proceedings of the 19th ACM
international conference on Multimedia (MM '11). ACM, New York, NY,
USA, 1173-1176. DOI=10.1145/2072298.2071967
http://doi.acm.org/10.1145/2072298.2071967

Xiangang Cheng and Liang-Tien Chia. 2011. Spatially-coherent pyramid
matching based on max-pooling. In Proceedings of the 19th ACM
international conference on Multimedia (MM '11). ACM, New York, NY,
USA, 1445-1448. DOI=10.1145/2072298.2072036
http://doi.acm.org/10.1145/2072298.2072036

Xiangang Cheng; Liang-Tien Chia; , "Stratification-Based Keyframe
Cliques for Effective and Efficient Video Representation," Multimedia,
IEEE Transactions on , vol.13, no.6, pp.1333-1342, Dec. 2011 doi:
10.1109/TMM.2011.2167222 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6009224&isnumber=6069890

Xiangang Cheng, Yiqun Hu, and Liang-Tien Chia. 2011. Exploiting local
dependencies with spatial-scale space (S-Cube) for near-duplicate
retrieval. Comput. Vis. Image Underst. 115, 6 (June 2011),
750-758. DOI=10.1016/j.cviu.2011.02.003
http://dx.doi.org/10.1016/j.cviu.2011.02.003

Daniyal, F.; Cavallaro, A.; , "Abnormal motion detection in crowded
scenes using local spatio-temporal analysis," Acoustics, Speech and
Signal Processing (ICASSP), 2011 IEEE International Conference on ,
vol., no., pp.1944-1947, 22-27 May 2011 doi:
10.1109/ICASSP.2011.5946889 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5946889&isnumber=5946226

Youdong Ding; Jianfei Zhang; Jun Li; Xiaocheng Wei; , "A
Bag-of-Feature Model for Video Semantic Annotation," Image and
Graphics (ICIG), 2011 Sixth International Conference on , vol., no.,
pp.696-701, 12-15 Aug. 2011 doi: 10.1109/ICIG.2011.135 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6005612&isnumber=6005527

F. Daniyal, A. Cavallaro. Abnormal motion detection in crowded
scenes using local spatio-temporal analysis, In Proc. of IEEE
Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), pages
3913-3916. Prague, Czech Republic, 22-27 May 2011.

Pınar Duygulu and Muhammet Baştan. 2011. Multimedia translation for
linking visual data to semantics in videos. Mach. Vision Appl. 22, 1
(January 2011), 99-115. DOI=10.1007/s00138-009-0217-8
http://dx.doi.org/10.1007/s00138-009-0217-8

Nizar Elleuch, Mohamed Zarka, Anis Ben Ammar, and Adel
M. Alimi. 2011. A fuzzy ontology: based framework for reasoning in
visual vidBahjat Safadi and Georges Quénot. 2011. Re-ranking for
multimedia indexing and retrieval. In Proceedings of the 33rd European
conference on Advances in information retrieval (ECIR'11), Paul
Clough, Colum Foley, Cathal Gurrin, Hyowon Lee, and Gareth J. F. Jones
(Eds.). Springer-Verlag, Berlin, Heidelberg, 708-711. eo content
analysis and indexing. In Proceedings of the Eleventh International
Workshop on Multimedia Data Mining (MDMKDD '11). ACM, New York, NY,
USA, , Article 1 , 8 pages. DOI=10.1145/2237827.2237828
http://doi.acm.org/10.1145/2237827.2237828

Bailan Feng, Juan Cao, Xiuguo Bao, Lei Bao, Yongdong Zhang, Shouxun
Lin, and Xiaochun Yun. 2011. Graph-based multi-space semantic
correlation propagation for\&\#x00a0;video retrieval. Vis. Comput. 27,
1 (January 2011), 21-34. DOI=10.1007/s00371-010-0510-6
http://dx.doi.org/10.1007/s00371-010-0510-6

Huamin Feng; Chao Jiang; Xinghua Yang; , "An audio classification and
speech recognition system for video content analysis," Multimedia
Technology (ICMT), 2011 International Conference on , vol., no.,
pp.5272-5276, 26-28 July 2011 doi: 10.1109/ICMT.2011.6002093 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6002093&isnumber=6001647

Bauke Freiburg, Jaap Kamps, and Cees G. M. Snoek, "Crowdsourcing
Visual Detectors for Video Search," in Proceedings of the ACM
International Conference on Multimedia, Scottsdale, AZ, USA, 2011.

Garcia-Martin, A.; Hauptmann, A.; Martinez, J.M.; , "People detection
based on appearance and motion models," Advanced Video and
Signal-Based Surveillance (AVSS), 2011 8th IEEE International
Conference on , vol., no., pp.256-260, Aug. 30 2011-Sept. 2 2011 doi:
10.1109/AVSS.2011.6027333 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6027333&isnumber=6027273

Gkalelis, N.; Mezaris, V.; Kompatsiaris, I.; , "High-level event
detection in video exploiting discriminant concepts," Content-Based
Multimedia Indexing (CBMI), 2011 9th International Workshop on , vol.,
no., pp.85-90, 13-15 June 2011 doi: 10.1109/CBMI.2011.5972525 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5972525&isnumber=5972508

Jinlin Guo; Foley, Colum; Gurrin, Cathal; Songyang Lao; , "Semantic
concept detection in imbalanced datasets based on different
under-sampling strategies," Multimedia and Expo (ICME), 2011 IEEE
International Conference on , vol., no., pp.1-6, 11-15 July 2011 doi:
10.1109/ICME.2011.6011923 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6011923&isnumber=6011826

Hiep Van Hoang, Duy-Dinh Le, Shin'ichi Satoh, and Quang Hong
Nguyen. 2011. Improving retake detection by adding motion feature. In
Proceedings of the 16th international conference on Image analysis and
processing - Volume Part II (ICIAP'11), Giuseppe Maino and Gian Luca
Foresti (Eds.), Vol. Part II. Springer-Verlag, Berlin, Heidelberg,
150-157.

Marijn Huijbregts and Franciska de Jong. 2011. Robust
speech/non-speech classification in heterogeneous multimedia
content. Speech Commun. 53, 2 (February 2011),
143-153. DOI=10.1016/j.specom.2010.08.008
http://dx.doi.org/10.1016/j.specom.2010.08.008

Wolfgang Hürst, Cees G. M. Snoek, Willem-Jan Spoel, and Mate Tomin,
"Size Matters! How Thumbnail Number, Size, and Motion Influence Mobile
Video Retrieval," in International Conference on MultiMedia Modeling,
Taipei, Taiwan, 2011.

Nakamasa Inoue and Koichi Shinoda. 2011. A fast MAP adaptation
technique for gmm-supervector-based video semantic indexing
systems. In Proceedings of the 19th ACM international conference on
Multimedia (MM '11). ACM, New York, NY, USA,
1357-1360. DOI=10.1145/2072298.2072014
http://doi.acm.org/10.1145/2072298.2072014

Xiang Ji; Junwei Han; Xintao Hu; Kaiming Li; Fan Deng; Jun Fang; Lei
Guo; Tianming Liu; , "Retrieving video shots in semantic brain imaging
space using manifold-ranking," Image Processing (ICIP), 2011 18th IEEE
International Conference on , vol., no., pp.3633-3636, 11-14
Sept. 2011 doi: 10.1109/ICIP.2011.6116505 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6116505&isnumber=6115588

Wei Jiang and Alexander Loui. 2011. Laplacian adaptive context-based
SVM for video concept detection. In Proceedings of the 3rd ACM SIGMM
international workshop on Social media (WSM '11). ACM, New York, NY,
USA, 15-20. DOI=10.1145/2072609.2072615
http://doi.acm.org/10.1145/2072609.2072615

Ilseo Kim; Chin-Hui Lee; , "Optimization of average precision with
Maximal Figure-of-Merit Learning," Machine Learning for Signal
Processing (MLSP), 2011 IEEE International Workshop on , vol., no.,
pp.1-6, 18-21 Sept. 2011 doi: 10.1109/MLSP.2011.6064638 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6064638&isnumber=6064496

Ksibi, A.; Elleuch, N.; Ben Ammar, A.; Alimi, A.M.; , "Semi-automatic
soft collaborative annotation for semantic video indexing," EUROCON -
International Conference on Computer as a Tool (EUROCON), 2011 IEEE ,
vol., no., pp.1-6, 27-29 April 2011 doi: 10.1109/EUROCON.2011.5929417
URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5929417&isnumber=5929030

Duy-Dinh Le; Satoh, S.; , "A Comprehensive Study of Feature
Representations for Semantic Concept Detection," Semantic Computing
(ICSC), 2011 Fifth IEEE International Conference on , vol., no.,
pp.235-238, 18-21 Sept. 2011 doi: 10.1109/ICSC.2011.92 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6061339&isnumber=6061289

Duy-Dinh Le; Satoh, S.; , "Indexing Faces in Broadcast News Video
Archives," Data Mining Workshops (ICDMW), 2011 IEEE 11th International
Conference on , vol., no., pp.519-526, 11-11 Dec. 2011 doi:
10.1109/ICDMW.2011.101 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6137423&isnumber=6137352

Lin Lin; Chao Chen; Mei-Ling Shyu; Shu-Ching Chen; , "Weighted
Subspace Filtering and Ranking Algorithms for Video Concept
Retrieval," MultiMedia, IEEE , vol.18, no.3, pp.32-43, March 2011 doi:
10.1109/MMUL.2011.35 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5765919&isnumber=5986505

Nan Liu; Yao Zhao; Zhenfeng Zhu; Hanqing Lu; , "Exploiting
Visual-Audio-Textual Characteristics for Automatic TV Commercial Block
Detection and Segmentation," Multimedia, IEEE Transactions on ,
vol.13, no.5, pp.961-973, Oct. 2011 doi: 10.1109/TMM.2011.2160334 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5928417&isnumber=6018340

Yuan Liu; Tao Mei; , "Optimizing Visual Search Reranking via Pairwise
Learning," Multimedia, IEEE Transactions on , vol.13, no.2,
pp.280-291, April 2011 doi: 10.1109/TMM.2010.2103931 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5680970&isnumber=5732768

Mezaris, V.; Sidiropoulos, P.; Kompatsiaris, I.; , "Improving
Interactive Video Retrieval by Exploiting Automatically-Extracted
Video Structural Semantics," Semantic Computing (ICSC), 2011 Fifth
IEEE International Conference on , vol., no., pp.224-227, 18-21
Sept. 2011 doi: 10.1109/ICSC.2011.29 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6061337&isnumber=6061289

Min, Hyun-seok; Jae Young Choi; De Neve, Wesley; Ro, Yong Man; ,
"Leveraging an image folksonomy and the Signature Quadratic Form
Distance for semantic-based detection of near-duplicate video clips,"
Multimedia and Expo (ICME), 2011 IEEE International Conference on ,
vol., no., pp.1-6, 11-15 July 2011 doi: 10.1109/ICME.2011.6011937 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6011937&isnumber=6011826

Hyun-seok Min, Jae Young Choi, Wesley De Neve, and Yong Man
Ro. 2011. Bimodal fusion of low-level visual features and high-level
semantic features for near-duplicate video clip detection. Image
Commun. 26, 10 (November 2011),
612-627. DOI=10.1016/j.image.2011.04.001
http://dx.doi.org/10.1016/j.image.2011.04.001

Lin Pang, Juan Cao, Lei Bao, Yongdong Zhang, and Shouxun
Lin. 2011. Towards hierarchical context: unfolding visual community
potential for interactive video retrieval. Multimedia Tools Appl. 55,
1 (October 2011), 151-178. DOI=10.1007/s11042-010-0605-0
http://dx.doi.org/10.1007/s11042-010-0605-0

Sanjay Purushotham, Qi Tian, and C.-C. Jay
Kuo. 2011. Picture-in-picture copy detection using spatial coding
techniques. In Proceedings of the 2011 ACM international workshop on
Automated media analysis and production for novel TV services (AIEMPro
'11), Jean-Pierre Evain, Gerald Friedland, Masanori Sano, and Patrick
Gros (Eds.). ACM, New York, NY, USA,
25-30. DOI=10.1145/2072552.2072559
http://doi.acm.org/10.1145/2072552.2072559

Rajendran, D.; Shivakumara, P.; Bolan Su; Shijian Lu; Chew Lim Tan; ,
"A New Fourier-Moments Based Video Word and Character Extraction
Method for Recognition," Document Analysis and Recognition (ICDAR),
2011 International Conference on , vol., no., pp.1165-1169, 18-21
Sept. 2011 doi: 10.1109/ICDAR.2011.235 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6065493&isnumber=6065247

Ranathunga, Lochandaka; Zainuddin, Roziati; Abdullah, Nor. Performance
evaluation of the combination of Compacted Dither Pattern Codes with
Bhattacharyya classifier in video visual concept depiction. Multimedia
Tools and Applications, 2011-08-01, Springer Netherlands, pp. 263-289,
Vol. 54, Issue 2, dx.doi.org/10.1007/s11042-010-0522-2,
10.1007/s11042-010-0522-2

Miriam Redi and Bernard Merialdo. 2011. Marginal-based visual
alphabets for local image descriptors aggregation. In Proceedings of
the 19th ACM international conference on Multimedia (MM '11). ACM, New
York, NY, USA, 1429-1432. DOI=10.1145/2072298.2072032
http://doi.acm.org/10.1145/2072298.2072032

Miriam Redi and Bernard Merialdo. 2011. Saliency moments for image
categorization. In Proceedings of the 1st ACM International Conference
on Multimedia Retrieval (ICMR '11). ACM, New York, NY, USA, , Article
39 , 8 pages. DOI=10.1145/1991996.1992035
http://doi.acm.org/10.1145/1991996.1992035

Reede Ren, John Collomosse, and Joemon Jose. 2011. A BOVW based query
generative model. In Proceedings of the 17th international conference
on Advances in multimedia modeling - Volume Part I (MMM'11), Kuo-Tien
Lee, Jun-Wei Hsieh, Wen-Hsiang Tsai, Hong-Yuan Mark Liao, and Tsuhan
Chen (Eds.), Vol. Part I. Springer-Verlag, Berlin, Heidelberg,
118-128.

Roopalakshmi, R.; Reddy, G.R.M.; , "A Novel CBCD Approach Using MPEG-7
Motion Activity Descriptors," Multimedia (ISM), 2011 IEEE
International Symposium on , vol., no., pp.179-184, 5-7 Dec. 2011 doi:
10.1109/ISM.2011.36 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6123343&isnumber=6123309

Roopalakshmi, R.; Reddy, G.R.M.; , "Towards a new approach to video
copy detection using acoustic features," Internet Multimedia Systems
Architecture and Application (IMSAA), 2011 IEEE 5th International
Conference on , vol., no., pp.1-5, 12-13 Dec. 2011 doi:
10.1109/IMSAA.2011.6156336 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6156336&isnumber=6156331

Bahjat Safadi and Georges Quénot. 2011. Re-ranking by local re-scoring
for video indexing and retrieval. In Proceedings of the 20th ACM
international conference on Information and knowledge management (CIKM
'11), Bettina Berendt, Arjen de Vries, Wenfei Fan, Craig Macdonald,
Iadh Ounis, and Ian Ruthven (Eds.). ACM, New York, NY, USA,
2081-2084. DOI=10.1145/2063576.2063895
http://doi.acm.org/10.1145/2063576.2063895

Bahjat Safadi and Georges Quénot. 2011. Re-ranking for multimedia
indexing and retrieval. In Proceedings of the 33rd European conference
on Advances in information retrieval (ECIR'11), Paul Clough, Colum
Foley, Cathal Gurrin, Hyowon Lee, and Gareth J. F. Jones
(Eds.). Springer-Verlag, Berlin, Heidelberg, 708-711.

Markus Seidl, Matthias Zeppelzauer, Dalibor Mitrović, and Christian
Breiteneder. 2011. Gradual transition detection in historic film
material—a systematic study. J. Comput. Cult. Herit. 4, 3, Article 10
(December 2011), 18 pages. DOI=10.1145/2069276.2069279
http://doi.acm.org/10.1145/2069276.2069279

Kimiaki Shirahama, Yuta Matsuoka, and Kuniaki Uehara. 2011. Video
event retrieval from a small number of examples using rough set
theory. In Proceedings of the 17th international conference on
Advances in multimedia modeling - Volume Part I (MMM'11), Kuo-Tien
Lee, Jun-Wei Hsieh, Wen-Hsiang Tsai, Hong-Yuan Mark Liao, and Tsuhan
Chen (Eds.), Vol. Part I. Springer-Verlag, Berlin, Heidelberg, 96-106.

Shirahama, K.; Uehara, K.; , "Query by Virtual Example: Video
Retrieval Using Example Shots Created by Virtual Reality Techniques,"
Image and Graphics (ICIG), 2011 Sixth International Conference on ,
vol., no., pp.829-834, 12-15 Aug. 2011 doi: 10.1109/ICIG.2011.158 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6005957&isnumber=6005527

Shirahama, K.; Uehara, K.; , "Utilizing Video Ontology for Fast and
Accurate Query-by-Example Retrieval," Semantic Computing (ICSC), 2011
Fifth IEEE International Conference on , vol., no., pp.395-402, 18-21
Sept. 2011 doi: 10.1109/ICSC.2011.88 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6061440&isnumber=6061289

Kimiaki Shirahama and Kuniaki Uehara. 2011. Effectiveness of video
ontology in query by example approach. In Proceedings of the 7th
international conference on Active media technology (AMT'11), Ning
Zhong, Vic Callaghan, Ali A. Ghorbani, and Bin Hu
(Eds.). Springer-Verlag, Berlin, Heidelberg, 49-58.

Shivakumara, P.; Trung Quy Phan; Shijian Lu; Chew Lim Tan; , "Video
Character Recognition through Hierarchical Classification," Document
Analysis and Recognition (ICDAR), 2011 International Conference on ,
vol., no., pp.131-135, 18-21 Sept. 2011 doi: 10.1109/ICDAR.2011.35
URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6065290&isnumber=6065247

Shivakumara, P.; Bhowmick, S.; Bolan Su; Tan, C.L.; Pal, U.; , "A New
Gradient Based Character Segmentation Method for Video Text
Recognition," Document Analysis and Recognition (ICDAR), 2011
International Conference on , vol., no., pp.126-130, 18-21 Sept. 2011
doi: 10.1109/ICDAR.2011.34 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6065289&isnumber=6065247

Sidiropoulos, P.; Mezaris, V.; Kompatsiaris, I.; Meinedo, H.; Bugalho,
M.; Trancoso, I.; , "Temporal Video Segmentation to Scenes Using
High-Level Audiovisual Features," Circuits and Systems for Video
Technology, IEEE Transactions on , vol.21, no.8, pp.1163-1177,
Aug. 2011 doi: 10.1109/TCSVT.2011.2138830 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5742987&isnumber=5970267

Mats Sjöberg and Jorma Laaksonen. 2011. Analysing the structure of
semantic concepts in visual databases. In Proceedings of the 8th
international conference on Advances in self-organizing maps
(WSOM'11), Jorma Laaksonen and Timo Honkela (Eds.). Springer-Verlag,
Berlin, Heidelberg, 338-347.

Takahashi, M.; Naemura, M.; Fujii, M.; Satoh, S.; , "Human action
recognition in crowded surveillance video sequences by using features
taken from key-point trajectories," Computer Vision and Pattern
Recognition Workshops (CVPRW), 2011 IEEE Computer Society Conference
on , vol., no., pp.9-16, 20-25 June 2011 doi:
10.1109/CVPRW.2011.5981713 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5981713&isnumber=5981671

Tapu, R.; Mocanu, B.; Raducanu, M.; Petrescu, T.; , "Multiresolution
median filtering based video temporal segmentation," Signals, Circuits
and Systems (ISSCS), 2011 10th International Symposium on , vol., no.,
pp.1-4, June 30 2011-July 1 2011 doi: 10.1109/ISSCS.2011.5978651 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5978651&isnumber=5978636

Yonghong Tian; Menglin Jiang; Luntian Mou; Xiaoyu Fang; Tiejun Huang;
, "A multimodal video copy detection approach with sequential pyramid
matching," Image Processing (ICIP), 2011 18th IEEE International
Conference on , vol., no., pp.3629-3632, 11-14 Sept. 2011 doi:
10.1109/ICIP.2011.6116504 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6116504&isnumber=6115588

Ioannis Tsampoulatidis, Nikolaos Gkalelis, Anastasios Dimou, Vasileios
Mezaris, and Ioannis Kompatsiaris. 2011. High-level event detection
system based on discriminant visual concepts. In Proceedings of the
1st ACM International Conference on Multimedia Retrieval (ICMR
'11). ACM, New York, NY, USA, , Article 68 , 2
pages. DOI=10.1145/1991996.1992064
http://doi.acm.org/10.1145/1991996.1992064

Yusuke Uchida, Motilal Agrawal, and Shigeyuki Sakazawa, "Accurate
Content-Based Video Copy Detection with Efficient Feature Indexing,"
Proceedings of the 1st ACM International Conference on Multimedia
Retrieval, Trento, Italy, 2011.
http://dl.acm.org/citation.cfm?id=1992015

Vahdat, A.; Bo Gao; Ranjbar, M.; Mori, G.; , "A discriminative key
pose sequence model for recognizing human interactions," Computer
Vision Workshops (ICCV Workshops), 2011 IEEE International Conference
on , vol., no., pp.1729-1736, 6-13 Nov. 2011 doi:
10.1109/ICCVW.2011.6130458 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6130458&isnumber=6130192

Koen E. A. van de Sande, Theo Gevers, and Cees G. M. Snoek,
"Empowering Visual Categorization with the GPU," IEEE Transactions on
Multimedia, vol. 13, iss. 1, pp. 60-70, 2011.

Hung Thanh Vu; Thanh Duc Ngo; Thao Ngoc Nguyen; Duy-Dinh Le; Satoh,
S.; Bac Hoai Le; Duc Anh Duong; , "Fast face sequence matching in
large-scale video databases," Image Processing (ICIP), 2011 18th IEEE
International Conference on , vol., no., pp.2549-2552, 11-14
Sept. 2011 doi: 10.1109/ICIP.2011.6116183 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6116183&isnumber=6115588

Kong-Wah Wan, Yan-Tao Zheng, and Lekha Chaisorn. 2011. Known-item
video search via query-to-modality mapping. In Proceedings of the 19th
ACM international conference on Multimedia (MM '11). ACM, New York,
NY, USA, 1133-1136. DOI=10.1145/2072298.2071957
http://doi.acm.org/10.1145/2072298.2071957

Jingdong Wang, Yinghai Zhao, Xiuqing Wu, and Xian-Sheng Hua. 2011. A
transductive multi-label learning approach for video concept
detection. Pattern Recogn. 44, 10-11 (October 2011),
2274-2286. DOI=10.1016/j.patcog.2010.07.015
http://dx.doi.org/10.1016/j.patcog.2010.07.015

Lei Wang, Dawei Song, and Eyad Elyan. 2011. Words-of-interest
selection based on temporal motion coherence for video retrieval. In
Proceedings of the 34th international ACM SIGIR conference on Research
and development in Information Retrieval (SIGIR '11). ACM, New York,
NY, USA, 1197-1198. DOI=10.1145/2009916.2010117
http://doi.acm.org/10.1145/2009916.2010117

Lezi Wang; Yuan Dong; Hongliang Bai; Wei Liu; Kun Tao; , "A word-based
approach for duplicate picture in picture sequence detection,"
Broadband Network and Multimedia Technology (IC-BNMT), 2011 4th IEEE
International Conference on , vol., no., pp.286-290, 28-30 Oct. 2011
doi: 10.1109/ICBNMT.2011.6155942 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6155942&isnumber=6155882

Xiangyu Wang, Yong Rui, and Mohan S. Kankanhalli. 2011. Up-fusion: an
evolving multimedia decision fusion method. In Proceedings of the 19th
ACM international conference on Multimedia (MM '11). ACM, New York,
NY, USA, 1089-1092. DOI=10.1145/2072298.2071945
http://doi.acm.org/10.1145/2072298.2071945

Xiao-Yong Wei and Zhen-Qun Yang. 2011. Coached active learning for
interactive video search. In Proceedings of the 19th ACM international
conference on Multimedia (MM '11). ACM, New York, NY, USA,
443-452. DOI=10.1145/2072298.2072356
http://doi.acm.org/10.1145/2072298.2072356

Xin-Shun Xu, Yuan Jiang, Liang Peng, Xiangyang Xue, and Zhi-Hua
Zhou. 2011. Ensemble approach based on conditional random field for
multi-label image and video annotation. In Proceedings of the 19th ACM
international conference on Multimedia (MM '11). ACM, New York, NY,
USA, 1377-1380. DOI=10.1145/2072298.2072019
http://doi.acm.org/10.1145/2072298.2072019

Xin-Shun Xu, Xiangyang Xue, and Zhi-Hua Zhou. 2011. Ensemble
multi-instance multi-label learning approach for video annotation
task. In Proceedings of the 19th ACM international conference on
Multimedia (MM '11). ACM, New York, NY, USA,
1153-1156. DOI=10.1145/2072298.2071962
http://doi.acm.org/10.1145/2072298.2071962

Jian Yi, Yuxin Peng, and Jianguo Xiao. 2011. Mining concept
relationship in temporal context for effective video annotation. In
Proceedings of the 19th ACM international conference on Multimedia (MM
'11). ACM, New York, NY, USA, 1053-1056. DOI=10.1145/2072298.2071936
http://doi.acm.org/10.1145/2072298.2071936

Jin Yuan, Zheng-Jun Zha, Yao-Tao Zheng, Meng Wang, Xiangdong Zhou, and
Tat-Seng Chua. 2011. Learning concept bundles for video search with
complex queries. In Proceedings of the 19th ACM international
conference on Multimedia (MM '11). ACM, New York, NY, USA,
453-462. DOI=10.1145/2072298.2072357
http://doi.acm.org/10.1145/2072298.2072357

Jin Yuan; Zheng-Jun Zha; Yan-Tao Zheng; Meng Wang; Xiangdong Zhou;
Tat-Seng Chua; , "Utilizing Related Samples to Enhance Interactive
Concept-Based Video Search," Multimedia, IEEE Transactions on ,
vol.13, no.6, pp.1343-1355, Dec. 2011 doi: 10.1109/TMM.2011.2168813
URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6022804&isnumber=6069890

Lu Zhang, Tao Mei, Yuan Liu, Dacheng Tao, and He-Qin
Zhou. 2011. Visual search reranking via adaptive particle swarm
optimization. Pattern Recogn. 44, 8 (August 2011),
1811-1820. DOI=10.1016/j.patcog.2011.01.016
http://dx.doi.org/10.1016/j.patcog.2011.01.016

Qiusha Zhu; Lin Lin; Mei-Ling Shyu; Shu-Ching Chen; , "Effective
supervised discretization for classification based on correlation
maximization," Information Reuse and Integration (IRI), 2011 IEEE
International Conference on , vol., no., pp.390-395, 3-5 Aug. 2011
doi: 10.1109/IRI.2011.6009579 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6009579&isnumber=6009494




---------------------------------------------------------------------
2010 (72)
---------------------------------------------------------------------

Aly, Robin and Doherty, Aiden and Hiemstra, Djoerd and Smeaton,
A. 2010.  Beyond Shot Retrieval: Searching for Broadcast News Items
Using Language Models of Concepts  in ECIR '10: Proceedings of the 32th
European Conference on IR Research on Advances in Information
Retrieval}, Lecture Notes in Computer Science, Vol 5993, pp. 241-252,
Springer Verlag.

Amiri, A.; Fathy, M.; Naseri, A.; , "Key-frame extraction and video
summarization using QR-Decomposition," Digital Content, Multimedia
Technology and its Applications (IDC), 2010 6th International
Conference on , vol., no., pp.134-139, 16-18 Aug. 2010 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5568717&isnumber=5568515

Asaidi, H.; Aarab, A.; , "Visual video retrieval using Multivariate
GARCH models," I/V Communications and Mobile Network (ISVC), 2010 5th
International Symposium on , vol., no., pp.1-4, Sept. 30 2010-Oct. 2
2010 doi: 10.1109/ISVC.2010.5656176 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5656176&isnumber=5654712

Ates, T.K.; Esen, E.; Saracoglu, A.; Soysal, M.; Turgut, Y.; Oktay,
O.; Alatan, A.A.; , "Content based video copy detection with local
descriptors," Signal Processing and Communications Applications
Conference (SIU), 2010 IEEE 18th , vol., no., pp.49-52, 22-24 April
2010 doi: 10.1109/SIU.2010.5654395 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5654395&isnumber=5648807

Stephane Ayache, Georges Quenot, Andy Tseng. 2010. The lIGVID
system for video retrieval and concept annotation. March 2010 MIR '10:
Proceedings of the international conference on Multimedia information
retrieval

Werner Bailer. Evaluating Detection of Near Duplicate Video
Segments. Proceedings of the ACM International Conference on Image and
Video Retrieval, Xian, China, July 2010.

Werner Bailer, Wolfgang Weiss, Gert Kienast, Georg Thallinger and
Werner Haas. A Video Browsing Tool for Content Management in
Post-production. International Journal of Digital Multimedia
Broadcasting, Mar. 2010.

Chantamunee, S.; Gotoh, Y.; , "Nearly-repetitive video synchronisation
using nonlinear manifold embedding," Acoustics Speech and Signal
Processing (ICASSP), 2010 IEEE International Conference on , vol.,
no., pp.2282-2285, 14-19 March 2010 doi: 10.1109/ICASSP.2010.5495925
URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5495925&isnumber=5494886

Chayanurak, R.; Cooharojananone, N.; Satoh, S.; Lipikorn, R.; ,
"Carried object detection using star skeleton with adaptive centroid
and time series graph," Signal Processing (ICSP), 2010 IEEE 10th
International Conference on , vol., no., pp.736-739, 24-28 Oct. 2010
doi: 10.1109/ICOSP.2010.5655765 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5655765&isnumber=5654687


Juan Chen; , "Detection of video copies based on robust descriptors,"
Apperceiving Computing and Intelligence Analysis (ICACIA), 2010
International Conference on , vol., no., pp.303-306, 17-19 Dec. 2010
doi: 10.1109/ICACIA.2010.5709906 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5709906&isnumber=5709837
	
Shi Chen; Jinqiao Wang; Yi Ouyang; Bo Wang; Qi Tian; Hanqing Lu; ,
"Multi-level trajectory modeling for video copy detection," Acoustics
Speech and Signal Processing (ICASSP), 2010 IEEE International
Conference on , vol., no., pp.2378-2381, 14-19 March 2010 doi:
10.1109/ICASSP.2010.5496165 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5496165&isnumber=5494886

Xiaolin Chen; Xiaokang Yang; Rui Zhang; Anwen Liu; Shibao Zheng; ,
"Edge region color autocorrelogram: A new low-level feature applied in
CBIR," Broadband Multimedia Systems and Broadcasting (BMSB), 2010 IEEE
International Symposium on , vol., no., pp.1-4, 24-26 March 2010 doi:
10.1109/ISBMSB.2010.5463087 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5463087&isnumber=5463070

Xiangang Cheng, Liang-Tien Chia. 2010. Stratification-based keyframe
cliques for removal of near-duplicates in video search results. March
2010 MIR '10: Proceedings of the international conference on
Multimedia information retrieval

Cirakman, O.; Gunsel, B.; Sengor, N.S.; Gursoy, O.; , "Key-frame based
video fingerprinting by NMF," Image Processing (ICIP), 2010 17th IEEE
International Conference on , vol., no., pp.2373-2376, 26-29
Sept. 2010 doi: 10.1109/ICIP.2010.5652649 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5652649&isnumber=5648792

de Rooij, O.; Worring, M.; , "Browsing Video Along Multiple Threads,"
Multimedia, IEEE Transactions on , vol.12, no.2, pp.121-130, Feb. 2010
doi: 10.1109/TMM.2009.2037388 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5340554&isnumber=5379168

Ding, G.; Qin, K.; , "Semantic classifier based on compressed sensing
for image and video annotation," Electronics Letters , vol.46, no.6,
pp.417-419, March 18 2010 doi: 10.1049/el.2010.2295 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5434620&isnumber=5434595

Diou, C.; Stephanopoulos, G.; Panagiotopoulos, P.; Papachristou, C.;
Dimitriou, N.; Delopoulos, A.; , "Large-Scale Concept Detection in
Multimedia Data Using Small Training Sets and Cross-Domain Concept
Fusion," Circuits and Systems for Video Technology, IEEE Transactions
on , vol.20, no.12, pp.1808-1821, Dec. 2010 doi:
10.1109/TCSVT.2010.2087814 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5604666&isnumber=5704816

Douze, M.; Jegou, H.; Schmid, C.; , "An Image-Based Approach to Video
Copy Detection With Spatio-Temporal Post-Filtering," Multimedia, IEEE
Transactions on , vol.12, no.4, pp.257-266, June 2010 doi:
10.1109/TMM.2010.2046265 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5437235&isnumber=5463236

Feng, Y.; Ren, J.; Jiang, J.; , "Mixed ranking scheme for video
retrieval," Electronics Letters , vol.46, no.24, pp.1600-1601,
November 25 2010 doi: 10.1049/el.2010.8621 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5659664&isnumber=5659647

Ke Gao, Yongdong Zhang, Wei Zhang, Shouxun Lin. 2010. Affine Stable
Characteristic based sample expansion for object detection.  July 2010
CIVR '10: Proceedings of the ACM International Conference on Image and
Video Retrieval

Bo Geng; Linjun Yang; Chao Xu; Xian-Sheng Hua; , "Content-aware
Ranking for visual search," Computer Vision and Pattern Recognition
(CVPR), 2010 IEEE Conference on , vol., no., pp.3400-3407, 13-18 June
2010 doi: 10.1109/CVPR.2010.5540003 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5540003&isnumber=5539770

Gupta, V.; Boulianne, G.; Cardinal, P.; , "Content-based audio copy
detection using nearest-neighbor mapping," Acoustics Speech and Signal
Processing (ICASSP), 2010 IEEE International Conference on , vol.,
no., pp.261-264, 14-19 March 2010 doi: 10.1109/ICASSP.2010.5495963
URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5495963&isnumber=5494886

Gupta, V.; Boulianne, G.; Cardinal, P.; , "Crim's content-based audio
copy detection system for TRECVID 2009," Content-Based Multimedia
Indexing (CBMI), 2010 International Workshop on , vol., no., pp.1-6,
23-25 June 2010 doi: 10.1109/CBMI.2010.5529908 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5529908&isnumber=5529836

Gürsoy, O.; Kutluk, S.; Günsel, B.; Şengör, N.; , "Negatif olmayan
matris ayrıştirma ile ikili video kiyimlama binary video hashing by
non-negative matrix factorization," Signal Processing and
Communications Applications Conference (SIU), 2010 IEEE 18th , vol.,
no., pp.894-897, 22-24 April 2010 doi: 10.1109/SIU.2010.5651268 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5651268&isnumber=5648807

Hongzhong Tang; Huixian Huang; Songhao Zhu; , "Video concept detection
based on spatio-temporal correlation," Computer Application and System
Modeling (ICCASM), 2010 International Conference on , vol.8, no.,
pp.V8-638-V8-642, 22-24 Oct. 2010 doi: 10.1109/ICCASM.2010.5620186
URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5620186&isnumber=5619026

Wolfgang Huerst, Cees G. M. Snoek, Willem-Jan Spoel, and Mate Tomin,
"Keep Moving! Revisiting Thumbnails for Mobile Video Retrieval," in
Proceedings of the ACM International Conference on Multimedia,
Firenze, Italy, 2010.

Bouke Huurnink, Cees G. M. Snoek, Maarten de Rijke, and Arnold
W. M. Smeulders, "Today's and Tomorrow's Retrieval Practice in the
Audiovisual Archive," in Proceedings of the ACM International
Conference on Image and Video Retrieval, Xi'an, China, 2010,
pp. 18-25.

Nakamasa Inoue, Tatsuhiko Saito, Koichi Shinoda and Sadaoki Furui,
"High-Level Feature Extraction Using SIFT GMMs and Audio Models", In
Proceedings of the International Conference on Pattern Recognition,
pp. 3220-3223, Istanbul, Turkey, August 2010.

Yu-Gang Jiang; Jun Yang; Chong-Wah Ngo; Hauptmann, A.G.; ,
"Representations of Keypoint-Based Semantic Concept Detection: A
Comprehensive Study," Multimedia, IEEE Transactions on , vol.12, no.1,
pp.42-53, Jan. 2010 doi: 10.1109/TMM.2009.2036235 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5332300&isnumber=5353832

Huan Li; Yuan Shi; Mingyu Chen; Hauptmann, A.; Zhang Xiong; ,
"Joint-AL: Joint Discriminative and Generative Active Learning for
Cross-Domain Semantic Concept Classification," Semantic Computing
(ICSC), 2010 IEEE Fourth International Conference on , vol., no.,
pp.60-66, 22-24 Sept. 2010 doi: 10.1109/ICSC.2010.86 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5628856&isnumber=5628562

LI Li; Weiming Hu; Bing Li; Chunfeng Yuan; Pengfei Zhu; Wanqing Li; ,
"Event Recognition Based on Top-Down Motion Attention," Pattern
Recognition (ICPR), 2010 20th International Conference on , vol., no.,
pp.3561-3564, 23-26 Aug. 2010 doi: 10.1109/ICPR.2010.869 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5597831&isnumber=5595735
	
Yuanning Li, Yonghong Tian, Jingjing Yang, Ling-Yu Duan, Wen
Gao. 2010. Video retargeting with multi-scale trajectory optimization.
March 2010 MIR '10: Proceedings of the international conference on
Multimedia information retrieval

Yuanning Li; Yonghong Tian; Ling-Yu Duan; Jingjing Yang; Tiejun Huang;
Wen Gao; , "Sequence Multi-Labeling: A Unified Video Annotation Scheme
With Spatial and Temporal Context," Multimedia, IEEE Transactions on ,
vol.12, no.8, pp.814-828, Dec. 2010 doi: 10.1109/TMM.2010.2066960 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5549915&isnumber=5623225
	
Ke-yan Liu; Tong Zhang; Lei Wang; , "A new parallel video
understanding and retrieval system," Multimedia and Expo (ICME), 2010
IEEE International Conference on , vol., no., pp.679-684, 19-23 July
2010 doi: 10.1109/ICME.2010.5583873 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5583873&isnumber=5582530

Zhu Liu, Tao Liu, David C. Gibbon, Behzad Shahraray. 2010. Effective
and scalable video copy detection March 2010 MIR '10: Proceedings of
the international conference on Multimedia information retrieval
	
Yang Liu, Wan-Lei Zhao, Chong-Wah Ngo, Chang-Sheng Xu, Han-Qing
Lu. 2010. Coherent bag-of audio words model for efficient large-scale
video copy detection. July 2010 CIVR '10: Proceedings of the ACM
International Conference on Image and Video Retrieval

Shiyang Lu; Zhiyong Wang; Meng Wang; Ott, M.; Dagan Feng; , "Adaptive
reference frame selection for near-duplicate video shot detection,"
Image Processing (ICIP), 2010 17th IEEE International Conference on ,
vol., no., pp.2341-2344, 26-29 Sept. 2010 doi:
10.1109/ICIP.2010.5649254 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5649254&isnumber=5648792

Mezaris, V.; Sidiropoulos, P.; Dimou, A.; Kompatsiaris, I.; , "On the
Use of Visual Soft Semantics for Video Temporal Decomposition to
Scenes," Semantic Computing (ICSC), 2010 IEEE Fourth International
Conference on , vol., no., pp.141-148, 22-24 Sept. 2010 doi:
10.1109/ICSC.2010.23 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5628943&isnumber=5628562

Dianting Liu; Mei-Ling Shyu; Chao Chen; Shu-Ching Chen; , "Integration
of global and local information in videos for key frame extraction,"
Information Reuse and Integration (IRI), 2010 IEEE International
Conference on , vol., no., pp.171-176, 4-6 Aug. 2010 doi:
10.1109/IRI.2010.5558944 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5558944&isnumber=5558895

Natsev, A.; Hill, M.; Smith, J.R.; , "Design and evaluation of an
effective and efficient video copy detection system," Multimedia and
Expo (ICME), 2010 IEEE International Conference on , vol., no.,
pp.1353-1358, 19-23 July 2010 doi: 10.1109/ICME.2010.5583216 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5583216&isnumber=5582530

Thao Ngoc Nguyen, Thanh Duc Ngo, Duy-Dinh Le, Shin'ichi Satoh, Bac
Hoai Le, Duc Anh Duong. 2010. An efficient method for face retrieval
from large video datasets. July 2010 CIVR '10: Proceedings of the ACM
International Conference on Image and Video Retrieval

Lin Pang, Juan Cao, Yongdong Zhang, Shouxun Lin. 2010. Hierarchical
feedback algorithm based on visual community discovery for interactive
video retrieval. July 2010 CIVR '10: Proceedings of the ACM
International Conference on Image and Video Retrieval

Pinheiro, A.M.G.; , "Performance analysis of the Edge Pixel
Orientations Histogram," Image Analysis for Multimedia Interactive
Services (WIAMIS), 2010 11th International Workshop on , vol., no.,
pp.1-4, 12-14 April 2010 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5617641&isnumber=5617638

Yu Qiu; Genliang Guan; Zhiyong Wang; Dagan Feng; , "Improving News
Video Annotation with Semantic Context," Digital Image Computing:
Techniques and Applications (DICTA), 2010 International Conference on
, vol., no., pp.214-219, 1-3 Dec. 2010 doi: 10.1109/DICTA.2010.47 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5692567&isnumber=5692215

Ranathunga, L.; Zainuddin, R.; Abdullah, N.A.; , "Semantic visual
search with feature space reduction," Information and Automation for
Sustainability (ICIAFs), 2010 5th International Conference on , vol.,
no., pp.463-468, 17-19 Dec. 2010 doi: 10.1109/ICIAFS.2010.5715706 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5715706&isnumber=5715624

Roth, G.; Laganière, R.; Lambert, P.; Lakhmiri, I.; Janati, T.; , "A
Simple but Effective Approach to Video Copy Detection," Computer and
Robot Vision (CRV), 2010 Canadian Conference on , vol., no., pp.63-70,
May 31 2010-June 2 2010 doi: 10.1109/CRV.2010.15 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5479485&isnumber=5479157

Stevan Rudinac, Martha Larson, Alan Hanjalic. 2010. Visual
concept-based selection of query expansions for spoken content
retrieval. July 2010 SIGIR '10: Proceeding of the 33rd international
ACM SIGIR conference on Research and development in information
retrieval

Safadi, B.; Quenot, G.; , "Active learning with multiple classifiers
for multimedia indexing," Content-Based Multimedia Indexing (CBMI),
2010 International Workshop on , vol., no., pp.1-6, 23-25 June 2010
doi: 10.1109/CBMI.2010.5529910 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5529910&isnumber=5529836

Saracoğlu, A.; Tekin, M.; Esen, E.; Soysal, M.; Loğoğlu, K.B.; Ateş,
T.K.; Sevinç, A.M.; Sevimli, H.; Acar, B.O.; Zubari, U.; Ozan, E.C.;
Alatan, A.A.; , "Generalized visual concept detection," Signal
Processing and Communications Applications Conference (SIU), 2010 IEEE
18th , vol., no., pp.621-624, 22-24 April 2010 doi:
10.1109/SIU.2010.5650360 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5650360&isnumber=5648807

Klaus Schoeffmann, Frank Hopfgartner, Oge Marques, Laszlo
Boeszoermenyi, and Joemon M. Jose. 2010 Video browsing interfaces and
applications: a review SPIE Reviews 1, 018004 (2010),
DOI:10.1117/6.0000005

Markus Seidl, Matthias Zeppelzauer, and Christian Breiteneder. 2010. A
study of gradual transition detection in historic film material. In
Proceedings of the second workshop on eHeritage and digital art
preservation (eHeritage '10). ACM, New York, NY, USA,
13-18. DOI=10.1145/1877922.1877929
http://doi.acm.org/10.1145/1877922.1877929

Shirahama, Kimiaki; Lin Yanpeng; Matsuoka, Yuta; Uehara, Kuniaki; ,
"Query by example for large-scale video data by parallelizing rough
set theory based on MapReduce," Science and Social Research (CSSR),
2010 International Conference on , vol., no., pp.390-395, 5-7
Dec. 2010 doi: 10.1109/CSSR.2010.5773806 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5773806&isnumber=5773667

Cees G. M. Snoek and Arnold W. M. Smeulders, "Visual-Concept Search
Solved?," IEEE Computer, vol. 43, iss. 6, pp. 76-78, 2010.

Tahayna, B.; Belkhatir, M.; Alhashmi, S.M.; O'Daniel, T.; , "Human
action detection and classification using optimal bag-of-words
representation," Digital Content, Multimedia Technology and its
Applications (IDC), 2010 6th International Conference on , vol., no.,
pp.75-80, 16-18 Aug. 2010 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5568597&isnumber=5568515

Tuan Hue Thi; Jian Zhang; Li Cheng; Li Wang; Satoh, S.; , "Human
Action Recognition and Localization in Video Using Structured Learning
of Local Space-Time Features," Advanced Video and Signal Based
Surveillance (AVSS), 2010 Seventh IEEE International Conference on ,
vol., no., pp.204-211, Aug. 29 2010-Sept. 1 2010 doi:
10.1109/AVSS.2010.76 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5597147&isnumber=5597063

Jeff Ubois, Jamie Davidson, Marko Grobelnik, Paul Over, Hans
Westerhof. 2010. Video search: are algorithms all we need? April 2010
WWW '10: Proceedings of the 19th international conference on World
wide web

Uijlings, J.R.R.; Smeulders, A.W.M.; Scha, R.J.H.; , "Real-Time Visual
Concept Classification," Multimedia, IEEE Transactions on , vol.12,
no.7, pp.665-681, Nov. 2010 doi: 10.1109/TMM.2010.2052027 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5482156&isnumber=5601986

Adrian Ulges, Christian Schulze, Markus Koch, and Thomas
M. Breuel. 2010. Learning automatic concept detectors from online
video.  Computer Vision and Image Understanding Volume 114, Issue 4,
April 2010, Pages 429-438

David Vallet, Ivan Cantador, Joemon M. Jose. 2010. Exploiting
external knowledge to improve video retrieval.  March 2010 MIR '10:
Proceedings of the international conference on Multimedia information
retrieval

David Vallet, Frank Hopfgartner, Joemon M. Jose, and Pablo
Castells. 2011. Effects of Usage-Based Feedback on Video Retrieval: A
Simulation-Based Study. ACM Trans. Inf. Syst. 29, 2, Article 11 (April
2011), 32 pages. DOI=10.1145/1961209.1961214
http://doi.acm.org/10.1145/1961209.1961214

Koen E. A. van de Sande, Theo Gevers, and Cees G. M. Snoek,
"Evaluating Color Descriptors for Object and Scene Recognition," IEEE
Transactions on Pattern Analysis and Machine Intelligence, vol. 32,
iss. 9, pp. 1582-1596, 2010.

Koen E. A. van de Sande, Theo Gevers, and Cees G. M. Snoek,
"Accelerating Visual Categorization with the GPU," in ECCV Workshop on
Computer Vision on GPU, Crete, Greece, 2010.

Jan C. van Gemert, Cees G. M. Snoek, Cor J. Veenman, Arnold
W. M. Smeulders, and Jan-Mark Geusebroek, "Comparing Compact Codebooks
for Visual Categorization," Computer Vision and Image Understanding,
vol. 114, iss. 4, pp. 450-462, 2010.

Stefanos Vrochidis, Ioannis Kompatsiaris, Ioannis
Patras. 2010. Optimizing visual search with implicit user feedback in
interactive video retrieval July 2010 CIVR '10: Proceedings of the ACM
International Conference on Image and Video Retrieval

Kong-Wah Wan; Ah-Hwee Tan; Joo-Hwee Lim; Liang-Tien Chia; , "Faceted
topic retrieval of news video using joint topic modeling of visual
features and speech transcripts," Multimedia and Expo (ICME), 2010
IEEE International Conference on , vol., no., pp.843-848, 19-23 July
2010 doi: 10.1109/ICME.2010.5583061 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5583061&isnumber=5582530

Yaowei Wang; Yonghong Tian; Lingyu Duan; Zhipeng Hu; Guochen Jia; ,
"ESUR: A system for Events detection in SURveillance video," Image
Processing (ICIP), 2010 17th IEEE International Conference on , vol.,
no., pp.2317-2320, 26-29 Sept. 2010 doi: 10.1109/ICIP.2010.5654246
URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5654246&isnumber=5648792
	
Peter Wilkins, Alan F. Smeaton, Paul Ferguson. 2010. Properties of
optimally weighted data fusion in CBMIR. July 2010 SIGIR '10:
Proceeding of the 33rd international ACM SIGIR conference on Research
and development in information retrieval

Yu Xiang; Xiangdong Zhou; Zuotao Liu; Tat-Seng Chua; Chong-Wah Ngo; ,
"Semantic context modeling with maximal margin Conditional Random
Fields for automatic image annotation," Computer Vision and Pattern
Recognition (CVPR), 2010 IEEE Conference on , vol., no., pp.3368-3375,
13-18 June 2010 doi: 10.1109/CVPR.2010.5540015 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5540015&isnumber=5539770

Xinxing Xu; Dong Xu; Tsang, I.W.; , "Video Concept Detection Using
Support Vector Machine with Augmented Features," Image and Video
Technology (PSIVT), 2010 Fourth Pacific-Rim Symposium on , vol., no.,
pp.381-385, 14-17 Nov. 2010 doi: 10.1109/PSIVT.2010.70 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5673969&isnumber=5673675

Jin Yuan, Zheng-Jun Zha, Zhengdong Zhao, Xiangdong Zhou, Tat-Seng
Chua. 2010. Utilizing related samples to learn complex queries in
interactive concept-based video search. July 2010 CIVR '10:
Proceedings of the ACM International Conference on Image and Video
Retrieval

Hui Zhang; Zhicheng Zhao; Anni Cai; Xiaohui Xie; , "A novel framework
for content-based video copy detection," Network Infrastructure and
Digital Content, 2010 2nd IEEE International Conference on , vol.,
no., pp.753-757, 24-26 Sept. 2010 doi: 10.1109/ICNIDC.2010.5657881
URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5657881&isnumber=5657774

Zhicheng Zhao; Xiaodan Liu; , "A segment-based advertisement search
method from TV stream," Future Computer and Communication (ICFCC),
2010 2nd International Conference on , vol.2, no., pp.V2-690-V2-693,
21-24 May 2010 doi: 10.1109/ICFCC.2010.5497581 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5497581&isnumber=5497284

Qiusha Zhu; Lin Lin; Mei-Ling Shyu; Shu-Ching Chen; , "Feature
Selection Using Correlation and Reliability Based Scoring Metric for
Video Semantic Detection," Semantic Computing (ICSC), 2010 IEEE Fourth
International Conference on , vol., no., pp.462-469, 22-24 Sept. 2010
doi: 10.1109/ICSC.2010.65 URL:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5629038&isnumber=5628562




---------------------------------------------------------------------
2009 (39)
---------------------------------------------------------------------

Aly, Robin and Hiemstra, Djoerd. 2009. Concept detectors: how good is
good enough? in MM '09: Proceedings of the Seventeenth ACM
International Conference on Multimedia, Beijing, China, pp. 233 - 242,
New York, NY, USA. ACM, doi:doi.acm.org/10.1145/1631272.1631306,
isbn:978-1-60558-608-3

Robin Aly, Djoerd Hiemstra, Arjen P. de Vries. 2009.  Reusing
Annotation Labor for Concept Selection. in CIVR '09: Proceedings of
the International Conference on Content-Based Image and Video
Retrieval 2009, Santorini

Ioannis Arapakis, Ioannis Konstas, Joemon M. Jose, Ioannis
Kompatsiaris. 2009. Modeling facial expressions and peripheral
physiological signals to predict topical relevance.  July 2009 SIGIR
'09: Proceedings of the 32nd international ACM SIGIR conference on
Research and development in information retrieval

Stéphane Ayache, Georges Quénot, Laurent Besacier. 2009. The LIG
multi-criteria system for video retrieval.  July 2009 CIVR '09:
Proceeding of the ACM International Conference on Image and Video
Retrieval

Bailer W., Lee F. and Thallinger G. A Distance Measure for Repeated
Takes of One Scene. The Visual Computer, 25(1):53-68, Jan. 2009.

Bailer W. and Rehatschek H. Comparing Fact Finding Tasks and User
Survey for Evaluating a Video Browsing Tool. Proceedings of ACM
Multimedia, Beijing, CN, Oct. 2009.

Bailer W. and Thallinger G. Summarizing Raw Video Material Using
Hidden Markov Models. Proceedings of 10th International Workshop on
Image Analysis for Multimedia Interactive Services (WIAMIS), London,
UK, May 2009, pp. 53-56.
	
Juan Cao, HongFang Jing, Chong-Wah Ngo, YongDong Zhang. 2009. 
Distribution-based concept selection for concept-based
video retrieval.  October 2009 MM '09: Proceedings of the seventeen
ACM international conference on Multimedia
	
Juan Cao, Yong-Dong Zhang, Jun-Bo Guo, Lei Bao, Jin-Tao
Li. 2009. VideoMap: an interactive video retrieval system of
MCG-ICT-CAS.  July 2009 CIVR '09: Proceeding of the ACM International
Conference on Image and Video Retrieval

Lixin Duan, Ivor W. Tsang, Dong Xu, Tat-Seng Chua. 2009. Domain
adaptation from multiple sources via auxiliary classifiers.  June 2009
ICML '09: Proceedings of the 26th Annual International Conference on
Machine Learning

Martin Halvey, Joemon M. Jose. 2009. The role of expertise in aiding
video search.  July 2009 CIVR '09: Proceeding of the ACM International
Conference on Image and Video Retrieval
	
Martin Halvey, David Vallet, David Hannah, Joemon
M. Jose. 2009. ViGOR: a grouping oriented interface for search and
retrieval in video libraries.  June 2009 JCDL '09: Proceedings of the
9th ACM/IEEE-CS joint conference on Digital libraries
	
Benoit Huet, Jinhui Tang, Alex Hauptmann. 2009 .ACM SIGMM the first
workshop on web-scale multimedia corpus (WSMC09).  October 2009 MM
'09: Proceedings of the seventeen ACM international conference on
Multimedia

Yu-Gang Jiang, Chong-Wah Ngo, Shih-Fu Chang. 2009. Semantic context
transfer across heterogeneous sources for domain adaptive video
search.  October 2009 MM '09: Proceedings of the seventeen ACM
international conference on Multimedia

Philip Kelly, Ciarán Ó Conaire, Noel
E. O'Connor. 2009. Exploiting contextual data for event retrieval in
surveillance video. July 2009 CIVR '09: Proceeding of the ACM
International Conference on Image and Video Retrieval

Duy-Dinh Le, Shin'ichi Satoh. 2009. Efficient concept detection by
fusing simple visual features.  March 2009 SAC '09: Proceedings of the
2009 ACM symposium on Applied Computing
	
Wei-Hao Lin, Alexander Haputmann. 2009. Identifying news videos'
ideological perspectives using emphatic patterns of visual concepts.
October 2009 MM '09: Proceedings of the seventeen ACM international
conference on Multimedia
	
Yuan Liu, Tao Mei, Xian-Sheng Hua. 2009. CrowdReranking: exploring
multiple search engines for visual search reranking.  July 2009 SIGIR
'09: Proceedings of the 32nd international ACM SIGIR conference on
Research and development in information retrieval

Paul Over, George Awad, Alan F. Smeaton, Colum Foley, James
Lanagan. 2009. Creating a web-scale video collection for research.
October 2009 WSMC '09: Proceedings of the 1st workshop on Web-scale
multimedia corpus

Yuxin Peng, Zhiwu Lu, Jianguo Xiao. 2009. Semantic concept annotation
based on audio PLSA model.  October 2009 MM '09: Proceedings of the
seventeen ACM international conference on Multimedia

Sébastien Poullot, Michel Crucianu, Shin'Ichi Satoh. 2009. Indexing
local configurations of features for scalable content-based video copy
detection.  October 2009 LS-MMRM '09: Proceedings of the First ACM
workshop on Large-scale multimedia retrieval and mining
	
P. Punitha, Joemon M. Jose, Anuj Goyal. 2009. Topic prerogative
feature selection using multiple query examples for automatic video
retrieval.  July 2009 SIGIR '09: Proceedings of the 32nd international
ACM SIGIR conference on Research and development in information
retrieval

Arjan T. Setz and Cees G. M. Snoek, "Can Social Tagged Images Aid
Concept-Based Video Search?," in Proceedings of the IEEE International
Conference on Multimedia & Expo, 2009.

Kimiaki Shirahama, Chieri Sugihara, Yuta Matsuoka, Kuniaki
Uehara. 2009. Query-based video event definition using rough set
theory.  October 2009 EiMM '09: Proceedings of the 1st ACM
international workshop on Events in multimedia

Alan F. Smeaton,. Paul Over, Aiden R. Doherty. Video shot boundary
detection: Seven years of TRECVid activity.  To appear in the IEEE
Computer Vision and Image Understanding. Online at
http://dx.doi.org/10.1016/j.cviu.2009.03.011

Cees G. M. Snoek and Marcel Worring, "Concept-Based Video Retrieval,"
Foundations and Trends in Information Retrieval, vol. 4, iss. 2, pp.
215-322, 2009.

Lin-Xie Tang, Tao Mei, Xian-Sheng Hua. 2009. Near-lossless video
summarization. October 2009 MM '09: Proceedings of the seventeen ACM
international conference on Multimedia

Pablo Toharia, Oscar D. Robles, Alan F. Smeaton and Angel Rodriguez:
Measuring the influence of Concept Detection on Video Retrieval.
Proceedings of the 13th International Conference on Computer Analysis
of Images and Patterns, pp. 581--589. Münster, Germany. Sept. 2009
ISBN: 978-3-6425-03766-5

Pablo Toharia, Alberto Sánchez, José Luis Bosque and Oscar
D. Robles: GCViR: Grid Content-Based Video Retrieval with work
allocation brokering Concurrency and Computation, Practices and
Experience, John Wiley & Sons.  ISSN: 1532-0626. DOI: 10.1002/cpe.1492

Pablo Toharia, Alberto Sánchez, José Luis Bosque and Oscar
D. Robles: Efficient Grid-Based Video Storage and Retrieval.
International Symposium on Grid computing, high-performance and
Distributed Applications (GADA '08) Proceedings of the GADA
08. Lecture Notes in Computer Science, Vol.  5331. Springer-Verlag
Berlin Heidelberg, pp. 833 -- 851 ISBN: 978-3-540-88870-3. Monterrey,
Mexico. Nov. 2008

Thierry Urruty, Frank Hopfgartner, David Hannah, Desmond Elliott,
Joemon M. Jose. 2009. Supporting aspect-based video browsing: analysis
of a user study.  July 2009 CIVR '09: Proceeding of the ACM
International Conference on Image and Video Retrieval
	
Stefanos Vrochidis, Paul King, Lambros Makris, Anastasia Moumtzidou,
Spiros Nikolopoulos, Anastasios Dimou, Vasileios Mezaris, Ioannis
Kompatsiaris. 2009. MKLab interactive video retrieval system.  July
2009 CIVR '09: Proceeding of the ACM International Conference on Image
and Video Retrieval

Wang, D., Wang, Z., Li, J., Zhang, B., and Li, X. 2009. Query
representation by structured concept threads with application to
interactive video retrieval. J. Vis. Comun. Image Represent. 20, 2
(Feb. 2009), 104-116. DOI=
http://dx.doi.org/10.1016/j.jvcir.2008.12.001
	
Xiao-Yong Wei, Yu-Gang Jiang, Chong-Wah Ngo. 2009. Exploring
inter-concept relationship with context space for semantic video
indexing.  July 2009 CIVR '09: Proceeding of the ACM International
Conference on Image and Video Retrieval

Peter Wilkins, Raphaël Troncy, Martin Halvey, Daragh Byrne, Alia
Amin, P. Punitha, Alan F. Smeaton, Robert Villa. 2009. User variance
and its impact on video retrieval benchmarking July 2009 CIVR '09:
Proceeding of the ACM International Conference on Image and Video
Retrieval
	
Zhipeng Wu, Shuqiang Jiang, Qingming Huang. 2009. Near-duplicate video
matching with transformation recognition.  October 2009 MM '09:
Proceedings of the seventeen ACM international conference on
Multimedia

Rong Yan, Marc-Olivier Fleury, Michele Merler, Apostol Natsev, John
R. Smith. 2009. Large-scale multimedia semantic concept modeling using
robust subspace bagging and MapReduce. October 2009 LS-MMRM '09:
Proceedings of the First ACM workshop on Large-scale multimedia
retrieval and mining

Ming Yang, Fengjun Lv, Wei Xu, Kai Yu, Yihong Gong. Human action
detection by boosting efficient motion features. IEEE Workshop on
Video-oriented Object and Event Classification in Conjunction with
ICCV, Kyoto, Japan, Sept.28, 2009, (VOEC'2009).

Guangyu Zhu, Ming Yang, Kai Yu, Wei Xu, Yihong Gong. Detecting video
events based on action recognition in complex scenes using spatio-
temporal descriptor. ACM International Conference on Multimedia,
Beijing, China, Oct.19-23, 2009, full paper, (ACM MM'2009).




---------------------------------------------------------------------
2008 (31)
---------------------------------------------------------------------

Aly, R.B.N. and Hiemstra, D. and de Vries, A.P. and de Jong, F.M.G.
(2008)  Probabilistic Ranking Framework using Unobservable Binary
Events for Video Search./ <http://eprints.eemcs.utwente.nl/12167/> In:
Proceedings of the 7th ACM International Conference on Content-based
Image and Video Retrieval, 7-9 July 2008, Niagara Falls. pp. 349-358.
ACM. ISBN 978-1-60558-070-8
	
Ioannis Arapakis, Ioannis Konstas, Joemon M. Jose. 2009. Using facial
expressions and peripheral physiological signals as implicit
indicators of topical relevance.  October 2009 MM '09: Proceedings of
the seventeen ACM international conference on Multimedia

Bailer W. A Comparison of Distance Measures for Clustering Video
Sequences. Proceedings of 1st Workshop on Automated Information
Extraction in Media Production, Turin, IT, Sept. 2008, pp. 595-599.

Bailer W., Dumont E., Essid S., and M�érialdo B. A collaborative
approach to automatic rushes video summarization. Proceedings of First
ICIP Workshop on Multimedia Information Retrieval, San Diego, CA, USA,
Oct. 2008.

Werner Bailer, Felix Lee and Georg Thallinger. Detecting and Clustering
Multiple Takes of One Scene. Proceedings of the 14th International
Multimedia Modeling Conference, Kyoto, Japan, 9-11 January 2008.

Bredin H, Byrne D, Lee H, O'Connor N and Jones G. Dublin City
University at the TRECVid 2008 BBC Rushes Summarisation Task.  TVS
2008 - TRECVID BBC Rushes Summarization Workshop, ACM Multimedia 2008,
Vancouver, Canada, 31 October 2008.

Byrne D, Doherty A, Snoek C.G.M, Jones G and Smeaton A.F. Validating
the Detection of Everyday Concepts in Visual Lifelogs. SAMT 2008 - 3rd
International Conference on Semantic and Digital Media Technologies,
Koblenz, Germany, 3-5 December 2008.

Daragh Byrne, Aiden R. Doherty, Cees G. M. Snoek, Gareth J. F. Jones,
and Alan F. Smeaton, "Everyday Concept Detection in Visual Lifelogs:
Validation, Relationships and Trends," Multimedia Tools and
Applications, vol. 49, iss. 1, pp. 119-144, 2010.

Byrne D, Wilkins P, Jones G, Smeaton A.F and O'Connor N. Measuring the
Impact of Temporal Context on Video Retrieval. CIVR 2008 - ACM
International Conference on Image and Video Retrieval, Niagara Falls,
Canada, 7-9 July 2008.

Doherty A, Byrne D, Smeaton A.F, Jones G, and Hughes M. Investigating
Keyframe Selection Methods in the Novel Domain of Passively Captured
Visual Lifelogs. CIVR 2008 - ACM International Conference on Image and
Video Retrieval, Niagara Falls, Canada, 7-9 July 2008.
 
Doherty A, O Conaire C, Blighe M, Smeaton A.F and O'Connor
N. Combining Image Descriptors to Effectively Retrieve Events from
Visual Lifelogs.  MIR 2008 - ACM International Conference on
Multimedia Information Retrieval 2008, Vancouver, Canada, 30-31
October 2008

Doherty A and Smeaton A.F. Automatically Segmenting Lifelog Data Into
Events. WIAMIS 2008 - 9th International Workshop on Image Analysis for
Multimedia Interactive Services, Klagenfurt, Austria, 7-9 May 2008.

Dumont E, Merialdo B, Essid S, Bailer W, Byrne D, Bredin H, O'Connor
N, Jones G, Haller M, Krutz A, Sikora T and Platrik T. A Collaborative
Approach to Video Summarization.  SAMT 2008 - 3rd International
Conference on Semantic and Digital Media Technologies, Koblenz,
Germany, 3-5 December 2008.

Dumont E, Merialdo B, Essid S, Bailer W, Rehatschek H, Byrne D, Bredin
H, O'Connor N, Jones G, Smeaton A.F, Haller M and Piatrick T. Video
Rushes Summarization Using a Collaborative Approach. . TVS 2008 -
TRECVID BBC Rushes Summarization Workshop, ACM Multimedia 2008,
Vancouver, Canada, 31 October 2008.

Zhiwei Gu, Tao Mei, Jinhui Tang, Xiuqing Wu, Xian-Sheng Hua. "MILC^2:
A Multi-Layer Multi-Instance Learning Approach to Video Concept
Detection," International Conference on Multi-Media Modeling (MMM),
Kyoto, Japan, Jan. 2008.

Gurrin C. Content-based Video Retrieval. Encyclopedia of Database
Systems, Springer, 2008.

Haubold, A. and Natsev, A. 2008. Web-based information content and its
application to concept-based video retrieval. In Proceedings of the
2008 international Conference on Content-Based Image and Video
Retrieval (Niagara Falls, Canada, July 07 - 09, 2008). CIVR '08. ACM,
New York, NY, 437-446. DOI= http://doi.acm.org/10.1145/1386352.1386408

Koen E. A. van de Sande, Theo Gevers, and Cees G. M. Snoek, "A
Comparison of Color Features for Visual Concept Classification," in
Proceedings of the ACM International Conference on Image and Video
Retrieval, Niagara Falls, Canada, 2008, pp. 141-149.

Koen E. A. van de Sande, Theo Gevers, and Cees G. M. Snoek,
"Evaluation of Color Descriptors for Object and Scene Recognition," in
Proceedings of the IEEE Computer Society Conference on Computer Vision
and Pattern Recognition, Anchorage, Alaska, 2008.

Ken-Hao Liu, Ming-Fang Weng, Chi-Yao Tseng, Yung-Yu Chuang and
Ming-Syan Chen. Association and Temporal Rule Mining for
Post-Processing of Semantic Concept Detection in Video. In IEEE
Transactions on Multimedia, special issue on Multimedia Data Mining,
volume 10, issue 2, page 240-251, February 2008.

Lee F. and Bailer W. Organizing Rushes Video by Visually Similar
Setting. Proceedings of ACM International Conference on Image and
Video Retrieval, Niagara Falls, CA, Jul. 2008, pp. 279-287.

Lee H, Gurrin C, Jones G and Smeaton A.F. Interaction Design for
Personal Photo Management on a Mobile Device. Handbook of Research on
User Interface Design and Evaluation for Mobile Technology,
2008. (pp69-85) IGI Publishing, ISBN: 978-1-59904-871-0.

Ork de Rooij, Cees G. M. Snoek, and Marcel Worring, "Balancing Thread
Based Navigation for Targeted Video Search," in Proceedings of the ACM
International Conference on Image and Video Retrieval, Niagara Falls,
Canada, 2008, pp. 485-494.

Ork de Rooij, Cees G. M. Snoek, and Marcel Worring, "MediaMill: Fast
and Effective Video Search using the ForkBrowser," in Proceedings of
the ACM International Conference on Image and Video Retrieval, Niagara
Falls, Canada, 2008, pp. 561-561.

Jeremy Pickens, Gene Golovchinsky, Chirag Shah, Pernilla Qvarfordt,
and Maribeth Back. Algorithmic Mediation for Collaborative Exploratory
Search. SIGIR 2008. (Singapore, Singapore, July 20 - 24, 2008). ACM,
New York, NY, 315-322., July 22, 2008

Smeaton A.F, Foley C, Byrne D and Jones G. iBingo Mobile Collaborative
Search.  CIVR 2008 - ACM International Conference on Image and Video
Retrieval. VideOlympics @ CIVR, Niagara Falls, Canada, 7-9 July 2008.

Smeaton A.F, Foley C, Byrne D and Jones G. Mobile, Ubiquitous
Information Seeking, as a Group:The iBingo Collaborative Video
Retrieval System.  MobiQuitous 2008 - The 5th Annual International
Conference on Mobile and Ubiquitous Systems: Computing, Networking and
Services, Dublin, Ireland, 21-25 July 2008.

Smeaton A.F, Over P and Kraaij W. High-level Feature Detection from
Video in TRECVid: a 5-Year Retrospective of Achievements.  Multimedia
Content Analysis:Theory and Applications (in press), 2008.

Smeaton A.F, Wilkins P, Worring N, de Rooij O, Chua T-S and Luan
H. Content-Based Video Retrieval: Three Example Systems from TRECVid.
International Journal of Imaging Systems and Technology, Special Issue
on Multimedia Information Retrieval (in press), 2008. 

Cees G. M. Snoek, Marcel Worring, Ork de Rooij, Koen E. A. van de
Sande, Rong Yan, and Alexander G. Hauptmann, "VideOlympics: Real-Time
Evaluation of Multimedia Retrieval Systems," IEEE Multimedia, vol. 15,
iss. 1, 2008.

Jinhui Tang, Xian-Sheng Hua, Yan Song, Tao Mei, Xiuqing
Wu. "Optimizing Training Set Construction for Video Semantic
Classification," EURASIP Journal on Advances in Signal Processing,
2008.

Wilkins P, Smeaton A.F, O'Connor N and Byrne D. K-Space Interactive
Search.  CIVR 2008 - ACM International Conference on Image and Video
Retrieval. VideOlympics @ CIVR, Niagara Falls, Canada, 7-9 July 2008.

Yan-Tao Zheng, Shi-Yong Neo, Tat-Seng Chua, Qi Tian, ’¡ÈObject-based
Image Retrieval Beyond Visual Appearances’¡É, MMM 2008, Kyoto, Japan,
Jan 2008



---------------------------------------------------------------------
2007 (59)
---------------------------------------------------------------------

Werner Bailer and Georg Thallinger. A Framework for Multimedia Content
Abstraction and its Application to Rushes Exploration. CIVR 2007 - ACM
International Conference on Image and Video Retrieval, Amsterdam, The
Netherlands, 9-11 July 2007.

Bosch, A., Zisserman, A. and Munoz, X.  Representing shape with a
spatial pyramid kernel Proceedings of the International Conference on
Image and Video Retrieval (2007)

Byrne D, Kehoe P, Lee H, O Conaire C, Smeaton A.F, O'Connor N and
Jones G. A User-Centered Approach to Rushes Summarisation Via
Highlight-Detected Keyframes. TVS 2007 - TRECVID BBC Rushes
Summarization Workshop, ACM Multimedia 2007, Augsburg, Germany, 24-29
September 2007. (pp35-39)

Christel, M. G. 2007. Establishing the utility of non-text search for
news video retrieval with real world users. In Proceedings of the 15th
international Conference on Multimedia (Augsburg, Germany, September
25 - 29, 2007). MULTIMEDIA '07. ACM, New York, NY, 707-716. DOI=
http://doi.acm.org/10.1145/1291233.1291395

Christel, M. Examining User Interactions with Video Retrieval Systems.
Proc. of SPIE Vol. 6506 Multimedia Content Access: Algorithms and
Systems (San Jose, CA, Feb. 2007).

Chum, O., Philbin, J., Isard, M. and Zisserman, A.  Scalable Near
Identical Image and Shot Detection Proceedings of the International
Conference on Image and Video Retrieval (2007)

Dayong Ding, Bo Zhang. Probabilistic Model Supported Rank Aggregation
for the Semantic Concept Detection in Video. in Intl. Conference of
Image and Video Retrieval (CIVR), Amsterdam, 2007.

Zhiwei Gu, Tao Mei, Xian-Sheng Hua, Jinhui Tang, Xiuqing
Wu. "Multi-Layer Multi-Instance Kernel for Video Concept Detection,"
Accepted by ACM International Conference on Multimedia (ACM MM),
Augsburg, Germany, Sept. 2007

Steven C.H. Hoi and Michael R Lyu. "A Multi-Modal and Multi-Level
Ranking Framework for Content-Based Video Retrieval," , In the 32nd
IEEE International Conference on Acoustics, Speech, and Signal
Processing (ICASSP2007), Special Session on "Web Image/Video Search
Technologies", Hawaii, USA, 15-20 April, 2007.

Winston H. Hsu, Lyndon Kennedy, Shih-Fu Chang. Reranking Methods for
Visual Search. IEEE Multimedia Magazine, 13(3), 2007.

Winston H. Hsu, Lyndon Kennedy, Shih-Fu Chang. Video Search Reranking
through Random Walk over Document-Level Context Graph. In ACM
Multimedia, Augsburg, Germany, September 2007.

Wei Jiang, Shih-Fu Chang, Alexander C. Loui. Kernel Sharing With Joint
Boosting For Multi-Class Concept Detection. In IEEE CVPR Workshop on
Semantic Learning Application in Multimedia, Minneapolis, Minnesota,
June 2007.

Wei Jiang, Shih-Fu Chang, Alexander C. Loui. Context-Based Concept
Fusion with Boosted Conditional Random Fields. In IEEE International
Conference on Acoustics, Speech, and Signal Processing (ICASSP),
Hawaii, USA, April 2007.

Yu-Gang Jiang, Chong-Wah Ngo, Jun Yang, Towards Optimal
Bag-of-Features for Object Categorization and Semantic Video
Retrieval, ACM International Conference on Image and Video Retrieval
(CIVR), 2007

Kehoe P and Smeaton A.F. Using Graphics Processor Units (GPUs) for
Automatic Video Structuring. Proceedings of the WIAMIS 2007 -
International Workshop on Image Analysis for Multimedia Interactive
Services, Santorini, Greece, 6-8 June 2007.

Lyndon Kennedy, Shih-Fu Chang. A Reranking Approach for Context-based
Concept Fusion in Video Indexing and Retrieval. In ACM International
Conference on Image and Video Retrieval, Amsterdam, Netherlands, July
2007.

Koskela M and Smeaton A.F. Measuring Concept Similarities in
Multimedia Ontologies: Analysis and Evaluations. IEEE Transactions on
Multimedia, 2007.

Koskela M and Smeaton A.F. An Empirical Study of Inter-Concept
Similarities in Multimedia Ontologies.  CIVR 2007 - ACM International
Conference on Image and Video Retrieval, Amsterdam, The Netherlands,
9-11 July 2007.

Xirong Li, Dong Wang, Jianmin Li and Bo Zhang, Video Search in Concept
Subspace: A Text-Like Paradigm, in Intl. Conference of Image and Video
Retrieval (CIVR), Amsterdam, 2007

Jingjing Liu, Wei Lai, Xian-Sheng Hua, Yalou Huang, Shipeng Li. Video
Search Re-Ranking via Multi-Graph Propagation. ACM International
Conference on Multimedia (ACM MM), Augsburg, Germany, Sept. 2007.

Xiaobing Liu, Dong Wang, Jianmin Li, Bo Zhang, The Feature and Spatial
Covariant Kernel: Adding Implicit Spatial Constraints to Histogram, in
Proc. ACM International Conference on Image and Video Retrieval
(CIVR), Amsterdam, Netherlands, July, 2007

Huan-Bo Luan, Shi-Yong Neo, Hai-Kiat Goh, Yong-Dong Zhang, Shou-Xun Lin,
Tat-Seng Chua. Segregated Feedback with Performance-based Adaptive
Sampling for Interactive News Video Retrieval, ACM MM 2007, Augsburg,
Germany, 23-29 Sep 2007.

Shi-Yong Neo, Yuanyuan Ran, Hai-Kiat Goh, Yantao Zheng, Tat-Seng Chua,
Jintao Li, ’¡ÈThe Use of Topic Evolution to help Users Browse and Find
Answers in News Video Corpus,’¡É ACM MM 2007, Augsburg, Germany, 23-29
Sep 2007.

Shi-Yong Neo, Yantao Zheng, Hai-Kiat Goh, Tat-Seng Chua, Sheng Tang, 
’¡ÈNews Video Retrieval Using Implicit Event Semantics,’¡É ICME 2007,
Beijing, China, 2-5 Jul 2007.

Chen-Ming Pan, Yung-Yu Chuang, Winston H. Hsu."NTU TRECVID-2007 Fast
Rushes Summarization System,", ACM Multimedia TRECVID BBC Rushes
Summarization Workshop (TVS 2007), Augsburg, Germany, September 23-29,
2007.

Ork de Rooij, Cees G.M. Snoek, and Marcel Worring. MediaMill: Semantic
Video Browsing using the RotorBrowser. In ACM CIVR 2007 -
International Conference on Image and Video Retrieval, Amsterdam, The
Netherlands, July 2007.

Ork de Rooij, Cees G.M. Snoek, and Marcel Worring. MediaMill: Video
Query on demand using the RotorBrowser. In Proceedings of the IEEE
International Conference on Multimedia & Expo, Beijing, China, July
2007.

Over P, Smeaton A.F and Kelly P. The TRECVID 2007 BBC Rushes
Summarization Evaluation Pilot. TVS 2007 - TRECVID BBC Rushes
Summarization Workshop, ACM Multimedia 2007, Augsburg, Germany, 24-29
September 2007. (pp1-15)

Guo-Jun Qi, Xian-Sheng Hua, Yong Rui, Jinhui Tang, Tao Mei, Hong-Jiang
Zhang. "Correlative Multi-Label Video Annotation," ACM International
Conference on Multimedia (ACM MM), Augsburg, Germany,
Sept. 2007. (Best paper award)

Frank J. Seinstra, Jan-Mark Geusebroek, Dennis Koelma, Cees
G.M. Snoek, Marcel Worring, and Arnold
W.M. Smeulders. High-Performance Distributed Image and Video Content
Analysis with Parallel-Horus. IEEE Multimedia. 2007. In press.

Smeaton A.F. Techniques Used and Open Challenges to the Analysis,
Indexing and Retrieval of Digital Video. Information Systems Journal,
Vol. 32, No. 4, 2007. (pp545-559)

Smeaton A.F. TRECVid - Video Evaluation. ASIST Bulletin,  2007.

Smeaton A.F. Video Summarisation: A new Challenge. Proceedings of the
MAR 2007 - Research Challenges in Multimedia Analysis and Retrieval,
Glasgow, Scotland, 20 July 2007.

Cees G.M. Snoek, Bouke Huurnink, Laura Hollink, Maarten de Rijke, Guus
Schreiber, and Marcel Worring. Adding Semantics to Detectors for Video
Retrieval. IEEE Transactions on Multimedia, August, 2007. In press.

Cees G.M. Snoek and Marcel Worring. Are Concept Detector Lexicons
Effective for Video Search? In Proceedings of the IEEE International
Conference on Multimedia & Expo, Beijing, China, July 2007.

Cees G.M. Snoek, Marcel Worring, Dennis C. Koelma, and Arnold
W.M. Smeulders. A Learned Lexicon-Driven Paradigm for Interactive
Video Retrieval. IEEE Transactions on Multimedia, 9(2):280-292,
February 2007.

Jinhui Tang, Xian-Sheng Hua, Guo-Jun Qi, Meng Wang, Tao Mei, Xiuqing
Wu. "Structure-Sensitive Manifold Ranking for Video Concept
Detection," ACM International Conference on Multimedia (ACM MM),
Augsburg, Germany, Sept. 2007.

Tesic, J., Natsev, A., and Smith, J. R. 2007. Cluster-based data
modeling for semantic video search. In Proceedings of the 6th ACM
international Conference on Image and Video Retrieval (Amsterdam, The
Netherlands, July 09 - 11, 2007). CIVR '07. ACM, New York, NY,
595-602. DOI= http://doi.acm.org/10.1145/1282280.1282365

Dong Wang, Jianmin Li, and Bo Zhang. The Importance of
Query-Concept-Mapping for Automatic Video Retrieval, ACM Multimedia
2007

Dong Wang, Xiaobing Liu, Linjie Luo, Jianmin Li and Bo Zhang. Video
Diver: Generic Video Indexing with Diverse Features. MIR workshop at
ACM Multimedia 2007

Dong Wang, Zhikun Wang, Xirong Li, Xiaobing Liu, Jianmin Li and Bo
Zhang. Mapping Query to Semantic Concepts: Leveraging Semantic Indices
for Automatic and Interactive Video Retrieval, Invited paper in
special session of "Closing the semantic gap: concept-based video
mining and retrieval" at Intl. Conference Semantic Computing (ICSC)
2007

Feng Wang, Chong-Wah Ngo, Rushes Video Summarization by Object and
Event Understanding, TRECVID BBC Rushes Summarization Workshop at ACM
Multimedia (TVS'07), Augsburg, Germany, Sep. 2007.

Meng Wang, Xian-Sheng Hua, Xun Yuan, Yan Song, Li-Rong Dai. Optimizing
Multi-Graph Learning: Towards A Unified Video Annotation Scheme. ACM
International Conference on Multimedia (ACM MM), Augsburg, Germany,
Sept. 2007.

Meng Wang, Xian-Sheng Hua, Yan Song, Li-Rong Dai, Ren-Hua Wang. An
Interactive Video Annotation Framework With Multiple
Modalities. International Conference on Acoustic, Speech, and Signal
Processing (ICASSP), April, 2007, Honolulu, Hawaii, USA.

Meng Wang, Xian-Sheng Hua, Yan Song, Jinhui Tang, Li-Rong Dai, 
’¡ÈMulti-Concept Multi-Modality Active Learning for Interactive Video
Annotation’¡É, to appear in First IEEE International Conference on
Semantic Computing (ICSC), Irvine, California, USA, September, 2007.

Xiao-Yong Wei, Chong-Wah Ngo, Ontology-Enriched Semantic Space for
Video Search, ACM Multimedia (MM'07), Augsburg, Germany, Sep. 2007.

Wilkins P, Adamek T, Smeaton A.F. and O'Connor N. Inexpensive Fusion
Methods for Enhancing Feature Detection. CBMI 2007 - 5th International
Workshop on Content-Based Multimedia Indexing, Bordeaux, France, 25-27
June 2007.

Wilkins P, Adamek T, O'Connor N and Smeaton A.F. Inexpensive Fusion
Methods for Enhancing Feature Detection. Signal Processing: Image
Communication, Special Issue on Content-Based Multimedia Indexing and
Retrieval, Vol. 22, No. 7-8, 2007. (pp 635-650)

Marcel Worring, Cees G.M. Snoek, Ork de Rooij, Giang P. Nguyen, and
Arnold W.M. Smeulders. The MediaMill Semantic Video Search Engine. In
Proceedings of IEEE International Conference on Acoustics, Speech, and
Signal Processing, Honolulu, Hawaii, USA, April 2007.

Xiao Wu, Alexander G. Hauptmann, Chong-Wah Ngo, Novelty Detection for
Cross-Lingual News Stories with Visual Duplicates and Speech
Transcripts, ACM Multimedia (MM'07), Augsburg, Germany, Sep. 2007.

Xiao Wu, Wan-Lei Zhao, Chong-Wah Ngo, Efficient Near-Duplicate
Keyframe Retrieval with Visual Language Models, International
Conference on Multimedia and Expo (ICME), 2007

Xiao Wu, Wan-Lei Zhao, Chong-Wah Ngo, Near-Duplicate Keyframe
Retrieval with Visual Keywords and Semantic Context, ACM International
Conference on Image and Video Retrieval (CIVR), 2007.

Yan, R. and Hauptmann, A. G. 2007. A review of text and image
retrieval approaches for broadcast news video. Inf. Retr. 10, 4-5
(Oct. 2007), 445-484. DOI= http://dx.doi.org/10.1007/s10791-007-9031-y

Jun Yang, Yu-Gang Jiang, Alexander G. Hauptmann, Chong-Wah Ngo,
Evaluating Bag-of-Visual-Words Representations in Scene
Classification, ACM SIGMM Int'l Workshop on Multimedia Information
Retrieval (MIR'07), Augsburg, Germany, Sep. 2007.

Jinhui Yuan, Huiyi Wang, Lan Xiao, Wujie Zheng, Jianmin Li, Fuzong
Lin, Bo Zhang: A Formal Study of Shot Boundary Detection. IEEE
Trans. Circuits Syst. Video Techn. 17(2): 168-186 (2007)

Jinhui Yuan, Jianmin Li, Bo Zhang. Gradual transitions detection with
conditional random fields. Proc. of ACM Multimedia. Augsburg,
Germany. ACM Press, September, 2007. pages 277-280.

Eric Zavesky, Zhu Liu, David Gibbon, Behzad Shahraray. Searching
Visual Semantic Spaces with Concept Filters. In IEEE International
Conference on Semantic Computing, Irvine, California, September 2007.

Zhengjun Zha, Tao Mei, Zengfu Wang, Xian-Sheng Hua. "Building a
Comprehensive Ontology to Refine Video Concept Detection," ACM SIGMM
International Conference Workshop on Multimedia Information Retrieval
(ACM MIR), In conjunction with ACM Multimedia, Augsburg, Germany,
Sept. 2007.

Wan-Lei Zhao, Chong-Wah Ngo, Hung-Khoon Tan, Xiao Wu, Near-Duplicate
Keyframe Identification with Interest Point Matching and Pattern
Learning, IEEE Trans. on Multimedia, vol. 9, pp. 1037-1048, Aug 2007.




---------------------------------------------------------------------
2006 (54)
---------------------------------------------------------------------

Christel, M. Evaluation and User Studies with Respect to Video
Summarization and Browsing. Proc. of SPIE Vol. 6073 Multimedia Content
Analysis, Management and Retrieval (San Jose, CA, Jan. 2006).

Christel, M., and Conescu, R. Mining Novice User Activity with TRECVID
Interactive Retrieval Tasks. In CIVR 2006 - International Conference
on Image and Video Retrieval, H. Sundaram et al. (Eds.), LNCS 4071,
pp.  21-30, Tempe, USA, 13-15 July 2006. Springer-Verlag.

Shahram Ebadollahi, Lexing Xie, Shih-Fu Chang, John R. Smith. Visual
Event Detection Using Multi-Dimensional Concept Dynamics. In IEEE
International Conference on Multimedia and Expo (ICME 06), Toronto,
Canada, 2006.

Ewerth, Ralph and Freisleben, Bernd: Self-Supervised Learning for
Robust Video Indexing. In: Proceedings of the IEEE International
Conference on Multimedia & Expo, Toronto, Canada, 2006, pp. 1749-1752.

Garnaud E, Smeaton A.F and Koskela M. Evaluation of a Video Annotation
Tool Based on the LSCOM Ontology. SAMT 2006 - Proceedings of The First
International Conference on Semantics And Digital Media Technology,
Athens, Greece, 6-8 December 2006.

Gurrin C, Johansen D and Smeaton A.F. Supporting Relevance Feedback in
Video Search. ECIR 2006 - European Conference on Information
Retrieval. Lalmas M et al. (Eds.): Lecture Notes in Computer Science
(LNCS Series 3936), London, U.K., 10-12 April 2006.

Hoashi, K., Sugano, M., Naito, M., Matsumoto, K., Sugaya, F. (2006)
Video story segmentation based on generic low-level features, Trans of
IEICE on Information and Systems, Vol. J89-D, No. 10, pp. 2305-2314,
2006.  (In Japanese)

Winston H. Hsu, Lyndon Kennedy, and Shih-Fu Chang. (2006) "Video Search
Reranking via Information Bottleneck Principle," ACM Multimedia 2006
(full paper), Santa Barbara, CA, October 22-27.

Winston H. Hsu and Shih-Fu Chang.(2006) "Topic Tracking across Broadcast
News Videos with Visual Duplicates and Semantic Concepts," The
International Conference on Image Processing (ICIP), Atlanta, GA,
October.

Wei Jiang, Shih-Fu Chang, Alexander C. Loui. Active Context-based
concept fusion with partial user labels. In IEEE International
Conference on Image Processing (ICIP 06), Atlanta, GA, USA, 2006.

Lyndon Kennedy, Shih-Fu Chang, Igor Kozintsev. To Search or To Label?:
Predicting the Performance of Search-Based Automatic Image
Classifiers. In Multimedia Information Retrieval Workshop (MIR), Santa
Barbara, CA, USA, 2006.

Koskela M and Smeaton A.F. Clustering-Based Analysis of Semantic
Concept Models for Video Shots. ICME 2006 - IEEE International
Conference on Multimedia and Expo, Toronto, Canada, 9-12 July 2006.

Koskela M, Smeaton A.F and Gaughan G. Semantic Analysis of Concept
Models for News Videos. VCIMS - Workshop on Visual Categorisation and
Image Management Systems, Sunderland, U.K., 28 June 2006.

Wei Lai, Xian-Sheng Hua, Wei-Ying Ma.  Towards Content-Based Relevance
Ranking for Video Search.  ACM Multimedia (ACM MM), Santa Barbara, CA,
USA, Oct 23-27 2006.

Matsumoto, K., Hoashi, K., Naito, M., Shishibori, M., Kita, K. (2006)
Report on TRECVID2005, Proc. of 12th Korea-Japan Joint Workshop on
Frontiers of Computer Vision(FCV2006), pp.65-70, Feb. 2006.

Matsumoto, K., Naito, M., Hoashi, K., Sugaya, F. (2006) SVM-based Shot
Boundary Detection with a Novel Feature.  In Proceedings of the IEEE
International Conference on Multimedia & Expo (ICME) 2006,
pp. 1837-1840, Toronto, Ontario, Canada, 9-12 July, 2006.

Naito, M., Matsumoto, K., Hoashi, K., Sugaya, F. (2006) Camera Motion
Detection using Video Mosaicing.  In Proceedings of the IEEE
International Conference on Multimedia & Expo (ICME) 2006,
pp. 1741-1744, Toronto, Ontario, Canada, 9-12 July, 2006.

Milind Naphade, John R. Smith, Jelena Tesic, Shih-Fu Chang, Winston
Hsu, Lyndon Kennedy, Alexander Hauptmann, Jon Curtis. Large-Scale
Concept Ontology for Multimedia. IEEE Multimedia Magazine, 13(3),
2006.

Shi-Yong Neo, Yantao Zheng, Tat-Seng Chua, Qi Tian ’¡ÈNews Video Search
with Fuzzy Event Clustering using High-level Features’¡É In ACM MM
2006, Santa Barbara, USA, 23-27 October 2006.

Shi-Yong Neo, Jin Zhao, Min-Yan Kan, Tat-Seng Chua ’¡ÈVideo Retrieval
Using High-level features: Exploiting Query-matching and
Confidence-based Weighting’¡É In CIVR 2006, Arizona, USA, 13-15 July
2006.

O'Connor N, Lee H, Smeaton A.F, Jones G, Cooke E, Le Borgne H and
Gurrin C. F�íschl�ár-TRECVid2004: Combined Text- and Image-Based
Searching of Video Archives. ISCAS 2006 - IEEE International Symposium
on Circuits and Systems, Kos, Greece, 21-24 May 2006.

Over P, Smeaton A.F and Docef A. Eval-ware: Digital Video Retrieval.
IEEE Signal Processing Magazine, 2006.

Sav S, Jones G, Lee H, O'Connor N and Smeaton A.F. Interactive
Experiments in Object-Based Retrieval. CIVR2006 - 5th International
Conference on Image and Video Retrieval. Springer Lecture Notes in
Computer Science Vol. 4071, Tempe, AZ, 13-15 July 2006.

Shishibori, M., Minamimoto, T., Matsumoto, K., Hoashi, K., Naito, M.,
Kita, K. (2006) Estimation of The Camera Motion based on Movement of
Interest Points between Images, Proc. of 12th Korea-Japan Joint
Workshop on Frontiers of Computer Vision (FCV2006),
pp. 145-150, Feb. 2006.

Smeaton A.F.  TrecVid.  CLEAR '06 (Classification of Events,
Activities and Relationships) Evaluation Workshop, Southampton, U.K.,
6-7 April 2006.

Smeaton A.F, Foley C, Gurrin C, Lee H and Mc Givney S. (2006)
Collaborative Searching for Video Using the Fishlar System and a
DiamondTouch Table.  TableTop2006 - The 1st IEEE International
Workshop on Horizontal Interactive Human-Computer Systems, Adelaide,
Australia, 5-7 January 2006.

Smeaton A.F, Gurrin C and Lee H. Interactive Searching and Browsing of
Video Archives: Using Text and Using Image Matching.  In: Hammoud,
Riad (Ed.), Interactive Video: Algorithms and Technologies, 2006, XVI,
250 p. 109 illus., Hardcover, ISBN: 3-540-33214-6 , 2006.

Smeaton A.F, Jones G, Lee H and O'Connor N and Sav S. Object-Based
Access to TV Rushes Video. ECIR 2006 - European Conference on
Information Retrieval. Lalmas M et al. (Eds.): Lecture Notes in
Computer Science (LNCS Series 3936), pp. 476-479., London, U.K., 10-12
April 2006.

Smeaton A.F, Lee H, Foley C, Mc Givney S and Gurrin C. (2006)
Fishlar-DiamondTouch: Collaborative Video Searching on a Table.  SPIE
Electronic Imaging - Multimedia Content Analysis, Management, and
Retrieval, San Jose, CA, 15-19 January 2006.

Smeaton A.F, Lee H, Foley C and Mc Givney S. Collaborative Video
Searching on a Tabletop. Multimedia Systems Journal, Vol. 12, No. 4-5,
2006.

Smeaton A.F, Over P and Kraaij W. Evaluation Campaigns and
TRECVid. MIR 2006 - 8th ACM SIGMM International Workshop on Multimedia
Information Retrieval, Santa Barbara, CA, 26-27 October 2006.

Arnold W.M. Smeulders, Jan C. van Gemert, Jan-Mark Geusebroek, Cees
G.M. Snoek, and Marcel Worring Browsing for the National Dutch Video
Archive In Proceedings of the 2nd IEEE-EURASIP International Symposium
on Communications, Control and Signal Processing, Marrakech, Morocco,
March 2006.

Cees G.M. Snoek, Marcel Worring, Jan-Mark Geusebroek, Dennis
C. Koelma, Frank J. Seinstra, and Arnold W.M. Smeulders The Semantic
Pathfinder: Using an Authoring Metaphor for Generic Multimedia
Indexing IEEE Transactions on Pattern Analysis and Machine
Intelligence, 28(10), October 2006.

Cees G.M. Snoek, Marcel Worring, Jan-Mark Geusebroek, Dennis
C. Koelma, Frank J. Seinstra, and Arnold W.M. Smeulders The Semantic
Pathfinder for Generic News Video Indexing In Proceedings of the IEEE
International Conference on Multimedia & Expo, pp. 1469-1472, Toronto,
Canada, July 2006.

Cees G.M. Snoek, Marcel Worring, and Alexander G. Hauptmann Learning
Rich Semantics from News Video Archives by Style Analysis ACM
Transactions on Multimedia Computing, Communications and Applications,
2(2):91-108, May 2006.

Cees G.M. Snoek, Marcel Worring, Dennis C. Koelma, and Arnold
W.M. Smeulders Learned Lexicon-driven Interactive Video Retrieval In
CIVR 2006 - International Conference on Image and Video Retrieval,
H. Sundaram et al. (Eds.), LNCS 4071, pp. 11-20, Tempe, USA, 13-15
July 2006. Springer-Verlag.

Cees G.M. Snoek, Marcel Worring, Bouke Huurnink, Jan C. van Gemert,
Koen E.A. van de Sande, Dennis C. Koelma, and Ork de Rooij. MediaMill:
Video Search using a Thesaurus of 500 Machine Learned Concepts. In
Proceedings of the 1st International Conference on Semantic and
Digital Media Technologies, Athens, Greece, December 2006.

Cees G.M. Snoek, Marcel Worring, Jan C. van Gemert, Jan-Mark
Geusebroek, and Arnold W.M. Smeulders The Challenge Problem for
Automated Detection of 101 Semantic Concepts in Multimedia In
Proceedings of ACM Multimedia, Santa Barbara, USA, October 2006.

Pablo Toharia, Oscar Robles, Jose Luis Bosque and A.
Rodriguez. "Towards a Parallel Video Segmentation on a Shared Memory
Architecture". Workshop 2006 on Computation Intensive Methods for
Computer Vision, held with ECCV 2006. Graz, Austria, May 2006.

P. Toharia, O. D. Robles, J. L. Bosque, A. Rodriguez. "Video shot
extraction on parallel architectures". In proceedings of the 2006
International Symposium on Parallel and Distributed Processing and
Applications (ISPA 2006). Sorrento, Italy, December 2006. Lecture
Notes in Computer Science, Vol. 4330, pp. 869-883. Springer
Verlag. ISBN: 978-3-540-68067-3.

Jan C. van Gemert, Jan-Mark Geusebroek, Cor J. Veenman, Cees
G.M. Snoek, and Arnold W.M. Smeulders Robust Scene Categorization by
Learning Image Statistics in Context In CVPR Workshop on Semantic
Learning Applications in Multimedia, New York, USA, June 2006.

Jan C. van Gemert, Cees G.M. Snoek, Cor Veenman, and Arnold
W.M. Smeulders The Influence of Cross-Validation on Video
Classification Performance In Proceedings of ACM Multimedia, Santa
Barbara, USA, October 2006.

Volkmer, Timo and Natsev, Apostol (Paul). (2006) Exploring Automatic
Query Refinement for Text-Based Video Retrieval. In Proceedings of the
IEEE International Conference on Multimedia & Expo (ICME) 2006,
Toronto, Ontario, Canada, 9-12 July, 2006. 

Dong Wang and Jianmin Li and Bo Zhang. (2006) "Relay Boost Fusion for
Learning Rare Concepts in Multimedia" in Proceedings of the Conference
on Image and Video Retrieval (CIVR 2006).

Meng Wang, Xian-Sheng Hua, Yan Song, Xun Yuan, Shipeng Li, and
Hong-Jiang Zhang. Automatic Video Annotation by Semi-supervised
Learning with Kernel Density Estimation.  ACM Multimedia (ACM MM),
Santa Barbara, CA, USA, Oct 23-27 2006.

Wilkins P, Ferguson P, Gurrin C and Smeaton A.F. Automatic
Determination of Feature Weights for Mult-Feature CBIR. ECIR 2006 -
European Conference on Information Retrieval. Lalmas M et al. (Eds.):
Lecture Notes in Computer Science (LNCS Series 3936), London, U.K.,
10-12 April 2006.

Wilkins P, Ferguson P and Smeaton A.F. Using Score Distributions for
Querytime Fusion in Multimedia Retrieval. MIR 2006 - 8th ACM SIGMM
International Workshop on Multimedia Information Retrieval, Santa
Barbara, CA, 26-27 October 2006.

Marcel Worring, Cees G.M. Snoek, Ork de Rooij, Giang P. Nguyen, and
Dennis C. Koelma Lexicon based browsers for searching in news video
archives In Proceedings of the International Conference on Pattern
Recognition, Hong Kong, China, August 2006.

Marcel Worring, Cees G.M. Snoek, Bouke Huurnink, Jan van Gemert,
Dennis Koelma, and Ork de Rooij The MediaMill Large-lexicon Concept
Suggestion Engine In Proceedings of ACM Multimedia, Santa Barbara,
USA, October 2006.

Marcel Worring, Cees G.M. Snoek, Ork de Rooij, Giang P. Nguyen,
Richard van Balen and Dennis C. Koelma MediaMill: Advanced Browsing in
News Video Archives In CIVR 2006 - International Conference on Image
and Video Retrieval, H. Sundaram et al. (Eds.), LNCS 4071,
pp. 533-536, Tempe, USA, 13-15 July 2006. Springer-Verlag.

Xiao Wu, Chong-Wah Ngo, and Qing Li. (2006). Threading and
Autodocumenting News Videos. IEEE Signal Processing Magazine, volume
23, issue 2, pp. 59-68, March 2006.

Lexing Xie, Dong Xu, Shahram Ebadollahi, Katya Scheinberg, Shih-Fu
Chang, John R. Smith. Detecting Generic Visual Events with Temporal
Cues. In Proc. 40th Asilomar Conference on Signals, Systems, and
Computers, Pacific Grove, CA, October 2006.

Lexing Xie, Shih-Fu Chang. Pattern Mining In Visual Concept
Streams. In IEEE International Conference on Multimedia and Expo (ICME
06), Toronto, Canada, 2006.

Akira Yanagawa, Winston Hsu, Shih-Fu Chang. Brief Descriptions of
Visual Features for Baseline TRECVID Concept Detectors. ADVENT
Technical Report #219-2006-5 Columbia University, July 2006.

Ming Zhao, Shi-Yong Neo, Hai-Kiat Goh, Tat-Seng Chua, ’¡ÈMulti-Faceted
Contextual Model for Person Identification in News Video’¡É In
Multimedia Modeling (MMM), Beijing, China 4-6 Jan, 2006.

Wujie Zheng and Jianmin Li and Zhangzhang Si and Fuzong Lin and and Bo
Zhang", "Using High-level Semantic Features in Video Retrieval" in the
Proceedings of the Confernce on Image and Video Retrieval. (CIVR
2006).



---------------------------------------------------------------------
2005 (42)
---------------------------------------------------------------------

John Adcock, Matthew Cooper, Andreas Girgensohn, and Lynn Wilcox.
(2005) Interactive Video Search Using Multilevel Indexing
International Conference on Image and Video Retrieval (CIVR) 2005
pp. 205-14

Amir, A., Berg, M., and Permuter, H. 2005. Mutual relevance feedback
for multimodal query formulation in video retrieval. In Proceedings of
the 7th ACM SIGMM international Workshop on Multimedia information
Retrieval (Hilton, Singapore, November 10 - 11, 2005). MIR '05. ACM,
New York, NY, 17-24. DOI= http://doi.acm.org/10.1145/1101826.1101832

Chen, M.-Y., Christel, M., Hauptmann, A., and Wactlar, H. Putting
Active Learning into Multimedia Applications: Dynamic Definition and
Refinement of Concept Classifiers. Proc. ACM Multimedia '05
(Singapore, November 2005), pp. 902-911.

Christel, M., and Conescu, R. (2005). Addressing the Challenge of
Visual Information Access from Digital Image and Video
Libraries. Proc. ACM/IEEE-CS Joint Conference on Digital Libraries
(Denver, CO, June 2005), 69-78.

Christel, M., and Hauptmann, A. (2005). The Use and Utility of
High-Level Semantic Features. Proc. International Conference on Image
and Video Retrieval (CIVR) (Singapore, July 2005), in Lecture Notes in
Computer Science 3568, 134-144.

Gaughan G and Smeaton A.F.(2005) Finding New News: Novelty Detection in
Broadcast News.AIRS 2005 - Second Asia Information Retrieval
Symposium, Jeju Island, Korea, 13-15 October 2005.

Demir Gokalp and Selim Aksoy. (2005) "Finding Faces in News Videos," in 4th
International Workshop on Content-Based Multimedia Indexing, Riga,
Latvia, June 21-23, 2005.

Andreas Girgensohn, John Adcock, Matthew Cooper, and Lynn
Wilcox. (2005) A Synergistic Approach to Efficient Interactive Video
Retrieval INTERACT 2005, LNCS 3585, pp. 781-794 .

Andreas Girgensohn, John Adcock, Matthew Cooper, and Lynn Wilcox.
(2005) Interactive Search in Large Video Collections CHI 2005 Extended
Abstracts, ACM Press, pp. 1395-1398

D Heesch and S R#N|ger. (2005) Image Browsing: Semantic Analysis of NNk
Networks.  Int'l Conf on Image and Video Retrieval (CIVR, Singapore,
Jul 2005), pp 609--618, Springer LNCS 3568

Hoashi, K., Sugano, M., Naito, M., Matsumoto, K., Sugaya, F. (2005)
Video Story Segmentation and its Application to Personal Video
Recorders, Proc. of International Conference on Image and Video
Retrieval 2005, LNCS3568, pp. 39-48, Jul 2005.

P Howarth and S R#N|ger. (2005) Trading Precision for Speed: Localised
Similarity Functions. Int'l Conf on Image and Video Retrieval (CIVR,
Singapore, Jul 2005), pp 415--424, Springer LNCS 3568, 2005

P Howarth and S R#N|ger. (2005) Fractional Distance Measures for Content-Based
Image Retrieval. 27th European Conference on Information Retrieval
(ECIR, Santiago de Compostela, Spain, Mar 2005), pp 447-456, Springer
LNCS 3408, 2005

Winston Hsu, Shih-Fu Chang. (2005) "Visual Cue Cluster Construction via
Information Bottleneck Principle and Kernel Density Estimation," In
International Conference on Content-Based Image and Video Retrieval
(CIVR), Singapore, 2005.

Nazli Ikizler and Pinar Duygulu, 2005) Person Search Made Easy. In
Proceedings of The Fourth International Conference on Image and Video
Retrieval (CIVR 2005), Singapore, July 20-22, 2005. 

Jaffre, G., and Joly, P. (2005) . Improvement of a Temporal Video
Index Produced by an Object Detector. In Proceedings of the 11th
International Conference on Computer Analysis of Images and Patterns
(CAIP), Rocquencourt, France, september 2005.

Malobabic J, Le Borgne H, Murphy N and O'Connor N. (2005) Detecting
The Presence of Large Buildings in Natural Images.  CBMI 2005 - 4th
International Workshop on Content-Based Multimedia Indexing, Riga,
Latvia, 21-23 June 2005.

Mc Donald K and Smeaton A.F. (2005) A Comparison of Score, Rank and
Probability-based Fusion Methods for Video Shot Retrieval. CIVR 2005 -
International Conference on Image and Video Retrieval, W-K Leow et
al. (Eds.), LNCS 3569, pp61-70, Singapore, 20-22 July 2005.  LNCS
Series 3569, (c) Springer-Verlag 2005.

Natsev, A. (., Naphade, M. R., and Te#%G�Å�¡#%@i#%G�Ć#%@,
J. 2005. Learning the semantics of multimedia queries and concepts
from a small number of examples. In Proceedings of the 13th Annual ACM
international Conference on Multimedia (Hilton, Singapore, November 06
- 11, 2005). MULTIMEDIA '05. ACM, New York, NY, 598-607. DOI=
http://doi.acm.org/10.1145/1101149.1101288

O'Connor N, Cooke E, Le Borgne H, Blighe M and Adamek T. (2005) The
AceToolbox: Low-Level Audiovisual Feature Extraction for Retrieval and
Classification.  2nd IEE European Workshop on the Integration of
Knowledge, Semantic and Digital Media Technologies, London, U.K., 30
November-1 December 2005.

Rautiainen M & Sepp#Ndnen T (2005) Comparison of visual features and
fusion techniques in automatic detection of concepts from news video.
Proc. 2005 IEEE International Conference on Multimedia & Expo,
Amsterdam, The Netherlands.

Rautiainen M, Ojala T, Sepp#Ndnen T (2005) Content-based browsing in
large news video databases.  Proc. 5th IASTED International Conference
on Visualization, Imaging and Image Processing, Benidorm, Spain.

Sav S, Lee H, Smeaton A.F, O'Connor N and Murphy N. (2005) Using Video
Objects and Relevance Feedback in Video Retrieval.  In Multimedia
Systems and Applications VIII, edited by Anthony Vetro, Chang Wen
Chen, C.-C. J. Kuo, Tong Zhang, Qi Tian and John R. Smith. Proceedings
of SPIE (SPIE, Bellingham, Wa) Vol. 6015, 601512 (2005), Boston, MA,
USA, 23-26 October 2005.

Sav S, Lee H, O'Connor N and Smeaton A.F. (2005) Interactive
Object-based Retrieval Using Relevance Feedback.  Acivs 2005 -
Advanced Concepts for Intelligent Vision Systems, Antwerp, Belgium,
20-23 September 2005.

Sav S, Lee H, Smeaton A.F. and O'Connor N. (2005) Using Segmented
Objects in Ostensive Video Shot Retrieval.  AMR 2005 - 3rd
International Workshop on Adaptive Multimedia Retrieval, Glasgow,
U.K., 28-29 July 2005.

Sav S, O'Connor N, Smeaton A.F and Murphy N. (2005) Associating
Low-level Features with Semantic Concepts using Video Objects and
Relevance Feedback.  WIAMIS 2005 - 6th International Workshop on Image
Analysis for Multimedia Interactive Services, Montreux, Switzerland,
13-15 April 2005.

F.J. Seinstra, C.G.M. Snoek, D. Koelma, J.M. Geusebroek, and
M. Worring. (2005) User Transparent Parallel Processing of the 2004 NIST
TRECVID Data Set. In International Parallel and Distributed Processing
Symposium, Denver, USA, April 2005.

Smeaton, A.F. (2005) TRECVid Evaluation and Related Work at Dublin
City University.  Smeaton A.F. VACE 18-Month Workshop, Baltimore,
Maryland, 26-28 April 2005.

Smeaton A.F. Large Scale Evaluations of Multimedia Information
Retrieval: The TRECVid Experience. (2005) CIVR 2005 - International
Conference on Image and Video Retrieval, W-K Leow et al. (Eds.), LNCS
3569, pp11-17, Singapore, 20-22 July 2005.  LNCS Series 3569, (c)
Springer- Verlag 2005.

C.G.M. Snoek (2005) The Authoring Metaphor to Machine Understanding of
Multimedia Ph.D. Thesis, University of Amsterdam, October 2005.

C.G.M. Snoek et al. (2005) Multimodal Video Indexing: Past, Present,
and Future, Workshop on Digital Media Monitoring and Management,
Fraunhofer Institute for Computer Graphics, Darmstadt, October 17-18,
2005. (Invited talk)

C.G.M. Snoek, M. Worring, J.M. Geusebroek, D.C. Koelma, and
F.J. Seinstra (2005) On the Surplus Value of Semantic Video Analysis
Beyond the Key Frame In Proceedings of the IEEE International
Conference on Multimedia & Expo (ICME), Amsterdam, The Netherlands,
July 2005.

C.G.M. Snoek, M. Worring, and A.W.M. Smeulders (2005) Early versus
Late Fusion in Semantic Video Analysis In Proceedings of ACM
Multimedia, Singapore, November 2005. (To appear)

S.M.M. Tahaghoghi, Hugh E. Williams, James A. Thom, and Timo Volkmer.
(2005) Video Cut Detection using Frame Windows. In Proceedings of the
28th Australasian Computer Science Conference (ACSC2005), 193-200,
Newcastle, Australia, 31 January - 3 February 2005. ISBN: 1 920 68220
1.

Paola Virga and, Pinar Duygulu (2005) Systematic Evaluation of Machine
Translation Methods for Image and Video Annotation, , In Proceedings
of The Fourth International Conference on Image and Video Retrieval
(CIVR 2005), Singapore, July 20-22, 2005.  

Timo Volkmer, John R. Smith, Apostol (Paul) Natsev, Murray Campbell,
Milind Naphade. (2005) "A web-based system for collaborative annotation of
large image and video collections", In Proceedings of the 13th ACM
international conference on Multimedia, Singapore, 6-11 November, 2005

Xiao Wu, Chong-Wah Ngo, and Qing Li (2005). Co-clustering of
Time-evolving News Story with Transcript and Keyframe. Proceedings of
IEEE International Conference on Multimedia & Expo (ICME'05),
Netherlands, Jul. 2005.

Z. Yu and G. Herman. (2005) "On the Earth Mover's Distance as a Histogram
Similarity Metric for Image Retrieval," IEEE International Conference on
Multimedia & Expo (ICME), Jul 2005.

Jinhui Yuan, Jianmin Li, Fuzong Lin and Bo Zhang. (2005) A Unified
Shot Boundary Detection Framework Based on Graph Partition Model
ACM Multimedia 2005, Singapore (to appear)

Zhai. Y. and Shah, M. (2005) "Tracking News Stories Across Different
Sources", 13-th ACMMM Multimedia Conference, Singapore, 2005.
 
Zhai, Y., Yilmaz, A. and Shah, M. (2005) "Story Segmentation in News Videos
Using Visual and Textual Cues", 4-th International Conference on Image
and Video Retrieval, Singapore, 2005.
 
Zhai, Y. and Shah, M. (2005) "A Multi-Level Framework for Video Shot
Structuring", International Conference on Image Analysis and
Recognition, Toronto, Canada, 2005.



---------------------------------------------------------------------
2004 (52)
---------------------------------------------------------------------

Amir A., Iyengar G., Lin C.-Y., Naphade M., Natsev A., Neti C., Nock
H.J., Smith J.R., Tseng B. (2004). Multimodal video search techniques:
late fusion of speech-based retrieval and visual content-based
retrieval. 2004 IEEE International Conference on Acoustics, Speech,
and Signal Processing Vol. III Pgs. 1048-51.

Liudmila Boldareva adn Djoerd Hjemstra. (2004). Interactive
Content-Based Retrieval Using Pre-computed Object-Object
Similarities. in P. Enser et al. (Eds.): CIVR 2004, LNCS 3115,
pp.308-316.

Ming-yu Chen, Alexander Hauptmann. (2004). Multi-modal classification in
digital news libraries. International Conference on Digital Libraries
archive Proceedings of the 2004 joint ACM/IEEE conference on Digital
libraries. Tuscon, AZ.  2004.  Pages: 212-213.

Christel, M., Huang, C., Moraveji, N., and Papernick,
N. (2004). Exploiting Multiple Modalities for Interactive Video
Retrieval. Proc. IEEE International Conference on Acoustics, Speech,
and Signal Processing (ICASSP) (Montreal, Canada, May 2004), Vol. III,
pp. 1032-1035.

Christel, M., and Moraveji, N. (2004). Finding the Right Shots:
Assessing Usability and Performance of a Digital Video Library
Interface. In Proceedings of ACM MM'04, October 10-16, 2004, New York,
NY, USA., pp. 732-739.

Christel, M., Moraveji, N., and Huang, C. (2004). Evaluating
Content-Based Filters for Image and Video Retrieval. Proc. ACM SIGIR
'04 (Sheffield, South Yorkshire, UK, July 2004), pp. 590-591.

T.S. Chua, L. Chaisorn. (2004). Story Boundary Detection in Large
Broadcast News Video Archives - Techniques, Experience, Trends.  In
Proceedings of ACM MM'04, October 10-16, 2004, New York, NY, USA.,
pp. 656-659.

Cooper, M. (2004). Video Segmentation Combining Similarity Analysis
and Classification. In Proceedings of the ACM MM'04, October 10-16,
2004, New York, NY, USA.  Pages 252-255.

de Vries A.P., Westerveld T., Ianeva T.I. (2004). Combining multiple
representations on the TRECVID search task [video retrieval
system]. 2004 IEEE International Conference on Acoustics, Speech, and
Signal Processing Vol. III Pgs. 1052-5.

Pinar Duygulu and Alexander Hauptmann. (2004). What's News, What's Not?
Associating News Videos with Words. in P. Enser et al. (Eds.): CIVR
2004, LNCS 3115, pp.132-140.

P. Duygulu, J.-Y. Pan, D.A. Forsyth. (2004).Towards Auto-Documentary:
Tracking the Evolution of News Stories.  In Proceedings of the ACM
MM'04, October 10-16, 2004, New York, NY, USA., pp.820-827.

Gurrin C. (2004). Video Retrieval within the TREC Framework. 2004. 
Dagstuhl Seminars (04021) on Content-Based Retrieval, Schloss Dagstuhl, 
Germany, 4-9 January 2004.

C. Gurrin, H. Lee, A. F. Smeaton.  (2004). F#Nmschl#Nar @ TRECVID2003:
System Description (Paper and Accompanying Video).  In Proceedings of
the ACM MM'04, October 10-16, 2004, New York, NY, USA. pp. 938-939.

Hauptmann, A., and Christel, M. Successful Approaches in the TREC
Video Retrieval Evaluations. (2004). Proceedings of ACM Multimedia '04
(New York, NY, October 2004), pp. 668-675.

Daniel Heesch and Stefan Rueger. (2004). Three Interfaces for
Content-Based Access to Image Collections. in P. Enser et al. (Eds.):
CIVR 2004, LNCS 3115, pp. 491-499.

L. Hollink, G.P. Ngyuyen, D.C. Koelma, A.T. Schreiber, M. Worring. (2004).
User Strategies in Video Retrieval: A Case Study. in P. Enser et
al. (Eds.): CIVR 2004, LNCS 3115, pp.6-14.

Peter Howarth and Stefan Rueger. (2004). Evaluation of Texture Features
for Content-Based Image Retrieval. in P. Enser et al. (Eds.): CIVR
2004, LNCS 3115, pp.326-334.

Hsu W., Kennedy L., Huang C.-W., Chang S.-F., Lin C.-Y., Iyengar
G. (2004). News video story segmentation using fusion of multi-level
multi-modal features in TRECVID 2003. 2004 IEEE International
Conference on Acoustics, Speech, and Signal Processing Vol. III
Pgs.645-8.

Jarina R, O'Connor N, Murphy N and Marlow S. (2004) An Experiment in
Audio Classification from Compressed Data. Proc. of Int. Workshop on
Systems, Signals and Image Processing IWSSIP'04 , Poznan, Poland,
13-15 September 2004.

Jones, GJF. (2004). Adaptive systems for multimedia information
retrieval. ADAPTIVE MULTIMEDIA RETRIEVAL 3094: 1-18.

Kyperountas M., Cernekova Z., Kotropoulos C., Gavrielides M., Pitas
L. (2004). Audio PCA in a novel multimedia scheme for scene change
detection. 2004 IEEE International Conference on Acoustics, Speech,
and Signal Processing: Vol. iv.  Pgs. 353-6.

Lavrenko V., Feng S.L., Manmatha R. (2004). Statistical models for
automatic video annotation and retrieval. 2004 IEEE International
Conference on Acoustics, Speech, and Signal Processing Vol. III
Pgs.1044-7.

Malobabic J, O'Connor N, Murphy N, and Marlow S. (2004). Automatic
Detection and Extraction of Artificial Text in Video. WIAMIS 2004 -
5th International Workshop on Image Analysis for Multimedia
Interactive Services, Lisbon, Portugal, 21-23 April 2004

M.R. Naphade, J.R. Smith. (2004). On the Detection of Semantic
Concepts at TRECVID. In Proceedings of ACM MM'04, October 10-16, 2004,
New York, NY, USA., pp. 660-667.

Natsev, A., Naphade, M. R., and Smith, J. R. 2004. Semantic
representation: search and mining of multimedia content. In
Proceedings of the Tenth ACM SIGKDD international Conference on
Knowledge Discovery and Data Mining (Seattle, WA, USA, August 22 - 25,
2004). KDD '04. ACM, New York, NY, 641-646. DOI=
http://doi.acm.org/10.1145/1014052.1014133

O'Hare N, Smeaton A, Czirjek C, O'Connor N, and Murphy N. (2004). A
generic news story segmentation system and its evaluation. ICASSP 2004
- IEEE International Conference on Acoustics, Speech, and Signal
Processing, Montreal, Quebec, Canada, 17-21 May 2004.

Rautiainen M, Ojala T & Sepp#Ndnen T (2004) Cluster-temporal browsing of
large news video databases.  Proc. 2004 IEEE International Conference
on Multimedia and Expo, Taipei, Taiwan, 2:751-754.

Rautiainen M, Ojala T & Sepp#Ndnen T (2004) Analysing the performance of
visual, concept and text features in content-based video retrieval.
Proc. 6th ACM SIGMM International Workshop on Multimedia Information
Retrieval, New York, NY, 197-205.

Oscar David Robles Sanchez. Tecnicas de Recuperacion por Contenido
para Imagen y Video en Arquitecturas Paralelas [Techniques for
Content-based Image and Video Retrieval on Parallel
Architectures]. Universidad Politecnica de Madrid. Tesis
Doctoral. Diciembre 2004.

Oscar D. Robles, Pablo Toharia, Angel Rodriguez and Luis
Pastor. (2004) Towards a Content-Based Video Retrieval System using
Wavelet-Based Signatures. In proceedings of IASTED CGIM 2004, Kauai,
Hawaii, USA, Aug. 2004, pp. 344-349. ISBN: 0-88986-418-7

Oscar D. Robles, Pablo Toharia, Angel Rodriguez and Luis
Pastor. (2004) XML Specification for AVI Files in a Content-based
Video Retrieval System. In proceedings of IASTED VIIP 2004. Marbella,
Spain, Sep. 2004, pp. 374-378. ISBN: 0-88986-454-3

Smeaton A. (2004). Access to Archives of Digital Video
Information. The 9th Search Engine Meeting, The Hague, The
Netherlands, 19-20 April 2004.

Smeaton A, Lee H and Mc Donald K. (2004) Experiences of Creating Four
Video Library Collections with the F#Nmschl#Nar System. Journal of Digital
Libraries: Special Issue on Digital Libraries as Experienced by the
Editors of the Journal, Vol. 4, No. 1, pp 42-44, 2004.

A. F. Smeaton, P. Over and W. Kraaij. (2004). TRECVID: Evaluating the
Effectiveness of Information Retrieval Tasks on Digital Video. In
Proceedings of the ACM MM'04, October 10-16, 2004, New York, NY,
USA. Pages 652-655.

Alan F. Smeaton, Wessel Kraaij, and Paul Over. (2004). The TREC Video
Retrieval Evaluation (TRECVID): A Case Study and Status Report.  in
RIAO 2004 Conference Proceedings, Avignon, France. 26-28 April 2004.
Pgs 25-37.

Smith J.R., Over P., Leung C., Ip H., Grubinger M. (2004). Multimedia
retrieval benchmarks. IEEE Multimedia vol.11, no.2: 80-4.

C.G.M. Snoek, M. Worring, and A.G. Hauptmann. Detection of TV news
monologues by style analysis. In International Conference on
Multimedia and Expo, Taipei, Taiwan, June 2004.

Fabrice Souvannavong, Bernard Merialdo, Benoit Huet. (2004). Improved
Video Content Indexing by Multiple Latent Semantic Analysis. in
P. Enser et al. (Eds.): CIVR 2004, LNCS 3115, pp.483-490.

T. Ianeva, A.P. de Vries, and T. Westerveld (2004) A Dynamic
Probabilistic Multimedia Retrieval Model. 2004 IEEE International
Conference on Multimedia & Expo (ICME 2004), Taipei, Taiwan, June,
2004. http://www.uv.es/%7Etzveta/icme04.pdf

T. Volkmer, S.M.M. Tahaghoghi and H.E. Williams.(2004) Gradual Transition
Detection Using Average Frame Similarity. In Sadiye Guler, Alexander
G.  Hauptmann and Andreas Henrich editors, Proceedings of the Fourth
International Workshop on Multimedia Data and Document Engineering
(MDDE-04), in conjunction with the 2004 Computer Vision Pattern
Recognition Conference (CVPR-04), Washington D.C., USA, 2nd July
2004, IEEE Computer Society. [also published as: Proceedings of the
2004 Conference on Computer Vision and Pattern Recognition Workshop
(CVPRW'04), Volume 9, 27 June - 2 July 2004.]

Thijs Westerveld and Arjen P. de Vries. (2004). Multimedia Retrieval
Using Multiple Examples. in P. Enser et al. (Eds.): CIVR 2004, LNCS
3115, pp.344-352

Westerveld, Thijs. (2004). Using generative probabilistic models for
multimedia retrieval (Doctoral dissertation, Twente University, 2004).

M. Worring, G.P. Nguyen, L. Hollink, J.C. van Gemert, and
D.C. Koelma. Accessing video archives using interactive search. In
International Conference on Multimedia and Expo, Taipei, Taiwan, June
2004.

Y. Wu, E.Y. Chang, K.C-C. Chang, J.R. Smith. (2004). Optimal
Multimodal Fusion for Multimedia Data Analysis. In Proceedings of ACM
MM'04, October 10-16, 2004, New York, NY, USA., pp.

Rong Yan, Alexander G. Hauptmann. (2004). Co-retrieval: A Boosted
Reranking Approach for Video Retrieval. in P. Enser et al. (Eds.):
CIVR 2004, LNCS 3115, pp.60-69..

Yan, R., Yang, J. Hauptmann, A.  (2004). Learning Query-Class
Dependent Weights in Automatic Video Retrieval. In Proceedings of ACM
MM'04, October 10-16, 2004, New York, NY, USA., pp. 548-555.

Yang, J., Hauptmann, A. (2004). Naming Every Individual in News Video
Monologues.  In Proceedings of ACM MM'04, October 10-16, 2004, New
York, NY, USA., pp. 580-587.

Jun Yang, Ming-yu Chen, Alex Hauptmann. (2004). Finding Person X:
Correlating Names with Visual Appearances. in P. Enser et al. (Eds.):
CIVR 2004, LNCS 3115, pp.270-278.

M.Yang, B.M. Wildemuth, G.Marchionini. (2004). The Relative
Effectiveness of Concept-based Versus Content-based Video
Retrieval. In Proceedings of ACM MM'04, October 10-16, 2004, New York,
NY, USA., pp. 368-371.

Yavlinsky A., Pickering M.J., Heesch D., Ruger S. (2004). A
comparative study of evidence combination strategies. 2004 IEEE
International Conference on Acoustics, Speech, and Signal Processing
Vol. III Pgs.1040-3.

Ye J. and Smeaton A. Poster (2004) Aggregated Feature Retrieval for
MPEG-7 via Clustering.  presented at: SIGIR 2004 - the 27th Annual
International ACM SIGIR Conference, pp514-515, Sheffield, UK, 25-29
July 2004.

D.Q. Zhang, S.-F. Chang. (2004). Detecting Image Near-Duplicate by
Stochastic Attributed Relational Graph Matching with Learning. In
Proceedings of ACM MM'04, October 10-16, 2004, New York, NY, USA.,
pp. 877-884.



---------------------------------------------------------------------
2003
---------------------------------------------------------------------

Christel, M.G., and Huang, C. Enhanced Access to Digital Video through
Visually Rich Interfaces. (2003).Proceedings of the IEEE International
Conference on Multimedia and Expo (ICME) (Baltimore, MD, July 2003),
pp. III-21 - III-24.

Georgina Gaughan, Alan F. Smeaton, Cathal Gurrin, Hyowon Lee, Kieran
McDonald. (2003). Video retrieval: Design, implementation and testing
of an interactive video retrieval system. Proceedings of the 5th ACM
SIGMM international workshop on Multimedia information
retrieval. Berkeley, California. 2003. Pages: 23-30.

Hauptmann A.G., Rong Jin, Ng T.D. (2003). Video retrieval using speech
and image information. Proceedings of the SPIE - The International
Society for Optical Engineering vol.5021: 148-59.

G. Iyengar, H. J. Nock. (2003). Discriminative model fusion for
semantic concept detection and annotation in video Proceedings of the
eleventh ACM international conference on Multimedia Berkeley,
CA. November 2003.  Pages: 255-258.

C-Y. Lin, M. Naphade, A. Natsev, C. Neti, J. R. Smith, B. Tseng,
H. J. Nock, W. Adams. (2003). User-trainable video annotation using
multimodal cues Proceedings of the 26th annual international ACM SIGIR
conference on Research and development in informaion
retrieval. Toronto, Canada July 2003.  Pages: 403-404

Naphade, MR; Smith, JR. (2003). A hybrid framework for detecting the
semantics of concepts and context. IMAGE AND VIDEO RETRIEVAL,
PROCEEDINGS 2728: 196-205.

Naphade M.R., Smith J.R. (2003). Role of classifiers in multimedia
content management. Proceedings of the SPIE - The International
Society for Optical Engineering vol.5021: 89-99.

Natsev A., Naphade M.R., Smith J.R. (2003). Exploring semantic
dependencies for scalable concept detection. Proceedings 2003
International Conference on Image Processing Vol. III Pgs. 625-8.

Rautiainen M, Ojala T & Sepp#Ndnen T (2003) Cluster-temporal video
browsing with semantic filtering.  Proc. Advanced Concepts for
Intelligent Vision Systems, Ghent, Belgium, 116 - 123.

Rautiainen M, Sepp#Ndnen T, Penttil#Nd J & Peltola J (2003) Detecting
semantic concepts from video using temporal gradients and audio
classification.  Proc. International Conference on Image and Video
Retrieval, Urbana, IL, 260 - 270.

Rong Yan, Alexander G. Hauptmann, Rong Jin. (2003). Negative
pseudo-relevance feedback in content-based video retrieval Proceedings
of the eleventh ACM international conference on Multimedia Berkeley,
CA.  November 2003.  Pages: 343-346

A. Smeaton. (2003).Information Access to Digital Video Archives: A
Review of TREC, and the F#Nmschl#Nar System. Invited speech at: MIR2003 -
Workshop: Multimedia Information Retrieval in Business Applications,
Fraunhofer Institute for Computer Graphics (IGD), Darmstadt, Germany,
30-31 Jaunary 2003

Smeaton A, Lee H, O'Connor N, Marlow S and Murphy N. (2003). TV News
Story Segmentation, Personalisation and Recommendation AAAI 2003
Spring Symposium on Intelligent Multimedia Knowledge Management,
Stanford University, Palo Alto, CA, 24-26 March 2003.

Smeaton, AF; Over, P. (2003). TRECVID: Benchmarking the effectiveness of
information retrieval tasks on digital video. IMAGE AND VIDEO
RETRIEVAL, PROCEEDINGS 2728: 19-27.

T. Ianeva, A. P. de Vries, and H. R#Nvhrig (2003) Detecting cartoons: a
case study in automatic video-genre classification.  In Proceedings of
the IEEE International Conference on Multimeda & Expo (ICME),
pp. 1149-1452, Baltimore, MD, US,July
2003. http://www.uv.es/%7Etzveta/icme03.pdf

Thijs Westerveld, Arjen P. de Vries. (2003). Multimedia information
retrieval: Experimental result analysis for a generative probabilistic
image retrieval model Proceedings of the 26th annual international ACM
SIGIR conference on Research and development in information retrieval.
Toronto, Canada.  July 2003. Pages: 135-142.

Westerveld, T; de Vries, AP; van Ballegooij, A; de Jong, F; Hiemstra,
D. (2003). A Probabilistic multimedia retrieval model and its
evaluation. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING 2003 (2):
186-198.

Yanjun Qi, Hauptmann A., Ting Liu. (2003). Supervised classification for
video shot segmentation. Proceedings 2003 International Conference on
Multimedia and Expo.  Vol. II Pgs.689-92.



---------------------------------------------------------------------
2002
---------------------------------------------------------------------

Basu S., Naphade M., Smith J.R. (2002). A statistical modeling approach
to content based retrieval. 2002 IEEE International Conference on
Acoustics, Speech, and Signal Processing. Proceedings: IV 4080-3.

Hauptmann A.G., Christel M.G., Papernick N.D. (2002). Video retrieval
with multiple image search strategies. JCDL 2002. Proceedings of the
Second ACM/IEEE-CS Joint Conference on Digital Libraries: 376 edited
by Marchionini G., Hersh W.

Hauptmann A.G., Jin R., Ng T.D. (2002). Multimodal information retrieval
from broadcast video using OCR and speech recognition. JCDL
2002. Proceedings of the Second ACM/IEEE-CS Joint Conference on
Digital Libraries: 160-1 edited by Marchionini G., Hersh W.

Hauptmann A.G., Papernick N.D. (2002). Video-Cuebik: adapting image
search to video shots. JCDL 2002. Proceedings of the Second
ACM/IEEE-CS Joint Conference on Digital Libraries: 156-7 edited by
Marchionini G., Hersh W.

Naphade M.R., Basu S., Smith J.R., Ching-Yung Lin, Tseng
B. (2002). Modeling semantic concepts to support query by keywords in
video. Proceedings 2002 International Conference on Image Processing
Vol. I Pgs. 145-8.

Naphade M.R., Basu S., Smith J.R., Ching-Yung Lin, Tseng B. (2002). A
statistical modeling approach to content based video
retrieval. Proceedings 16th International Conference on Pattern
Recognition: Pgs. 953-6 edited by Kasturi R., Laurendeau D., Suen C.

H. J. Nock, G. Iyengar, C. Neti. (2002). Assessing face and speech
consistency for monologue detection in video Proceedings of the tenth
ACM international conference on Multimedia Juan-les-Pins,
France. December 2002. Pages: 303-306.

Rautiainen M., Doermann D. (2002). Temporal color correlograms for video
retrieval.  Proceedings 16th International Conference on Pattern
Recognition: 267-70 edited by Kasturi R., Laurendeau D., Suen C.

Rautiainen M & Ojala T (2002)
Color correlograms in image and video retrieval.
Proc. STeP 2002, The 10th Finnish Artificial Intelligence Conference,
Oulu, Finland, 203 - 212.

Smeaton A.F., Over P., Costello C.J., de Vries A.P., Doermann D.,
Hauptmann A., Rorvig M.E., Smith J.R., Wu L. (2002). The TREC2001 video
track: information retrieval on digital video information. Research
and Advanced Technology for Digital Libraries. 6th European
Conference, ECDL 2002. Proceedings (Lecture Notes in Computer Science
Vol. 2458 Pgs. 266-75 edited by Agosti M., Thanos C.



---------------------------------------------------------------------
2001
---------------------------------------------------------------------

Smeaton, A. (2001). Content-based access to digital video: the F#Nmschl#Nar
system and the TREC Video track. MMCBIR 2001 - Multimedia
Content-based Indexing and Retrieval, INRIA, Rocquencourt, France,
24-25 September 2001.