Auto encoder with mode‐based learning for keyframe extraction in video summarization

doi:10.1111/exsy.13437

DOI: 10.1111/exsy.13437 ISSN:

Auto encoder with mode‐based learning for keyframe extraction in video summarization

Prashant Giridhar Shambharkar, Ruchi Goel

Artificial Intelligence
Computational Theory and Mathematics
Theoretical Computer Science
Control and Systems Engineering

Show PDF Cite

Abstract

The exponential increase in video consumption has created new difficulties for browsing and navigating through video more effectively and efficiently. Researchers are interested in video summarization because it offers a brief but instructive video version that helps users and systems save time and effort when looking for and comprehending relevant content. Key frame extraction is a method of video summarization that only chooses the most important frames from a given video. In this article, a novel supervised learning method ‘TC‐CLSTM Auto Encoder with Mode‐based Learning’ using temporal and spatial features is proposed for automatically choosing keyframes or important sub‐shots from videos. The method was able to achieve an average F‐score of 84.35 on TVSum dataset. Extensive tests on benchmark data sets show that the suggested methodology outperforms state‐of‐the‐art methods.

Outline

Auto encoder with mode‐based learning for keyframe extraction in video summarization

Abstract

More from our Archive