Towards Automatic Textual Summarization of Movies

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review


With the rapidly increasing number of online video resources, the ability of automatically understanding those videos becomes more and more important, since it is almost impossible for people to watch all of the videos and provide textual descriptions. The duration of online videos varies in a extremely wide range, from several seconds to more than 5 h. In this paper, we focus on long videos, especially on full-length movies, and propose the first pipeline for automatically generating textual summaries of such movies. The proposed system takes an entire movie as input (including subtitles), splits it into scenes, generates a one-sentence description for each scene and summarizes those descriptions and subtitles into a final summary. In our initial experiment on a popular cinema movie (Forrest Gump), we utilize several existing algorithms and software tools for implementing the different components of our system. Most importantly, we use the S2VT (Sequence to Sequence—Video to Text) algorithm for scene description generation and MUSEEC (MUltilingual SEntence Extraction and Compression) for extractive text summarization. We present preliminary results from our prototype experimental framework. An evaluation of the resulting textual summaries for a movie made of 156 scenes demonstrates the feasibility of the approach—the summary contains the descriptions of three out of the four most important scenes/storylines in the movie. Although the summaries are far from satisfactory, we argue that the current results can be used to prove the merit of our approach.

Original languageAmerican English
Title of host publicationStudies in Fuzziness and Soft Computing
Place of PublicationCham
Number of pages11
ISBN (Electronic)978-3-030-47124-8
ISBN (Print)978-3-030-47123-1
StatePublished - 11 Jul 2021

Publication series

NameStudies in Fuzziness and Soft Computing

All Science Journal Classification (ASJC) codes

  • Computer Science (miscellaneous)
  • Computational Mathematics


Dive into the research topics of 'Towards Automatic Textual Summarization of Movies'. Together they form a unique fingerprint.

Cite this