Welcometo my personal page.
Adrian CHIFU
21 Nov 2020

SegChainW2V: Towards a generic automatic video segmentation framework, based on lexical chains of audio transcriptions and word embeddings

With the advances in multimedia broadcasting through a rich variety of channels and with the vulgarization of video production, it becomes essential to be able to provide reliable means of retrieving information within videos, not only the videos themselves. Research in this area has been widely focused on the context of TV news broadcasts, for which the structure itself provides clues for story segmentation. The systematic employment of these clues would lead to thematically driven systems that would not be easily adaptable in the case of videos of other types. The systems are therefore dependent on the type of videos for which they have been designed. In this paper we aim at introducing SegChainW2V, a generic unsupervised framework for story segmentation, based on lexical chains from transcriptions and their vectorization. SegChainW2V takes into account the topic changes by perceiving the fiuctuations of the most frequent terms throughout the video, as well as their semantics through the word embedding vectorization.

Comments Off on SegChainW2V: Towards a generic automatic video segmentation framework, based on lexical chains of audio transcriptions and word embeddings