Multimedia content segmentation based on speaker recognition

Jasine Babu; V. Pathari

doi:10.1109/ICSCN.2007.350672

Profiles Research Units Publications

Conferences

Multimedia content segmentation based on speaker recognition

, V. Pathari

Published in

2007

DOI: 10.1109/ICSCN.2007.350672

Pages: 16 - 19

Abstract

Many recent works attempt to index multimedia data based on characteristics such as speaker identify and emo- tional content. In this work, speaker segmentation is performed on movies to extract the shots in which the target actor is speaking. A case of speaker identification on conversational speech under noisy conditions - this work is organized into two phases; an audio classification phase, for the removal of non-speech content, followed by a speaker recognition phase. Along with the speaker models, Gaussian Mixture Models are constructed for sound effects like fight sequences and drum beats to refine the removal of non-speech sounds. Results prove the effectiveness of this deviation from the conventional methods. © 2007 IEEE.

Topics: Speaker diarisation (77)%, Speaker recognition (73)% and Audio signal processing (51)%

View more info for "Multimedia Content Segmentation Based on Speaker Recognition"

About the journal

Journal	Proceedings of ICSCN 2007: International Conference on Signal Processing Communications and Networking

Authors (1)

Jasine Babu
- Department of Computer Science and Engineering

About IIT Palakkad

Research & Development

Academics

Quick Find

About IIT Palakkad

Research & Development

Academics

Quick Find