Feature extraction for speech and music discrimination

Zhou, Huiyu and Sadka, Abdul and Jiang, Richard M. (2008) Feature extraction for speech and music discrimination. In: 2008 International Workshop on Content-Based Multimedia Indexing, CBMI 2008, Conference Proceedings :. 2008 International Workshop on Content-Based Multimedia Indexing, CBMI 2008, Conference Proceedings . IEEE, GBR, pp. 170-173. ISBN 9781424420445

Full text not available from this repository.

Abstract

Driven by the demand of information retrieval, video editing and human-computer interface, in this paper we propose a novel spectral feature for music and speech discrimination. This scheme attempts to simulate a biological model using the averaged cepstrum, where human perception tends to pick up the areas of large cepstral changes. The cepstrum data that is away from the mean value will be exponentially reduced in magnitude. We conduct experiments of music/speech discrimination by comparing the performance of the proposed feature with that of previously proposed features in classification. The dynamic time warping based classification verifies that the proposed feature has the best quality of music/speech classification in the test database.

Item Type:

Contribution in Book/Report/Proceedings

Uncontrolled Keywords:

/dk/atira/pure/subjectarea/asjc/1700/1704

Subjects:

?? computer graphics and computer-aided designinformation systemsinformation systems and management ??

Departments:

Faculty of Science and Technology > School of Computing & Communications

ID Code:

134375

Deposited By:

ep_importer_pure

Deposited On:

22 Jun 2019 01:03

Refereed?:

Yes

Published?:

Published

Last Modified:

19 Sep 2024 10:01

URI:

https://eprints.lancs.ac.uk/id/eprint/134375