A temporal multi-view approach for audio key finding using adaboost

Ching-Hua Chuan

doi:10.1109/ICMEW.2013.6618295

Back

Conference proceeding

A temporal multi-view approach for audio key finding using adaboost

Ching-Hua Chuan

2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW), pp.1-4

2013-07

DOI: https://doi.org/10.1109/ICMEW.2013.6618295

Abstract

Accuracy

AdaBoost

Adaptation models

Algorithm design and analysis

Arrays

Audio key finding

Center of Effect Generator

Decision trees

Fuzzy Analysis

Hidden Markov models

Spiral Array

Spirals

Audio key finding is an integral step in content-based music indexing and retrieval. In this paper, we present a system that combines ensemble learning with an existing model-based key finding algorithm: the Fuzzy Analysis Center of Effect Generator algorithm. We demonstrate the manner in which AdaBoost improves the accuracy of FACEG using a dataset containing 2785 audio excerpts of real performances composed by Bach and Mozart. Two sets of experiments were conducted: intra-system comparison examining the effect of different settings in FACEG/AdaBoost, and inter-system comparison comparing FACEG/AdaBoost with the key finding implementation in Music Information Retrieval (MIR) toolbox. When FACEG is executed to generate keys at multiple stopping points of the excerpt, AdaBoost with multi-views of tonal information improves key detection accuracy up to 35% on the challenging dataset and up to 21% on the entire dataset.

Metrics

7 Record Views

Details

Title: A temporal multi-view approach for audio key finding using adaboost
Creators: Ching-Hua Chuan - University of North Florida
Publication Details: 2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW), pp.1-4
Publisher: IEEE
Academic Unit: School of Communication; School of Communication - Cinema & Interactive Media
Language: English
Resource Type: Conference proceeding
Record Identifier: 991031713254402976