Learning optimal features for music transcription

Huaiping Ming, Dongyan Huang, Lei Xie, Haizhou Li

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

This paper aims to design time-frequency representation (TFR) functions for automatic music transcription. It is desirable that the decomposition of those TFR functions are suitable for notes having variation of both pitch and spectral envelop over time. The Harmonic Adaptive Latent Component Analysis (HALCA) model adopted in this paper allows considering those two kinds of variations simultaneously. We evaluate the influence of three TFR functions including IIR, FIR filter bank semigram (FBSG) and constant-Q transform semigram in automatic music transcription task, on a database of popular and polyphonic classic music. The experiment results show that the filter bank based representations are suitable for multiple-instrument recordings and a CQT-based representation turns out to provide very accurate transcription for solo-instrument recordings.

源语言英语
主期刊名2014 IEEE China Summit and International Conference on Signal and Information Processing, IEEE ChinaSIP 2014 - Proceedings
出版商Institute of Electrical and Electronics Engineers Inc.
105-109
页数5
ISBN(电子版)9781479954032
DOI
出版状态已出版 - 3 9月 2014
活动2nd IEEE China Summit and International Conference on Signal and Information Processing, IEEE ChinaSIP 2014 - Xi'an, 中国
期限: 9 7月 201413 7月 2014

出版系列

姓名2014 IEEE China Summit and International Conference on Signal and Information Processing, IEEE ChinaSIP 2014 - Proceedings

会议

会议2nd IEEE China Summit and International Conference on Signal and Information Processing, IEEE ChinaSIP 2014
国家/地区中国
Xi'an
时期9/07/1413/07/14

指纹

探究 'Learning optimal features for music transcription' 的科研主题。它们共同构成独一无二的指纹。

引用此