Evaluating The Performance of DWT-DCT Feature Extraction in Guitar Chord Recognition

Linggo Sumarno

doi:10.24071/ijasst.v6i2.9972

Evaluating The Performance of DWT-DCT Feature Extraction in Guitar Chord Recognition

Linggo Sumarno^(1*),

(1) Sanata Dharma University
(*) Corresponding Author

Abstract

This study presents advancements in audio signal processing techniques, specifically in enhancing the efficiency of guitar chord recognition. It is a continuation of the previous studies, which also aim at minimizing the feature extraction length with the intended performance. This study adopted two signal processing techniques that are common: Discrete Wavelet Transform (DWT) and Discrete Cosine Transform (DCT) for use in the feature extraction method. By conducting a systematic evaluation of two key parameters: frame blocking length and wavelet filter selection, a significant achievement could be achieved. The recognition system managed to obtain chord recognition with an accuracy of up to 91.43%, by using a feature extraction length of only three, which brought about smaller representation than the previous studies. The outcome of this study will help improve the data processing, which can be applied in real time, in this case in Field Programmable Gate Array (FPGA)-based chord recognition systems.

Keywords: chord recognition, Discrete Wavelet Transform, Discrete Cosine Transform, feature extraction

Full Text:

PDF

References

[1] T. Fujishima, “Realtime chord recognition of musical sound: a system using Common Lisp Music”, Proceeding of the International Computer Music Conference (ICMC), Beijing, pp. 464–467, 1999.

[2] K. Ma, “Automatic Chord Recognition”, Department of Computer Sciences, University of Wisconsin-Madison, May 2016. [Online]. Available : http://pages.cs.wisc.edu/~kma/projects.html. [Accessed Jul. 22, 2024].

[3] P. Rajparkur, B. Girardeau, and T. Migimatsu, “A Supervised Approach to Musical Chord Recognition”, Stanford Undergraduate Research Journal, Vol. 15, pp. 36-40, 2015.

[4] E. Demirel, B. Bozkurt, and X. Serra, “Automatic chord-scale recognition using harmonic pitch class profiles”, Proc. Sound Music Comput. Conf., pp. 72–79, 2019.

[5] L. Sumarno, “Chord Recognition using FFT Based Segment Averaging and Subsampling Feature Extraction,” Proceeding of 8th International Conference on Information and Communication Technology (ICoICT 2020), pp. 465–469. 2020.

[6] L. Sumarno, “Guitar chord recognition using MFCC based feature extraction with Kaiser windowing”, Proceeding of Transdisciplinary Symposium on Engineering and Technology (TSET 2022), Published 2024.

[7] L. Sumarno, “The performance of DST-Wavelet feature extraction for guitar chord recognition”, Proceeding of The 1st International Conference on Applied Sciences and Smart Technologies (InCASST 2023), Published 2024.

[8] K. Vaca, M. M. Jefferies, and X. Yang, “An Open Audio Processing Platform with Zync FPGA”, Proceeding of 22nd IEEE Int. Symp. Meas. Control Robot. Robot. Benefit Humanit. ISMCR 2019, pp. D1-2-1–D1-2-6, 2019.

[9] K. Vaca, A. Gajjar, and X. Yang, “Real-Time Automatic Music Transcription (AMT) with Zync FPGA”, Proceeding of IEEE Comput. Soc. Annu. Symp. VLSI, ISVLSI, Vol. 2019-July (2019), pp. 378–384, 2019.

[10] O.K. Hamid, “Frame Blocking and Windowing Speech Signal”, J. Information, Commun. Intell. Syst., Vol. 4, No. 5, pp. 87–94, 2018.

[11] H. Rakshit, and M. A. Ullah, “A comparative work on window functions for designing efficient FIR filter”, Proceeding of 2014 9th Int. Forum Strateg. Technol. (IFOST 2014), pp. 91–96, 2014.

[12] L. Sumarno, “Chord recognition using segment averaging feature extraction with simplified harmonic product spectrum and logarithmic scaling”, Int. J. Electr. Eng. Informatics, Vol. 10, No. 4, pp. 753–764, 2018.

[13] A.M. Noll, “Pitch Determination of Human Speech by the Harmonic Product Spectrum, the Harmonic Sum Spectrum and a Maximum Likelihood Estimate”, Proceeding of the Symposium on Computer Processing in Communications, Vol. 19, Polytechnic Press, Brooklyn, New York, pp. 779-797, 1970.

[14] I. Izonin, R. Tkachenko, N. Shakhovska, B. Ilchyshyn, and K.K. Singh, “A Two-Step Data Normalization Approach for Improving Classification Accuracy in the Medical Diagnosis Domain”, Mathematics, Vol. 10, No. 11, 2022

[15] A.K. Jain, R.P.W. Duin, and J. Mao, “Statistical pattern recognition: A review”, IEEE Trans. Pattern Anal. Mach. Intell. Vol. 22, No. 1, pp. 4–37, 2000.

[16] A. Massari, R.W. Clayton, and M. Kohler, “Damage detection by template matching of scattered waves”, Bull. Seismol. Soc. Am. Vol. 108, No. 5, pp. 2556–2564, 2018.

[17] H.U. Zhi-Qiang, Z. Jia-Qi, W. Xin, L.I.U. Zi-Wei, and L.I.U. Yong, “Improved algorithm of DTW in speech recognition”, in Proceeding of IOP Conf. Ser. Mater. Sci. Eng., Vol. 563, No. 5, 2019.

[18] S. Sohangir, and D. Wang, “Improved sqrt-cosine similarity measurement”, J. Big Data, Vol 4, No 1, 2017.

[19] H.R. Shahdoosti, and F. Mirzapour, “Spectral–spatial feature extraction using orthogonal linear discriminant analysis for classification of hyperspectral data”, Eur. J. Remote Sens., Vol. 50, No. 1, 2017.

[20] Y. Zhao, H. Lv, J. Li, and L. Zhu, “High performance and resource efficient FFT processor based on CORDIC algorithm”, EURASIP J. Adv. Signal Process, Vol. 23, 2022.

[21] M.A.M. Basiri, and P. Bharadwaja, "Efficient FPGA Implementations of Lifting based DWT using Partial Reconfiguration," 2023 36th International Conference on VLSI Design and 2023 22nd International Conference on Embedded Systems (VLSID), Hyderabad, India, pp. 319-324, 2023.

[22] C.A. Kumar, G.R. Poornima, R. Aruna, B.P.P. Kumar, S. Harish, & D.A.L. Vaishnavi, “Implementation of an Efficient and Reconfigurable Architecture for DCT on FPGA”, International Journal of Intelligent Systems and Applications in Engineering, Vol. 12, No. 10s, pp. 597–604, 2024.

[23] I. Bravo-Munoz, J.L. Lazaro-Galilea, and A. Gardel-Vicente, “FPGA and SoC Devices Applied to New Trends in Image/Video and Signal Processing Fields”, Electronics, Vol. 6, No. 25, 2017.

DOI: https://doi.org/10.24071/ijasst.v6i2.9972

Refbacks

There are currently no refbacks.

Publisher : Faculty of Science and Technology

Society/Institution : Sanata Dharma University

This work is licensed under a Creative Commons Attribution 4.0 International License.

Username
Password
Remember me