The simplest music file to transcribe is a single note. Here I use a bassoon playing a single note A (octave 2) to walk through the simplest transcription.
![](http://sound-analysis.com/wp-content/uploads/2024/03/image-1024x255.png)
From the signal and FFT result we can see that this is indeed a single note with a single dominant frequency.
![](http://sound-analysis.com/wp-content/uploads/2024/03/image-1.png)
The spectrogram confirms the simplicity of this example.
![](http://sound-analysis.com/wp-content/uploads/2024/03/image-2-1024x374.png)
To transcribe this note we will use the built in bassoon profile and the default options (i.e. correlation).
![](http://sound-analysis.com/wp-content/uploads/2024/03/image-3.png)
The result is as expected.
Alternatively we could have elected to use the built in Convolutional Neural Network for the bassoon.
![](http://sound-analysis.com/wp-content/uploads/2024/03/image-4.png)
The result is a little different. The note ends just after the forth beat. The CNN transcribes this as a 3 beat note, not 4 beats.
Leave a Reply