数据集:

teticio/audio-diffusion-instrumental-hiphop-256

中文

256x256 mel spectrograms of 5 second samples of instrumental Hip Hop. The code to convert from audio to spectrogram and vice versa can be found in https://github.com/teticio/audio-diffusion along with scripts to train and run inference using De-noising Diffusion Probabilistic Models.

x_res = 256
y_res = 256
sample_rate = 22050
n_fft = 2048
hop_length = 512