DrumGAN - Adversarial synthesis of drum sounds

Falini, Victor

DrumGAN - Adversarial synthesis of drum sounds

Mark

Falini, Victor ^LU (2021) STAN40 20201
Department of Statistics

Abstract: This paper faces the problem of audio synthesis with Generative Adversarial Networks (GAN), with an attempt to create novel, original high-quality samples of drums that could be used within the realm of music production.
My results show that it is possible to create high-quality drum samples with architecture such as DCGAN (Deep Convolutional GAN) and that using causal and dilated convolutions is a viable approach, while not making clear if this approach is significantly better than the one of using standard convolutions. It also shows that 1-d transpose convolutions can be substituted with nearest neighbour upsampling followed by regular 1-d convolutions for GANs that generate one-dimensional data. In the final discussion the idea of... (More); This paper faces the problem of audio synthesis with Generative Adversarial Networks (GAN), with an attempt to create novel, original high-quality samples of drums that could be used within the realm of music production.
My results show that it is possible to create high-quality drum samples with architecture such as DCGAN (Deep Convolutional GAN) and that using causal and dilated convolutions is a viable approach, while not making clear if this approach is significantly better than the one of using standard convolutions. It also shows that 1-d transpose convolutions can be substituted with nearest neighbour upsampling followed by regular 1-d convolutions for GANs that generate one-dimensional data. In the final discussion the idea of initializing generator weights in a strategic way in order to increase GAN training stability is introduced. (Less)
Popular Abstract: This paper faces the problem of audio synthesis with Generative Adversarial Networks (GAN), with an attempt to create novel, original high-quality samples of drums that could be used within the realm of music production.
My results show that it is possible to create high-quality drum samples with architecture such as DCGAN (Deep Convolutional GAN) and that using causal and dilated convolutions is a viable approach, while not making clear if this approach is significantly better than the one of using standard convolutions. It also shows that 1-d transpose convolutions can be substituted with nearest neighbour upsampling followed by regular 1-d convolutions for GANs that generate one-dimensional data. In the final discussion the idea of... (More); This paper faces the problem of audio synthesis with Generative Adversarial Networks (GAN), with an attempt to create novel, original high-quality samples of drums that could be used within the realm of music production.
My results show that it is possible to create high-quality drum samples with architecture such as DCGAN (Deep Convolutional GAN) and that using causal and dilated convolutions is a viable approach, while not making clear if this approach is significantly better than the one of using standard convolutions. It also shows that 1-d transpose convolutions can be substituted with nearest neighbour upsampling followed by regular 1-d convolutions for GANs that generate one-dimensional data. In the final discussion the idea of initializing generator weights in a strategic way in order to increase GAN training stability is introduced. (Less)

Please use this url to cite or link to this publication: http://lup.lub.lu.se/student-papers/record/9055621

author

Falini, Victor ^LU

supervisor

Peter Gustafsson ^LU

organization

Department of Statistics

course

STAN40 20201

year

2021

type

H1 - Master's Degree (One Year)

subject

Mathematics and Statistics

keywords

Generative Adversarial Network, Audio Generation, Audio Synthesis, Drums, Samples, GAN, DCGAN

language

English

id

9055621

date added to LUP

2022-06-03 09:31:24

date last changed

2022-06-03 11:41:28

@misc{9055621,
  abstract     = {{This paper faces the problem of audio synthesis with Generative Adversarial Networks (GAN), with an attempt to create novel, original high-quality samples of drums that could be used within the realm of music production. 
My results show that it is possible to create high-quality drum samples with architecture such as DCGAN (Deep Convolutional GAN) and that using causal and dilated convolutions is a viable approach, while not making clear if this approach is significantly better than the one of using standard convolutions. It also shows that 1-d transpose convolutions can be substituted with nearest neighbour upsampling followed by regular 1-d convolutions for GANs that generate one-dimensional data. In the final discussion the idea of initializing generator weights in a strategic way in order to increase GAN training stability is introduced.}},
  author       = {{Falini, Victor}},
  language     = {{eng}},
  note         = {{Student Paper}},
  title        = {{DrumGAN - Adversarial synthesis of drum sounds}},
  year         = {{2021}},
}

LUP Student Papers

LUND UNIVERSITY LIBRARIES

DrumGAN - Adversarial synthesis of drum sounds