AudioGen- Textually Guided Audio Generation - Paper Explained

AudioGen- Textually Guided Audio Generation - Paper Explained

Aleksa Gordić - The AI Epiphany via YouTube Direct link

Deep dive: audio representation, LSTM

6 of 13

6 of 13

Deep dive: audio representation, LSTM

Class Central Classrooms beta

YouTube playlists curated by Class Central.

Classroom Contents

AudioGen- Textually Guided Audio Generation - Paper Explained

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 Why is text-to-audio hard?
  3. 3 Comparison with VQ-GAN
  4. 4 Comparison with SoundStream
  5. 5 AudioGen overview
  6. 6 Deep dive: audio representation, LSTM
  7. 7 Losses explained
  8. 8 Complex-valued STFTs
  9. 9 Audio Language Modeling
  10. 10 Multi-stream audio inputs
  11. 11 Data and augmentations
  12. 12 Results
  13. 13 Outro

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.