Audio Course documentation
Check your understanding of the course material
Unit 0. Welcome to the course!
Unit 1. Working with audio data
What you'll learnIntroduction to audio dataLoad and explore an audio datasetPreprocessing audio dataStreaming audio dataQuizSupplemental reading and resources
Unit 2. A gentle introduction to audio applications
Unit 3. Transformer architectures for audio
Unit 4. Build a music genre classifier
Unit 5. Automatic Speech Recognition
Unit 6. From text to speech
Unit 7. Putting it all together
Unit 8. Finish line
Course Events
Check your understanding of the course material
1. What units is the sampling rate measured in?
2. When streaming a large audio dataset, how soon can you start using it?
3. What is a spectrogram?
4. What is the easiest way to convert raw audio data into log-mel spectrogram expected by Whisper?
A.
librosa.feature.melspectrogram(audio["array"])B.
feature_extractor = WhisperFeatureExtractor.from_pretrained("openai/whisper-small")
feature_extractor(audio["array"])C.
dataset.feature(audio["array"], model="whisper")5. How do you load a dataset from 🤗 Hub?
A.
from datasets import load_dataset
dataset = load_dataset(DATASET_NAME_ON_HUB)B.
import librosa
dataset = librosa.load(PATH_TO_DATASET)C.
from transformers import load_dataset
dataset = load_dataset(DATASET_NAME_ON_HUB)