selfclean_audio.datasets.base#
Base audio dataset class providing common functionality for audio loading and preprocessing.
Members
Base class for audio datasets with common preprocessing functionality. |
- class selfclean_audio.datasets.base.BaseAudioDataset(root: str | None = None, convert_mono: bool = True, sample_rate: int = 44100, target_duration_sec: float | None = None)[source]#
Base class for audio datasets with common preprocessing functionality.
Provides standardized audio loading, mono conversion, resampling, and duration handling.
Initialize base audio dataset.
- Parameters:
root – Root directory path for the dataset
convert_mono – Convert stereo audio to mono if True
sample_rate – Target sample rate for audio (will resample if needed)
target_duration_sec – Target duration in seconds (will pad/trim if specified)