selfclean_audio.datasets.base#

Base audio dataset class providing common functionality for audio loading and preprocessing.

Members

BaseAudioDataset

Base class for audio datasets with common preprocessing functionality.

class selfclean_audio.datasets.base.BaseAudioDataset(root: str | None = None, convert_mono: bool = True, sample_rate: int = 44100, target_duration_sec: float | None = None)[source]#

Base class for audio datasets with common preprocessing functionality.

Provides standardized audio loading, mono conversion, resampling, and duration handling.

Initialize base audio dataset.

Parameters:
  • root – Root directory path for the dataset

  • convert_mono – Convert stereo audio to mono if True

  • sample_rate – Target sample rate for audio (will resample if needed)

  • target_duration_sec – Target duration in seconds (will pad/trim if specified)