selfclean_audio.datasets.csem#

Members

CSEMMembranePumps

CSEM Membrane Pump Audio Dataset loader.

class selfclean_audio.datasets.csem.CSEMMembranePumps(root: str | Path, convert_mono: bool = True, sample_rate: int = 16000, target_duration_sec: float | None = None, index_file: str | Path = 'index.csv', files_dir: str | Path = 'files')[source]#

CSEM Membrane Pump Audio Dataset loader.

Expects the following structure under root (see data/CSEM/README.md):

root/
    files/
        {guid}.wav
        ...
    index.csv   # columns: id, filename, label

The dataset returns tuples of (waveform, absolute_path, label). noisy_label is not known and will be set by the synthetic/noise wrappers when applicable, otherwise considered 0 by downstream code.

get_errors()[source]#

Return dummy ground truth for CSEM dataset (no ground truth available).

Returns empty lists to bypass scoring requirements while allowing ranking generation to proceed.