Status
Time
39s
Submitted
4 mo ago
Privacy
The model to be used for audio separation. Default is "htdemucs_ft". Check the README for more information on the available models.
Only seperate audio into stem and no_stem. the stem which is specified will be seperated from the rest of the stems.
option controls the amount of overlap between prediction windows. Default is 0.25 (i.e. 25%) which is probably fine. It can probably be reduced to 0.1 to improve a bit speed.
The number of shifts to be used. performs multiple predictions with random shifts (a.k.a the shift trick) of the input and average them. This makes prediction SHIFTS times slower. Default is 0
The format of the audio file to be returned. Default is "wav". You can choose from "mp3", "flac" or "wav".