Vocal & Instrumental Isolation

Home
News
Plans
Demo
Create Account
Login
Theme

Model Selector
Language
English

Русский

中文

اَلْعَرَبِيَّةُ

Polski

Portugues do Brasil

Español

日本語

Français

Oʻzbekcha

Türkçe

हिन्दी

Tiếng Việt

Deutsch

한국어

Bahasa Indonesia

Italiano

Svenska

suomi

български език

magyar nyelv

עִבְֿרִית

ภาษาไทย

hrvatski

Română

MVSep Wind (wind, other)

The MVSep Wind model produces high-quality separation of music into a wind part and everything else. The MVSep Wind model exists in 2 different variants based on following architectures: MelRoformer and SCNet Large. Wind includes 2 categories of instruments: brass and woodwind. More specific we inluded in wind: flute, saxophone, trumpet, trombone, horn, clarinet, oboe, harmonica, bagpipes, bassoon, tuba, kazoo, piccolo, flugelhorn, ocarina, shakuhachi, melodica, reeds, didgeridoo, mussette, gaida.

Quality metrics

Algorithm name	Wind dataset
Algorithm name	SDR Wind	SDR Other
MelBand Roformer	6.73	16.10
SCNet Large	6.76	16.13
MelBand + SCNet Ensemble	7.22	16.59
MelBand + SCNet Ensemble (+extract from Instrumental)	---	---
BS Roformer	9.82	19.19

Algorithm name	DnR dataset (test)
Algorithm name	SDR Speech	SDR Music	SDR Effects
BandIt Plus	15.62	9.21	9.69

Algorithm name	SDR Metric on DnR v3 leaderboard
	music (SDR)	sfx (SDR)	speech (SDR)
SCNet Large	9.94	11.35	12.59
Mel Band Roformer	9.45	11.24	12.27
Ensemble (Mel + SCNet)	10.15	11.67	12.81
Bandit v2 (for reference)	9.06	10.82	12.29

Author	Architecture	Works with	SDR (no independent testing yet)
FoxJoy	MDX-B	Full track	~6.50
anvuew	MelRoformer	Only vocals	7.56
anvuew	BSRoformer	Only vocals	8.07
anvuew v2	MelRoformer	Only vocals	---
Sucial	MelRoformer	Only vocals	10.01
anvuew	BSRoformer	Only vocals (Room)	---

Algorithm name	Multisong dataset					Synth dataset
Algorithm name	SDR Bass	SDR Drums	SDR Other	SDR Vocals	SDR Instrumental	SDR Vocals	SDR Instrumental
Demucs3 (Model A)	9.50	8.97	4.40	7.21	13.52	---	---
Demucs3 (Model B)	10.69	10.27	5.35	8.13	14.44	9.78	9.48

MVSep Wind (wind, other)

MVSep Brass (brass, other)

MVSep Woodwind (woodwind, other)

MVSep Percussion (percussion, other)

BandIt Plus (speech, music, effects)

BandIt v2 (speech, music, effects)

MVSep DnR v3 (speech, music, effects)

Apollo Enhancers (by JusperLee, Lew, baicai1145)

Reverb Removal (noreverb)

AudioSR (Super Resolution)

FlashSR (Super Resolution)

Stable Audio Open Gen

Whisper (extract text from audio)

Parakeet (extract text from audio)

VibeVoice (Voice Cloning)

Key features:

How to use the model

How to generate a reference track?

Option 1: Universal (Balanced & Clear)

Option 2: Conversational (Vlog & Social Media)

Option 3: Professional (Business & Narration)

Tips for recording:

VibeVoice (TTS)

Key Features:

How to use the model

Correct format:

Incorrect format:

Example scenarios:

MVSep MultiSpeaker (MDX23C)

Aspiration (by Sucial)

Matchering (by sergree)

SOME (Singing-Oriented MIDI Extractor)

Demucs3 Model (vocals, drums, bass, other)

Vit Large 23 (vocals, instrum)

MVSep MelBand Roformer (vocals, instrum)

LarsNet (kick, snare, cymbals, toms, hihat)

Site information

Company

Extra