
| Mixed Audio Input | Reference audio for Speaker Embedding | VoiceFilter Output | Spectron Output | Clean audio (ground truth) |
|---|---|---|---|---|
| Mixed Audio Input | Reference audio for Speaker Embedding | X-tasnet Output | Spectron Output | Clean audio (ground truth) |
|---|---|---|---|---|
| Mixed Audio Input | Reference audio for Speaker Embedding |
Spectron Output |
|---|---|---|