cs.SD

该分类下的最新论文

cs.SD 2026-06-17
Reference-Driven Multi-Speaker Audio Scene Generation from In-the-Wild Priors

Existing multi-speaker dialogue systems bind speakers to utterances through structured supervision: per-turn tags, multi-stream transcriptions, or learnable speaker embeddings. These systems operate w...

Michael Finkelson, Daniel Segal, Eitan Richardson 等