All Stories

  1. A Survey on Music Generation from Single-Modal, Cross-Modal, and Multi-Modal Perspectives
  2. Singing Timbre Popularity Assessment Based on Multimodal Large Foundation Model
  3. Multi-Accent Mandarin Dry-Vocal Singing Dataset: Benchmark for Singing Accent Recognition
  4. SongDriver: Real-time Music Accompaniment Generation without Logical Latency nor Exposure Bias