All Stories

  1. Dual Alignment-enhanced Fashion Vision-Language Pre-training
  2. MIRAGE25: ACM MM25 Multimodal Interleaved Reasoning and Generation Challenge
  3. Semantic-enhanced Co-attention Prompt Learning for Non-overlapping Cross-Domain Recommendation
  4. Mitigating Data Redundancy to Revitalize Transformer-based Long-Term Time Series Forecasting System
  5. Generating Action-conditioned Prompts for Open-vocabulary Video Action Recognition
  6. In-Context Learning for Zero-shot Medical Report Generation
  7. Voice-Face Homogeneity Tells Deepfake
  8. Bi-directional Heterogeneous Graph Hashing towards Efficient Outfit Recommendation
  9. Learning Adaptive Spatial-Temporal Context-Aware Correlation Filters for UAV Tracking
  10. Personalized Fashion Compatibility Modeling via Metapath-guided Heterogeneous Graph Learning
  11. A Comprehensive Survey of Neural Architecture Search
  12. Multimodal Compatibility Modeling via Exploring the Consistent and Complementary Correlations
  13. Self-weighted Robust LDA for Multiclass Classification with Edge Classes
  14. Grounding Visual Concepts for Zero-Shot Event Detection and Event Captioning
  15. Pair-based Uncertainty and Diversity Promoting Early Active Learning for Person Re-identification
  16. Annotation Efficient Cross-Modal Retrieval with Adversarial Attentive Alignment
  17. Improving What Cross-Modal Retrieval Models Learn through Object-Oriented Inter- and Intra-Modal Attention Networks
  18. Learning-Based Multimedia Analyses and Applications
  19. Few-Shot Text and Image Classification via Analogical Transfer Learning
  20. Deep Semisupervised Zero-Shot Learning with Maximum Mean Discrepancy
  21. Refined Spectral Clustering via Embedded Label Propagation
  22. Learning Multiple Diagnosis Codes for ICU Patients with Local Disease Correlation Mining
  23. Robust Top- k Multiclass SVM for Visual Category Recognition
  24. Avoiding Optimal Mean ℓ2,1-Norm Maximization-Based Robust PCA for Reconstruction
  25. Convex Sparse PCA for Unsupervised Feature Learning
  26. Searching Persuasively
  27. Incremental Multimodal Query Construction for Video Search