All Stories

  1. Few-Shot Multimodal Explanation for Visual Question Answering
  2. Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval
  3. LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Composed Image Retrieval
  4. Open-World Social Event Classification
  5. MMT: Image-guided Story Ending Generation with Multimodal Memory Transformer