All Stories

  1. StreamingCoT: A Dataset for Temporal Dynamics and Multimodal Chain-of-Thought Reasoning in Streaming VideoQA
  2. Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval
  3. LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Composed Image Retrieval