All Stories

  1. Skynet-V1: Towards Early Warning of Video Abnormal Events via A Spatial-temporal Causal-enhanced MoE Framework
  2. OmniDoctor: Towards LLM-centric Lifelong Learning for New Emerging Medical VQA Tasks
  3. Omni-SILA: Towards <u>Omni</u>-scene Driven Visual <u>S</u>entiment <u>I</u>dentifying, <u>L</u>ocating and <u>A</u>ttributing in Videos
  4. Sherlock: Towards Multi-scene Video Abnormal Event Extraction and Localization via a Global-local Spatial-sensitive LLM
  5. Towards Emotion-enriched Text-to-Motion Generation via LLM-guided Limb-level Emotion Manipulating
  6. Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanced Video Large Language Model