All Stories

  1. Conversational Agents: From RAG to LTM
  2. Gestura: A LVLM-Powered System Bridging Motion and Semantics for Real-Time Free-Form Gesture Understanding
  3. SmartFreeEdit: Mask-Free Spatial-Aware Image Editing with Complex Instruction Understanding
  4. Deep Hashing with Semantic Hash Centers for Image Retrieval
  5. AV-NAS: Audio-Visual Multi-Level Semantic Neural Architecture Search for Video Hashing
  6. BAKER: Bayesian Kernel Uncertainty in Domain-Specific Document Modelling
  7. AVHash: Joint Audio-Visual Hashing for Video Retrieval
  8. Empowering Smart Glasses with Large Language Models: Towards Ubiquitous AGI
  9. Unleashing the Power of Large Language Models for Legal Applications
  10. A Theoretical Analysis of Out-of-Distribution Detection in Multi-Label Classification
  11. Uncertainty Quantification for Text Classification
  12. Context-Aware Classification of Legal Document Pages
  13. Long-Tail Hashing
  14. Reinforcement Learning for Information Retrieval
  15. Factorized Q-learning for large-scale multi-agent systems
  16. Bootstrap Domain-Specific Sentiment Classifiers from Unlabeled Corpora
  17. Probabilistic Verb Selection for Data-to-Text Generation
  18. IRGAN
  19. A Probabilistic Multi-Touch Attribution Model for Online Advertising
  20. Bayesian Performance Comparison of Text Classifiers
  21. A Semantic Graph based Topic Model for Question Retrieval in Community Question Answering
  22. Estimating the Uncertainty of Average F1 Scores