All Stories

  1. Proceedings of the 31st Annual International Conference on Mobile Computing and Networking
  2. METIS: Fast Quality-Aware RAG Systems with Configuration Adaptation
  3. CacheGen: KV Cache Compression and Streaming for Fast Large Language Model Serving
  4. OneAdapt
  5. Machine Learning at the Network Edge: A Survey
  6. Towards memory-efficient inference in edge video analytics
  7. Geo-distributed and edge data analytics
  8. Multi-resource packing for cluster schedulers
  9. Wrangler