All Stories

  1. Arrow Matrix Decomposition: A Novel Approach for Communication-Efficient Sparse Matrix Multiplication
  2. High-Performance and Programmable Attentional Graph Neural Networks with Global Tensor Formulations
  3. FuzzyFlow: Leveraging Dataflow To Find and Squash Program Optimization Bugs
  4. Performance Embeddings: A Similarity-Based Transfer Tuning Approach to Performance Optimization
  5. Lifting C semantics for dataflow optimization
  6. Pebbles, Graphs, and a Pinch of Combinatorics
  7. NPBench