All Stories

  1. Multiple Resumptions and Local Mutable State, Directly
  2. Slotted E-Graphs: First-Class Support for (Bound) Variables in E-Graphs
  3. The MLIR Transform Dialect
  4. xDSL: Sidekick Compilation for SSA-Based Compilers
  5. Descend: A Safe GPU Systems Programming Language
  6. A shared compilation stack for distributed-memory parallelism in stencil DSLs
  7. Collection skeletons: Declarative abstractions for data collections
  8. Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming
  9. Guided Equality Saturation
  10. Shoggoth: A Formal Foundation for Strategic Rewriting
  11. Structural Subtyping as Parametric Polymorphism
  12. BaCO: A Fast and Portable Bayesian Compiler Optimization Framework
  13. Collection Skeletons: Declarative Abstractions for Data Collections
  14. Investigating magic numbers: improving the inlining heuristic in the Glasgow Haskell Compiler
  15. Systematically extending a high-level code generator with support for tensor cores
  16. Generating high performance code for irregular data structures using dependent types
  17. Efficient Auto-Tuning of Parallel Programs with Interdependent Tuning Parameters via Auto-Tuning Framework (ATF)
  18. Integrating a functional pattern-based IR into MLIR
  19. Towards a Domain-Extensible Compiler: Optimizing an Image Processing Pipeline on Mobile CPUs
  20. DelayRepay: delayed execution for kernel fusion in Python
  21. Achieving high-performance the functional way
  22. High-level hardware feature extraction for GPU performance prediction of stencils
  23. Generating fast sparse matrix vector multiplication from a high level generic functional IR
  24. Tiling Optimizations for Stencil Computations Using Rewrite Rules in L ift
  25. High-level synthesis of functional patterns with Lift
  26. Position-dependent arrays and their application for high performance code generation
  27. Generating efficient FFT GPU code with Lift
  28. Automatic Matching of Legacy Code to Heterogeneous APIs
  29. Automatic Matching of Legacy Code to Heterogeneous APIs
  30. High performance stencil code generation with Lift
  31. Just-In-Time GPU Compilation for Interpreted Languages with Partial Evaluation
  32. Towards Composable GPU Programming
  33. Matrix multiplication beyond auto-tuning
  34. Multi-stage programming for GPUs in C++ using PACXX
  35. Performance portable GPU code generation for matrix multiplication
  36. Generating performance portable code using rewrite rules: from high-level functional expressions to high-performance OpenCL code
  37. Runtime Code Generation and Data Management for Heterogeneous Computing in Java
  38. Generating performance portable code using rewrite rules: from high-level functional expressions to high-performance OpenCL code
  39. Towards High-Level Programming for Systems with Many Cores
  40. High-Level Programming of Stencil Computations on Multi-GPU Systems Using the SkelCL Library
  41. gCUP: rapid GPU-based HIV-1 co-receptor usage prediction for next-generation sequencing
  42. SkelCL: a high-level extension of OpenCL for multi-GPU systems
  43. A Composable Array Function Interface for Heterogeneous Computing in Java
  44. dOpenCL: Towards uniform programming of distributed heterogeneous multi-/many-core systems
  45. Introducing and Implementing the Allpairs Skeleton for Programming Multi-GPU Systems
  46. High-level Programming for Medical Imaging on Multi-GPU Systems Using the SkelCL Library
  47. SkelCL: Enhancing OpenCL for High-Level Programming of Multi-GPU Systems
  48. Using the SkelCL Library for High-Level GPU Programming of 2D Applications
  49. Towards High-Level Programming of Multi-GPU Systems Using the SkelCL Library
  50. dOpenCL: Towards a Uniform Programming Approach for Distributed Heterogeneous Multi-/Many-Core Systems
  51. SkelCL - A Portable Skeleton Library for High-Level GPU Programming