All Stories

  1. Towards Pen-and-Paper-Style Equational Reasoning in Interactive Theorem Provers by Equality Saturation
  2. Multiple Resumptions and Local Mutable State, Directly
  3. Slotted E-Graphs: First-Class Support for (Bound) Variables in E-Graphs
  4. The MLIR Transform Dialect
  5. xDSL: Sidekick Compilation for SSA-Based Compilers
  6. Descend: A Safe GPU Systems Programming Language
  7. A shared compilation stack for distributed-memory parallelism in stencil DSLs
  8. Collection skeletons: Declarative abstractions for data collections
  9. Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming
  10. Guided Equality Saturation
  11. Shoggoth: A Formal Foundation for Strategic Rewriting
  12. Structural Subtyping as Parametric Polymorphism
  13. BaCO: A Fast and Portable Bayesian Compiler Optimization Framework
  14. Collection Skeletons: Declarative Abstractions for Data Collections
  15. Investigating magic numbers: improving the inlining heuristic in the Glasgow Haskell Compiler
  16. Systematically extending a high-level code generator with support for tensor cores
  17. Generating high performance code for irregular data structures using dependent types
  18. Efficient Auto-Tuning of Parallel Programs with Interdependent Tuning Parameters via Auto-Tuning Framework (ATF)
  19. Integrating a functional pattern-based IR into MLIR
  20. Towards a Domain-Extensible Compiler: Optimizing an Image Processing Pipeline on Mobile CPUs
  21. DelayRepay: delayed execution for kernel fusion in Python
  22. Achieving high-performance the functional way
  23. High-level hardware feature extraction for GPU performance prediction of stencils
  24. Generating fast sparse matrix vector multiplication from a high level generic functional IR
  25. Tiling Optimizations for Stencil Computations Using Rewrite Rules in L ift
  26. High-level synthesis of functional patterns with Lift
  27. Position-dependent arrays and their application for high performance code generation
  28. Generating efficient FFT GPU code with Lift
  29. Automatic Matching of Legacy Code to Heterogeneous APIs
  30. Automatic Matching of Legacy Code to Heterogeneous APIs
  31. High performance stencil code generation with Lift
  32. Just-In-Time GPU Compilation for Interpreted Languages with Partial Evaluation
  33. Towards Composable GPU Programming
  34. Matrix multiplication beyond auto-tuning
  35. Multi-stage programming for GPUs in C++ using PACXX
  36. Performance portable GPU code generation for matrix multiplication
  37. Generating performance portable code using rewrite rules: from high-level functional expressions to high-performance OpenCL code
  38. Runtime Code Generation and Data Management for Heterogeneous Computing in Java
  39. Generating performance portable code using rewrite rules: from high-level functional expressions to high-performance OpenCL code
  40. Towards High-Level Programming for Systems with Many Cores
  41. High-Level Programming of Stencil Computations on Multi-GPU Systems Using the SkelCL Library
  42. gCUP: rapid GPU-based HIV-1 co-receptor usage prediction for next-generation sequencing
  43. SkelCL: a high-level extension of OpenCL for multi-GPU systems
  44. A Composable Array Function Interface for Heterogeneous Computing in Java
  45. dOpenCL: Towards uniform programming of distributed heterogeneous multi-/many-core systems
  46. Introducing and Implementing the Allpairs Skeleton for Programming Multi-GPU Systems
  47. High-level Programming for Medical Imaging on Multi-GPU Systems Using the SkelCL Library
  48. SkelCL: Enhancing OpenCL for High-Level Programming of Multi-GPU Systems
  49. Using the SkelCL Library for High-Level GPU Programming of 2D Applications
  50. Towards High-Level Programming of Multi-GPU Systems Using the SkelCL Library
  51. dOpenCL: Towards a Uniform Programming Approach for Distributed Heterogeneous Multi-/Many-Core Systems
  52. SkelCL - A Portable Skeleton Library for High-Level GPU Programming