All Stories

  1. Detailed Modeling of Heterogeneous and Contention-Constrained Point-to-Point MPI Communication
  2. Enabling unstructured-mesh computation on massively tiled AI processors: An example of accelerating in silico cardiac simulation
  3. A cell-based framework for modeling cardiac mechanics
  4. On Memory Traffic and Optimisations for Low-order Finite Element Assembly Algorithms on Multi-core CPUs
  5. Resource-Efficient Use of Modern Processor Architectures For Numerically Solving Cardiac Ionic Cell Models
  6. iPUG for Multiple Graphcore IPUs: Optimizing Performance and Scalability of Parallel Breadth-First Search
  7. On the impact of heterogeneity-aware mesh partitioning and non-contributing computation removal on parallel reservoir simulations
  8. Efficient Numerical Solution of the EMI Model Representing the Extracellular Space (E), Cell Membrane (M) and Intracellular Space (I) of a Collection of Cardiac Cells
  9. Operator Splitting and Finite Difference Schemes for Solving the EMI Model
  10. Cache simulation for irregular memory traffic on multi-core CPUs: Case study on performance models for sparse matrix–vector multiplication
  11. Performance Optimization and Modeling of Fine-Grained Irregular Communication in UPC
  12. Combining Algorithmic Rethinking and AVX-512 Intrinsics for Efficient Simulation of Subcellular Calcium Signaling
  13. Panda: A Compiler Framework for Concurrent CPU $$+$$ + GPU Execution of 3D Stencil Computations on GPU-accelerated Supercomputers
  14. Accelerating Detailed Tissue-Scale 3D Cardiac Simulations Using Heterogeneous CPU-Xeon Phi Computing
  15. On the performance and energy efficiency of the PGAS programming model on multicore architectures
  16. Matlab2cpp: A Matlab-to-C++ code translator
  17. Solving 3D Time-Fractional Diffusion Equations by High-Performance Parallel Computing
  18. CPU+GPU Programming of Stencil Computations for Resource-Efficient Use of GPU Clusters
  19. Scalable Heterogeneous CPU-GPU Computations for Unstructured Tetrahedral Meshes
  20. Communication-hiding programming for clusters with multi-coprocessor nodes
  21. An analytical GPU performance model for 3D stencil computations from the angle of data traffic
  22. Parallel performance modeling of irregular applications in cell-centered finite volume methods over unstructured tetrahedral meshes
  23. Multi-GPU Implementations of Parallel 3D Sweeping Algorithms with Application to Geological Folding
  24. Towards Detailed Tissue-Scale 3D Simulations of Electrical Activity and Calcium Handling in the Human Cardiac Ventricle
  25. Effective multi-GPU communication using multiple CUDA streams and threads
  26. Heterogeneous CPU-GPU computing for the finite volume method on 3D unstructured meshes
  27. Automated Transformation of GPU-Specific OpenCL Kernels Targeting Performance Portability on Multi-Core/Many-Core CPUs
  28. Performance modeling of serial and parallel implementations of the fractional Adams-Bashforth-Moulton method
  29. Time-fractional heat equations and negative absolute temperatures
  30. Utilizing Multiple Xeon Phi Coprocessors on One Compute Node
  31. Balancing efficiency and accuracy for sediment transport simulations
  32. Towards simulation of subcellular calcium dynamics at nanometre resolution
  33. On the GPU-CPU Performance Portability of OpenCL for 3D Stencil Computations
  34. High efficient sedimentary basin simulations on hybrid CPU-GPU clusters
  35. Resource-efficient utilization of CPU/GPU-based heterogeneous supercomputers for Bayesian phylogenetic inference
  36. On the GPU Performance of 3D Stencil Computations Implemented in OpenCL
  37. On the GPU performance of cell-centered finite volume method over unstructured tetrahedral meshes
  38. Performance of Sediment Transport Simulations on NVIDIA's Kepler Architecture
  39. Simulating Cardiac Electrophysiology in the Era of GPU-Cluster Computing
  40. Using 1000+ GPUs and 10000+ CPUs for Sedimentary Basin Simulations
  41. Accelerating a 3D Finite-Difference Earthquake Simulation with a C-to-CUDA Translator
  42. A New Parallel 3D Front Propagation Algorithm for Fast Simulation of Geological folds
  43. An OpenMP-enabled parallel simulator for particle transport in fluid flows
  44. Mint
  45. Simplifying the parallelization of scientific codes by a function-centric approach in Python
  46. Computational modelling of the initiation and development of spontaneous intracellular Ca2+ waves in ventricular myocytes
  47. Numerical Analysis of a Dual-Sediment Transport Model Applied to Lake Okeechobee, Florida
  48. A study on modified Szabo's wave equation modeling of frequency-dependent dissipation in ultrasonic medical imaging
  49. Towards a computational method for imaging the extracellular potassium concentration during regional ischemia
  50. Analysis of tracer tomography using temporal moments of tracer breakthrough curves
  51. Evolution of Intracellular Ca2 + Waves from about 10,000 RyR Clusters: Towards Solving a Computationally Daunting Task
  52. Simulating frequency-dependent dissipation in the CARI technique for breast tumors using the modified Szabo's wave model
  53. A view toward the future of subsurface characterization: CAT scanning groundwater basins
  54. On the possibility for computing the transmembrane potential in the heart with a one shot method: An inverse problem
  55. A unified framework of multi-objective cost functions for partitioning unstructured finite element meshes
  56. A note on the efficiency of the conjugate gradient method for a class of time-dependent problems
  57. An order optimal solver for the discretized bidomain equations
  58. On the Computational Complexity of the Bidomain and the Monodomain Models of Electrophysiology
  59. Improving the Performance of Large-Scale Unstructured PDE Applications
  60. Message from HPSEC Workshop Co-chairs
  61. Message from IWEC Workshop Co-chairs
  62. A parallel multi-subdomain strategy for solving Boussinesq water wave equations
  63. Using the parallel algebraic recursive multilevel solver in modern physical applications
  64. Parallel solution of the bidomain equations with high resolutions
  65. Applied Parallel Computing. New Paradigms for HPC in Industry and Academia
  66. Parallel multilevel methods with adaptivity on unstructured grids
  67. A Finite Element Method for Fully Nonlinear Water Waves
  68. Message from the Chairs
  69. Parallel Computing Engines for Subsurface Imaging Technologies
  70. A numerical study of some parallel algebraic preconditioners
  71. Parallel simulation of 3D nonlinear acoustic fields on a Linux-cluster