All Stories

  1. Time-series Forecasting for Network Utilization in Large-Scale Scientific Workflows
  2. BBRv3 Startup Behavior: Analysis and Fairness Enhancements
  3. Predicting Dataset Popularity for Improved Distributed Content Caching in Scientific Workflows
  4. Validating TCP Behavior in DISTRI: A Comparison of Simulated and Real-World Network Performance for Distributed Computing
  5. ScaleQsim: Highly Scalable Quantum Circuit Simulation Framework for Exascale HPC Systems
  6. Achieving Deterministic and Reliable Large-Scale Data Transfers in a Scientific Network
  7. Regen: An object layout regenerator on large-scale production HPC systems
  8. Toward Performance Prediction in Large-Scale Systems through Temporal System and Application Log Analysis
  9. Swiftn: Accelerating Quantum Circuit Simulation Through Tensor Optimization
  10. Improving Slow Transfer Predictions: Generative Methods Compared
  11. Conditional Recurrent Neural Networks for Enhancing Throughput Prediction and Slow File Transfers Detection in Large Science Workflows
  12. Comparing Cache Utilization Trends for Regional Data Caches
  13. Exploring Data Caching Policy with Data Access Patterns from dCache System
  14. TensorSearch: Parallel Similarity Search on Tensors
  15. A Study of a Deterministic Networking Framework for Latency Critical Large Scientific Data Transfers
  16. Imb-FinDiff: Conditional Diffusion Models for Class Imbalance Synthesis of Financial Tabular Data
  17. A2FL: Autonomous and Adaptive File Layout in HPC through Real-time Access Pattern Analysis
  18. Detecting Anomalies in Time Series Using Kernel Density Approaches
  19. Experiences in deploying in-network data caches
  20. Predicting Resource Utilization Trends with Southern California Petabyte Scale Cache
  21. Understanding Data Access Patterns for dCache System
  22. Automatic Data Transformation Using Large Language Model - An Experimental Study on Building Energy Data
  23. Counterfactual Analysis: A Case Study on Impact of External Events on Building Energy Consumption
  24. Saving network usage across the seas, saving network traffic volume by 97%
  25. Gender Gaps in Mode Usage, Vehicle Ownership, and Spatial Mobility When Entering Parenthood: A Life Course Perspective
  26. Leveraging History to Predict Infrequent Abnormal Transfers in Distributed Workflows
  27. Design and implementation of I/O performance prediction scheme on HPC systems through large-scale log analysis
  28. Effectiveness and predictability of in-network storage cache for Scientific Workflows
  29. Locating Partial Discharges in Power Transformers with Convolutional Iterative Filtering
  30. Design and Implementation of Burst Buffer Over-Subscription Scheme for HPC Storage Systems
  31. Design and implementation of dynamic I/O control scheme for large scale distributed file systems
  32. What Makes You Hold on to That Old Car? Joint Insights From Machine Learning and Multinomial Logit on Vehicle-Level Transaction Decisions
  33. Predicting Slow Network Transfers in Scientific Computing
  34. Studying Scientific Data Lifecycle in On-demand Distributed Storage Caches
  35. Access Trends of In-network Cache for Scientific Data
  36. SNTA'22
  37. LBNL Superfacility Project Report
  38. Enhancing IoT anomaly detection performance for federated learning
  39. Adaptive Optimization for Sparse Data on Heterogeneous GPUs
  40. Using Multi-Resolution Data to Accelerate Neural Network Training in Scientific Applications
  41. Performance of the Gold Standard and Machine Learning in Predicting Vehicle Transactions
  42. An In-Depth I/O Pattern Analysis in HPC Systems
  43. Asynchronous I/O Strategy for Large-Scale Deep Learning Applications
  44. Automated Feature Selection for Anomaly Detection in Network Traffic Data
  45. Adaptive Stochastic Gradient Descent for Deep Learning on Heterogeneous CPU+GPU Architectures
  46. Effective Missing Value Imputation Methods for Building Monitoring Data
  47. Botnet Detection Using Recurrent Variational Autoencoder
  48. Enhancing IoT Anomaly Detection Performance for Federated Learning
  49. Cross-facility science with the Superfacility Project at LBNL
  50. Software-Defined Network for End-to-end Networked Science at the Exascale
  51. Towards HPC I/O Performance Prediction through Large-scale Log Analysis
  52. Access Patterns to Disk Cache for Large Scientific Archive
  53. GPU-based Classification for Wireless Intrusion Detection
  54. understanding scientific data access patterns with data caching within the network
  55. Feature Selection Improves Tree-based Classification for Wireless Intrusion Detection
  56. Evaluation of Deep Learning Models for Network Performance Prediction for Scientific Facilities
  57. Transfer Learning Approach for Botnet Detection Based on Recurrent Variational Autoencoder
  58. HPC Workload Characterization Using Feature Selection and Clustering
  59. BBOS: Efficient HPC Storage Management via Burst Buffer Over-Subscription
  60. Predicting Resource Requirement in Intermediate Palomar Transient Factory Workflow
  61. Clustering Life Course to Understand the Heterogeneous Effects of Life Events, Gender, and Generation on Habitual Travel Modes
  62. A Reinforcement Learning Based Network Scheduler for Deadline-Driven Data Transfers
  63. Federated Wireless Network Intrusion Detection
  64. Machine Learning for Prediction of Mid to Long Term Habitual Transportation Mode Use
  65. Spatiotemporal Real-Time Anomaly Detection for Supercomputing Systems
  66. Evaluating the Effects of Missing Values and Mixed Data Types on Social Sequence Clustering Using t-SNE Visualization
  67. Co-optimizing Latency and Energy for IoT services using HMP servers in Fog Clusters
  68. DCA-IO: A Dynamic I/O Control Scheme for Parallel and Distributed File Systems
  69. A New Approach to Multivariate Network Traffic Analysis
  70. Multidimensional Compression with Pattern Matching
  71. Automatic Detection of Network Traffic Anomalies and Changes
  72. Performance Prediction for Data Transfers in LCLS Workflow
  73. Similarity-based Compression with Multidimensional Pattern Matching
  74. Understanding Parallel I/O Performance Trends Under Various HPC Configurations
  75. Detecting Anomalies in the LCLS Workflow
  76. Dynamic Online Performance Optimization in Streaming Data Compression
  77. Predicting Network Traffic Using TCP Anomalies
  78. Consensus Ensemble System for Traffic Flow Prediction
  79. SDN for End-to-End Networked Science at the Exascale (SENSE)
  80. Auto-Tuned Publisher in a Pub/Sub System: Design and Performance Evaluation
  81. Modeling Data Transfers: Change Point and Anomaly Detection
  82. Spatio-Temporal Analysis of HPC I/O and Connection Data
  83. Identifying Anomalous File Transfer Events in LCLS Workflow
  84. Towards Autonomic Science Infrastructure
  85. Multivariate network traffic analysis using clustered patterns
  86. Foreword
  87. Predicting baseline for analysis of electricity pricing
  88. Feature Engineering and Classification Models for Partial Discharge Events in Power Transformers
  89. Accurate signal timing from high frequency streaming data
  90. Data quality challenges with missing values and mixed types in joint sequence analysis
  91. Convolutional Filtering for Accurate Signal Timing from Noisy Streaming Data
  92. Statistical data reduction for streaming data
  93. A New Approach to Online, Multivariate Network Traffic Analysis
  94. Improving Statistical Similarity Based Data Reduction for Non-Stationary Data
  95. Parallel Variable Selection for Effective Performance Prediction
  96. Expanding Statistical Similarity Based Data Reduction to Capture Diverse Patterns
  97. A lightweight network anomaly detection technique
  98. An approach to online network monitoring using clustered patterns
  99. Towards Real-Time Detection and Tracking of Spatio-Temporal Features: Blob-Filaments in Fusion Plasma
  100. Novel Data Reduction Based on Statistical Similarity
  101. Machine learning based job status prediction in scientific clusters
  102. Time-Series Forecast Modeling on High-Bandwidth Network Measurements
  103. Visualization and Analysis for Near-Real-Time Decision Making in Distributed Workflows
  104. Performance Analysis Tool for HPC and Big Data Applications on Scientific Clusters
  105. Named Data Networking in Climate Research and HEP Applications
  106. PATHA: Performance Analysis Tool for HPC Applications
  107. Extracting Baseline Electricity Usage Using Gradient Tree Boosting
  108. Best Predictive Generalized Linear Mixed Model with Predictive Lasso for High-Speed Network Data Analysis
  109. Network bandwidth utilization forecast model on high bandwidth networks
  110. Statistical Overfitting and Backtest Performance
  111. Adaptation and Policy-Based Resource Allocation for Efficient Bulk Data Transfers in High Performance Computing Environments
  112. Efficient Data Staging Using Performance-Based Adaptation and Policy-Based Resource Allocation
  113. Statistical Overfitting and Backtest Performance
  114. Estimating and Forecasting Network Traffic Performance Based on Statistical Patterns Observed in SNMP Data
  115. What SNMP Data Can Tell Us about Edge-to-Edge Network Performance
  116. Adaptive Data Transfers that Utilize Policies for Resource Sharing
  117. Efficient Attribute-Based Data Access in Astronomy Analysis
  118. Experiences with 100Gbps network applications
  119. Adoption of a SAML-XACML Profile for Authorization Interoperability across Grid Middleware in OSG and EGEE
  120. StorNet: Integrated Dynamic Storage and Network Resource Provisioning and Management for Automated Data Transfers
  121. StorNet: Co-scheduling of end-to-end bandwidth reservation on storage and network systems for high-performance data transfers
  122. Efficient Bulk Data Replication for the EarthSystem Grid
  123. A Flexible Reservation Algorithm for Advance Network Provisioning
  124. Finding Tropical Cyclones on a Cloud Computing Cluster: Using Parallel Virtualization for Large-Scale Climate Simulation Analysis
  125. EFFIS: An End-to-end Framework for Fusion Integrated Simulation
  126. Lessons learned from moving earth system grid data sets over a 20 Gbps wide-area network
  127. Adaptive Transfer Adjustment in Efficient Bulk Data Transfer Management for Climate Datasets
  128. Scientific Data Management
  129. Dynamic Storage Management
  130. Hadoop distributed file system for the Grid
  131. Practical Grid Storage Interoperation
  132. FastBit: interactively searching massive data
  133. Interoperation of world-wide production e-Science infrastructures
  134. The Earth System Grid: Enabling Access to Multimodel Climate Simulation Data
  135. Efficient Operational Profiling of Systems Using Suffix Arrays on Execution Logs
  136. Grid data access on widely distributed worker nodes using scalla and SRM
  137. Storage resource manager version 2.2: design, implementation, and testing experience
  138. Data management and analysis for the Earth System Grid
  139. Toward a first-principles integrated simulation of tokamak edge plasmas
  140. Storage Resource Managers: Recent International Experience on Requirements and Multiple Co-Operating Implementations
  141. Building a global federation system for climate change research: the earth system grid center for enabling technologies (ESG-CET)
  142. Enabling worldwide access to climate simulation data: the earth system grid (ESG)
  143. The Earth System Grid: Supporting the Next Generation of Climate Modeling Research
  144. High-performance remote access to climate simulation data: a challenge problem for data grid technologies
  145. An ontology for scientific information in a Grid environment: the earth system Grid
  146. Grid collector: an event catalog with automated file management
  147. High-performance remote access to climate simulation data
  148. New capabilities in the HENP Grand Challenge Storage Access System and its application at RHIC
  149. Experience with using CORBA to implement a file caching coordination system
  150. Storage access coordination using CORBA
  151. Invariant representation and hierarchical network for inspection of nuts from X-ray images
  152. Coordinating simultaneous caching of file bundles from tertiary storage
  153. Multidimensional indexing and query coordination for tertiary storage management