Abe, Makito · more Makito Abe (University of Tsukuba) | Toward OpenACC-enabled GPU-FPGA Accelerated Computing · view |
Ahmad, Zafar · more Zafar Ahmad (Stony Brook University) | Efficient Execution of Dynamic Programming Algorithms on Apache Spark · view |
Ali, Ghanzanfar · more Ghanzanfar Ali (Texas Tech University) | MonSTer: An Out-of-the-Box Monitoring Tool for High Performance Computing Systems · view |
Antoniu, Gabriel · more Gabriel Antoniu (Inria) | E2Clab: Exploring the Computing Continuum through Repeatable, Replicable and Reproducible Edge-to-Cloud Experiments · view |
Arslan, Engin · more Engin Arslan (University of Nevada, Reno) | Streaming File Transfer Optimization for Distributed Science Workflows · view |
Ayguadé, Eduard · more Eduard Ayguadé (Barcelona Supercomputing Center, Universitat Politècnica de Catalunya) | Evaluating Worksharing Tasks on Distributed Environments · view |
Bai, Yang · more Yang Bai (National University of Defense Technology) | SSP: Speeding up Small Flows for Proactive Transport in Datacenters. · view |
Bateman, Keith · more Keith Bateman (Illinois Institute of Technology Chicago) | HCL: Distributing Parallel Data Structures in Extreme Scales · view |
Beltran, Vicenç · more Vicenç Beltran (Barcelona Supercomputing Center) | Evaluating Worksharing Tasks on Distributed Environments · view Towards Data-Flow Parallelization for Adaptive Mesh Refinement Applications · view |
Benoit, Anne · more Anne Benoit (ENS Lyon) | Resilient Scheduling of Moldable Jobs on Failure-Prone Platforms · view |
Bhatele, Abhinav · more Abhinav Bhatele (University of Maryland) | Predicting MPI Collective Communication Performance Using Machine Learning · view |
Binyahib, Roba · more Roba Binyahib (University of Oregon) | Parallel Particle Advection Bake-Off For Scientific Visualization Workloads · view |
Bosilca, George · more George Bosilca (University of Tennessee) | HAN: a Hierarchical AutotuNed Collective Communication Framework · view Flexible Data Redistribution in a Task-Based Runtime System · view Predicting MPI Collective Communication Performance Using Machine Learning · view |
Bouteiller, Aurelien · more Aurelien Bouteiller (University of Tennessee) | Flexible Data Redistribution in a Task-Based Runtime System · view |
Brinkmann, André · more André Brinkmann (Johannes Gutenberg University Mainz) | DelveFS - An event-driven semantic file system for object stores · view |
Bull, Mark · more Mark Bull (University of Edinburgh) | Evaluating Worksharing Tasks on Distributed Environments · view |
Cao, Qinglei · more Qinglei Cao (University of Tennessee) | HAN: a Hierarchical AutotuNed Collective Communication Framework · view Flexible Data Redistribution in a Task-Based Runtime System · view |
Cappello, Franck · more Franck Cappello (Argonne National Laboratory) | DeepClone: Scalable Live Migration of Deep Learning Models for Data Parallel Training · view Towards End-to-end SDC Detection for HPC Applications Equipped with Lossy Compression · view |
Casanova, Henri · more Henri Casanova (University of Hawai'i at Manoa) | Modeling the Performance of Scientific Workflow Executions on HPC Platforms with Burst Buffers · view |
Chang, Chan-Jung · more Chan-Jung Chang (National Tsing Hua University) | ECS2: A Fast Erasure Coding Library for GPU-Accelerated Storage Systems With Parallel & Direct IO · view |
Chen, Jin-Kun · more Jin-Kun Chen (Shanghai Jiao Tong University) | NeoMPX: Characterizing and Improving Estimation of Multiplexing Hardware Counters for PAPI · view |
Chen, Wei · more Wei Chen (Nvidia Corporation) | Data Life Aware Model Updating Strategy for Stream-based Online Deep Learning · view |
Chen, Yong · more Yong Chen (Texas Tech University) | MonSTer: An Out-of-the-Box Monitoring Tool for High Performance Computing Systems · view |
Chen, Zizhong · more Zizhong Chen (UC, Riverside) | Towards End-to-end SDC Detection for HPC Applications Equipped with Lossy Compression · view |
Cheng, Dazhao · more Dazhao Cheng (UNC Charlotte) | Data Life Aware Model Updating Strategy for Stream-based Online Deep Learning · view |
Chien, Steven W. D. · more Steven W. D. Chien (KTH Royal Institute of Technology) | tf-Darshan: Understanding Fine-grained I/O Performance in Machine Learning Workloads · view |
Childs, Hank · more Hank Childs (University of Oregon) | Parallel Particle Advection Bake-Off For Scientific Visualization Workloads · view |
Chou, Jerry · more Jerry Chou (National Tsing Hua University) | ECS2: A Fast Erasure Coding Library for GPU-Accelerated Storage Systems With Parallel & Direct IO · view |
Chou, Yu-Ching · more Yu-Ching Chou (H3 Platform Inc.) | ECS2: A Fast Erasure Coding Library for GPU-Accelerated Storage Systems With Parallel & Direct IO · view |
Choudhary, Alok N. · more Satoshi Matsuoka (RIKEN Center for Computational Science (R-CCS)) Satoshi Matsuoka from April 2018 has become the director of Riken CCS, the top-tier HPC center that represents HPC in Japan, developing and hosting Japan’s tier-one ‘Fugaku’ supercomputer which has become the fastest supercomputer in the world in all four major supercomputer rankings, along with multitudes of ongoing cutting edge HPC research being conducted, including investigating Post-Moore era computing. He was the leader of the TSUBAME series of supercomputers, at Tokyo Institute of Technology, where he still holds a Professor position, to continue his research activities in HPC as well as scalable Big Data and AI. His commendations include the ACM Gordon Bell Prize in 2011 and the IEEE Sidney Fernbach Award in 2014, both being one of the highest awards in the field of HPC, as well as being the Program Chair for ACM/IEEE Supercomputing 2013 (SC13). | AI for Science (Alok N. Choudhary) · view The Price Performance of Performance Models (Felix Wolf) · view Fugaku: the First `Exascale' Supercomputer (Satoshi Matsuoka) · view |
Chowdhury, Rezaul · more Rezaul Chowdhury (Stony Brook University) | Efficient Execution of Dynamic Programming Algorithms on Apache Spark · view |
Chu, Ching-Hsiang · more Ching-Hsiang Chu (The Ohio State University) | Dynamic Kernel Fusion for Bulk Non-contiguous Data Transfer on GPU Clusters · view |
Chung, I-Hsin · more I-Hsin Chung (IBM T. J. Watson) | ECS2: A Fast Erasure Coding Library for GPU-Accelerated Storage Systems With Parallel & Direct IO · view |
Cook, Brandon · more Brandon Cook (Lawrence Berkeley National Laboratory) | Quantifying the impact of network congestion on application performance and network metrics · view |
Coskun, Ayse K. · more Ayse K. Coskun (Boston University) | Quantifying the impact of network congestion on application performance and network metrics · view |
Costan, Alexandru · more Alexandru Costan (IRISA, INSA Rennes) | E2Clab: Exploring the Computing Continuum through Repeatable, Replicable and Reproducible Edge-to-Cloud Experiments · view |
Da Silva, Rafael Ferreira · more Rafael Ferreira Da Silva (USC Information Sciences Institute) | Modeling the Performance of Scientific Workflow Executions on HPC Platforms with Burst Buffers · view |
Dang, Tommy · more Tommy Dang (Texas Tech University) | MonSTer: An Out-of-the-Box Monitoring Tool for High Performance Computing Systems · view |
Davis, Philip E. · more Philip E. Davis (Rutgers University) | A Staging Based Task Execution Framework for Data-driven Scientific Workflows · view |
Deelman, Ewa · more Ewa Deelman (USC Information Sciences Institute) | Modeling the Performance of Scientific Workflow Executions on HPC Platforms with Burst Buffers · view |
Devarajan, Hariharan · more Hariharan Devarajan (Illinois Institute of Technology Chicago) | HCL: Distributing Parallel Data Structures in Extreme Scales · view |
Di, Sheng · more Sheng Di (Argonne National Laboratory) | Towards End-to-end SDC Detection for HPC Applications Equipped with Lossy Compression · view |
Dong, Dezun · more Dezun Dong (National University of Defense Technology) | SSP: Speeding up Small Flows for Proactive Transport in Datacenters. · view |
Dongarra, Jack · more Jack Dongarra (University of Tennessee) | HAN: a Hierarchical AutotuNed Collective Communication Framework · view Flexible Data Redistribution in a Task-Based Runtime System · view |
Dorier, Matthieu · more Matthieu Dorier (Argonne National Laboratory) | DeepClone: Scalable Live Migration of Deep Learning Models for Data Parallel Training · view A Staging Based Task Execution Framework for Data-driven Scientific Workflows · view |
Enes, Jonatan · more Jonatan Enes (Universidade da Coruña (UDC), CITIC) | Power Budgeting of Big Data Applications in Container-based Clusters · view |
Expósito, Roberto Rey · more Roberto Rey Expósito (Universidade da Coruña (UDC), CITIC) | Power Budgeting of Big Data Applications in Container-based Clusters · view |
Fieni, Guillaume · more Guillaume Fieni (University of Lille, INRIA) | Power Budgeting of Big Data Applications in Container-based Clusters · view |
Fujisawa, Katsuki · more Katsuki Fujisawa (Institute of Mathematics for Industry, Kyushu University) | Performance Evaluation of Supercomputer Fugaku using Breadth-First Search Benchmark in Graph500 · view |
Fujita, Norihisa · more Norihisa Fujita (University of Tsukuba) | Toward OpenACC-enabled GPU-FPGA Accelerated Computing · view |
Gong, Lei · more Lei Gong (University of Science and Technology of China) | OctCNN: An Energy-Efficient FPGA Accelerator for CNNs using Octave Convolution Algorithm · view |
Groves, Taylor · more Taylor Groves (Lawrence Berkeley National Laboratory) | Quantifying the impact of network congestion on application performance and network metrics · view |
Hanawa, Toshihiro · more Toshihiro Hanawa (The University of Tokyo, JCAHPC) | Analysis of Cooling Water Temperature Impact on Computing Performance and Energy Consumption · view |
Harrison, Robert · more Robert Harrison (Stony Brook University) | Efficient Execution of Dynamic Programming Algorithms on Apache Spark · view |
Hass, Jon · more Jon Hass (Dell EMC Inc.) | MonSTer: An Out-of-the-Box Monitoring Tool for High Performance Computing Systems · view |
Hatta, Kazuma · more Kazuma Hatta (IMAGICADIGITALSCAPE Co., Ltd.) | ChOWDER: A New Approach for Viewing 3D Web GIS on Ultra-High-Resolution Scalable Display · view |
Hegeman, Tim · more Tim Hegeman (Vrije Universiteit Amsterdam) | Grade10: A Framework for Performance Characterization of Distributed Graph Processing · view |
Huan, Shan · more Shan Huan (National University of Defense Technology) | SSP: Speeding up Small Flows for Proactive Transport in Datacenters. · view |
Hunold, Sascha · more Sascha Hunold (TU Wien) | Efficient Process-to-Node Mapping Algorithms for Stencil Computations · view Predicting MPI Collective Communication Performance Using Machine Learning · view Decomposing MPI Collectives for Exploiting Multi-lane Communication · view |
Huthmann, Jens · more Jens Huthmann (RIKEN R-CCS) | Extending High-Level Synthesis with High-Performance Computing Performance Visualization · view |
Imamura, Toshiyuki · more Toshiyuki Imamura (RIKEN R-CCS) | Prompt report on Exa-scale HPL-AI benchmark · view An FPGA-based Sound Field Rendering System · view |
Ina, Takuya · more Takuya Ina (RIKEN R-CCS) | Prompt report on Exa-scale HPL-AI benchmark · view |
Iosup, Alexandru · more Alexandru Iosup (Vrije Universiteit Amsterdam) | Grade10: A Framework for Performance Characterization of Distributed Graph Processing · view |
Jansson, Niclas · more Niclas Jansson (KTH Royal Institute of Technology) | A Hybrid MPI+PGAS Approach to Improve Strong Scalability Limits of Finite Element Solvers · view |
Javanmard, Mohammad Mahdi · more Mohammad Mahdi Javanmard (Stony Brook University) | Efficient Execution of Dynamic Programming Algorithms on Apache Spark · view |
K G, Renga Bashyam · more Renga Bashyam K G (Indian Institute of Science) | Fast Scalable Approximate Nearest Neighbor Search for High-dimensional Data · view |
Kang, Ji-Hoon · more Ji-Hoon Kang (Korea Institute of Science and Technology Information) | An HPC-based Prediction on the Practicality of Long-distance Quantum Key Distributions · view |
Kawanabe, Tomohiro · more Tomohiro Kawanabe (RIKEN R-CCS) | ChOWDER: A New Approach for Viewing 3D Web GIS on Ultra-High-Resolution Scalable Display · view |
Kenny, Joseph · more Joseph Kenny (Sandia National Labs) | Opportunities and limitations of Quality-of-Service in Message Passing applications on adaptively routed Dragonfly and Fat Tree networks · view |
Kim, Sejin · more Sejin Kim (Sookmyung Women's University) | Co-scheML: Interference-aware Container Co-scheduling Scheme using Machine Learning Application Profiles for GPU Clusters · view |
Kim, Yoonhee · more Yoonhee Kim (Sookmyung Women's University) | Co-scheML: Interference-aware Container Co-scheduling Scheme using Machine Learning Application Profiles for GPU Clusters · view |
Knees, Peter · more Peter Knees (TU Wien) | Predicting MPI Collective Communication Performance Using Machine Learning · view |
Kobayashi, Ryohei · more Ryohei Kobayashi (University of Tsukuba) | Toward OpenACC-enabled GPU-FPGA Accelerated Computing · view |
Koch, Andreas · more Andreas Koch (TU Darmstadt) | Extending High-Level Synthesis with High-Performance Computing Performance Visualization · view |
Kodama, Yuetsu · more Yuetsu Kodama (RIKEN R-CCS) | Performance Evaluation of Supercomputer Fugaku using Breadth-First Search Benchmark in Graph500 · view |
Kougkas, Anthony · more Anthony Kougkas (Illinois Institute of Technology Chicago) | HCL: Distributing Parallel Data Structures in Extreme Scales · view |
Kremer-Herman, Nathaniel · more Nathaniel Kremer-Herman (University of Notre Dame) | Autoscaling High-Throughput Workloads on Container Orchestrators · view |
Kudo, Shuhei · more Shuhei Kudo (RIKEN R-CCS) | Prompt report on Exa-scale HPL-AI benchmark · view |
Kwon, Minseok · more Minseok Kwon (Rochester Institute of Technology) | CuVPP: Filter-based Longest Prefix Matching in Software Data Planes · view |
Le Fèvre, Valentin · more Valentin Le Fèvre (ENS Lyon) | Resilient Scheduling of Moldable Jobs on Failure-Prone Platforms · view |
Lehr, Markus · more Markus Lehr (TU wien/Faculty of Informatics) | Efficient Process-to-Node Mapping Algorithms for Stencil Computations · view |
Li, Dong · more Dong Li (University of California, Merced) | Exploring Non-Volatility of Non-Volatile Memory for High Performance Computing Under Failures · view |
Li, Jie · more Jie Li (Texas Tech University) | MonSTer: An Out-of-the-Box Monitoring Tool for High Performance Computing Systems · view |
Li, Sihuan · more Sihuan Li (UC, Riverside) | Towards End-to-end SDC Detection for HPC Applications Equipped with Lossy Compression · view |
Li, Yun · more Yun Li (Nanjing University of Posts and Telecommunications) | Estimating Power Consumption of Containers and Virtual Machines in Data Centers · view |
Liang, Xin · more Xin Liang (Oak Ridge National Laboratory) | Towards End-to-end SDC Detection for HPC Applications Equipped with Lossy Compression · view |
Liao, Qing · more Qing Liao (HITSZ) | Exploring the Potential of Fast Delta Encoding: Marching to a Higher Compression Ratio · view |
Liao, Xiangke · more Xiangke Liao (National University of Defense Technology) | SSP: Speeding up Small Flows for Proactive Transport in Datacenters. · view |
Lin, James · more James Lin (Shanghai Jiao Tong University) | NeoMPX: Characterizing and Improving Estimation of Multiplexing Hardware Counters for PAPI · view |
Liu, Zheng · more Zheng Liu (Nanjing University of Posts and Telecommunications) | Estimating Power Consumption of Containers and Virtual Machines in Data Centers · view |
Lou, Wenqi · more Wenqi Lou (University of Science and Technology of China) | OctCNN: An Energy-Efficient FPGA Accelerator for CNNs using Octave Convolution Algorithm · view |
Lu, Gangzhao · more Gangzhao Lu (Harbin Institute of Technology) | Optimizing GPU Memory Transactions for Convolution Operations · view |
Luo, Xi · more Xi Luo (University of Tennessee, Knoxville) | HAN: a Hierarchical AutotuNed Collective Communication Framework · view |
Magoutis, Kostas · more Kostas Magoutis (ICS - FORTH, University of Crete) | The Case for Better Integrating Scalable Data Stores and Stream-Processing Systems · view |
Markidis, Stefano · more Stefano Markidis (KTH Royal Institute of Technology) | tf-Darshan: Understanding Fine-grained I/O Performance in Machine Learning Workloads · view |
Maroñas, Marcos · more Marcos Maroñas (Barcelona Supercomputing Center) | Evaluating Worksharing Tasks on Distributed Environments · view |
Marshall, John · more John Marshall (Cisco Systems, Inc.) | CuVPP: Filter-based Longest Prefix Matching in Software Data Planes · view |
Nakao, Masahiro · more Masahiro Nakao (RIKEN R-CCS) | Performance Evaluation of Supercomputer Fugaku using Breadth-First Search Benchmark in Graph500 · view |
Neupane, Krishna Prasad · more Krishna Prasad Neupane (Rochester Institute of Technology) | CuVPP: Filter-based Longest Prefix Matching in Software Data Planes · view |
Nguyen, Ngan · more Ngan Nguyen (Texas Tech University) | MonSTer: An Out-of-the-Box Monitoring Tool for High Performance Computing Systems · view |
Nicolae, Bogdan · more Bogdan Nicolae (Argonne National Laboratory) | DeepClone: Scalable Live Migration of Deep Learning Models for Data Parallel Training · view |
Nitadori, Keigo · more Keigo Nitadori (RIKEN R-CCS) | Prompt report on Exa-scale HPL-AI benchmark · view |
Nonaka, Jorji · more Jorji Nonaka (RIKEN R-CCS) | Analysis of Cooling Water Temperature Impact on Computing Performance and Energy Consumption · view |
Ohmura, Itta · more Itta Ohmura (RIKEN BDR) | Implementing a Comprehensive Networks-on-Chip Generator with Optimal Configurations · view |
Ono, Kenji · more Kenji Ono (Kyushu University) | ChOWDER: A New Approach for Viewing 3D Web GIS on Ultra-High-Resolution Scalable Display · view |
Panda, Dhabaleswar K. · more Dhabaleswar K. Panda (The Ohio State University) | Dynamic Kernel Fusion for Bulk Non-contiguous Data Transfer on GPU Clusters · view |
Papaioannou, Antonis · more Antonis Papaioannou (ICS - FORTH, University of Crete) | The Case for Better Integrating Scalable Data Stores and Stream-Processing Systems · view |
Parashar, Manish · more Manish Parashar (Rutgers University) | A Staging Based Task Execution Framework for Data-driven Scientific Workflows · view |
Patinyasakdikul, Thananon · more Thananon Patinyasakdikul (Cray) | HAN: a Hierarchical AutotuNed Collective Communication Framework · view |
Pei, Yu · more Yu Pei (University of Tennessee, Knoxville) | HAN: a Hierarchical AutotuNed Collective Communication Framework · view |
Peng, Ivy B. · more Ivy B. Peng (Lawrence Livermore National Laboratory) | tf-Darshan: Understanding Fine-grained I/O Performance in Machine Learning Workloads · view |
Perotin, Lucas · more Lucas Perotin (ENS Lyon) | Resilient Scheduling of Moldable Jobs on Failure-Prone Platforms · view |
Podobas, Artur · more Artur Podobas (Royal Institute of Technology) | tf-Darshan: Understanding Fine-grained I/O Performance in Machine Learning Workloads · view |
Posner, Jonas · more Jonas Posner (University of Kassel, Germany) | System-Level vs. Application-Level Checkpointing · view |
Pottier, Loic · more Loic Pottier (USC Information Sciences Institute) | Modeling the Performance of Scientific Workflow Executions on HPC Platforms with Burst Buffers · view |
Pouchet, Louis-Noël · more Louis-Noël Pouchet (Colorado State University) | Efficient Execution of Dynamic Programming Algorithms on Apache Spark · view |
Pugmire, David · more David Pugmire (Oak Ridge National Laboratory) | Parallel Particle Advection Bake-Off For Scientific Visualization Workloads · view |
Rafique, M. Mustafa · more M. Mustafa Rafique (Rochester Institute of Technology) | CuVPP: Filter-based Longest Prefix Matching in Software Data Planes · view |
Raghavan, Padma · more Padma Raghavan (Vanderbilt University) | Resilient Scheduling of Moldable Jobs on Failure-Prone Platforms · view |
Rang, Wei · more Wei Rang (UNC Charlotte) | Data Life Aware Model Updating Strategy for Stream-based Online Deep Learning · view |
Ren, Jie · more Jie Ren (University of California, Merced) | Exploring Non-Volatility of Non-Volatile Memory for High Performance Computing Under Failures · view |
Rico, Alejandro · more Alejandro Rico (Arm Research) | Towards Data-Flow Parallelization for Adaptive Mesh Refinement Applications · view |
Robert, Yves · more Yves Robert (ENS Lyon, University of Tennessee Knoxville) | Resilient Scheduling of Moldable Jobs on Failure-Prone Platforms · view |
Rosendo, Daniel · more Daniel Rosendo (Inria) | E2Clab: Exploring the Computing Continuum through Repeatable, Replicable and Reproducible Edge-to-Cloud Experiments · view |
Rouvoy, Romain · more Romain Rouvoy (University of Lille, INRIA) | Power Budgeting of Big Data Applications in Container-based Clusters · view |
Ryu, Hoon · more Hoon Ryu (Korea Institute of Science and Technology Information) | An HPC-based Prediction on the Practicality of Long-distance Quantum Key Distributions · view |
Sala, Kevin · more Kevin Sala (Barcelona Supercomputing Center (BSC)) | Towards Data-Flow Parallelization for Adaptive Mesh Refinement Applications · view |
Salkhordeh, Reza · more Reza Salkhordeh (Johannes Gutenberg University Mainz) | DelveFS - An event-driven semantic file system for object stores · view |
Sano, Kentaro · more Kentaro Sano (RIKEN R-CCS) | Profiling and Visualizing Performance of FPGAs in High-Performance Computing Environments · view |
Sato, Mitsuhisa · more Mitsuhisa Sato (RIKEN R-CCS) | Performance Evaluation of Supercomputer Fugaku using Breadth-First Search Benchmark in Graph500 · view |
Schulz, Christian · more Christian Schulz (University of Vienna/Faculty of Computer Science) | Efficient Process-to-Node Mapping Algorithms for Stencil Computations · view |
Shaffer, Tim · more Tim Shaffer (University of Notre Dame) | Autoscaling High-Throughput Workloads on Container Orchestrators · view |
Shafie Khorassani, Kawthar · more Kawthar Shafie Khorassani (The Ohio State University) | Dynamic Kernel Fusion for Bulk Non-contiguous Data Transfer on GPU Clusters · view |
Shen, Ziyu · more Ziyu Shen (Nanjing University of Posts and Telecommunications) | Estimating Power Consumption of Containers and Virtual Machines in Data Centers · view |
Shoji, Fumiyoshi · more Fumiyoshi Shoji (RIKEN R-CCS) | Analysis of Cooling Water Temperature Impact on Computing Performance and Energy Consumption · view |
Sill, Alan · more Alan Sill (Texas Tech University) | MonSTer: An Out-of-the-Box Monitoring Tool for High Performance Computing Systems · view |
Silva, Pedro · more Pedro Silva (Hasso-Plattner Institut) | E2Clab: Exploring the Computing Continuum through Repeatable, Replicable and Reproducible Edge-to-Cloud Experiments · view |
Simonin, Matthieu · more Matthieu Simonin (Inria) | E2Clab: Exploring the Computing Continuum through Repeatable, Replicable and Reproducible Edge-to-Cloud Experiments · view |
Smigielski, Jean-François · more Jean-François Smigielski (OpenIO) | DelveFS - An event-driven semantic file system for object stores · view |
Sommer, Lukas · more Lukas Sommer (TU-Darmstadt) | Profiling and Visualizing Performance of FPGAs in High-Performance Computing Environments · view |
Steiner, Rebecca · more Rebecca Steiner (Johannes Gutenberg University Mainz) | DelveFS - An event-driven semantic file system for object stores · view |
Steinkamp, Jörg · more Jörg Steinkamp (Johannes Gutenberg University Mainz) | DelveFS - An event-driven semantic file system for object stores · view |
Su, Xiao-Ming · more Xiao-Ming Su (Shanghai Jiao Tong University) | NeoMPX: Characterizing and Improving Estimation of Multiplexing Hardware Counters for PAPI · view |
Subedi, Pradeep · more Pradeep Subedi (Rutgers University) | A Staging Based Task Execution Framework for Data-driven Scientific Workflows · view |
Subramoni, Hari · more Hari Subramoni (The Ohio State University) | Dynamic Kernel Fusion for Bulk Non-contiguous Data Transfer on GPU Clusters · view |
Sun, Hongyang · more Hongyang Sun (Vanderbilt University) | Resilient Scheduling of Moldable Jobs on Failure-Prone Platforms · view |
Sun, Xian-He · more Xian-He Sun (Illinois Institute of Technology Chicago) | HCL: Distributing Parallel Data Structures in Extreme Scales · view |
Suo, Kun · more Kun Suo (Kennesaw State University) | Data Life Aware Model Updating Strategy for Stream-based Online Deep Learning · view |
Taiji, Makoto · more Makoto Taiji (RIKEN BDR) | Implementing a Comprehensive Networks-on-Chip Generator with Optimal Configurations · view |
Tan, Haoliang · more Haoliang Tan (HITSZ) | Exploring the Potential of Fast Delta Encoding: Marching to a Higher Compression Ratio · view |
TAN, YIYU · more YIYU TAN (RIKEN Center for Computational Science) | An FPGA-based Sound Field Rendering System · view |
Teruel, Xavier · more Xavier Teruel (Barcelona Supercomputing Center) | Evaluating Worksharing Tasks on Distributed Environments · view |
Thain, Douglas · more Douglas Thain (University of Notre Dame) | Autoscaling High-Throughput Workloads on Container Orchestrators · view |
Touriño, Juan · more Juan Touriño (Universidade da Coruña (UDC), CITIC) | Power Budgeting of Big Data Applications in Container-based Clusters · view |
Träff, Jesper Larsson · more Jesper Larsson Träff (TU Wien) | Efficient Process-to-Node Mapping Algorithms for Stencil Computations · view Decomposing MPI Collectives for Exploiting Multi-lane Communication · view |
Trivedi, Animesh · more Animesh Trivedi (Vrije Universiteit Amsterdam) | Grade10: A Framework for Performance Characterization of Distributed Graph Processing · view |
Ucar, Davut · more Davut Ucar (University of Nevada, Reno) | Streaming File Transfer Optimization for Distributed Science Workflows · view |
Ueno, Koji · more Koji Ueno (Fixstars Corporation) | Performance Evaluation of Supercomputer Fugaku using Breadth-First Search Benchmark in Graph500 · view |
Umemura, Masayuki · more Masayuki Umemura (University of Tsukuba) | Toward OpenACC-enabled GPU-FPGA Accelerated Computing · view |
Vadhiyar, Sathish · more Sathish Vadhiyar (Indian Institute of Science) | Fast Scalable Approximate Nearest Neighbor Search for High-dimensional Data · view |
Vef, Marc-André · more Marc-André Vef (Johannes Gutenberg University Mainz) | DelveFS - An event-driven semantic file system for object stores · view |
Vennetier, Florent · more Florent Vennetier (OpenIO) | DelveFS - An event-driven semantic file system for object stores · view |
von Kirchbach, Konrad · more Konrad von Kirchbach (TU Wien/Faculty of Informatics) | Efficient Process-to-Node Mapping Algorithms for Stencil Computations · view |
Wang, Chao · more Chao Wang (University of Science and Technology of China) | OctCNN: An Energy-Efficient FPGA Accelerator for CNNs using Octave Convolution Algorithm · view |
Wang, Jie · more Jie Wang (Shanghai Jiao Tong University) | NeoMPX: Characterizing and Improving Estimation of Multiplexing Hardware Counters for PAPI · view |
Wang, Yi-Chao · more Yi-Chao Wang (Shanghai Jiao Tong University) | NeoMPX: Characterizing and Improving Estimation of Multiplexing Hardware Counters for PAPI · view |
Wang, Zhe · more Zhe Wang (Rutgers University) | A Staging Based Task Execution Framework for Data-driven Scientific Workflows · view |
Wang, Zheng · more Zheng Wang (University of Leeds) | Optimizing GPU Memory Transactions for Convolution Operations · view |
Wilke, Jeremiah · more Jeremiah Wilke (Sandia National Labs) | Opportunities and limitations of Quality-of-Service in Message Passing applications on adaptively routed Dragonfly and Fat Tree networks · view |
Wozniak, Justin · more Justin Wozniak (Argonne National Laboratory) | DeepClone: Scalable Live Migration of Deep Learning Models for Data Parallel Training · view |
Wright, Nicholas · more Nicholas Wright (Lawrence Berkeley National Laboratory) | Quantifying the impact of network congestion on application performance and network metrics · view |
Wu, Kai · more Kai Wu (University of California, Merced) | Exploring Non-Volatility of Non-Volatile Memory for High Performance Computing Under Failures · view |
Wu, Wei · more Wei Wu (Los Alamos National Laboratory) | HAN: a Hierarchical AutotuNed Collective Communication Framework · view Flexible Data Redistribution in a Task-Based Runtime System · view |
Xia, Bin · more Bin Xia (Nanjing University of Posts and Telecommunications) | Estimating Power Consumption of Containers and Virtual Machines in Data Centers · view |
Xia, Wen · more Wen Xia (HITSZ) | Exploring the Potential of Fast Delta Encoding: Marching to a Higher Compression Ratio · view |
Yamaguchi, Yoshiki · more Yoshiki Yamaguchi (University of Tsukuba) | Toward OpenACC-enabled GPU-FPGA Accelerated Computing · view |
Yang, Donglin · more Donglin Yang (UNC Charlotte) | Data Life Aware Model Updating Strategy for Stream-based Online Deep Learning · view |
Yenpure, Abhishek · more Abhishek Yenpure (University of Oregon) | Parallel Particle Advection Bake-Off For Scientific Visualization Workloads · view |
Yoshikawa, Kohji · more Kohji Yoshikawa (University of Tsukuba) | Toward OpenACC-enabled GPU-FPGA Accelerated Computing · view |
Zeginis, Chrysostomos · more Chrysostomos Zeginis (ICS - FORTH) | The Case for Better Integrating Scalable Data Stores and Stream-Processing Systems · view |
Zhang, Hao · more Hao Zhang (RIKEN BDR) | Implementing a Comprehensive Networks-on-Chip Generator with Optimal Configurations · view |
Zhang, Weizhe · more Weizhe Zhang (Harbin Institute of Technology) | Optimizing GPU Memory Transactions for Convolution Operations · view |
Zhang, Xusheng · more Xusheng Zhang (Nanjing University of Posts and Telecommunications) | Estimating Power Consumption of Containers and Virtual Machines in Data Centers · view |
Zhang, Yijia · more Yijia Zhang (Boston University) | Quantifying the impact of network congestion on application performance and network metrics · view |
Zhang, Zhiyuan · more Zhiyuan Zhang (HITSZ) | Exploring the Potential of Fast Delta Encoding: Marching to a Higher Compression Ratio · view |
Zhao, Kai · more Kai Zhao (UC, Riverside) | Towards End-to-end SDC Detection for HPC Applications Equipped with Lossy Compression · view |
Zheng, Chao · more Chao Zheng (University of Notre Dame) | Autoscaling High-Throughput Workloads on Container Orchestrators · view |
Zhong, Dong · more Dong Zhong (University of Tennessee) | HAN: a Hierarchical AutotuNed Collective Communication Framework · view Flexible Data Redistribution in a Task-Based Runtime System · view |
Zhou, Qinghua · more Qinghua Zhou (The Ohio State University) | Dynamic Kernel Fusion for Bulk Non-contiguous Data Transfer on GPU Clusters · view |
Zhou, Xuehai · more Xuehai Zhou (University of Science and Technology of China) | OctCNN: An Energy-Efficient FPGA Accelerator for CNNs using Octave Convolution Algorithm · view |
Zhou, Zejia · more Zejia Zhou (National University of Defense Technology) | SSP: Speeding up Small Flows for Proactive Transport in Datacenters. · view |
Zola, Jaroslaw · more Jaroslaw Zola (University at Buffalo) | Efficient Execution of Dynamic Programming Algorithms on Apache Spark · view |
Zou, Xiangyu · more Xiangyu Zou (HITSZ) | Exploring the Potential of Fast Delta Encoding: Marching to a Higher Compression Ratio · view |
Zuo, Si-Cheng · more Si-Cheng Zuo (Shanghai Jiao Tong University) | NeoMPX: Characterizing and Improving Estimation of Multiplexing Hardware Counters for PAPI · view |