IEEE Cluster 2020 Program
back to Cluster 2020 Top page


All times are JST timezone.

Overview | By Date | By Event Type | By Room | Author Index

A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | R | S | T | U | V | W | X | Y | Z

A
Abe, Makito · moreToward OpenACC-enabled GPU-FPGA Accelerated Computing · view
Ahmad, Zafar · moreEfficient Execution of Dynamic Programming Algorithms on Apache Spark · view
Ali, Ghanzanfar · moreMonSTer: An Out-of-the-Box Monitoring Tool for High Performance Computing Systems · view
Antoniu, Gabriel · moreE2Clab: Exploring the Computing Continuum through Repeatable, Replicable and Reproducible Edge-to-Cloud Experiments · view
Arslan, Engin · moreStreaming File Transfer Optimization for Distributed Science Workflows · view
Ayguadé, Eduard · moreEvaluating Worksharing Tasks on Distributed Environments · view

B
Bai, Yang · moreSSP: Speeding up Small Flows for Proactive Transport in Datacenters. · view
Bateman, Keith · moreHCL: Distributing Parallel Data Structures in Extreme Scales · view
Beltran, Vicenç · moreEvaluating Worksharing Tasks on Distributed Environments · view
Towards Data-Flow Parallelization for Adaptive Mesh Refinement Applications · view
Benoit, Anne · moreResilient Scheduling of Moldable Jobs on Failure-Prone Platforms · view
Bhatele, Abhinav · morePredicting MPI Collective Communication Performance Using Machine Learning · view
Binyahib, Roba · moreParallel Particle Advection Bake-Off For Scientific Visualization Workloads · view
Bosilca, George · moreHAN: a Hierarchical AutotuNed Collective Communication Framework · view
Flexible Data Redistribution in a Task-Based Runtime System · view
Predicting MPI Collective Communication Performance Using Machine Learning · view
Bouteiller, Aurelien · moreFlexible Data Redistribution in a Task-Based Runtime System · view
Brinkmann, André · moreDelveFS - An event-driven semantic file system for object stores · view
Bull, Mark · moreEvaluating Worksharing Tasks on Distributed Environments · view

C
Cao, Qinglei · moreHAN: a Hierarchical AutotuNed Collective Communication Framework · view
Flexible Data Redistribution in a Task-Based Runtime System · view
Cappello, Franck · moreDeepClone: Scalable Live Migration of Deep Learning Models for Data Parallel Training · view
Towards End-to-end SDC Detection for HPC Applications Equipped with Lossy Compression · view
Casanova, Henri · moreModeling the Performance of Scientific Workflow Executions on HPC Platforms with Burst Buffers · view
Chang, Chan-Jung · moreECS2: A Fast Erasure Coding Library for GPU-Accelerated Storage Systems With Parallel & Direct IO · view
Chen, Jin-Kun · moreNeoMPX: Characterizing and Improving Estimation of Multiplexing Hardware Counters for PAPI · view
Chen, Wei · moreData Life Aware Model Updating Strategy for Stream-based Online Deep Learning · view
Chen, Yong · moreMonSTer: An Out-of-the-Box Monitoring Tool for High Performance Computing Systems · view
Chen, Zizhong · moreTowards End-to-end SDC Detection for HPC Applications Equipped with Lossy Compression · view
Cheng, Dazhao · moreData Life Aware Model Updating Strategy for Stream-based Online Deep Learning · view
Chien, Steven W. D. · moretf-Darshan: Understanding Fine-grained I/O Performance in Machine Learning Workloads · view
Childs, Hank · moreParallel Particle Advection Bake-Off For Scientific Visualization Workloads · view
Chou, Jerry · moreECS2: A Fast Erasure Coding Library for GPU-Accelerated Storage Systems With Parallel & Direct IO · view
Chou, Yu-Ching · moreECS2: A Fast Erasure Coding Library for GPU-Accelerated Storage Systems With Parallel & Direct IO · view
Choudhary, Alok N. · moreAI for Science (Alok N. Choudhary) · view
The Price Performance of Performance Models (Felix Wolf) · view
Fugaku: the First `Exascale' Supercomputer (Satoshi Matsuoka) · view
Chowdhury, Rezaul · moreEfficient Execution of Dynamic Programming Algorithms on Apache Spark · view
Chu, Ching-Hsiang · moreDynamic Kernel Fusion for Bulk Non-contiguous Data Transfer on GPU Clusters · view
Chung, I-Hsin · moreECS2: A Fast Erasure Coding Library for GPU-Accelerated Storage Systems With Parallel & Direct IO · view
Cook, Brandon · moreQuantifying the impact of network congestion on application performance and network metrics · view
Coskun, Ayse K. · moreQuantifying the impact of network congestion on application performance and network metrics · view
Costan, Alexandru · moreE2Clab: Exploring the Computing Continuum through Repeatable, Replicable and Reproducible Edge-to-Cloud Experiments · view

D
Da Silva, Rafael Ferreira · moreModeling the Performance of Scientific Workflow Executions on HPC Platforms with Burst Buffers · view
Dang, Tommy · moreMonSTer: An Out-of-the-Box Monitoring Tool for High Performance Computing Systems · view
Davis, Philip E. · moreA Staging Based Task Execution Framework for Data-driven Scientific Workflows · view
Deelman, Ewa · moreModeling the Performance of Scientific Workflow Executions on HPC Platforms with Burst Buffers · view
Devarajan, Hariharan · moreHCL: Distributing Parallel Data Structures in Extreme Scales · view
Di, Sheng · moreTowards End-to-end SDC Detection for HPC Applications Equipped with Lossy Compression · view
Dong, Dezun · moreSSP: Speeding up Small Flows for Proactive Transport in Datacenters. · view
Dongarra, Jack · moreHAN: a Hierarchical AutotuNed Collective Communication Framework · view
Flexible Data Redistribution in a Task-Based Runtime System · view
Dorier, Matthieu · moreDeepClone: Scalable Live Migration of Deep Learning Models for Data Parallel Training · view
A Staging Based Task Execution Framework for Data-driven Scientific Workflows · view

E
Enes, Jonatan · morePower Budgeting of Big Data Applications in Container-based Clusters · view
Expósito, Roberto Rey · morePower Budgeting of Big Data Applications in Container-based Clusters · view

F
Fieni, Guillaume · morePower Budgeting of Big Data Applications in Container-based Clusters · view
Fujisawa, Katsuki · morePerformance Evaluation of Supercomputer Fugaku using Breadth-First Search Benchmark in Graph500 · view
Fujita, Norihisa · moreToward OpenACC-enabled GPU-FPGA Accelerated Computing · view

G
Gong, Lei · moreOctCNN: An Energy-Efficient FPGA Accelerator for CNNs using Octave Convolution Algorithm · view
Groves, Taylor · moreQuantifying the impact of network congestion on application performance and network metrics · view

H
Hanawa, Toshihiro · moreAnalysis of Cooling Water Temperature Impact on Computing Performance and Energy Consumption · view
Harrison, Robert · moreEfficient Execution of Dynamic Programming Algorithms on Apache Spark · view
Hass, Jon · moreMonSTer: An Out-of-the-Box Monitoring Tool for High Performance Computing Systems · view
Hatta, Kazuma · moreChOWDER: A New Approach for Viewing 3D Web GIS on Ultra-High-Resolution Scalable Display · view
Hegeman, Tim · moreGrade10: A Framework for Performance Characterization of Distributed Graph Processing · view
Huan, Shan · moreSSP: Speeding up Small Flows for Proactive Transport in Datacenters. · view
Hunold, Sascha · moreEfficient Process-to-Node Mapping Algorithms for Stencil Computations · view
Predicting MPI Collective Communication Performance Using Machine Learning · view
Decomposing MPI Collectives for Exploiting Multi-lane Communication · view
Huthmann, Jens · moreExtending High-Level Synthesis with High-Performance Computing Performance Visualization · view

I
Imamura, Toshiyuki · morePrompt report on Exa-scale HPL-AI benchmark · view
An FPGA-based Sound Field Rendering System · view
Ina, Takuya · morePrompt report on Exa-scale HPL-AI benchmark · view
Iosup, Alexandru · moreGrade10: A Framework for Performance Characterization of Distributed Graph Processing · view

J
Jansson, Niclas · moreA Hybrid MPI+PGAS Approach to Improve Strong Scalability Limits of Finite Element Solvers · view
Javanmard, Mohammad Mahdi · moreEfficient Execution of Dynamic Programming Algorithms on Apache Spark · view

K
K G, Renga Bashyam · moreFast Scalable Approximate Nearest Neighbor Search for High-dimensional Data · view
Kang, Ji-Hoon · moreAn HPC-based Prediction on the Practicality of Long-distance Quantum Key Distributions · view
Kawanabe, Tomohiro · moreChOWDER: A New Approach for Viewing 3D Web GIS on Ultra-High-Resolution Scalable Display · view
Kenny, Joseph · moreOpportunities and limitations of Quality-of-Service in Message Passing applications on adaptively routed Dragonfly and Fat Tree networks · view
Kim, Sejin · moreCo-scheML: Interference-aware Container Co-scheduling Scheme using Machine Learning Application Profiles for GPU Clusters · view
Kim, Yoonhee · moreCo-scheML: Interference-aware Container Co-scheduling Scheme using Machine Learning Application Profiles for GPU Clusters · view
Knees, Peter · morePredicting MPI Collective Communication Performance Using Machine Learning · view
Kobayashi, Ryohei · moreToward OpenACC-enabled GPU-FPGA Accelerated Computing · view
Koch, Andreas · moreExtending High-Level Synthesis with High-Performance Computing Performance Visualization · view
Kodama, Yuetsu · morePerformance Evaluation of Supercomputer Fugaku using Breadth-First Search Benchmark in Graph500 · view
Kougkas, Anthony · moreHCL: Distributing Parallel Data Structures in Extreme Scales · view
Kremer-Herman, Nathaniel · moreAutoscaling High-Throughput Workloads on Container Orchestrators · view
Kudo, Shuhei · morePrompt report on Exa-scale HPL-AI benchmark · view
Kwon, Minseok · moreCuVPP: Filter-based Longest Prefix Matching in Software Data Planes · view

L
Le Fèvre, Valentin · moreResilient Scheduling of Moldable Jobs on Failure-Prone Platforms · view
Lehr, Markus · moreEfficient Process-to-Node Mapping Algorithms for Stencil Computations · view
Li, Dong · moreExploring Non-Volatility of Non-Volatile Memory for High Performance Computing Under Failures · view
Li, Jie · moreMonSTer: An Out-of-the-Box Monitoring Tool for High Performance Computing Systems · view
Li, Sihuan · moreTowards End-to-end SDC Detection for HPC Applications Equipped with Lossy Compression · view
Li, Yun · moreEstimating Power Consumption of Containers and Virtual Machines in Data Centers · view
Liang, Xin · moreTowards End-to-end SDC Detection for HPC Applications Equipped with Lossy Compression · view
Liao, Qing · moreExploring the Potential of Fast Delta Encoding: Marching to a Higher Compression Ratio · view
Liao, Xiangke · moreSSP: Speeding up Small Flows for Proactive Transport in Datacenters. · view
Lin, James · moreNeoMPX: Characterizing and Improving Estimation of Multiplexing Hardware Counters for PAPI · view
Liu, Zheng · moreEstimating Power Consumption of Containers and Virtual Machines in Data Centers · view
Lou, Wenqi · moreOctCNN: An Energy-Efficient FPGA Accelerator for CNNs using Octave Convolution Algorithm · view
Lu, Gangzhao · moreOptimizing GPU Memory Transactions for Convolution Operations · view
Luo, Xi · moreHAN: a Hierarchical AutotuNed Collective Communication Framework · view

M
Magoutis, Kostas · moreThe Case for Better Integrating Scalable Data Stores and Stream-Processing Systems · view
Markidis, Stefano · moretf-Darshan: Understanding Fine-grained I/O Performance in Machine Learning Workloads · view
Maroñas, Marcos · moreEvaluating Worksharing Tasks on Distributed Environments · view
Marshall, John · moreCuVPP: Filter-based Longest Prefix Matching in Software Data Planes · view

N
Nakao, Masahiro · morePerformance Evaluation of Supercomputer Fugaku using Breadth-First Search Benchmark in Graph500 · view
Neupane, Krishna Prasad · moreCuVPP: Filter-based Longest Prefix Matching in Software Data Planes · view
Nguyen, Ngan · moreMonSTer: An Out-of-the-Box Monitoring Tool for High Performance Computing Systems · view
Nicolae, Bogdan · moreDeepClone: Scalable Live Migration of Deep Learning Models for Data Parallel Training · view
Nitadori, Keigo · morePrompt report on Exa-scale HPL-AI benchmark · view
Nonaka, Jorji · moreAnalysis of Cooling Water Temperature Impact on Computing Performance and Energy Consumption · view

O
Ohmura, Itta · moreImplementing a Comprehensive Networks-on-Chip Generator with Optimal Configurations · view
Ono, Kenji · moreChOWDER: A New Approach for Viewing 3D Web GIS on Ultra-High-Resolution Scalable Display · view

P
Panda, Dhabaleswar K. · moreDynamic Kernel Fusion for Bulk Non-contiguous Data Transfer on GPU Clusters · view
Papaioannou, Antonis · moreThe Case for Better Integrating Scalable Data Stores and Stream-Processing Systems · view
Parashar, Manish · moreA Staging Based Task Execution Framework for Data-driven Scientific Workflows · view
Patinyasakdikul, Thananon · moreHAN: a Hierarchical AutotuNed Collective Communication Framework · view
Pei, Yu · moreHAN: a Hierarchical AutotuNed Collective Communication Framework · view
Peng, Ivy B. · moretf-Darshan: Understanding Fine-grained I/O Performance in Machine Learning Workloads · view
Perotin, Lucas · moreResilient Scheduling of Moldable Jobs on Failure-Prone Platforms · view
Podobas, Artur · moretf-Darshan: Understanding Fine-grained I/O Performance in Machine Learning Workloads · view
Posner, Jonas · moreSystem-Level vs. Application-Level Checkpointing · view
Pottier, Loic · moreModeling the Performance of Scientific Workflow Executions on HPC Platforms with Burst Buffers · view
Pouchet, Louis-Noël · moreEfficient Execution of Dynamic Programming Algorithms on Apache Spark · view
Pugmire, David · moreParallel Particle Advection Bake-Off For Scientific Visualization Workloads · view

R
Rafique, M. Mustafa · moreCuVPP: Filter-based Longest Prefix Matching in Software Data Planes · view
Raghavan, Padma · moreResilient Scheduling of Moldable Jobs on Failure-Prone Platforms · view
Rang, Wei · moreData Life Aware Model Updating Strategy for Stream-based Online Deep Learning · view
Ren, Jie · moreExploring Non-Volatility of Non-Volatile Memory for High Performance Computing Under Failures · view
Rico, Alejandro · moreTowards Data-Flow Parallelization for Adaptive Mesh Refinement Applications · view
Robert, Yves · moreResilient Scheduling of Moldable Jobs on Failure-Prone Platforms · view
Rosendo, Daniel · moreE2Clab: Exploring the Computing Continuum through Repeatable, Replicable and Reproducible Edge-to-Cloud Experiments · view
Rouvoy, Romain · morePower Budgeting of Big Data Applications in Container-based Clusters · view
Ryu, Hoon · moreAn HPC-based Prediction on the Practicality of Long-distance Quantum Key Distributions · view

S
Sala, Kevin · moreTowards Data-Flow Parallelization for Adaptive Mesh Refinement Applications · view
Salkhordeh, Reza · moreDelveFS - An event-driven semantic file system for object stores · view
Sano, Kentaro · moreProfiling and Visualizing Performance of FPGAs in High-Performance Computing Environments · view
Sato, Mitsuhisa · morePerformance Evaluation of Supercomputer Fugaku using Breadth-First Search Benchmark in Graph500 · view
Schulz, Christian · moreEfficient Process-to-Node Mapping Algorithms for Stencil Computations · view
Shaffer, Tim · moreAutoscaling High-Throughput Workloads on Container Orchestrators · view
Shafie Khorassani, Kawthar · moreDynamic Kernel Fusion for Bulk Non-contiguous Data Transfer on GPU Clusters · view
Shen, Ziyu · moreEstimating Power Consumption of Containers and Virtual Machines in Data Centers · view
Shoji, Fumiyoshi · moreAnalysis of Cooling Water Temperature Impact on Computing Performance and Energy Consumption · view
Sill, Alan · moreMonSTer: An Out-of-the-Box Monitoring Tool for High Performance Computing Systems · view
Silva, Pedro · moreE2Clab: Exploring the Computing Continuum through Repeatable, Replicable and Reproducible Edge-to-Cloud Experiments · view
Simonin, Matthieu · moreE2Clab: Exploring the Computing Continuum through Repeatable, Replicable and Reproducible Edge-to-Cloud Experiments · view
Smigielski, Jean-François · moreDelveFS - An event-driven semantic file system for object stores · view
Sommer, Lukas · moreProfiling and Visualizing Performance of FPGAs in High-Performance Computing Environments · view
Steiner, Rebecca · moreDelveFS - An event-driven semantic file system for object stores · view
Steinkamp, Jörg · moreDelveFS - An event-driven semantic file system for object stores · view
Su, Xiao-Ming · moreNeoMPX: Characterizing and Improving Estimation of Multiplexing Hardware Counters for PAPI · view
Subedi, Pradeep · moreA Staging Based Task Execution Framework for Data-driven Scientific Workflows · view
Subramoni, Hari · moreDynamic Kernel Fusion for Bulk Non-contiguous Data Transfer on GPU Clusters · view
Sun, Hongyang · moreResilient Scheduling of Moldable Jobs on Failure-Prone Platforms · view
Sun, Xian-He · moreHCL: Distributing Parallel Data Structures in Extreme Scales · view
Suo, Kun · moreData Life Aware Model Updating Strategy for Stream-based Online Deep Learning · view

T
Taiji, Makoto · moreImplementing a Comprehensive Networks-on-Chip Generator with Optimal Configurations · view
Tan, Haoliang · moreExploring the Potential of Fast Delta Encoding: Marching to a Higher Compression Ratio · view
TAN, YIYU · moreAn FPGA-based Sound Field Rendering System · view
Teruel, Xavier · moreEvaluating Worksharing Tasks on Distributed Environments · view
Thain, Douglas · moreAutoscaling High-Throughput Workloads on Container Orchestrators · view
Touriño, Juan · morePower Budgeting of Big Data Applications in Container-based Clusters · view
Träff, Jesper Larsson · moreEfficient Process-to-Node Mapping Algorithms for Stencil Computations · view
Decomposing MPI Collectives for Exploiting Multi-lane Communication · view
Trivedi, Animesh · moreGrade10: A Framework for Performance Characterization of Distributed Graph Processing · view

U
Ucar, Davut · moreStreaming File Transfer Optimization for Distributed Science Workflows · view
Ueno, Koji · morePerformance Evaluation of Supercomputer Fugaku using Breadth-First Search Benchmark in Graph500 · view
Umemura, Masayuki · moreToward OpenACC-enabled GPU-FPGA Accelerated Computing · view

V
Vadhiyar, Sathish · moreFast Scalable Approximate Nearest Neighbor Search for High-dimensional Data · view
Vef, Marc-André · moreDelveFS - An event-driven semantic file system for object stores · view
Vennetier, Florent · moreDelveFS - An event-driven semantic file system for object stores · view
von Kirchbach, Konrad · moreEfficient Process-to-Node Mapping Algorithms for Stencil Computations · view

W
Wang, Chao · moreOctCNN: An Energy-Efficient FPGA Accelerator for CNNs using Octave Convolution Algorithm · view
Wang, Jie · moreNeoMPX: Characterizing and Improving Estimation of Multiplexing Hardware Counters for PAPI · view
Wang, Yi-Chao · moreNeoMPX: Characterizing and Improving Estimation of Multiplexing Hardware Counters for PAPI · view
Wang, Zhe · moreA Staging Based Task Execution Framework for Data-driven Scientific Workflows · view
Wang, Zheng · moreOptimizing GPU Memory Transactions for Convolution Operations · view
Wilke, Jeremiah · moreOpportunities and limitations of Quality-of-Service in Message Passing applications on adaptively routed Dragonfly and Fat Tree networks · view
Wozniak, Justin · moreDeepClone: Scalable Live Migration of Deep Learning Models for Data Parallel Training · view
Wright, Nicholas · moreQuantifying the impact of network congestion on application performance and network metrics · view
Wu, Kai · moreExploring Non-Volatility of Non-Volatile Memory for High Performance Computing Under Failures · view
Wu, Wei · moreHAN: a Hierarchical AutotuNed Collective Communication Framework · view
Flexible Data Redistribution in a Task-Based Runtime System · view

X
Xia, Bin · moreEstimating Power Consumption of Containers and Virtual Machines in Data Centers · view
Xia, Wen · moreExploring the Potential of Fast Delta Encoding: Marching to a Higher Compression Ratio · view

Y
Yamaguchi, Yoshiki · moreToward OpenACC-enabled GPU-FPGA Accelerated Computing · view
Yang, Donglin · moreData Life Aware Model Updating Strategy for Stream-based Online Deep Learning · view
Yenpure, Abhishek · moreParallel Particle Advection Bake-Off For Scientific Visualization Workloads · view
Yoshikawa, Kohji · moreToward OpenACC-enabled GPU-FPGA Accelerated Computing · view

Z
Zeginis, Chrysostomos · moreThe Case for Better Integrating Scalable Data Stores and Stream-Processing Systems · view
Zhang, Hao · moreImplementing a Comprehensive Networks-on-Chip Generator with Optimal Configurations · view
Zhang, Weizhe · moreOptimizing GPU Memory Transactions for Convolution Operations · view
Zhang, Xusheng · moreEstimating Power Consumption of Containers and Virtual Machines in Data Centers · view
Zhang, Yijia · moreQuantifying the impact of network congestion on application performance and network metrics · view
Zhang, Zhiyuan · moreExploring the Potential of Fast Delta Encoding: Marching to a Higher Compression Ratio · view
Zhao, Kai · moreTowards End-to-end SDC Detection for HPC Applications Equipped with Lossy Compression · view
Zheng, Chao · moreAutoscaling High-Throughput Workloads on Container Orchestrators · view
Zhong, Dong · moreHAN: a Hierarchical AutotuNed Collective Communication Framework · view
Flexible Data Redistribution in a Task-Based Runtime System · view
Zhou, Qinghua · moreDynamic Kernel Fusion for Bulk Non-contiguous Data Transfer on GPU Clusters · view
Zhou, Xuehai · moreOctCNN: An Energy-Efficient FPGA Accelerator for CNNs using Octave Convolution Algorithm · view
Zhou, Zejia · moreSSP: Speeding up Small Flows for Proactive Transport in Datacenters. · view
Zola, Jaroslaw · moreEfficient Execution of Dynamic Programming Algorithms on Apache Spark · view
Zou, Xiangyu · moreExploring the Potential of Fast Delta Encoding: Marching to a Higher Compression Ratio · view
Zuo, Si-Cheng · moreNeoMPX: Characterizing and Improving Estimation of Multiplexing Hardware Counters for PAPI · view

Created 2020-9-2 6:22