Tuesday, January 14

Registration Desk (4F)
5:00pm-7:00pm Registration
(Drink and snacks will be provided.)

Wednesday, January 15

Registration Desk (4F)
9:00am-5:00pm Registration
International Conference Hall (4F) Conference Room B (7F)
9:30am-9:45am Opening
Chair: Daisuke Takahashi (Univ. of Tsukuba)

Takeshi Iwashita (HPC Asia 2020 General chair, Hokkaido Univ.)
9:50am-10:40am Invited Talk
Chair: Daisuke Takahashi (Univ. of Tsukuba)

Extreme-Scale Earthquake Simulation on Sunway TaihuLight
Prof. Haohuan Fu (Tsinghua University, National Supercomputing Center in Wuxi)

10:40am-11:00am ~ Coffee Break ~
11:00am-12:30pm Best Paper Finalists
Chair: Jaejin Lee (SNU)

Quantum Dynamics at Scale: Ultrafast Control of Emergent Functional Materials
Subodh Tiwari (USC), Aravind Krishnamoorthy (USC), Pankaj Rajak (ANL), Putt Sakdhnagool (NSTDA), Manaschai Kunaseth (NSTDA), Fuyuki Shimojo (Kumamoto Univ.), Shogo Fukushima (Kumamoto Univ.), Aiichiro Nakano (USC), Ye Luo (ANL), Rajiv Kalia (USC), Ken-Ichi Nomura (USC), Priya Vashishta (USC)


Scalable Direct-Iterative Hybrid Solver for Sparse Matrices on Multi-Core and Vector Architectures
Kenji Ono (Kyushu Univ.), Toshihiro Kato (NEC), Takeshi Nanri (Kyushu Univ.), Satoshi Ohshima (Nagoya Univ.)


Enhancing a Manycore-Oriented Compressed Cache for GPGPU
Keitaro Oka (Kyushu Univ.), Satoshi Kawakami (Kyushu Univ.), Teruo Tanimoto (Kyushu Univ.), Takatsugu Ono (Kyushu Univ.), Koji Inoue (Kyushu Univ.)
12:30pm-2:00pm ~ Lunch Break ~
2:00pm-3:30pm Deep Learning and GPU Computing
Chair: Rio Yokota (TITech)

Towards GPU Acceleration of Phonon Computation with ShengBTE
Yi Wei (Beihang Univ.), Xin You (Beihang Univ.), Hailong Yang (Beihang Univ.), Zhongzhi Luan (Beihang Univ.), Depei Qian (Beihang Univ.)


Tiling-Based Programming Model for Structured Grids on GPU Clusters
Burak Bastem (Koc Univ.), Didem Unat (Koc Univ.)


Rethinking the Value of Asynchronous Solvers for Distributed Deep Learning
Arissa Wongpanich (UC Berkeley), Yang You (UC Berkeley), James Demmel (UC Berkeley)
3:30pm-3:50pm ~ Coffee Break ~
3:50pm-5:20pm Novel Networks
Chair: Jens Domke (TITech)

Parallelization of All-Pairs-Shortest-Path Algorithms in Unweighted Graphs for Order/Degree Problem
Masahiro Nakao (RIKEN), Hitoshi Murai (RIKEN), Mitsuhisa Sato (RIKEN)


Dual-Plane Isomorphic Hypercube Network
Takeo Hosomi (NEC), Ryota Yasudo (Hiroshima Univ.), Michihiro Koibuchi (NII), Shinji Shimojo (Osaka Univ.)


Wavelength-Routing Interconnect "Optical Hub" for Parallel Computing Systems
Yutaka Urino (PETRA), Kenji Mizutani (PETRA) Tatsuya Usuki (PETRA), Shigeru Nakamura (PETRA)

Thursday, January 16

Registration Desk (4F)
9:30am-5:00pm Registration
International Conference Hall (4F) Conference Room B (7F)
9:50am-10:40am Invited Talk
Chair: Taisuke Boku (Univ. of Tsukuba)

Challenges of Heterogenous Acceleration in Future HPC and Datacenters
Dr. John Shalf (Lawrence Berkeley National Laboratory)

10:40am-11:00am ~ Coffee Break ~
11:00am-12:30pm Numerical Kernels
Chair: Fumihiko Ino (Osaka Univ.)

Effect of Mixed Precision Computing on H-matrix Vector Multiplication in BEM Analysis
Rise Ooi (Hokkaido Univ.), Takeshi Iwashita (Hokkaido Univ.), Takeshi Fukaya (Hokkaido Univ.), Akihiro Ida (UTokyo), Rio Yokota (TITech)


Diamond Matrix Powers Kernel
Emil Vatai (UTokyo), Utsav Singhal (IITD), Reiji Suda (UTokyo)


FFTE on SVE: SPIRAL-Generated Kernels
Daisuke Takahashi (Univ. of Tsukuba), Franz Franchetti (CMU)
Modern Processors
Chair: Kazuhiko Komatsu (Tohoku Univ.)

Integrating Cache Oblivious Approach with Modern Processor Architecture: The Case of Floyd-Warshall Algorithm
Toshio Endo (TITrch)


On the Correct Measurement of Application Memory Bandwidth and Memory Access Latency
Christian Helm (UTokyo), Kenjiro Taura (UTokyo)


Accuracy Improvement of Memory System Simulation for Modern Shared Memory Processor
Yuetsu Kodama (RIKEN), Tetsuya Odajima (RIKEN), Akira Asato (Fujitsu), Mitsuhisa Sato (RIKEN/Univ. of Tsukuba)
12:30pm-2:00pm ~ Lunch Break ~
2:00pm-3:30pm Advanced Systems
Chair: Balazs Gerofi (RIKEN)

Exploiting Spark for HPC Simulation Data: Taming the Ephemeral Data Explosion
Ming Jiang (LLNL), Brian Gallagher (LLNL), Albert Chu (LLNL), Ghaleb Abdulla (LLNL), Timothy Bender (LLNL)


Extended Hoeffding Adaptive Tree Based-Server Load Prediction in Cloud Computing environment
Hajer Toumi (FST), Zaki Brahmi (RIADI-Lab), Mohammd Mohsen Gammoudi (Manouba Univ.)


Effect of an Incentive Implementation for Specifying Accurate Walltime in Job Scheduling
Shinichiro Takizawa (AIST), Ryousei Takano (AIST)
Numerical Solvers
Chair: Takeshi Fukaya (Hokkaido Univ.)

A Scalable Matrix-Free Iterative Eigensolver for Studying Many-Body Localization
Roel Van Beeumen (LBNL), Gregory D. Meyer (UC Berkeley), Norman Y. Yao (UC Berkeley), Chao Yang (LBNL)


Adaptive Level Binning for Sparse Triangular Solvers
Buse Yilmaz (Koc Univ.), Bugra Sipahioglu (Koc Univ.), Najeeb Ahmad (Koc Univ.), Didem Unat(Koc Univ.)
3:30pm-5:30pm Poster session @ Conference Room A (7F)
6:30pm-8:30pm ~ Banquet @ Hotel Nikko Fukuoka (3F, Room: Tsukushi) ~

Friday, January 17

Registration Desk (4F)
9:30am-1:00pm Registration
International Conference Hall (4F) Conference Room B (7F)
9:50am-10:40am Invited Talk
Chair: Takahiro Katagiri (Nagoya Univ.)

Codesign for "Fugaku"
Dr. Mitsuhisa Sato (RIKEN)

10:40am-11:00am ~ Coffee Break ~
11:00am-12:00pm Power Optimization
Chair: Ryousei Takano (AIST)

The Effectiveness of Low-Precision Floating Arithmetic on Numerical Codes: A Case Study on Power Consumption
Ryuichi Sakamoto (UTokyo), Masaaki Kondo (RIKEN/UTokyo), Kohei Fujita (UTokyo), Tsuyoshi Ichimura (UTokyo), Kengo Nakajima (UTokyo)


Energy Efficient Runahead Execution on a Tightly-Coupled Heterogeneous Core
Susumu Mashimo (Kyushu Univ.), Ryota Shioya (UTokyo), Koji Inoue (Kyushu Univ.)
Sparse Matrix Computations
Chair: Takeshi Nanri (Kyushu Univ.)

Multiplicative Schwartz-Type Block Multi-Color Gauss-Seidel Smoother for Algebraic Multigrid Methods
Masatoshi Kawai (RIKEN), Akihiro Ida (UTokyo), Hiroya Matsuba (RIKEN), Kengo Nakajima (UTokyo), Matthias Bolten (BUW)


Performance Improvement of a Scalable High-Order Compressible Flow Solver on Unstructured Hexahedral Grids
Kazuma Tago (JAXA), Takanori Haga (JAXA), Seiji Tsutsumi (JAXA), Ryoji Takaki (JAXA)
12:00pm-12:10pm Closing

Takeshi Iwashita (HPC Asia 2020 General chair, Hokkaido Univ.)
1:00pm-4:30pm Workshop
IXPUG Workshop Asia 2020
Workshop
Multi-scale, Multi-physics and Coupled Problems on Highly Parallel Systems (MMCP)

Posters

A zipped file that contains all poster abstracts can be downloaded here.
No. Title and Author(s) Abstract Poster
1 Performance Measurement of Kinetic Code on Scalar Processors
Takayuki Umeda (Nagoya Univ.)
2 A Ship Detection Algorithm for SAR Image Based on Box-plot
Yingqi Zhao (OUC), Takeshi Iwashita (Hokkaido Univ.), Linjie Zhang (OUC)
3 Single-Precision Calculation of Iterative Refinement of Eigenpairs of a Real Symmetric-Definite Generalized Eigenproblem by Using a Filter Composed of a Single Resolvent
Hiroshi Murakami (TMU)
4 Optimization of x265 Encoder using ARM SVE
Ryosuke Aoki (UEC), Hirokazu Murao (UEC)
5 Sound Rendering and its Acceleration Using FPGA
Yiyu Tan (RIKEN), Toshiyuki Imamura (RIKEN)
6 Performance Evaluation of Accurate Matrix-matrix Multiplications on GPU Using Sparse Matrix Multiplications
Fumiya Ishiguro (Nagoya Univ.), Takahiro Katagiri (Nagoya Univ.), Satoshi Ohshima (Nagoya Univ.), Toru Nagai (Nagoya Univ.)
7 On a Relationship between the ∗-congruence Sylvester Equation and the Generalized Sylvester Equation
Yuki Satake (Nagoya Univ.), Tomohiro Sogabe (Nagoya Univ.), Tomoya Kemmochi (Nagoya Univ.), Shao-Liang Zhang (Nagoya Univ.)
8 h3-Open-BDEC: Innovative Software Platform for Scientific Computing in the Exascale Era by Integrations of (Simulation + Data + Learning)
Kengo Nakajima (UTokyo), Takeshi Iwashita (Hokkaido Univ.), Takashi Shimokawabe (UTokyo), Hiromichi Nagao (UTokyo), Takeshi Ogita (TWCU), Takahiro Katagiri (Nagoya Univ.), Hisashi Yashiro (NIES), Hiroya Matsuba (RIKEN)
9 Accurate DGEMM using Tensor Cores
Daichi Mukunoki (RIKEN), Katsuhisa Ozaki (SIT), Takeshi Ogita (TWCU)
10 AMR Framework to Realize Effective High-Resolution Simulations on Multiple GPUs
Takashi Shimokawabe (UTokyo), Naoyuki Onodera (JAEA)
11 More Accurate Computation for Double-Double Arithmetic without Additional Execution Time by Parallel Processing
Hotaka Yagi (TUS), Emiko Ishiwata (TUS), Hidehiko Hasegawa (Univ. of Tsukuba)
12 Task-Parallel Algorithm for Matrix Factorizations
Tomohiro Suzuki (Univ. of Yamanashi)
13 Communication-Hiding Pipelined BiCGStar-Plus Method and Its Application to GPU-based Numerical Simulation of Blood Flow
Viet Huynh Quang Huy (Tohoku Univ.), Hiroshi Suito (Tohoku Univ.)
14 Distributed Memory Task-Based Block Low Rank Direct Solver
Sameer Deshmukh (TITech), Rio Yokota (TITech)
15 Implementation and Performance Evaluation of Parallel OpenACC Climate Code City-LES on GPU Cluster
Daisuke Tsuji (Univ. of Tsukuba), Hiroto Tadano (Univ. of Tsukuba), Taisuke Boku (Univ. of Tsukuba), Ryosaku Ikeda (Univ. of Tsukuba), Takuto Sato (Univ. of Tsukuba), Hiroyuki Kusaka (Univ. of Tsukuba)
16 Numerical Linear Algebra Based on Lattice H-Matrices
Akihiro Ida (UTokyo), Ichitaro Yamazaki (SNL), Rio Yokota (TITech), Satoshi Ohshima (Nagoya Univ.), Tasuku Hiraishi (Kyoto Univ.), Takeshi Iwashita (Hokkaido Univ.), Tetsuya Hoshino (UTokyo), Toshihiro Hanawa (UTokyo)
17 Performance Improvement of Block Red-Black MILU(0) Preconditioner with Relaxation on GPU
Akemi Shioya (UEC), Yusaku Yamamoto (UEC)
18 Co-Desigining of FEM based CFD code FrontFlow/blue for the Supercomputer Fugaku
Kiyoshi Kumahata (RIKEN), Kazuo Minami (RIKEN)
19 Predicting the Convergence of an Iterative Method from Matrix Images using CNN
Ryo Ota (Univ. of Tsukuba), Hidehiko Hasegawa (Univ. of Tsukuba)
20 An Optimization Technology of Software Auto-Tuning Applied to Machine Learning Software
Toshiki Tabeta (Kogakuin Univ.), Naoto Seki (Kogakuin Univ.), Akihiro Fujii (Kogakuin Univ.), Teruo Tanaka (Kogakuin Univ.), Hiroyuki Takizawa (Tohoku Univ.)
21 QR Decomposition of Block Low-Rank Matrices
Muhammad Ridwan Apriansyah (TITech), Rio Yokota (TITech)
22 Dissection Sparse Direct Solver and Parallel Task Management
Atsushi Suzuki (Osaka Univ.)
23 Cross-Reference Simulation by Code-To-Code Adapter (CoToCoA) Library for the Study of Planetary Magnetospheres
Yuto Katoh (Tohoku Univ.), Keiichiro Fukazawa (Kyoto Univ.), Takeshi Nanri (Kyushu Univ.), Yohei Miyake (Kobe Univ.)
24 A Study for Optimizing Schedule based on Work Life Balance of Radiologist
Yusuke Gotoh (Okayama Univ.), Koji Sakai (KPUM), Jun Tazoe (KPUM), Hiroshi Miura (KPUM), Yu Ohara (KPUM), Akira Uchiyama (Osaka Univ.), Yoshinari Nomura (Okayama Univ.)
25 Acceleration of Hyper-Parameter Auto-Tuning with Parallelization and Time Constraints
Chaoyi Zhang (Tohoku Univ.), Ryusuke Egawa (Tohoku Univ.), Hiroyuki Takizawa (Tohoku Univ.)
26 A New Record of Graph Enumeration Enabled by Parallel Processing
Zhipeng Xu (SYSU, SBU), Xiaolong Huang (SBU), Yidan Zhang (SBU), Yuefan Deng (SBU)
27 An Optimization of H-matrix-vector Multiplication by Using Un-used Cores
Tetsuya Hoshino (UTokyo), Toshihiro Hanawa (UTokyo), Akihiro Ida (UTokyo)
28 DNN Training Using Multiple GPUs for Medical Image Recognition
Akitoshi Hashizume (UTokyo), Toshihiro Hanawa (UTokyo)
29 Steady Flow Prediction using Convolutional Neural Networks with Boundary Exchange
Sora Hatayama (UTokyo), Takashi Shimokawabe (UTokyo)
30 Acceleration of Numerical Turbine using the Red-Black Method
Yuta Hougi (Tohoku Univ.), Kazuhiko Komatsu (Tohoku Univ.), Osamu Watanabe (Tohoku Univ.), Masayuki Sato (Tohoku Univ.), Hiroaki Kobayashi (Tohoku Univ.)
31 Performance Evaluation of a Clustering Approach based on Thermophysical Properties by using Multiple Platform
Kou Murakami (Tohoku Univ.), Kazuhiko Komatsu (Tohoku Univ.), Masayuki Sato (Tohoku Univ.), Hiroaki Kobayashi (Tohoku Univ.)
32 Performance Tuning of Deep Learning Framework Chainer on the K computer
Akiyoshi Kuroda (RIKEN), Kiyoshi Kumahata (RIKEN), Syuichi Chiba (Fujitsu), Katsutoshi Takashina (Fujitsu), Kazuo Minami (RIKEN)
33 Autotuning by Changing Directives and Number of Threads in OpenMP using ppOpen-AT
Toma Sakurai (Nagoya Univ.), Takahiro Katagiri (Nagoya Univ.), Satoshi Ohshima (Nagoya Univ.), Toru Nagai (Nagoya Univ.)
34 BITFLEX: A Dynamic Runtime Library for Bit-Level Precision Manipulation and Approximate Computing
Ryan Barton (TITech, AIST), Mohamed Wahib (AIST), Artur Podobas (RIKEN), Satoshi Matsuoka (RIKEN)
35 Runtime Correctness Check for Co-working Parallel Programs
Miwako Tsuji (RIKEN), Hitoshi Murai (RIKEN), Mitsuhisa Sato (RIKEN, Univ. of Tsukuba), Thomas Dufaud (UVSQ), Nahid Emad (UVSQ), Joachim Protze (RWTH), Christian Terboven (RWTH), Matthias S. Müller (RWTH), Taisuke Boku (Univ. of Tsukuba), Serge G. Petiton (ULille)
36 Enabling OpenACC Programming on Multi-hybrid Accelerated with GPU and FPGA
Ryuta Tsunashima (Univ. of Tsukuba), Ryohei Kobayashi (Univ. of Tsukuba), Norihisa Fujita (Univ. of Tsukuba), Ayumi Nakamichi (Univ. of Tsukuba), Taisuke Boku (Univ. of Tsukuba), Seyong Lee (ORNL), Jeffrey Vetter (ORNL), Hitoshi Murai (RIKEN), Mitsuhisa Sato (RIKEN)
37 Preliminary Evaluation towards Task Priority Control in HPX
Suhang Jiang (Tohoku Univ.), Mulya Agung (Tohoku Univ.), Ryusuke Egawa (Tohoku Univ.), Hiroyuki Takizawa (Tohoku Univ.)
38 Implementing the Tascell Task-Parallel Language Tascell Using Multithreaded MPI
Daiki Kojima (Kyoto Univ.), Tasuku Hiraishi (Kyoto Univ.), Hiroshi Nakashima (Kyoto Univ.), Masahiro Yasugi (KYUTECH)
39 A Study on Compiler Dependent Performance Improvement
Ryoichi Shibata (Kogakuin Univ.), Akira Fukuda (Kyushu Univ.), Yusuke Sato (Kogakuin Univ.), Takeshi Kamiyama (Kyushu Univ.), Masato Oguchi (Ochanomizu Univ.), Saneyasu Yamaguchi (Kogakuin Univ.)
40 A Study on CPU Clock Frequency Optimization in Kernel
Yusuke Sato (Kogakuin Univ.), Masato Oguchi (Ochanomizu Univ.), Saneyasu Yamaguchi (Kogakuin Univ.)
41 Towards Cross-stack Dynamic Resource Affinity Management
Balazs Gerofi (RIKEN)
42 System Software Support for Fast and Flexible Task Management on a Large-scale FPGA cluster
Atsushi Koshiba (RIKEN), Kentaro Sano (RIKEN)
43 Job Feature Aware File Location Optimization
Makoto Nakagami (Kogakuin Univ.), Jose A.B. Fortes (UF), Saneyasu Yamaguchi (Kogakuin Univ.)
44 Building and Evaluation of ABCI Cloud Storage Service
Yusuke Tanimura (AIST), Shinichiro Takizawa (AIST), Hirotaka Ogawa (AIST), Takahiro Hamanishi (AIST)
45 ChOWDER: A VDA-Based Scalable Display System for Displaying High-Resolution Visualization Results
Tomohiro Kawanabe (RIKEN), Jorji Nonaka (RIKEN), Kenji Ono (Kyushu Univ.)
46 Interactive In-situ Visualization of GPU-accelerated Simulations using Particle-based Volume Rendering
Takuma Kawamura (JAEA), Yasuhiro Idomura (JAEA), Naoyuki Onodera (JAEA)
47 Improving I/O Performance in Container with OverlayFS with Optimized Synchronization
Naoki Mizusawa (Kogakuin Univ.), Seki Yuya (AIST), Jian Tao (Texas A&M), Saneyasu Yamaguchi (Kogakuin Univ.)
48 Object Storage Performance Analyzing System Based on Packet Transfers and Method Calls Visualization
Shunpei Hayakawa (Kogakuin Univ.), Saneyasu Yamaguchi (Kogakuin Univ.)
49 Introduction of HPCI Shared Storage that has Achieved Year-Round Non-Stop Operation
Hiroshi Harada (RIKEN), Osamu Tatebe (Univ. of Tsukuba), Toshihiro Hanawa (UTokyo), Isamu Koseda (UTokyo), Hidetomo Kaneyama (RIKEN), Noriyuki Soda (SRA), Akira Kondo (SOUM), Takahiro Yugawa (SOUM)
50 Optimizing Precision for High-Performance, Robust, and Energy-Efficient Computations
Roman Iakymchuk (Sorbonne Univ., Fraunhofer ITWM), Fabienne Jézéquel (Sorbonne Univ.), Stef Graillat (Sorbonne Univ.), Daichi Mukunoki (RIKEN), Toshiyuki Imamura (RIKEN), Yiyu Tan (RIKEN), Atsushi Koshiba (RIKEN), Jens Huthmann (RIKEN), Kentaro Sano (RIKEN), Norihisa Fujita (Univ. of Tsukuba), Taisuke Boku (Univ. of Tsukuba)
51 Toward Latency-Aware Data Arrangement on Many-Core Processors
Tomoya Yuki (TITech), Toshio Endo (TITech)
52 Performance Evaluation of Acoustic FDTD(2,4) Method Using Distributed Shared Memory System mSMS
Ryoya Tabata (KYUTECH), Hiroko Midorikawa (Seikei Univ.), Ki’nya Takahashi (KYUTECH)
53 High-Performance Custom Computing with FPGA Cluster as an Off-loading Engine
Takaaki Miyajima (RIKEN), Tomohiro Ueno (RIKEN), Atsushi Koshiba (RIKEN), Jens Huthmann (RIKEN), Kentaro Sano (RIKEN)
54 A Study on Performances Behaviors of TCP BBR and CUBIC TCP in Deep Buffer Network
Kanon Sasaki (Kogakuin Univ.), Saneyasu Yamaguchi (Kogakuin Univ.)
55 A Periodic Table of Graphs with Special Properties
Yidan Zhang (SBU), Xiaolong Huang (SBU), Zhipeng Xu (SYSU), Yuefan Deng (SBU)
56 Cyclic Performance Fluctuation of TCP BBR
Kouto Miyazawa (Kogakuin Univ.), Saneyasu Yamaguchi (Kogakuin Univ.), Aki Kobayashi (Kogakuin Univ.)