Delivering Efficient Parallel I/O
on Exascale Computing Systems

Exascale Computing Project (ECP)
Software Technology area



Quincey Koziol and Suren Byna
"Presentation: ExaHDF5 - ECP ST Project - Delivering Efficient Parallel I/O on Exascale Systems"
NERSC Data Seminar Presentation [Slides]

Suren Byna, Mohamad Chaarawi, Quincey Koziol, John Mainzer, and Frank Willmore
"Tuning HDF5 subfiling performance on parallel file systems"
CUG 2017 [Preprint version] [Presentation]

Jialin Liu, Quincey Koziol, Houjun Tang, Fran├žois Tessier, Wahid Bhimji, Brandon Cook, Brian Austin, Suren Byna, Bhupender Thakur, Glenn Lockwood, Jack Deslippe, and Prabhat
"Understanding the I/O Performance Gap Between Cori KNL and Haswell"
CUG 2017 [Preprint version] [Presentation]

Cong Xu, Shane Snyder, Omkar Kulkarni, Vishwanath Venkatesan, Philip Carns, Suren Byna, Robert Sisneros, and Kalyana Chadalavada
"DXT: Darshan eXtended Tracing"
CUG 2017 [Preprint version] [Presentation]


Bin Dong, Suren Byna, Kesheng Wu, Prabhat, Hans Johansen, Jeffrey N. Johnson, and Noel Keen
"Data Elevator: Low-contention Data Movement in Hierarchical Storage System"
IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2016. [Preprint version] [Presentation]

Prior to ECP ExaHDF5; funded by DOE ASCR

Md. Mostofa Ali Patwary, Nadathur Rajagopalan Satish, Narayanan Sundaram, Jialin Liu, Peter Sadowski, Evan Racah, Suren Byna, Craig Tull, Wahid Bhimji, Prabhat, and Pradeep Dubey
"PANDA: Extreme Scale Parallel K-Nearest Neighbor on Distributed Architectures"
30th IEEE International Parallel & Distributed Processing Symposium (IPDPS) 2016, Chicago [Preprint version]

Houjun Tang, Suren Byna, Steven Harenberg, Xiaocheng Zou, Wenzhao Zhang, Kesheng Wu, Bin Dong, Oliver Rubel, Kristofer Bouchard, Scott Klasky, and Nagiza Samatova
"Usage Pattern-Driven Dynamic Data Layout Reorganization"
16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid) 2016 [Preprint version]

Dharshi Devendran, Suren Byna, Bin Dong, Brian van Straalen, Hans Johansen, Noel Keen, and Nagiza Samatova
"Collective I/O Optimizations for Adaptive Mesh Refinement Data Writes on Lustre File System"
CUG 2016 [Preprint version]

Cong Xu, Suren Byna, Vishwanath Venkatesan, Robert Sisneros, Omkar Kulkarni, Mohamad Chaarawi, and Kalyana Chadalavada
"LIOProf: Exposing Lustre File System Behavior for I/O Middleware"
CUG 2016 [Preprint version]

Wahid Bhimji, Debbie Bard, Melissa Romanus, David Paul, Andrey Ovsyannikov, Brian Friesen, Matt Bryson, Joaquin Correa, Glenn K. Lockwood, Vakho Tsulaia, Suren Byna, Steve Farrell, Doga Gursoy, Chris Daley, Vince Beckner, Brian Van Straalen, Nicholas Wright, Katie Antypas, and Prabhat
"Accelerating Science with the NERSC Burst Buffer Early User Program"
CUG 2016


Md. Mostofa Ali Patwary, Suren Byna, Nadathur Satish, Narayanan Sundaram, Zarija Lukic, Vadim Roytershteyn, Michael Anderson, Yushu Yao, Prabhat, and Pradeep Dubey
"BD-CATS: Big Data Clustering at Trillion Particle Scale"
IEEE/ACM Supercomputing - SC15. [PDF] [News article in ASCR Discovery]

Babak Behzad, Suren Byna, Prabhat and Marc Snir
"Pattern-driven Parallel I/O Tuning "
10th Parallel Data Storage Workshop (PDSW) 2015, in conjunction with SC15, November 2015 [PDF]

Shane Snyder, Philip Carns, Robert Latham, Misbah Mubarak, Chris Carothers, Babak Behzad, Huong Vu Thanh Luu, Suren Byna, and Prabhat
"Techniques for Modeling Large-scale HPC I/O Workloads"
the 6th International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS15), in conjunction with SC15, November 2015 [PDF] [ACM Digital Library]

P. Malakar and V. Vishwanath
"Hierarchical Read-write Optimizations for Scientific Applications with Multi-variable Structured Datasets"
Proceedings of the 12th Annual IFIP International Conference on Network and Parallel Computing (NPC), New York City, New York, USA, September 17-19, 2015
Also appears in IJPP [IJPP version]

Babak Behzad, Suren Byna, Stefan Wild, Prabhat and Marc Snir
"Dynamic Model-driven Parallel I/O Performance Tuning"
IEEE Cluster 2015 [PDF]

H. Luu, M. Winslett, W. Gropp, R. Ross, P. Carns, K. Harms, Prabhat, S. Byna, and Y. Yao
"A Multiplatform Study of I/O Behavior on Petascale Supercomputers"
The 24th ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC) 2015 [Preprint (PDF)]

Kalyana Chadalavada, Rob Sisneros, Suren Byna, and Quincey Koziol
"Tuning Parallel I/O on Blue Waters for Writing 10 Trillion Particles"
Cray Users Group (CUG) meeting 2015 [PDF]


B. Behzad, S. Byna, S. Wild, Prabhat, and M. Snir
"Improving Parallel I/O Autotuning with Performance Modeling"
The 23rd ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC) 2014 [PDF]

Ted Habermann, Andrew Collette, Steve Vincena, Werner Benger, Jay Jay Billings, Matt Gerring, Konrad Hinsen, Pierre de Buyl, Mark Könnecke, Filipe Maia, and Suren Byna,
"The Hierarchical Data Format (HDF): A Foundation for Sustainable Data and Software"
2nd Workshop on Sustainable Software for Science: Practice and Experiences (WSSSPE2), in conjunction with Supercomputing 2014 (SC14)

Vishwanath Venkatesan, Mohamad Chaarawi, Quincey Koziol, and Edgar Gabriel
"Compactor: an Optimization Framework for Staging I/O" Workshop on High Performance Data Intensive Computing (HPDIC), held in conjunction with IPDPS 2014 in Phoenix, AR, May 2014.


B. Behzad, H. Luu, J. Huchette, S. Byna, Prabhat, R. Aydt, Q. Koziol, and M. Snir
"Taming Parallel I/O Complexity with Auto-Tuning" [PDF]
Supercomputing 2013 (SC13) Technical Program

Babak Behzad, Joey Huchette, Huong Luu, Ruth Aydt, Suren Byna, Quincey Koziol, Yushu Yao, and Prabhat
"An Auto-tuning framework for HDF5 applications", [PDF]
HPDC 2013 Short Paper

Suren Byna, Andrew Uselton, Prabhat, David Knaak, and Helen He
"Trillion Particles, 300TB, 120,000 cores: Lessons Learnt from a Hero I/O run on Hopper" [PDF]
Cray User Group 2013 [Best Paper Award]


Prabhat, Suren Byna, Babak Behzad, Joey Huchette, Huong Luu, Ruth Aydt, and Quincey Koziol
"Auto-Tuning Parallel I/O"
Review of the LBL Computational Research Division. Jan 23 2012. Poster.

Surendra Byna, Jerry Chou, Oliver Rübel, Prabhat, Homa Karimabadi, William S. Daughton, Vadim Roytershteyn, E. Wes Bethel, Mark Howison, Ke-Jou Hsu, Kuan-Wu Lin, Arie Shoshani, Andrew Uselton, and Kesheng Wu,
"Parallel I.O, Analysis, and Visualization of a Trillion Particle Simulation," [PDF]
SuperComputing 2012 (SC12), Salt Lake City, Utah, Nov. 10-16, 2012. LBNL-5832E.

S. Byna, J. Chou, O. Rübel, Prabhat, H. Karimabadi, W. S. Daughton, V. Roytershteynz, E. W. Bethel, M. Howison, K.-J. Hsu, K.-W. Lin, A. Shoshani, A. Uselton, and K. Wu.
"Parallel Data Storage, Analysis, and Visualization of a Trillion Particles", YouTube video
XLDB 2012 Poster.

Babak Behzad, Joey Huchette, Huong Luu, Ruth Aydt, Suren Byna, Mohamad Chaarawi, Quincey Koziol, Prabhat, Yushu
"Auto-tuning of Parallel I/O parameters for HDF5 Applications",
SuperComputing 2012 Poster [Best Poster Nominee]

Prabhat, Suren Byna, John Wu, Jerry Chou, Mark Howison, Joey Huchette, Wes Bethel, Quincey Koziol, Mohamad Chaarawi, Ruth Aydt, Babak Behzad, Huong Luu, Karen Schuchardt, Bruce Palmer
"Updates from the ExaHDF5 project: Trillion Particle Run, Auto-Tuning and the Virtual Object Layer"
ASCR Exascale meeting, Arlington, VA. Oct 2012. Poster.

Oliver Rübel, E. Wes Bethel, Prabhat, and Kesheng Wu
"Query-Driven Visualization and Analysis"
In E. Wes Bethel, Hank Childs, and Charles Hansen, editors, High Performance Visualization - Enabling Extreme Scale Scientific Insight, Chapman & Hall, CRC Computational Science, pages 117-144. CRC Press/Francis-Taylor Group, Boca Raton, FL, USA, November 2012.

Prabhat, Kesheng Wu, Jerry Chou, Suren Byna, Mark Howison, E. Wes Bethel, Quincey Koziol, Peter Cao, Mohamad Charawi, Christian Chilan, Mike McGreevy, Karen Schuchardt, Bruce Palmer
"Technical Highlights from the ExaHDF5 project"
DOE Exascale Research Meeting April 2012.


Jerry Chou, John Wu, Oliver Rübel, Mark Howison, Ji Qiang, Prabhat, Brian Austin, E. Wes Bethel, Rob Rybe, and Arie Shoshani,
"Parallel Index and Query for Large Data". [PDF]
SuperComputing 2011.

Jerry Chou, John Wu, and Prabhat,
"FastQuery: A Parallel Indexing System for Scientific Data" [PDF]
Workshop on Interfaces and Abstractions for Scientific Data Storage, IEEE Cluster 2011.

Jerry Chou, John Wu, and Prabhat,
"FastQuery: a general Index and Query system for scientific data", Springer Link
Scientific and Statistical Database Management Conference 2011, Poster.

Prabhat, Quincey Koziol, Karen Schuchardt, E. Wes Bethel, Jerry Chuo, Mark Howison, Mike McGreevy, Bruce Palmer, Oliver Rübel and John Wu,
"ExaHDF5: An I/O Platform for Exascale Data Models, Analysis and Performance"
Scientific Discovery Through Advanced Computing 2011, Invited Paper. [PDF]