Publications
2020
- Jingqing Mu, Jerome Soumagne, Suren Byna, Quincey Koziol, Houjun Tang, and Richard Warren, "Interfacing HDF5 with A Scalable Object-centric Storage System on Hierarchical Storage", Journal of Concurrency and Computation: Practice and Experience (DOI: [Link])
- Houjun Tang, Suren Byna, Bin Dong, and Quincey Koziol, "Parallel Query Service for Object-centric Data Management Systems", The 6th IEEE International Workshop on High-Performance Big Data and Cloud Computing (HPBDC) 2020, in conjunction with IPDPS 2020. [Pre-print Paper]
- Tirthak Patel, Suren Byna, Glenn K. Lockwood, Nicholas J. Wright, Philip Carns, Rob Ross, and Devesh Tiwari, "Uncovering Access, Reuse, and Sharing Characteristics of I/O-Intensive Files on Large-Scale Production HPC Systems", FAST '20 [Link]
- Suren Byna, M. Scot Breitenfeld, Bin Dong, Quincey Koziol, Elena Pourmal, Dana Robinson, Jerome Soumagne, Houjun Tang, Venkatram Vishwanath, and Richard Warren, "ExaHDF5: Delivering Efficient Parallel I/O on Exascale Computing Systems," Journal of Computer Science and Technology, 2020, 35(1): 145-160. DOI: 10.1007/s11390-020-9822-9 [Paper]
2019
- Houjun Tang, Suren Byna, Stephen Bailey, Zarija Lukic, Jialin Liu, Quincey Koziol, and Bin Dong, "Tuning Object-centric Data Management Systems for Large Scale Scientific Applications ", 26th IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC) 2019 [Pre-print version]
- Richard Warren, Jerome Soumagne, Jingqing Mu, Houjun Tang, Suren Byna, Bin Dong, and Quincey Koziol, "Analysis in the Data Path of an Object-centric Data Management System ", 26th IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC) 2019 [Pre-print version]
- Houjun Tang, Quincey Koziol, Suren Byna, John Mainzer, and Tonglin Li, "Enabling Transparent Asynchronous I/O using Background Threads", PDSW 2019, in conjunction with SC19. [Pre-print version]
- Megha Agarwal, Divyansh Singhvi, Preeti Malakar, and Suren Byna, "Active Learning-based Automatic Tuning and Prediction of Parallel I/O Performance", PDSW 2019, in conjunction with SC19. [Pre-print version] [Slides]
- Jingqing Mu, Jerome Soumagne, Suren Byna, Quincey Koziol, Houjun Tang, and Richard Warren, "Interfacing HDF5 with A Scalable Object-centric Storage System on Hierarchical Storage", Cray User Group (CUG) 2019 [Pre-print version]
- Babak Behzad, Suren Byna, Prabhat, and Marc Snir, "Optimizing I/O Performance of HPC Applications with Autotuning", ACM Transactions on Parallel Computing (TOPC), Volume 5 Issue 4, March 2019, Article No. 15, doi: 10.1145/3309205 [Link to ACM Digital Library] [Pre-print version]
2018
- Bin Dong, Teng Wang, Houjun Tang, Quincey Koziol, Kesheng Wu, and Suren Byna, "ARCHIE: Data Analysis Acceleration with Array Caching in Hierarchical Storage", IEEE International Conference on Big Data (IEEE BigData) 2018 [Pre-print version]
- Suren Byna, Quincey Koziol, Venkatram Vishwanath, Jerome Soumagne, Houjun Tang, Kimmy Mu, Richard Warren, François Tessier, Bin Dong, Teng Wang, and Jialin Liu, "Proactive Data Containers (PDC): An object-centric data store for large-scale computing systems", AGU Fall Meeting 2018 [Session][Slides]
-
Wei Zhang, Houjun Tang, Suren Byna, and Yong Chen,
"DART: Distributed Adaptive Radix Tree for Efficient Affix-based Keyword Search on HPC Systems",
The 27th International Conference on Parallel Architectures and Compilation Techniques (PACT'18)
[Pre-print version]
-
Jialin Liu, Quincey Koziol, Gregory Butler, Neil Fortner, Mohamad Chaarawi, Houjun Tang, Suren Byna, Glenn Lockwood, Ravi Cheema, Kristy Kallback-Rose, Damian Hazen, and Prabhat,
"Evaluation of HPC Application I/O on Object Storage Systems",
3rd Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems (PDSW-DISCS), 2018
[Pre-print version]
- Fahim Chowdhury, Jialin Liu, Quincey Koziol, Thorsten Kurth, Steven Farrell, Suren Byna, Prabhat, Weikuan Yu, "Initial Characterization of I/O in Large-Scale Deep Learning Applications", Work in Progress paper, 3rd Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems (PDSW-DISCS), 2018 (Held in conjunction with SC18) [Pre-print version]
-
Teng Wang, Suren Byna, Bin Dong, and Houjun Tang,
"UniviStor: Integrated Hierarchical and Distributed Storage for HPC",
IEEE Cluster 2018, Belfast
[Pre-print version]
-
Kimmy Mu, Jerome Soumagne, Houjun Tang, Suren Byna, Quincey Koziol, and Richard Warren,
"A Server-managed Transparent Object Storage Abstraction for HPC",
IEEE Cluster 2018, Belfast
[Pre-print version]
-
Haoyuan Xing, Sofoklis Floratos, Spyros Blanas, Suren Byna, Prabhat, Kesheng Wu, and Paul Brown,
"ArrayBridge: Interweaving declarative array processing with imperative high-performance computing",
34th IEEE International Conference on Data Engineering (ICDE) 2018, Paris
[Pre-print version]
-
Houjun Tang, Suren Byna, Francois Tessier, Teng Wang, Bin Dong, Jingqing Mu, Quincey Koziol, Jerome Soumagne, Venkatram Vishwanath, Jialin Liu, and Richard Warren,
"Toward Scalable and Asynchronous Object-centric Data Management for HPC",
18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid) 2018
[Preprint version]
-
Bharti Wadhwa, Suren Byna, and Ali R. Butt,
"Toward Transparent Data Management in Multi-layer Storage Hierarchy for HPC Systems",
IEEE International Conference on Cloud Engineering 2018 (IC2E 2018)
[Preprint version]
2017
-
Houjun Tang, Suren Byna, Bin Dong, Jialin Liu, and Quincey Koziol,
"SoMeta: Scalable Object-centric Metadata Management for High Performance Computing",
The IEEE Cluster Conference 2017
[Preprint version]
-
François Tessier, Venkat Vishwanath, and Emmanuel Jeannot,
"TAPIOCA: An I/O library for optimized topology-aware data aggregation on large-scale supercomputers",
The IEEE Cluster Conference 2017
[Paper]
- Jialin Liu, Quincey Koziol, Houjun Tang, François Tessier, Wahid Bhimji, Brandon Cook, Brian Austin, Suren Byna, Bhupender Thakur, Glenn Lockwood, Jack Deslippe, Prabhat, "Understanding the I/O Performance Gap Between Cori KNL and Haswell", Cray User Group Conference 2017 (CUG 2017) [Preprint version] [Presentation]
2016
- Bin Dong, Suren Byna, Kesheng Wu, Prabhat, Hans Johansen, Jeffrey N. Johnson, and Noel Keen, "Data Elevator: Low-contention Data Movement in Hierarchical Storage System", The 23rd annual IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2016. [Preprint version]
- François Tessier, Preeti Malakar, Venkatram Vishwanath, Emmanuel Jeannot and Florin Isaila, "Topology-Aware Data Aggregation for Intensive I/O on Large-Scale Supercomputers", 1st Workshop on Optimization of Communication in HPC runtime systems (IEEE COM-HPC16), Held in conjunction with ACM/IEEE SuperComputing'16 Conference [Preprint version] [Presentation]