Publications

2020


  • Jingqing Mu, Jerome Soumagne, Suren Byna, Quincey Koziol, Houjun Tang, and Richard Warren, "Interfacing HDF5 with A Scalable Object-centric Storage System on Hierarchical Storage", Journal of Concurrency and Computation: Practice and Experience (DOI:  [Link])
  • Houjun Tang, Suren Byna, Bin Dong, and Quincey Koziol, "Parallel Query Service for Object-centric Data Management Systems", The 6th IEEE International Workshop on High-Performance Big Data and Cloud Computing (HPBDC) 2020, in conjunction with IPDPS 2020. [Pre-print Paper]
  • Tirthak Patel, Suren Byna, Glenn K. Lockwood, Nicholas J. Wright, Philip Carns, Rob Ross, and Devesh Tiwari, "Uncovering Access, Reuse, and Sharing Characteristics of I/O-Intensive Files on Large-Scale Production HPC Systems", FAST '20 [Link]
  • Suren Byna, M. Scot Breitenfeld, Bin Dong, Quincey Koziol, Elena Pourmal, Dana Robinson, Jerome Soumagne, Houjun Tang, Venkatram Vishwanath, and Richard Warren, "ExaHDF5: Delivering Efficient Parallel I/O on Exascale Computing Systems," Journal of Computer Science and Technology, 2020, 35(1): 145-160. DOI: 10.1007/s11390-020-9822-9 [Paper]

2019


  • Houjun Tang, Suren Byna, Stephen Bailey, Zarija Lukic, Jialin Liu, Quincey Koziol, and Bin Dong, "Tuning Object-centric Data Management Systems for Large Scale Scientific Applications ", 26th IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC) 2019  [Pre-print version]
  • Richard Warren, Jerome Soumagne, Jingqing Mu, Houjun Tang, Suren Byna, Bin Dong, and Quincey Koziol, "Analysis in the Data Path of an Object-centric Data Management System ", 26th IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC) 2019  [Pre-print version]
  • Houjun Tang, Quincey Koziol, Suren Byna, John Mainzer, and Tonglin Li, "Enabling Transparent Asynchronous I/O using Background Threads", PDSW 2019, in conjunction with SC19.  [Pre-print version]
  • Megha Agarwal, Divyansh Singhvi, Preeti Malakar, and Suren Byna, "Active Learning-based Automatic Tuning and Prediction of Parallel I/O Performance", PDSW 2019, in conjunction with SC19.  [Pre-print version]  [Slides]
  • Jingqing Mu, Jerome Soumagne, Suren Byna, Quincey Koziol, Houjun Tang, and Richard Warren, "Interfacing HDF5 with A Scalable Object-centric Storage System on Hierarchical Storage", Cray User Group (CUG) 2019  [Pre-print version]
  • Babak Behzad, Suren Byna, Prabhat, and Marc Snir, "Optimizing I/O Performance of HPC Applications with Autotuning", ACM Transactions on Parallel Computing (TOPC), Volume 5 Issue 4, March 2019, Article No. 15, doi: 10.1145/3309205 [Link to ACM Digital Library]  [Pre-print version]

2018


  • Bin Dong, Teng Wang, Houjun Tang, Quincey Koziol, Kesheng Wu, and Suren Byna, "ARCHIE: Data Analysis Acceleration with Array Caching in Hierarchical Storage", IEEE International Conference on Big Data (IEEE BigData) 2018  [Pre-print version]
  • Suren Byna, Quincey Koziol, Venkatram Vishwanath, Jerome Soumagne, Houjun Tang, Kimmy Mu, Richard Warren, François Tessier, Bin Dong, Teng Wang, and Jialin Liu, "Proactive Data Containers (PDC): An object-centric data store for large-scale computing systems", AGU Fall Meeting 2018 [Session][Slides]
  • Wei Zhang, Houjun Tang, Suren Byna, and Yong Chen, "DART: Distributed Adaptive Radix Tree for Efficient Affix-based Keyword Search on HPC Systems", The 27th International Conference on Parallel Architectures and Compilation Techniques (PACT'18) [Pre-print version]
  • Jialin Liu, Quincey Koziol, Gregory Butler, Neil Fortner, Mohamad Chaarawi, Houjun Tang, Suren Byna, Glenn Lockwood, Ravi Cheema, Kristy Kallback-Rose, Damian Hazen, and Prabhat, "Evaluation of HPC Application I/O on Object Storage Systems", 3rd Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems (PDSW-DISCS), 2018 [Pre-print version]
  • Fahim Chowdhury, Jialin Liu, Quincey Koziol, Thorsten Kurth, Steven Farrell, Suren Byna, Prabhat, Weikuan Yu, "Initial Characterization of I/O in Large-Scale Deep Learning Applications", Work in Progress paper, 3rd Joint International Workshop on Parallel Data Storage and Data Intensive Scalable Computing Systems (PDSW-DISCS), 2018 (Held in conjunction with SC18) [Pre-print version]
  • Teng Wang, Suren Byna, Bin Dong, and Houjun Tang, "UniviStor: Integrated Hierarchical and Distributed Storage for HPC", IEEE Cluster 2018, Belfast [Pre-print version]
  • Kimmy Mu, Jerome Soumagne, Houjun Tang, Suren Byna, Quincey Koziol, and Richard Warren, "A Server-managed Transparent Object Storage Abstraction for HPC", IEEE Cluster 2018, Belfast [Pre-print version]
  • Haoyuan Xing, Sofoklis Floratos, Spyros Blanas, Suren Byna, Prabhat, Kesheng Wu, and Paul Brown, "ArrayBridge: Interweaving declarative array processing with imperative high-performance computing", 34th IEEE International Conference on Data Engineering (ICDE) 2018, Paris [Pre-print version]
  • Houjun Tang, Suren Byna, Francois Tessier, Teng Wang, Bin Dong, Jingqing Mu, Quincey Koziol, Jerome Soumagne, Venkatram Vishwanath, Jialin Liu, and Richard Warren, "Toward Scalable and Asynchronous Object-centric Data Management for HPC", 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid) 2018 [Preprint version]
  • Bharti Wadhwa, Suren Byna, and Ali R. Butt, "Toward Transparent Data Management in Multi-layer Storage Hierarchy for HPC Systems", IEEE International Conference on Cloud Engineering 2018 (IC2E 2018) [Preprint version]

2017


  • Houjun Tang, Suren Byna, Bin Dong, Jialin Liu, and Quincey Koziol, "SoMeta: Scalable Object-centric Metadata Management for High Performance Computing", The IEEE Cluster Conference 2017 [Preprint version]
  • François Tessier, Venkat Vishwanath, and Emmanuel Jeannot, "TAPIOCA: An I/O library for optimized topology-aware data aggregation on large-scale supercomputers", The IEEE Cluster Conference 2017
    [Paper]
  • Jialin Liu, Quincey Koziol, Houjun Tang, François Tessier, Wahid Bhimji, Brandon Cook, Brian Austin, Suren Byna, Bhupender Thakur, Glenn Lockwood, Jack Deslippe, Prabhat, "Understanding the I/O Performance Gap Between Cori KNL and Haswell", Cray User Group Conference 2017 (CUG 2017) [Preprint version] [Presentation]

2016


  • Bin Dong, Suren Byna, Kesheng Wu, Prabhat, Hans Johansen, Jeffrey N. Johnson, and Noel Keen, "Data Elevator: Low-contention Data Movement in Hierarchical Storage System", The 23rd annual IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2016. [Preprint version]
  • François Tessier, Preeti Malakar, Venkatram Vishwanath, Emmanuel Jeannot and Florin Isaila, "Topology-Aware Data Aggregation for Intensive I/O on Large-Scale Supercomputers", 1st Workshop on Optimization of Communication in HPC runtime systems (IEEE COM-HPC16), Held in conjunction with ACM/IEEE SuperComputing'16 Conference [Preprint version] [Presentation]

Presentations

  • Suren Byna, Quincey Koziol, Venkatram Vishwanath, Jerome Soumagne, Houjun Tang, Kimmy Mu, Richard Warren, François Tessier, Bin Dong, Teng Wang, and Jialin Liu, "Proactive Data Containers (PDC): An object-centric data store for large-scale computing systems", AGU Fall Meeting 2018 [Slides]
  • Suren Byna and Quincey Koziol, PDC – Proactive Data Containers for Next Generation Scientific Data Storage [NERSC Data Seminar, July 7, 2017]
  • Suren Byna, Proactive Data Containers for Next Generation HPC Storage [Dagstuhl Seminar, May 17, 2017]