SDM Group Publications

* Recent papers are here http://crd.lbl.gov/departments/data-science-and-technology/sdm/sdm-publications/

* 2012* 2011 2010 2009 2008 2007 2006 2005 2004 2003 2002 2001 2000
Books Awards and Honors


#SdmPub2012

2012

  • [AWF+12] Deb Agarwal, Arthur Wiedmer, Boris Faybishenko, Tad Whiteside, James Hunt, Gary Kushner, Alex Romosan, Arie Shoshani, A Methodology for Management Of Heterogeneous Site Characterizaton and Modeling Data, XIX International Conference on Computational Methods in Water Resources, CMWR 2012.

  • [BCR+12] Surendra Byna, Jerry Chou, Oliver Rubel, Prabhat, Homa Karimabadi, William S. Daughton, Vadim Roytershteyn, E. Wes Bethel, Mark Howison, Ke-Jou Hsu, Kuan-Wu Lin, Arie Shoshani, Andrew Uselton, and Kesheng Wu, Parallel I/O, Analysis, and Visualization of a Trillion Particle Simulation. SuperComputing conference, SC’12, November 2012.
  • [MKH+] Joerg Meyer, Harinarayan Krishnan, Jennifer Horsman, Alexandru Romosan, E. Wes Bethel, Visual Data Analysis as an Integral Part of Environmental Management, to be published in Transactions on Visualization and Computer Graphics (IEEE TVCG, October 2012).

  • [PRB+12] Prabhat, Oliver Rubel, Surendra Byna, Kesheng Wu, Fuyu Li, Michael Wehner and Wes Bethel, TECA: A Parallel Toolkit for Extreme Climate Analysis, International Conference on Computational Science, ICCS June, 2012.

  • [PSW12] Elaheh Pourabbas, Arie Shoshani, and Kesheng Wu, Minimizing Index Size by Reordering Rows and Columns, International Conference on Scientific and Statistical Database Management (SSDBM), June 2012.

  • [SAF+12] Karen L. Schuchardt, Deborah A. Agarwal, Stefan A. Finsterle, Carl W. Gable, Ian Gorton, Luke J. Gosink, Elizabeth H. Keating, Carina S. Lansing, Joerg Meyer, William A.M. Moeglein, George S.H. Pau, Ellen A. Porter, Sumit Purohit, Mark L. Rockhold, Arie Shoshani, Chandrika Sivaramakrishnan, Akuna – Integrated Toolsets Supporting Advanced Subsurface Flow and Transport Simulations for Environmetal Management, XIX International Conference on Computational Methods in Water Resources, CMWR 2012.

2011

  • [BPW+11] S. Byna, Prabhat, M. Wehner, and K. Wu, "Detecting Atmospheric Rivers in Large Climate Datasets", 2nd International Workshop on Petascale Data Analytics: Challenges, and Opportunities (PDAC-11), Co-located with Supercomputing 2011

  • [CKP+11] Cummings, Klasky, Podhorszki, Barreto, Lofstead, Schwan, Docan, Parashar, Sim, Shoshani, “EFFIS: and End-to-end Framework for Fusion Integrated Simulation”, PDP 2011

  • [CKR11a] Jerry Chou, Jinoh Kim and Doron Rotem Energy-Aware Scheduling in Disk Storage Systems', in ICDCS 2011

  • [CKR11b] Jerry Chou, Jinoh Kim and Doron Rotem. “Energy Saving Techniques for Disk Storage Systems” Book chapter in Handbook of Energy-Aware and Green Computing edited by Ishfaq Ahmad and Sanjay Ranka ,2011.

  • [CWP11] J. Chou, K. Wu, and Prabhat. FastQuery: A General Indexing and Querying System for Scientific Data. To appear in SSDBM 2011.

  • [CWR+11] J. Chou, K. Wu, O. Rubel, M. Howison, J. Qiang, Prabhat, B. Austin, E. W. Bethel, R. D. Ryne, and A. Shoshani. Parallel Index and Query for Large Scale Data Analysis. SuperComputing Conference, 2011.

  • [GKL+11] Junmin Gu, Dimitrios Katramatos, Xin Liu, Vijaya Natarajan, Arie Shoshani, Alex Sim, Dantong Yu, Scott Bradley, Shawn McKee, "StorNet: Co-Scheduling of End-to-End Bandwidth Reservation on Storage and Network Systems for High Performance Data Transfers", Proceedings of IEEE INFOCOM HSN 2011, Shanghai China, 2011.
  • [KAC+11] Jinoh Kim, Hasan Abbasi, Luis Chacon, Ciprian Docan, Scott Klasky, Qing Liu, Norbert Podhorszki, Arie Shoshani, Kesheng Wu, Parallel In Situ Indexing for Data-intensive Computing, IEEE Symposium on Large Data Analysis and Visualization, October 23 - 24, 2011. http://dx.doi.org/10.1109/LDAV.2011.6092319

  • [KCR11] Jinoh Kim, Jerry Chou and Doron Rotem, “Energy Proportionality and Performance in Data Parallel Computing Clusters,” SSDBM 2011.

  • [KR11] Jinoh Kim, Doron Rotem, “Energy proportionality for disk storage using replication,” EDBT 2011: 81-92.

  • [MW11] K. Madduri and K. Wu. Massive-scale RDF Processing Using Compressed Bitmap Indexes. To appear in SSDBM 2011

  • [Sho+11] A. Shoshani, et al., The Scientific Data Management Center: Available Technologies and Highlights, SciDAC 2011 conference.

  • [WS+11] D. Williams, A. Shoshani, et. al, Earth System Grid Center for Enabling Technologies (ESG-CET): A Data Infrastructure for Data-Intensive Climate Research, SciDAC 2011 conference.

  • [WSJ+11] K. Wu, R. R Sinha, C. Jones, S. Ethier, S. Klasky, K.-L. Ma, A. Shoshani and M. Winslett. Finding regions of interest on toroidal meshes. Computational Science & Discovery, Volume 4, page 015003. 2011.

2010

  • [BCS+10] M. Balman, E. Chaniotakis, A. Shoshani, A. Sim. “An Efficient Reservation Algorithm for Advanced Network Provisioning”, ACM/IEEE Supercomputing Conference 2010 (SC10).

  • [CLS+10] Julian Cummings, Jay Lofstead, Karsten Schwan, Alexander Sim, Arie Shoshani, Ciprian Docan, Manish Parashar, Scott Klasky, Norbert Podhorszki, Roselyne Barreto. "EFFIS: An End-to-end Framework for Fusion Integrated Simulation”, Parallel, Distributed, and Network-Based Processing, Euromicro Conference on, pp. 428-434, 2010, 18th Euromicro Conference on Parallel, Distributed and Network-based Processing, 2010

  • [GBC+10] G Garzoglio, J Bester, K Chadwick, D Dykstra, D Groep, J Gu, T Hesselroth, O Koeroo, T Levshina, S Martin, M Salle, N Sharma, F Siebenlist, A Sim, A Verstegen, Adoption of a SAML-XACML Profile for Authorization Interoperability across Grid Middleware in OSG and EGEE, Proceedings of the 18th International Conference on Computing in High Energy and Nuclear Physics (CHEP 2010), 2010

  • [GKL+10a] Junmin Gu, Dimitrios Katramatos, Xin Liu, Vijaya Natarajan, Arie Shoshani, Alex Sim, Dantong Yu, Scott Bradley, Shawn McKee, "StorNet: Co-Scheduling of End-to-End Bandwidth Reservation on Storage and Network Systems for High Performance Data Transfers", Proceedings of IEEE INFOCOM HSN 2011, Shanghai China, 2011

  • [GKL+10b] Junmin Gu, Dimitrios Katramatos, Xin Liu, Vijaya Natarajan, Arie Shoshani, Alex Sim, Dantong Yu, Scott Bradley, Shawn McKee, "StorNet: Integrated Dynamic Storage and Network Resource Provisioning and Management for Automated Data Transfers", Proceedings of the 18th International Conference on Computing in High Energy and Nuclear Physics (CHEP 2010), Taipei Taiwan, 2010

  • [HSW+10] D. Hasenkamp, A. Sim, M. Wehner, K. Wu. Finding Tropical Cyclones on a Cloud Computing Cluster: Using Parallel Virtualization for Large-Scale Climate Simulation Analysis. CloudCom2010. 2010.

  • [HSW+10a] D. Hasenkamp A. Sim, M. Wehner, K. Wu, Finding Tropical Cyclones on Clouds, Super Computing (New Orleans, LA), 2010. (Third place in ACM Student Research Poster Competition.)

  • [KR10] Jinoh Kim, Doron Rotem “Using Replication for Energy Conservation in RAID Systems,” PDPTA 2010: 703-709

  • [ORT10] Ekow J. Otoo, Doron Rotem, Shih-Chiang Tsao: “Dynamic Data Reorganization for Energy Savings in Disk Storage Systems”. SSDBM 2010: 322-341

  • [PS10] E. Pourabbas and A. Shoshani, Improving Estimation Accuracy of Aggregate Queries on Data Cubes, Data & Knowledge Engineering 69 (2010) 50–72.

  • [SBW+10] A. Sim, M. Balman, D. Williams, A. Shoshani, and V. Natarajan, Adaptive Transfer Adjustment in Efficient Bulk Data Transfer Management for Climate Datasets, Parallel and Distributed Computing and Systems (PDCS2010), Nov. 2010.

  • [SGN+10] A. Sim, D. Gunter, V. Natarajan, A. Shoshani, D. Williams, J. Long, J. Hick, J. Lee, E. Dart, “Efficient Bulk Data Replication for the Earth System Grid”, LBNL-3821E, 2010

  • [SGN+10] A. Sim, D. Gunter, V. Natarajan, A. Shoshani, D. Williams, J. Long, J. Hick, J. Lee, E. Dart. “Efficient Bulk Data Replication for the Earth System Grid”, Proceedings of International Symposium on Grid Computing (ISGC), 2010

  • [SKR10] Arie Shoshani, Scott Klasky, Rob Ross, Scientific Data Management: Challenges and Approaches in the Extreme Scale Era, SciDAC Conference, 2010

  • [WMC10] K. Wu, K. Madduri, S. Canon. Multi-Level Bitmap Indexes for Flash Memory Storage. IDEAS'10. 2010.

  • [WSS10] K. Wu, A. Shoshani, and K. Stockinger. Analyses of Multi-Level and Multi-Component Compressed Bitmap Indexes. ACM TODS v35, Article 2, 2010

  • [YBS+10] I. Yamazaki, Z. Bai, H. Simon, L.-W. Wang, and K. Wu. 2010. Adaptive Projection Subspace Dimension for the Thick-Restart Lanczos Method. ACM Trans. Math. Softw. 37, 3, Article 27 (September 2010).

2009

  • [BJA+09] E. W. Bethel, C. Johnson, S. Ahern, K. Wu, et al., Occam's Razor and Petascale Visual Data Analysis. SciDAC 2009.

  • [BRK+09] E. Wes Bethel, Oliver Rubel, Prabhat, Kesheng Wu, et al, “Modern Scientific Visualization is More than Just Pretty Pictures.” 2009. LBNL-1450E

  • [GCE+09] C. G. R. Geddes, E Cormier-Michel, K. Wu, et al., Large Fields for Smaller Facility Sources. SciDAC Review, Number 13, Summer 2009.

  • [GWB+09] Luke J. Gosink, Kesheng Wu, E. Wes Bethel, John D. Owens, Kenneth I. Joy. “Data Parallel Bin-Based Indexing for Answering Queries on Multi-core Architectures.” SSDBM 2009: 110-129.

  • [HYS+09] Lifeng He, Yuyan Chao, Kenji Suzuki, Kesheng Wu. Fast connected-component labeling. Pattern Recognition 42(9): 1977-1987 (2009). DOI 10.1016/j.patcog.2008.10.013.

  • [MB09] K. Madduri and D.A. Bader, “Compact Graph Representations and Parallel Connectivity Algorithms for Massive Dynamic Network Analysis”, The 23rd IEEE International Parallel and Distributed Processing Symposium (IPDPS 2009)/, Rome, Italy, May 25-29, 2009.

  • [ORT09a] Ekow Otoo, Doron Rotem and Shih-Chiang Tsao, Analysis of Trade-Off Between Power Saving and Response Time in Disk Storage Systems, The Fifth Workshop on High-Performance, Power-Aware Computing - May 2, 2009, Rome, Italy.

  • [ORT09b] Ekow J. Otoo, Doron Rotem, and Shih-Chiang Tsao. “Energy smart management of scientific data.” 21st Int'l. Conf. on Sc. and Stat. Database Management (SSDBM’2009)., New Orleans, Louisiana, USA, Jun. 2009.LBNL Tech. Report No LBNL-2185E.

  • [ORT09c] Ekow J. Otoo, Doron Rotem, and Shih-Chiang Tsao. “Workload-adaptive management of energy-smart disk storage systems.” IASDS09: Workshop on Interfaces and Architectures for

  • [OW09] Ekow J. Otoo and Kesheng Wu. “Accelerating queries on very large datasets”, Book Chapter in forthcoming book entitled Scientific Data Management: Challenges, Technology, and Deployment, Editors Arie Shoshani and Doron Rotem (Chapman & Hall/CRC Computational Science, 2009)

  • [RLS+09] Morris Riedel, Erwin Laure, Th. Soddemann, Alex Sim, Vijaya Natarajan, Arie Shoshani, Junmin Gu, et al., “Interoperation of world-wide production e-Science infrastructures”, Concurrency and Computation: Practice and Experience 21(8): 961-990 (2009)

  • [RPW+09a] Oliver Rübel, Prabhat, Kesheng Wu, Hank Childs, Jeremy Meredith, Cameron G. R. Geddes, Estelle Cormier-Michel, Sean Ahern, Gunther H. Weber, Peter Messmer, Hans Hagen, Bernd Hamann, E. Wes Bethel. “High performance multivariate visual data exploration for extremely large data”. SC 2008: 51. LBNL-716E.

  • [RPW+09b] Oliver Rubel, Prabhat, Kesheng Wu, Hank Childs, Jeremy Meredith, Cameron G.R. Geddes, Estelle Cormier-Michel, Sean Ahern, Gunther H. Weber, Peter Messmer, Hans Hagen, Bernd Hamann and E. Wes Bethel. Application of High-performance Visual Analysis Methods to Laser Wakefield Particle Acceleration Data. IEEE Visualization 2008. LBNL-952E.

  • [SBI+09] Per Svensson, Peter Boncz, Milena Ivanova, Martin Kersten, Niels Nes, Doron Rotem,

  • [SDG+09] Arie Shoshani, Flavia Donno, Junmin Gu, Jason Hick, Maarten Litmaath, Alex Sim, “Dynamic Storage Management”, Book Chapter in forthcoming book entitled Scientific Data Management: Challenges, Technology, and Deployment, Editors Arie Shoshani and Doron Rotem (Chapman & Hall/CRC Computational Science, 2009)

  • [SR09] Arie Shoshani and Doron Rotem, editors, Scientific Data Management: Challenges, Technology, and Deployment (book), Chapman & Hall/CRC Computational Science., Dec. 2009

  • [SWW09] Rishi Rakesh Sinha, Marianne Winslett, Kesheng Wu. Finding Regions of Interest in Large Scientific Datasets. SSDBM 2009: 130-147.

  • [WABS+09] D. N. Williams, R. Ananthakrishnan, D. E. Bernholdt, A. Shoshani, A. Sim, et al., “The Earth System Grid: Enabling Access to Multimodel Climate Simulation Data”, American Meteorological Society, Vol. 90, No. 2, 195-205, 2009

  • [WABG+09] Kesheng Wu, Sean Ahern, E. Wes Bethel, Junmin Gu, Ekow Otoo, Arie Shoshani, Alexander Sim, E. Otoo, et al., FastBit: Interactively Searching Massive Data. SciDAC 2009. LBNL-2164E.

  • [WABC+09] Kesheng Wu, Sean Ahern, E. Wes Bethel, Jacqueline Chen, Hank Childs, Estelle Cormier-Michel, Cameron Geddes, Junmin Gu, Hans Hagen, Bernd Hamann, Wendy Koegler, Jerome Lauret, Jeremy Meredith, Peter Messmer, Ekow Otoo, Victor Perevoztchikov, Arthur Poskanzer, Prabhat, Oliver Rubel, Arie Shoshani, Alexander Sim, Kurt Stockinger, Gunther Weber, and Wei-Ming Zhang. FastBit: Interactively Searching Massive Data. In Proc. SciDAC 2009

  • [WABG+09] Kesheng Wu, Sean Ahern, E. Wes Bethel, Junmin Gu, Ekow Otoo, Arie Shoshani, Alexander Sim, E. Otoo, et al., FastBit: Interactively Searching Massive Data. SciDAC 2009. LBNL-2164E.

  • [WABS+09] D. N. Williams, R. Ananthakrishnan, D. E. Bernholdt, A. Shoshani, A. Sim, et al., “The Earth System Grid: Enabling Access to Multimodel Climate Simulation Data”, American Meteorological Society, Vol. 90, No. 2, 195-205, 2009

  • [WSS09a] Kesheng Wu, Kurt Stockinger, and Arie Shoshani. Breaking the Curse of Cardinality on Bitmap Indexes. SSDBM 2008. Tech Report LBNL-173E. Submitted for publication.

  • [WSS09b] Kesheng Wu, Kurt Stockinger and Arie Shoshani. Analyses of Multi-Level and Multi-Component Compressed Bitmap Indexes. LBNL Tech Report LBNL-60891. 2009. in review.

  • [YBS+09] Ichitaro Yamazaki, Zhaojun Bai, Horst Simon, Lin-Wang Wang and Kesheng Wu. Adaptive Projection Subspace Dimension for the Thick-Restart Lanczos Method. 2008. LBNL-1059E. In review.

  • ”Emerging vertical database systems in support of scientific data”, Book Chapter in forthcoming book entitled Scientific Data Management: Challenges, Technology, and Deployment, Editors Arie Shoshani and Doron Rotem (Chapman & Hall/CRC Computational Science, 2009) Scientific Data Storage, New Orleans, Louisiana, USA, Sept. 2009.

2008

  • [BDF+08] W. Betts, L. Didenko, T. Freeman, P. Jakl, L. Hajdu, E. Hjort, K. Keahey, J. Lauret, D. Olson, A. Rose, I. Sakrejda, A. Sim, “STAR Grid Activities, OSG and Beyond”, International Symposium on Grid Computing (ISGC), 2008

  • [CKC+08] C S Chang, S Klasky, J Cummings, R. Samtaney, A Shoshani, A Sim, et al., Toward a first-principles integrated simulation of tokamak edge plasmas, Journal of Physics: Conference Series 125 (2008) 012042.

  • [NVW+08] Meiyappan Nagappan, Mladen Vouk, Kesheng Wu, Alex Sim and Arie Shoshani, “Efficient Operational Profiling of Systems using Suffix Arrays on Execution Logs”, Proceedings of The 19th International Symposium on Software Reliability Engineering (ISSRE), 2008

  • [PS08] Elaheh Pourabbas, Arie Shoshani, Improving Estimation Accuracy of Aggregate Queries on Data Cubes, ACM Eleventh International Workshop on Data Warehousing and OLAP (DOLAP), October 2008. LBNL Tech Report No LBNL-72709. * [SCW+08] Kurt Stockinger, John Cieslewicz, Kesheng Wu, Doron Rotem, Arie Shoshani. Using Bitmap Indexing Technology for Combined Numerical and Text Queries. New Trends in Data Warehousing and Data Analysis, Annals of Information Systems. Vol 3. Pages 1-23. 2008. LBNL Tech. Report LBNL-61768. 2006.

  • [SS08] A. Sim, A. Shoshani (Editors) The Storage Resource Manager Interface Specification Version 2.2. Open Grid Forum, GFD.129, Feb. 2008.

  • [SWW+08] Rishi Rakesh Sinha, Marianne Winslett, Kesheng Wu, Kurt Stockinger, Arie Shoshani: Adaptive Bitmap Indexes for Space-Constrained Systems. Proceedings of the 24th International Conference on Data Engineering (ICDE) 2008, pp. 1418-1420.

  • [WOS08] Kesheng Wu, Ekow J. Otoo, and Kenji Suzuki. Optimizing two-pass connected component labeling algorithms. To appear in Pattern Analysis and Applications. LBNL Tech Report No LBNL-59102.

  • [WSS08a] Kesheng Wu, Kurt Stockinger and Arie Shoshani. Breaking Curse of Cardinality on Bitmap Indexes. 20th International Conference on Scientific and Statistical Database Management, (SSDBM) 2008. Tech Report LBNL-173E.

  • [WAB+08] D. N. Williams, R. Ananthakrishnan, D. E. Bernholdt, A Shoshani, A. Sim, A.shoshani, et.al, Data Management and Analysis for the Earth System Grid, SciDAC Conference 2008, Seattle.

2007

  • [ABB+07] Lana Abadie, Paolo Badino, Jean-Philippe Baud, Ezio Corso, Shaun De Witt, Patrick Fuhrmann, Junmin Gu, Birger Koblitz, Sophie emaitre, Maarten Litmaath, Dimtry Litvintsev, Giuseppe Lo Presti, Luca Magnoni, Gavin McCance, Tigran Mkrtchan, Rémi Mollon, Vijaya Natarajan, Timur Perelmutov, Don Petravick, Arie Shoshani, Alex Sim, David Smith, Paolo Tedesco, Riccardo Zappi, Storage Resource Manager version 2.2: design, implementation, and testing experience, Proceedings of Computing in High Energy Physics (CHEP) 2007.

  • [ABB+07a] R Ananthakrishnan, D E Bernholdt, S Bharathi, D Brown, M Chen, A L Chervenak, L Cinquini, R Drach, I T Foster, P Fox, D Fraser, K Halliday, S Hankin, P Jones, C Kesselman, D E Middleton, J Schwidder, R Schweitzer, R Schuler, A Shoshani, F Siebenlist, A Sim, W G Strand, N Wilhelmi, M Su, D N Williams, Building a global federation system for climate change research: the earth system grid center for enabling technologies (ESG-CET), 2007 J. Phys.: Conf. Ser. 78 012050 (7pp), http://www.iop.org/EJ/article/1742-6596/78/1/012050/jpconf7_78_012050.pdf

  • [BBB+07] David E. Bernholdt, Shishir Bharathi, David Brown, Kasidit Chanchio, Meili Chen, Ann L. Chervenak, Luca Cinquini, Bob Drach, Ian T. Foster, Peter Fox, Jose Garcia, Carl Kesselman, Rob S. Markel, Don Middleton, Veronika Nefedova, Line Pouchard, Arie Shoshani, Alex Sim, Gary Strand, Dean Williams: The Earth System Grid: Supporting the Next Generation of Climate Modeling Research, The Computing Research Repository (CoRR) abs/0712.2262: (2007)

  • [JLH+07] Pavel Jakl, Jerome Lauret, Andrew Hanushevsky, Arie Shoshani, Alex Sim, Junmin Gu, Grid data storage on widely distributed worker nodes using Scalla and SRM, Proceeding of Computing in High Energy and Nuclear Physics (CHEP) 2007.

  • [OOW07] Elizabeth O'Neil, Patrick O'Neil and Kesheng Wu. Bitmap Index Design Choices and Their Performance Implications. Eleventh International Database Engineering & Applications Symposium (IDEAS) 2007.

  • [OR07] Ekow J. Otoo and Doron Rotem. Parallel storage and access of out-of-core extendible arrays. In Cluster Computing, Austin, Texas, Sept 2007.

  • [ORS07a] Ekow J. Otoo, Doron Rotem, and Sridhar Seshadri. Optimal chunking of large multidimensional arrays for data warehousing. DOLAP, Lisbon, Portugal, Nov 2007.

  • [PS07] Elaheh Pourabbas, Arie Shoshani, "Efficient Estimation of Joint Queries from Multiple OLAP Databases", ACM Transactions on Database Systems (TODS), March 2007.

  • [RSW+07] Frederick Reiss, Kurt Stockinger, Kesheng Wu, Arie Shoshani, Joseph M. Hellerstein. Enabling Real-Time Querying of Live and Historical Stream Data. International Conference on Scientific and Statistical Database Management (SSDBM) 2007.

  • [SAC+07] Arie Shoshani, Ilkay Altintas, Alok Choudhary, Terence Critchlow, Chandrika Kamath, Bertram Ludäscher, Jarek Nieplocha, Steve Parker, Rob Ross, Nagiza Samatova, Mladen Vouk , “SDM Center Technologies for Accelerating Scientific Discoveries," SciDac 2007 Proceedings Dec 2006, in Journal of Physics, Conference Series, Vol. 78, paper #012068, 5 pages.

  • [Sho+07] A. Shoshani, et al, Storage Resource Managers: Recent International Experience on Requirements and Multiple Co-Operating Implementations, 24th IEEE Conference on Mass Storage Systems and Technologies (MSST 2007), September 2007, San Diego, California, USA. IEEE Computer Society 2007.

  • [Sho+07b] Arie Shoshani, et al, Scientific Data Management: Essential Technology for Accelerating Scientific Discoveries, CTWatch Quarterly, Volume 3, Number 4, November 2007.

  • [SW07] Kurt Stockinger, Kesheng Wu. Bitmap Indices for Data Warehouses. In Wrembel R., Koncilia Ch.: Data Warehouses and OLAP: Concepts, Architectures and Solutions. Idea Group, Inc.

2006

  • [ABC+06] Ilkay Altintas, Oscar Barney, Zhengang Cheng, Terence Critchlow, Bertram Ludaescher, Steve Parker, Arie Shoshani and Mladen Vouk, "Accelerating the scientific exploration process with scientific workflows," SciDAC 2006, Journal of Physics: Conference Series 46 (2006), 468-478.

  • [BCD+06] E. Wes Bethel, Scott Campbell, Eli Dart, Kurt Stockinger, Kesheng Wu. Accelerating Network Traffic Analysis Using Query-Driven Visualization. In Symposium on Visual Analytics Science and Technology (VAST) 2006, IEEE Computer Society Press.

  • [GSS+06] Luke Gosink, John Shalf, Kurt Stockinger, Kesheng Wu, Wes Bethel, HDF5-FastQuery: Accelerating Complex Queries on HDF Datasets using Fast Bitmap Indices, International Conference on Scientific and Statistical Database Management (SSDBM 2006), IEEE Computer Society Press.

  • [HHL+06] E. Hjort , L. Hajdu, J. Lauret, D. Olson , A. Sim, A. Shoshani, Data and Computational Grid Coupling in RHIC/STAR – An Analysis Scenario using SRM Technology, Proceedings of Computing in High Energy Physics (CHEP) 2006.

  • [JLH+06] P. Jakl, J. Lauret, A. Hanushevky, A. Shoshani, A. Sim, From rootd to Xrootd, from physical to logical files: experience on accessing and managing distributed data, Proceedings of Computing in High Energy Physics (CHEP) 2006.

  • [Oto06] Ekow J. Otoo, Parallel and Distributed Access of Dense Multidimensional Extendible Array Files, 7th Workshop on Distributed Data Structures (WDAS’06), Santa Clara, California, 2006.

  • [OR06a] Ekow J. Otoo and Doron Rotem. Efficient storage allocation of large-scale extendible multi-dimensional scientific datasets. In Proc. 18th Int’l. Conf. Scientific and Statistical Database Management (SSDBM’06), Vienna, Austria, Jul. 3 - 5 2006.

  • [OR06b] Ekow J. Otoo and Doron Rotem. A storage scheme for multi-dimensional databases using extendible array files. In Proc. 3rd Workshop on Spatio Temporal Database Management (STDBM’06), in conjunction with VLDB’2006, Seoul, Korea, Sept. 11 2006.

  • [ORS06] Ekow J. Otoo, Doron Rotem, and Sridhar Seshadri. Analysis of chunking of large multidimensional arrays. LBNL Technical report, July 2006.

  • [OSH01] E. J. Otoo, A. Shoshani, and S.W. Hwang. Clustering high dimenensional massive scientific datasets. J. Intelligent Info Syst., 17(2/3):147 – 168, 2006, Sept. 11 2006.

  • [OWR06] Ekow J. Otoo, Kesheng Wu and Doron Rotem, HDP-Trie: A high dimensional index scheme based on PATRICIA trie, LBNL Tech. Report Number LBNL-579203, February 2006.

  • [PS06] Elaheh Pourabbas , Arie Shoshani: The Composite OLAP-Object Data Model: Removing an Unnecessary Barrier. International Conference on Scientific and Statistical Database Management (SSDBM) 2006, 291-300.

  • [RSW06] Doron Rotem, Kurt Stockinger, Kesheng Wu, Minimizing I/O Costs of Multi-Dimensional Queries with Bitmap Indices, International Conference on Scientific and Statistical Database Management (SSDBM) 2006, IEEE Computer Society Press.

  • [SBC+06] K. Stockinger, E. W. Bethel, S. Campbell, E. Dart, K. Wu. Detecting Distributed Scans Using High-Performance Query-Driven Visualization. SuperComputing 2006 (SC06).

  • [SBG+06] N. F. Samatova, M. Branstetter, A. R. Ganguly, R. Hettich, S. Khan, G. Kora, J. Li, X. Ma, C. Pan, and A. Shoshani. High Performance Statistical Computing With Parallel R: Applications to Biology and Climate Modelling. Journal of Physics, 46:505-509, 2006.

  • [SRS+06] Kurt Stockinger, Doron Rotem, Arie Shoshani, Kesheng Wu, Analyzing Enron Data: Bitmap Indexing Outperforms MySQL Queries by Several Orders of Magnitude, Technical Report, LBNL-59437, Berkeley, California, January 2006.

  • [SSS06] A. Shoshani, A. Sim, K. Stockinger. RRS: Replica Registration Service for Data Grids, Lecture Notes in Computer Science, Edited by Jean-Marc Pierson, Springer-Verlag GmbH Publisher, Volume 3836, Dec 2006, Pages 100-112.

  • [SW06] Kurt Stockinger, Kesheng Wu, Bitmap Indices for Data Warehouses, to appear in Wrembel R., Koncilia Ch.: Data Warehouses and OLAP: Concepts, Architectures and Solutions. Idea Group, Inc. 2006.

  • [SWB+06] Kurt Stockinger, Kesheng Wu, Rene Brun, Philippe Canal, Bitmap Indices for Fast End-User Physics Analysis in ROOT, Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, Elsevier.

  • [VAB+06] M. A. Vouk, I. Altintas R. Barreto, J. Blondin, Z.Cheng, T. Critchlow, A. Khan, S. Klasky, J. Ligon, B. Ludaescher, P. A. Mouallem, S. Parker, N. Podhorszki, A. Shoshani, C. Silva, " Automation of Network-Based Scientific Workflows," Proc. of the IFIP WoCo 9 on Grid-based Problem Solving Environments: Implications for Development and Deployment of Numerical Software, IFIP WG 2.5 on Numerical Software, Prescott, AZ, 2006, printed in IFIP, Vol 239, "Grid-Based Problem Solving Environments, eds. Gaffney PW and Pool JCT (Boston: Springer), pp. 35-61, 2007

  • [WOS06] Kesheng Wu, Ekow Otoo, and Arie Shoshani. An Efficient Compression Scheme for Bitmap Indices. ACM Transactions on Database Systems (TODS), March 2006.

2005

  • [BBC05] D. Bernholdt, S. Bharathi, D. Brown, K. Chanchio, M. Chen, A. Chervenak, L. Cinquini, B. Zrach, I. Foster, P. Fox, J. Garcia, C. Kesselman, R. Markel, D. Middleton, V. Nefedova , L. Pouchard, A. Shoshani, A. Sim, G. Strand, D. Williams. The Earth System Grid: Supporting the Next Generation of Climate Modeling Research. IEEE Vol. 93, No. 3, 485-495, 2005.

  • [BRR+05] John Bent, Doron Rotem, Alexandru Romosan and Arie Shoshani, Coordination of Data Movement with Computation Scheduling on a Cluster, , Int l. Workshop on Challenges of Large Applications in Distrib. Environments (CLADE), Research Triangle Park, NC, July 24, 2005.

  • [KBB+05] S. Klasky, M. Beck, V. Bhat, E. Feibush, B. Ludaescher, M. Parashar, A. Shoshani, D. Silver, and M. Vouk, Data Management on the Fusion Computational Pipeline, Journal of Physics: Conference Series 16 (2005), 510-520

  • [ORR+05] Ekow J. Otoo, Doron Rotem, Alexandru Romosan, Sridhar Seshadri: File Caching in Data Intensive Scientific Applications on Data-Grids. Workshop on Data Management in Grids (DMG) 2005: 85-99

  • [ORS05] Ekow J. Otoo, Doron Rotem, and Arie Shoshani. Impact of admission and cache replacement policies on response times of jobs on data grids. Cluster Computing: The Journal of Networks, Software Tools and Applications, 2005, 293–303.

  • [RRS+05] Alexandru Romosan, Doron Rotem, Arie Shoshani and Derek Wright, Co-Scheduling of Computation and Data on Computer Clusters, International Conference on Scientific and Statistical Data Base Management (SSDBM) 2005, IEEE Computer Society Press.

  • [RSW05a] Doron Rotem, Kurt Stockinger, Kesheng Wu, Optimizing Candidate Check Costs for Bitmap Indices, Conference on Information and Knowledge Management (CIKM 2005), Bremen, Germany, November 2005, ACM Press.

  • [RSW05b] Doron Rotem, Kurt Stockinger, Kesheng Wu: Optimizing I/O Costs of Multi-dimensional Queries Using Bitmap Indices. International Conference on Database and Expert Systems Applications (DEXA) 2005: 220-229.

  • [Sho05] Efficient Indexing Technology for Data Mining of Scientific Data, Keynote Talk, The Fifth IEEE International Conference on Data Mining, Houston, Texas, 27 - 30 November 2005.

  • [SSB+05] Kurt Stockinger, John Shalf, Wes Bethel, and Kesheng Wu, DEX: Increasing the Capability of Scientific Data Analysis Pipelines by Using Efficient Bitmap Indices to Accelerate Scientific Visualization, International conference on Scientific and Statistical Database Management (SSDBM 2005), IEEE Computer Society Press.

  • [SSS05] Arie Shoshani, Alex Sim, Kurt Stockinger, RRS: Replica Registration Service for Data Grids, VLDB Workshop on Data Management in Grids, Trondheim, Norway, September 2005, Springer Verlag.

  • [SSW+05] Kurt Stockinger, John Shalf, Kesheng Wu, Wes Bethel, Query-Driven Visualization of Large Data Sets, IEEE Visualization, Minneapolis, Minnesota, USA, October 2005, IEEE Computer Society Press.

  • [SWC+05] Stockinger, K., Wu K., Campbell, S., Lau, S., Fisk, M., Gavrilov, E., Kent, A., Davis, C.E., Olinger, R., Young, R., Prewett, J., Weber, P., Caudell, T.P., Bethel, E.W., Smith, S., Network Traffic Analysis With Query Driven Visualization - SuperComputing 2005 HPC Analytics Challenge, Supercomputing 2005 (Received Honorable Mention).

  • [Wu05a] Kesheng Wu, FastBit: An Efficient Indexing Technology For Accelerating Data-Intensive Science. In Proceedings of SciDAC 2005, San Francisco, CA, USA.

  • [Wu05b] Kesheng Wu. FastBit: an efficient indexing technology for accelerating data-intensive science, J. Phys.: Conf. Ser. 16 556-560

  • [WGL+05] Kesheng Wu, Junmin Gu, Jerome Lauret, Arthur M. Poskanzer, Arie Shoshani, Alexander Sim, and Wei-Ming Zhang, Grid Collector: Facilitating Efficient Selective Access from Data Grids. In Proceedings of International Supercomputer Conference (ISC) 2005, Heidelberg, Germany. (Best Paper Award).

  • [WO05] Kesheng Wu and Ekow Otoo. A Simpler Proof of the Average Case Complexity of Union-Find with Path Compression. 2005. LBNL-57527

  • [WOS05] K. Wu, E. Otoo and A. Shoshani, Optimizing Connected Component Labeling Algorithms, In Proceedings of SPIE Medical Imaging Conference 2005.

2004

  • [KSA+04] William T. C. Kramer, Arie Shoshani, Deborah A. Agarwal, Brent R. Draney, Guojun Jin, Gregory F. Butler, John A. Hules: Deep scientific computing requires deep data. IBM Journal of Research and Development 48(2): 209-232 (2004)

  • [OR04] Ekow J. Otoo and Doron Rotem, Disk Caching with File Dependencies, Submitted to the 13th IEEE Int’l. Symp. on High-Performance Distributed Computing, HPDC’2004.

  • [ORR04] Ekow Otoo, Doron Rotem, Alex Romosan, Optimal File-Bundle Caching Algorithms for Data-Grids, SuperComputing 2004, Pittsburgh, Nov. 6-12, 2004

  • [ORS04] Ekow J. Otoo, Doron Rotem, Sridhar Seshadri: Efficient Algorithms for Multi-file Caching. DEXA 2004: 707-719

  • [RSW04] Doron Rotem, Kurt Stockinger, and Kesheng Wu, "Efficient binning for bitmap indices on high-cardinality attributes" (November 17, 2004). Lawrence Berkeley National Laboratory. Paper LBNL-56936. http://repositories.cdlib.org/lbnl/LBNL-56936

  • [SGS+04] Alex Sim, Junmin Gu, Arie Shoshani, Vijaya Natarajan, DataMover: Robust Terabyte-Scale Multi-file Replication over Wide-Area Networks, Proceedings of the 16th International Conference on Scientific and Statistical Database Management (SSDBM 2004), Greece.

  • [SW04] Kurt Stockinger, Kesheng Wu, Improved Searching for Spatial Features in Spatio-Temporal Data Technical Report, LBNL-56376, Berkeley, California, September 2004.

  • [SWS04] Kurt Stockinger, Kesheng Wu, Arie Shoshani, Evaluation Strategies for Bitmap Indices with Binning, International Conference on Database and Expert Systems Applications (DEXA), Zaragoza, Spain, September 2004, Springer-Verlag.

  • [WOS04a] Kesheng Wu, Ekow Otoo, and Arie Shoshani. An Efficient Compression Scheme For Bitmap Indices, Tech Report LBNL-49626.

  • [WOS04b] Kesheng Wu, Ekow Otoo, and Arie Shoshani. On the Performance of Bitmap Indices for High Cardinality Attributes. VLDB 2004, pages 24 - 35.

  • [WZP+04] Kesheng Wu, Wei-Ming Zhang, Victor Perevoztchikov, Jerome Lauret and Arie Shoshani. The Grid Collector: Using an Event Catalog to Speedup User Analysis in Distributed Environment, In Proceedings of Computing in High Energy and Nuclear Physics (CHEP) 2004.

2003

  • [ABB+03] I Altintas., S. Bhagwanani, D. Buttler, S. Chandra, Z. Cheng, M. Coleman, T. Critchlow, A. Gupta, W. Han, L. Liu, B. Ludaescher, C. Pu, R. Moore, A. Shoshani, M.A. Vouk, "A Modeling and Execution Environment for Distributed Scientific Workflows," Proc. 15th IEEE International Conference on Scientific and Statistical Database Management (SSDBM 2003), Cambridge, Massachusetts, 2003, pp. 247-250

  • [CDK+03] Ann L. Chervenak, Ewa Deelman, Carl Kesselman, William E. Allcock, Ian T. Foster, Veronika Nefedova, Jason Lee, Alex Sim, Arie Shoshani, Bob Drach, Dean Williams, Don Middleton: High-performance remote access to climate simulation data: a challenge problem for data grid technologies, Parallel Computing 29(10): 1335-1356 (2003)

  • [OLR03] Frank Olken, Doron Rotem, "Workflow Execution History Data Management: A Framework", Proceedings of the International Conference on Web Services (ICWS) 2003, pp. 55-61.

  • [ORS03] Ekow J. Otoo and Doron Rotem and Arie Shoshani, Impact of Admission and Cache Replacement Policies on Response Times of Jobs on Data Grids, Int'l. Workshop on Challenges of Large Applications in Distrib. Environments (CLADE) 2003.

  • [PCD+03] Line Pouchard, Luca Cinquini, Bob Drach, Don Middleton, David E. Bernholdt, Kasidit Chanchio, Ian T. Foster, Veronika Nefedova, David Brown, Peter Fox, Jose Garcia, Gary Strand, Dean Williams, Ann L. Chervenak, Carl Kesselman, Arie Shoshani, Alex Sim: An Ontology for Scientific Information in a Grid Environment: the Earth System Grid. CCGRID 2003: 626-632

  • [PS03] Elaheh Pourabbas, Arie Shoshani: Answering Joint Queries from Multiple Aggregate OLAP Databases. Data Warehousing and Knowledge Discovery (DaWaK) 2003: 24-34

  • [Sho03] A. Shoshani, Multidimensionality in Statistical, OLAP, and Scientific Databases, chapter in book: Multidimensional Databases: Problems and Solutions, Edited by Maurizio Rafanelli, Idea Group Publishing, 2003

  • [OS03] Otoo, E. J. and Shoshani, A. Accurate modeling of cache replacement policies in a data grid. 11th NASA Goddard Conf. on Mass Storage Syst. and Tech. / 20th IEEE Symposium on Mass Storage Systems, San Diego, California, April 2003.

  • [SSG03] A. Shoshani, A. Sim, J. Gu Storage Resource Managers: Essential Components for Data Grids, Chapter in the book “Resource Management in Grids”, Klewer Publishers, Fall, 2003, pp. 329-348.

  • [WKC+03] K. Wu, W. Koegler, J. Chen and A. Shoshani. Using Bitmap Index for Interactive Exploration of Large Datasets, Submitted to the International Conference on Scientific and Statistical Database Management, 2003 (SSDBM’03).

  • [WZS+03] K. Wu, W. Zhang, A. Sim, J. Gu and A. Shoshani. Grid Collector: an event catalog with automated file management. IEEE Nuclear Science Symposium (NSS) 2003.

2002

  • [PS02] Elaheh Pourabbas, Arie Shoshani: Joint Queries Estimation from Multiple OLAP Databases. In Proceedings of the International Conference on Scientific and Statistical Database Management, 2002 (SSDBM’02).

  • [OOS02] Otoo, E. J., Olken, F. and Shoshani, A. Disk cache replacement algorithm for storage resource managers in data grids. The 15th Annual Supercomputer Conf., SC2002. Baltimore, Maryland, Nov. 2002.

  • [SSG02] Arie Shoshani, Alex Sim, Junmin Gu, Storage Resource Managers: Middleware Components for Grid Storage, Nineteenth IEEE Symposium on Mass Storage Systems, 2002 (MSS '02).

  • [SWS02] K. Stockinger, K. Wu and A. Shoshani. Strategies for Processing ad hoc Queries on Large Data Warehouses. In Proceedings of the International Workshop on Data Warehousing and OLAP, 2002 (DOLAP'02).

  • [WOS02a] Kesheng Wu and Ekow J. Otoo and Arie Shoshani, "A compression scheme of bitmap indexes", LBNL report LBNL-49626, 2002.

  • [WOS02b] K. Wu, E. J. Otoo and A. Shoshani. Compressing Bitmap Indexes for Faster Search Operations. In Proceedings of the International Conference on Scientific and Statistical Database Management, 2002 (SSDBM’02), Pages 99-108.

2001

  • [ACD01] B. Allcock, A. Chervenak, E. Deelman, B. Drach, I. Foster, C. Kesselman, J. Lee, V. Nefedova, A. Sim, A. Shoshani, D. Williams, "High-Performance Remote Access to Climate Simulation Data: A Challenge Problem for Data Grid Technologies", SuperComputing Conference, 2001 (SC ’01).

  • [BGM+01] L. Bernardo, B. Gibbard, D. Malon, H. Nordberg, D. Olson, R. Porter, A. Shoshani, A. Sim, A. Vaniachine, T. Wenaus, K. Wu, D. Zimmerman, “New Capabilities in the HENP Grand Challenge Storage Access System and its Application at RHIC”, the Journal of Computer Physics Communications, 2001 (CPC '01).

  • [GPS01] Junmin Gu, Torben Bach Pedersen, Arie Shoshani, “Performance Results on the Federation of an OLAP Database with an Object Database”, The 5th World Multi-Conference on Systemics, Cybernetics and Informatics, 2001, (SCI ’01).

  • [KS01] Jinbaek Kim, Arie Shoshani, “Simulation Analysis of the Optimal Storage Resource Allocation for Large HENP Databases”, CHEP 2001 Conference, China, December 2001.

  • [OOH+01] E. Otoo, D. Olson, E. Hjort, J. Lauret, M. Messer, A. Shoshani, A. Sim, “Non-shared disk cluster – a fault tolerant, commodity approach to high-bandwidth data analysis, Proceedings of Computing in High Energy Physics (CHEP) 2001.

  • [HOS+01] E. Hjort, D. Olson, A. Sim, J. Yang, J. Lauret, M. Messer. Data Grid Services in STAR, Initial Deployment: Site-to-Site File Replication”, Proceedings of Computing in High Energy Physics (CHEP) 2001.

  • [OSW01] Otoo, E. J., Shoshani, A. and Hwang, S-W., “Clustering High Dimensional Massive Scientific datasets”, 13th Int’l. Conf. On Scientific and Statistical Database Management”, 2001 (SSDBM ’01). Also selected for publication and published in Journal of Intelligent Information Systems, 17:2/3, 2001, 147 – 168.

  • [SNB+01] Experience with Using CORBA to Implement a File Caching Coordination System, Alex Sim, Henrik Nordberg, Luis Bernardo, Arie Shoshani, Doron Rotem, Journal of Concurrency: Practice and Experience, 2001; 13:1-15.

  • [WON01] Kesheng Wu, Ekow Otoo, Arie Shoshani and Henrik Nordberg. Notes on Design and Implementation of Compressed Bit Vectors. Tech report LBNL/PUB-3161. 2001.

  • [WOS01] K. Wu, E. J. Otoo and A. Shoshani, A Performance Comparison of Bitmap Indexes. In Proceedings of the Conference on Information and Knowledge Management, 2001 (CIKM’01), pages 559-561.

2000

  • [BSS+00] L. M. Bernardo, A. Shoshani, A. Sim, H. Nordberg, Access Coordination of Tertiary Storage for High Energy Physics Application, 17th IEEE Symposium on Mass Storage Systems (MSS) 2000.

  • [GPS00] Junmin Gu , Torben Bach Pedersen , Arie Shoshani, OLAP++: Powerful and Easy-to-Use Federations of OLAP and Object Databases, Proceedings of the 26th International Conference on Very Large Data Bases (VLDB) 2000, p.599-602.

  • [PSG+00] Torben Bach Pedersen , Arie Shoshani , Junmin Gu , Christian S. Jensen, Extending OLAP querying to external object databases, Proceedings of the ninth international conference on Information and knowledge management (CKIM) 2000, pp.405-413.

  • [SSB+00] Shoshani, A. Sim, L. M. Bernardo, H. Nordberg, Coordinating Simultaneous Caching of File Bundles from Tertiary Storage A. 12th International Conference on Scientific and Statistical Database Management (SSDBM) 2000.
Newer publications available on the new CRD web site, follow this link.
You are here:   Publications/WebHome
Last Updated r34 on 2015-09-09 15:33:59 by Kwu