Analyzing Enron Data: Bitmap Indexing Outperforms MySQL Queries by Several Orders of Magnitude

Kurt Stockinger, Doron Rotem, Arie Shoshani, Kesheng Wu


FastBit is an efficient, compressed bitmap indexing technology that was developed in our group. In this report we evaluate the performance of MySQL and FastBit for analyzing the email traffic of the Enron dataset. The first finding shows that materializing the join results of several tables significantly improves the query performance. The second finding shows that FastBit outperforms MySQL by several orders of magnitude.

full text of LBNL-59437 (PDF)

Closely related
LBNL-61083: Enron Data Revisited - Neighborhood Queries with FastBitWin over Popular Commercial Database System
LBNL-61768: Using Bitmap Indexing Technology for Combined Numerical and Text Queries
