Analyzing Enron Data: Bitmap Indexing Outperforms MySQL Queries by Several Orders of Magnitude

Kurt Stockinger, Doron Rotem, Arie Shoshani, Kesheng Wu


FastBit is an efficient, compressed bitmap indexing technology that was developed in our group. In this report we evaluate the performance of MySQL and FastBit for analyzing the email traffic of the Enron dataset. The first finding shows that materializing the join results of several tables significantly improves the query performance. The second finding shows that FastBit outperforms MySQL by several orders of magnitude.

full text of LBNL-59437 (PDF)

Closely related
LBNL-61083: Enron Data Revisited - Neighborhood Queries with FastBitWin over Popular Commercial Database System
LBNL-61768: Using Bitmap Indexing Technology for Combined Numerical and Text Queries
In the news
LBNL CS news article
Primeur article
More research work by John Wu
Bitmap Index
Connected Component Labeling
Eigenvalue Computation
Inforamtion available elsewhere on the web
Google Scholar
Contact us

John Wu