People Publications Projects

ArrayUDF: Custom Analyses with Automated Data Management

Open source at:


Scientific data analysis code spends a lot of effort on data management and other common tasks, but performs a wide variety of operations. Can we automate the data management without restricting analysis operations?


Demonstrate a novel scalable framework to perform user-defined custom data analysis on massive datasets on supercomputers

Signficance and Impact

Tens to thousands of times faster than the state-of-the-art Big Data systems

Research Details