Careers | Phone Book | A - Z Index

Sifting Through a Trillion Electrons


SDM's Surendra Byna and colleagues from Berkeley Lab’s Computational Research Division teamed up with researchers to develop novel software strategies for storing, mining, and analyzing massive datasets generated by a state-of-the-art plasma physics code called VPIC. » Read More

Catching Turbulence in the Solar Wind


Massive datasets plus modelling, visualization and analytics allow researchers to "see" the unseen: the turbulence in solar winds. » Read More

Arie Shoshani Earns Lifetime Achievement Award

Arie award

More than 25 years ago, Arie Shoshani realized that researchers were facing significant challenges in organizing, managing and analyzing their scientific data. He set out to develop computer applications to help them better meet the challenges and created the Scientific Data Management Group in the process. » Read More

The Scientific Data Management (SDM) group develops technologies and tools for efficient data access and storage management of massive scientific datasets. We are currently developing storage resource management tools, data querying technologies, in situ feature extraction algorithms, along with software platforms for exascale data. The group also works closely with application scientists to address their data processing challenges. These tools and application development activities are backed by active research efforts on novel algorithms for emerging hardware platforms.

Group Leader: John Wu

»Visit the Scientific Data Management (SDM) site.

SDM Publications

Life Course as a Contextual System to Investigate the Effects of Life Events, Gender, and Generation on Travel Mode Use

January 12, 2020

A Reinforcement Learning Based Network Scheduler For Deadline-Driven Data Transfers

December 10, 2019

Spatiotemporal Real-Time Anomaly Detection for Supercomputing Systems

December 10, 2019

Machine Learning for Prediction of Mid to LongTerm Habitual Transportation Mode Use

December 10, 2019

Federated Wireless Network Intrusion Detection

December 10, 2019

Life course as a contextual system to investigate the effects of life events, gender and generation on travel mode usage

November 19, 2019

SDN for End-to-End Networking at Exascale

November 4, 2019

Handling Missing Values in Joint Sequence Analysis

October 10, 2019

Terabyte-scale Particle Data Analysis: An ArrayUDF Case Study

July 23, 2019

Understanding Parallel I/O Performance Trends Under Various HPC Configurations

June 25, 2019

Performance Prediction for Data Transfers in LCLS Workflow

June 25, 2019

Similarity-based Compression with Multidimensional Pattern Matching

June 25, 2019

Automatic Detection of Network Traffic Anomalies and Changes

June 25, 2019

SLOPE: Structural Locality-aware Programming Model for Composing Array Data Analysis

June 16, 2019

Co-optimizing Latency and Energy for IoT services using HMP servers in Fog Clusters

June 11, 2019

DCA-IO: A Dynamic I/O Control Scheme for Parallel and Distributed File System

May 14, 2019

Multivariate Network Traffic Analysis using Clustered Patterns

April 1, 2019

A new approach to multivariate network traffic analysis

March 30, 2019

Multidimensional Compression with Pattern Matching

March 26, 2019

Evaluating the Effects of Missing Values and Mixed Data Types on Social Sequence Clustering Using t-SNE Visualization

March 6, 2019

More from SDM Publications »