ESS-DIVE
The U.S. Department of Energy’s (DOE) Environmental Systems Science Data Infrastructure for a Virtual Ecosystem (ESS-DIVE) is a data repository for Earth and environmental science data. ESS-DIVE store…
Science Search
A team of researchers from the Department of Energy’s Lawrence Berkeley National Laboratory (Berkeley Lab) and UC Berkeley are developing innovative machine learning tools to pull contextual informati…
Jupyter
Project Jupyter is an open, international collaboration that develops tools for interactive computing: a process of human computer interplay for scientific exploration and data analysis. The collabora…
About the Group
The Integrated Data Systems (IDS) group is focused on data integration and frameworks for computational and data analysis systems. We are involved in several important areas of data science including usable scientific workflow tools and data pipelines; intuitive interfaces and web science gateways to explore, analyze, process data and run computations on HPC and distributed systems; and data integration, standardization and curation, with an emphasis on Findable, Accessible, Interoperable, and Reusable (FAIR) data.
Overall, our approach is distinguished by a strong focus on partnerships and collaborations with scientific user communities. We have learned that designing the best systems can only occur when we have the best understanding of the constraints, needs, and culture of the scientists who will use it. The IDS group works closely with the Usable Data Systems group on research and development of user-centered solutions and the Sustainable Software Engineering group to promote robust, sustainable software solutions.
Group Leader: Shreyas Cholia