Daniel Gunter

dan photo
Dan Gunter
Group Lead & Research Scientist
Phone: +1 510 495 2504
Fax: +1 510 486 6363

Biographical Sketch

Dan Gunter leads the Integrated Data Frameworks (IDF) group in the Data Science and Technology department of the Computational Research Division. Dan's research is in distributed and parallel systems, with a focus on performance and usability issues at the intersection of large databases and fast networks. Recent work includes distributed workflow performance analysis; NoSQL databases for materials science; networked application adaptation; and user-centered design approaches for complex APIs and interfaces. Past research includes the the Template Interfaces for Agile Parallel Data-Intensive Science (Tigres), the NetLogger distributed systems analysis software, network measurement standards in the Global Grid Forum, scientific collaboration tools, and the Distributed Parallel Storage System, a predecessor to GridFTP.

Dan has been senior personnel on a number of projects, including the Carbon Capture Systems Initiative (CCSI) and the DOE Systems Biology Knowledge Base (KBase), and Institute for the Design of Advanced Energy Systems (IDAES)Dan is a Co-PI on the  Materials Project Center for Functional Electronic Materials Design.

Journal Articles

Scott Callaghan, Ewa Deelman, Dan Gunter, Gideon Juve, Philip Maechling, Christopher Brooks, Karan Vahi, Kevin Milner, Robert Graves, Edward Field, David Okaya, Thomas Jordan, "Scaling up Workflow-based Applications", Journal of Computer and System Sciences, 2010, 76:428--446,

Conference Papers

Lavanya Ramakrishnan, Sarah Poon, Gilberto Z. Pastorello, Daniel Gunter, Valerie Hendrix, Deborah Agarwal, "Scientist-Centered Design for eScience:A Tigres Case Study", IEEE eScience, 2014,

Elif Dede, Madhusudhan Govindaraju, Daniel Gunter, Richard Canon, Lavanya Ramakrishnan, "Semi-Structured Data Analysis using MongoDB and MapReduce: A Performance Evaluation", Proceedings of the 4th international workshop on Scientific cloud computing, 2013,

Karan Vahi, Ian Harvey, Taghrid Samak, Dan Gunter, Kieran Evans, David Rogers, Ian Taylor, Monte Goode, Fabio Silva, Eddie Al-Shakarchi, Gaurang Mehta, Andrew Jones, Ewa Deelman, "A General Approach to Real-time Workflow Monitoring", The Seventh Workshop on Workflows in Support of Large-Scale Science (WORKS12), 2012,

Daniel Gunter, Shreyas Cholia, Anubhav Jain, Michael Kocher, Kristin Persson, Lavanya Ramakrishnan, Shyue Ping Ong, Gerbrand Ceder, "Community Accessible Datastore of High-Throughput Calculations: Experiences from the Materials Project", 5th workshop on Many-Task Computing on Grids and Supercomputers (MTAGS), 2012,

Dan Gunter, Raj Kettimuthu, Ezra Kissel, Martin Swany, Jun Yi, Jason Zurawski, "Exploiting Network Parallelism for Improving Data Transfer Performance", SC12 Companion, 2012,

Taghrid Samak, Dan Gunter, Monte Goode, Ewa Deelman, Fabio Silva, Karan Vahi, "Failure Analysis of Distributed Scientific Workflows Executing in the Cloud", 8th International Conference on Network and Service Management (CNSM 2012), 2012,

Taghrid Samak, Dan Gunter, Valerie Hendrix, "Scalable Analysis of Network Measurements with Hadoop and Pig", Fifth International IFIP/IEEE Workshop on Distributed Autonomous Network Management Systems (DANMS 2012), IEEE, 2012,

Ezra Kissel, Ahmed El-Hassany, Guilherme Fernandes, Martin Swany, Dan Gunter, Taghrid Samak, Jennifer M. Schopf, "Scalable Integrated Performance Analysis of Multi-Gigabit Networks", Fifth International Workshop on Distributed Autonomous Network Management Systems 2012 (DANMS 12), 2012,

Elif Dede, Zacharia Fadika, Jessica Hartog, Modhusudhan Govindaraju, Lavanya Ramakrishnan, Daniel Gunter, Richard Shane Canon, "MARISSA: MApReduce Implementation for Streaming Science Applications", IEEE eScience Conference, 2012,

Taghrid Samak, Dan Gunter, Monte Goode, Ewa Deelman, Gaurang Mehta, Fabio Silva, Karan Vahi, "Failure Prediction and Localization in Large Scientific Workflows", The Sixth Workshop on Workflows in Support of Large-Scale Science (WORKS11), 2011,

Dan Gunter, Taghrid Samak, Ewa Deelman, Christopher H. Brooks, Monte Goode, Gideon Juve, Gaurang Mehta, Priscilla Moraes, Fabio Silva, Martin Swany, Karan Vahi, "Online Workflow Management and Performance Analysis with STAMPEDE", 7th International Conference on Network and Service Management (CNSM 2011), Paris, France, 2011,

Taghrid Samak, Dan Gunter, Ewa Deelman, Gideon Juve, Gaurang Mehta, Fabio Silva, Karan Vahi, "Online Fault and Anomaly Detection for Large-Scale Scientific Workflows", 13th IEEE Conference on High Performance Computing and Communications (HPCC-2011), 2011,

Elif Dede, Madhusudan Govindaraju, Daniel Gunter, Lavanya Ramakrishnan, "Riding the Elephant: Managing Ensembles with Hadoop", 4th Workshop on Many-Task Computing on Grids and Supercomputers (MTAGS), 2011,

Raj Kettimuthu, Alex Sim, Dan Gunter, Bill Allcock, Peer T. Bremer, John Bresnahan, Andrew Cherry, Lisa Childers, Eli Dart, Ian Foster, Kevin Harms, Jason Hick, Jason Lee, Michael Link, Jeff Long, Keith Miller, Vijaya Natarajan, Valerio Pascucci, Ken Raffenetti, David Ressman, Dean Williams, Loren Wilson, Linda Winkler, "Lessons learned from moving earth system grid data sets over a 20 Gbps wide-area network", HPDC 10, New York, NY, USA, ACM, 2010, 316--319, doi: 10.1145/1851476.1851519

Shoaib Kamil, Pinar, Gunter, Lijewski, Oliker, John Shalf, "Reconfigurable hybrid interconnection for static and dynamic scientific applications", Conf. Computing Frontiers, 2007, 183-194, LBNL 60060,

Dan Gunter, Brian L. Tierney, Aaron Brown, Martin Swany, John Bresnahan, Jennifer M. Schopf, "Log summarization and anomaly detection for troubleshooting distributed systems", Grid Computing, 2007 8th IEEE/ACM International Conference on, Washington, DC, USA, IEEE Computer Society, 2007, 226--234, doi: 10.1109/GRID.2007.4354137

Book Chapters

A. Sim, D. Gunter, V. Natarajan, A. Shoshani, D. Williams, J. Long, J. Hick, J. Lee, E. Dart, "Efficient Bulk Data Replication for the Earth System Grid", Data Driven E-science: Use Cases and Successful Applications of Distributed Computing Infrastructures (ISGC 2010), (Springer-Verlag New York Inc: 2010) Pages: 435