Careers | Phone Book | A - Z Index

Oliver Rübel

OliverRubel.jpg
Oliver Rübel
Research Scientist
Phone: +1 510 486 4064
1 Cyclotron Road
Mail Stop 50F1650
Berkeley, CA 94720 US

Education

  • October 2014 -- present Computer Research Scientist, Visualization Group, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
  • March 2011--2014 Computer Systems Engineer, Visualization Group, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
  • 2010--2011: Post-doctoral researcher, Data Analysis Group, Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, Livermore, CA, USA.
  • November 2009: Dr. rer. nat (equivilant Ph.D), Department of Computer Science, University of Kaiserslautern, Germany. Thesis title: "Linking Automated Data Analysis and Visualziation with Applications in Developmental Biology and High-energy Physics" Advisors: Dr. Hans Hagen (University of Kaiserslautern, Germany), Dr. Bernd Hamann (University of California, Davis, CA, U.S.A.), and Dr. Gunther H. Weber (Lawrence Berkeley National Laboratory, Berkeley, CA, U.S.A.). During my Ph.D. I was a fellow of the International Research Training Group 1131 of the University of Kaiserslautern and Student Assistant at the Visualization Group at the Lawrence Berkeley National Laboratory.
  • Janurary 2006: Diploma (equivalent M.S.), Computer Science, Department of Computer Science, University of Kaiserslautern, Germany. Thesis title: "Integrating Data Analysis and Visualization for the Exploration of Three-dimensional Gene Expression Data." Advisors: Dr. Hans Hagen (University of Kaiserslautern, Germany), Dr. Bernd Hamann (University of California, Davis, CA, U.S.A.), and Dr. Gunther H. Weber (University of California, Davis, CA, U.S.A; currently at LBNL). Received award by the Sparkassenstiftung for outstanding work in diploma thesis (Kaiserslautern, Germany, 2007).

Research Highlights

Oliver Rübel is the computing lead of OpenMSI. The OpenMSI Science Gateway interface lets scientists view and retrieve images easily and with unprecedented access speeds. For more information see here.

 

Oliver Rübel is the lead developer of BrainFormat a data standarization framework and library used to develop a novel file format for management and storage of neuro-science data. For more information see here.
(Image Credit: Wikimedia Commons)

 

Oliver Rübel is a co-developer WarpIV an advanced application for in situ visualization and analysis of particle-in-cell simulations using the Warp simulation framework. For more information see WarpIV.

 

Visualization and Analysis of Extremely Large Data

 

Coupling Visualization and Data Analysis for Knowledge Discovery from Multi-dimensional Scientific Data

Journal Articles

Kesheng Wu, Wes Bethel, Ming Gu, David, Oliver R\ ubel, "A Big Data Approach to Analyzing Market Volatility", Algorithmic Finance, 2013, 2:241--267, LBNL LBNL-6382E, doi: 10.3233/AF-13030

Understanding the microstructure of the financial market requires the processing of a vast amount of data related to individual trades, and sometimes even multiple levels of quotes. Analyzing such a large volume of data requires tremendous computing power that is not easily available to financial academics and regulators. Fortunately, public funded High Performance Computing (HPC) power is widely available at the National Laboratories in the US. In this paper we demonstrate that the HPC resource and the techniques for data-intensive sciences can be used to greatly accelerate the computation of an early warning indicator called Volume-synchronized Probability of Informed trading (VPIN). The test data used in this study contains five and a half year's worth of trading data for about 100 most liquid futures contracts, includes about 3 billion trades, and takes 140GB as text files. By using (1) a more efficient file format for storing the trading records, (2) more effective data structures and algorithms, and (3) parallelizing the computations, we are able to explore 16,000 different ways of computing VPIN in less than 20 hours on a 32-core IBM DataPlex machine. Our test demonstrates that a modest computer is sufficient to monitor a vast number of trading activities in real-time -- an ability that could be valuable to regulators.

Our test results also confirm that VPIN is a strong predictor of liquidity-induced volatility. With appropriate parameter choices, the false positive rates are about 7% averaged over all the futures contracts in the test data set. More specifically, when VPIN values rise above a threshold (CDF > 0.99), the volatility in the subsequent time windows is higher than the average in 93% of the cases.

Jihan Kim, Richard L. Martin, Oliver Rübel, Maciej Haranczyk & Berend Smit, "High-throughput Characterization of Porous Materials Using Graphics Processing Units", Journal of Chemical Theory and Computation, March 16, 2012, 8:1684–1693, LBNL 5409E, doi: 10.1021/ct200787v

We have developed a high-throughput graphics processing unit (GPU) code that can characterize a large database of crystalline porous materials. In our algorithm, the GPU is utilized to accelerate energy grid calculations, where the grid values represent interactions (i.e., Lennard-Jones + Coulomb potentials) between gas molecules (i.e., CH4 and CO2) and materials’ framework atoms. Using a parallel flood fill central processing unit (CPU) algorithm, inaccessible regions inside the framework structures are identified and blocked, based on their energy profiles. Finally, we compute the Henry coefficients and heats of adsorption through statistical Widom insertion Monte Carlo moves in the domain restricted to the accessible space. The code offers significant speedup over a single core CPU code and allows us to characterize a set of porous materials at least an order of magnitude larger than those considered in earlier studies. For structures selected from such a prescreening algorithm, full adsorption isotherms can be calculated by conducting multiple Grand Canonical Monte Carlo (GCMC) simulations concurrently within the GPU.

Samuel Gerber, Oliver Rübel, Peer-Timo Bremer, Valerio Pascucci and Ross T. Whitaker, "Morse-Smale Regression", Journal of Computational and Graphical Statistics, January 2012, doi: 10.1080/10618600.2012.657132

  • Download File: MSR.pdf (pdf: 292 KB)

E. W. Bethel and D. Leinweber and O. Rubel and K. Wu, "Federal Market Information Technology in the Post Flash Crash Era: Roles of Supercomputing", The Journal of Trading, 2012, 7:9-24, LBNL 5263E, doi: 10.3905/jot.2012.7.2.009

O. Rübel, G. H. Weber, M-Y Huang, E. W. Bethel, M. D. Biggin, C. C. Fowlkes, C. Luengo Hendriks, S. V. E. Keränen, M. Eisen, D. Knowles, J. Malik, H. Hagen and B. Hamann, "Integrating Data Clustering and Visualization for the Analysis of 3D Gene Expression Data", IEEE Transactions on Computational Biology and Bioinformatics, March 2010, 7:64-79, LBNL 382E,

O. Rübel, C.G.R. Geddes, E. Cormier-Michel, K. Wu, Prabhat, G.H. Weber, D.M. Ushizima, P. Messmer, H. Hagen, B. Hamann, and E.W. Bethel, "Automatic Beam Path Analysis of laser Wakefield Particle Acceleration Data", IOP Computational Science & Discovery, November 2009, 2, LBNL 2734E,

G. H. Weber, O. Rübel, M.-Y. Huang, A. H. DePace, C. C. Fowlkes, S. V. E. Keränen, C. L. Luengo Hendriks, H. Hagen, D. W. Knowles, J. Malik, M. D. Biggin and B. Hamann, "Visual exploration of three-dimensional gene expression using physical views and linked abstract views", IEEE Transactions on Computational Biology and Bioinformatics, 2009, 6:296-309, LBNL 63776, doi: 10.1109/TCBB.2007.70249

C. C. Fowlkes, C. L. Luengo Hendriks, S. V. E. Keränen, G. H. Weber, O. Rübel, M.-Y. Huang, S. Chatoor, A. H. DePace, L. Simirenko, C. Henriquez, A. Beaton, R. Weiszmann, S. Celniker, B. Hamann, D. W. Knowles, M. D. Biggin, M. B. Eisen, J. Malik, "A Quantitative Spatio-temporal Atlas of Gene Expression in the Drosophila Blastoderm", Cell, April 18, 2008, 133:364-374,

Conference Papers

E. Wes Bethel, Prabhat, Suren Byna, Oliver Rübel, K. John Wu, and Michael Wehner, "Why High Performance Visual Data Analytics is both Relevant and Difficult", Proceedings of Visualization and Data Analysis 2013, IS&T/SPIE Electronic Imaging 2013, San Francisco, CA, USA, SPIE, February 2013, LBNL LBNL-6063E,

Surendra Byna, Jerry Chou, Oliver Rübel, Prabhat, Homa Karimabadi, William S. Daughton, Vadim Roytershteyn, E. Wes Bethel, Mark Howison, Ke-Jou Hsu, Kuan-Wu Lin, Arie Shoshani, Andrew Uselton, and Kesheng Wu, "Parallel I/O, Analysis, and Visualization of a Trillion Particle Simulation", SuperComputing 2012 (SC12), Salt Lake City, Utah, November 2012,

Prabhat, Oliver Rübel, Surendra Byna, Kesheng Wu, Fuyu Li, Michael Wehner and E. Wes Bethel, "TECA: A Parallel Toolkit for Extreme Climate Analysis", Procedia Computer Science, Proceedings of the International Conference on Computational Science, ICCS 2012, Presented at Third Worskhop on Data Mining in Earth System Science (DMESS 2012), Omaha, Nebraska, June 2012, 9:866–876, LBNL 5352E, doi: 10.1016/j.procs.2012.04.093

We present TECA, a parallel toolkit for detecting extreme events in large climate datasets. Modern climate datasets expose parallelism across a number of dimensions: spatial locations, timesteps and ensemble members. We design TECA to exploit these modes of parallelism and demonstrate a prototype implementation for detecting and tracking three classes of extreme events: tropical cyclones, extra-tropical cyclones and atmospheric rivers. We process a modern TB-sized CAM5 simulation dataset with TECA, and demonstrate good runtime performance for the three case studies.

Allen R. Sanderson, Brad Whitlock, Oliver, Hank Childs, Gunther H. Weber, , Kesheng Wu, "A System for Query Based Analysis and Visualization", Third International Eurovis Workshop on Visual EuroVA 2012, Vienna, Austria, January 2012, LBNL 5507E,

E. W. Bethel, Surendra Byna, Jerry Chou, Cormier-Michel, Cameron G. R. Geddes, Howison, Fuyu Li, Prabhat, Ji Qiang, R\ ubel, Rob D. Ryne, Michael Wehner, Wu, "Big Data Analysis and Visualization: What Do LINACS Tropical Storms Have In Common?", 11th International Computational Accelerator Physics ICAP 2012, Germany, 2012,

E. Wes Bethel, David Leinweber, Oliver Rübel, Kesheng Wu, "Federal Market Information Technology in the Post Flash Crash Era: Roles of Supercomputing", Workshop on High Performance Computational Finance at SC11, Seattle, WA, USA, November 2011, LBNL 5263E,

Jerry Chou, Kesheng Wu, Oliver Rübel, Mark Howison, Ji Qiang, Prabhat, Brian Austin, E. Wes Bethel, Rob D. Ryne, and Arie Shoshani, "Parallel Index and Query for Large Scale Data Analysis", In Proceedings of Supercomputing 2011, Seattle, WA, USA, 2011, 1-11, LBNL 5317E, doi: 10.1145/2063384.2063424

Prabhat, Quincey Koziol, Karen Schuchardt, E. Wes Bethel, Jerry Chuo, Mark Howison, Mike McGreevy, Bruce Palmer, Oliver Ruebel and John Wu, "ExaHDF5: An I/O Platform for Exascale Data Models, Analysis and Performance", Scientific Discovery Through Advanced Computing 2011, 2011,

J. Chou, K. Wu, O. R\ ubel, M. Howison, Qiang, Prabhat, B. Austin, E. W. Bethel, D. Ryne, A. Shoshani, "Parallel Index and Query for Large Scale Data", SC11, 2011, doi: 10.1145/2063384.2063424

Oliver Rübel, Sean Ahern, E. Wes Bethel, Mark. D Biggin, Hank Childs, Estelle Cormier-Michel, Angela DePace, Michael B. Eisen, Charless C. Fowlkes, Cameron G. R. Geddes, Hans Hagen, Bernd Hamann, Min-Yu Huang, Soile V. E. Keränen, David W. Knowles, Cris L. Luengo Hendriks, Jitendra Malik, Jeremy Meredith, Peter Messmer, Prabhat, Daniela Ushizima, Gunther H. Weber, and Kesheng Wu, "Coupling Visualization and Data Analysis for Knowledge Discovery from Multi-dimensional Scientific Data", Procedia Computer Science, Proceedings of International Conference on Computational Science, ICCS 2010, June 2010, LBNL 3669E,

G. H. Weber, S. Ahern, E.W. Bethel, S. Borovikov, H.R. Childs, E. Deines, C. Garth, H. Hagen, B. Hamann, K.I. Joy, D. Martin, J. Meredith, Prabhat, D. Pugmire, O. Rübel, B. Van Straalen and K. Wu, "Recent Advances in VisIt: AMR Streamlines and Query-Driven Visualization", Numerical Modeling of Space Plasma Flows: Astronum-2009 (Astronomical Society of the Pacific Conference Series, 3185E, 2010, 429:329-334,

E. W. Bethel, C. Johnson, S. Ahern, J. Bell, P.-T. Bremer, H. Childs, E. Cormier-Michel, M. Day, E. Deines, T. Fogal, C. Garth, C. G. R. Geddes, H. Hagen, B. Hamann, C. Hansen, J. Jacobsen, K. Joy, J. Kruger, J. Meredith, P. Messmer, G. Ostrouchov, V. Pascucci, K. Potter, Prabhat, D. Pugmire, O. Rubel, A. Sanderson, C. Silva, D. Ushizima, G. Weber, B. Whitlock, K. Wu, "Occam's Razor and Petascale Visual Data Analysis", SciDAC 2009, J. of Physics: Conference Series, San Diego, California, July 2009, LBNL 2210E,

E. Wes Bethel, Oliver Rübel, Prabhat, Kesheng Wu, Gunther H. Weber, Valerio Pascucci, Hank Childs, Ajith Mascarenhas, Jeremy Meredith, and Sean Ahern, "Modern Scientific Visualization is More than Just Pretty Pictures", Numerical Modeling of Space Plasma Flows: Astronum-2008 (Astronomical Society of the Pacific Conference Series, St. Thomas, USVI, June 2009, 301-317, LBNL 1450E,

K Wu et al., "FastBit: Interactively Searching Massive Data", SciDAC 2009, 2009, LBNL 2164E, doi: 10.1088/1742-6596/180/1/012053

O. Rübel, Prabhat, K. Wu, H. Childs, J. Meredith, C.G.R. Geddes, E. Cormier-Michel, S. Ahern, G.H. Weber, P. Messmer, H. Hagen, B. Hamann and E.W. Bethel, "High Performance Multivariate Visual Data Exploration for Extemely Large Data", Supercomputing (SC), Austin, Texas, USA, November 2008, LBNL 716E,

Daniela Ushizima, Oliver Rübel, Prabhat, Gunther Weber, E. Wes Bethel, Cecilia Aragon, Cameron Geddes, Estelle Cormier-Michel, Bernd Hamann, Peter Messmer, Hans Hagen, "Automated Analysis for Detecting Beams in Laser Wakefield Simulations", 2008 Seventh International Conference on Machine Learning and Applications, Proceedings of IEEE ICMLA'08, 2008, 382-387, LBNL 960E,

Books

O. Rübel, Linking Automated Data Analysis and Visualization with Applications in Developmental Biology and High-energy Physics, Schriftenreihe Informatik, (Der Dekan (hrsg), Fachbereich Informatik, Technische Universität Kaiserslautert: December 2010) LBNL 3155E,

Book Chapters

Hank Childs, Eric Brugger, Brad Whitlock, Jeremy Meredith, Sean Ahern, David Pugmire, Kathleen Biagas, Mark Miller, Cyrus Harrison, Gunther H. Weber, Hari Krishnan, Thomas Fogal, Allen Sanderson, Christoph Garth, E. Wes Bethel, David Camp, Oliver Rubel, Marc Durant, Jean M. Favre, Paul Navratil, "VisIt: An End-User Tool For Visualizing and Analyzing Very Large Data", High Performance Visualization---Enabling Extreme-Scale Scientific Insight, ( October 2012) Pages: 357--372

O. Rübel, S.V.E. Keränen, M.D. Biggin, D.W. Knowles, G.H. Weber, H. Hagen, B. Hamann, and E.W. Bethel, "Linking Advanced Visualization and MATLAB for the Analysis of 3D Gene Expression Data", Mathematics and Visualization, Visualization in Medicine and Life Sciences II, Progress and New Challenges, edited by L. Linsen and B. Hamann and H. Hagen and H.-C. Hege, (Springer Verlag: 2012) Pages: 267-285, LBNL 4891E,

Daniela Ushizima, Cameron Geddes, Estelle Cormier-Michel, E. Wes Bethel, Janet Jacobsen, Prabhat, Oliver Rubel, Gunther Weber, Bernard Hamann, Peter Messmer, Hans Hagen, "Automated detection and analysis of particle beams in laser-plasma accelerator simulations", Machine Learning, edited by Yagang Zhang, (In-Teh: February 2010) Pages: 367-389, LBNL 3845E,

O. Rübel, G. H. Weber, M-Y Huang, E. W. Bethel, S. V. E. Keränen, C. C. Fowlkes, C. L. Luengo Hendriks, A. H. DePace, L. Simirenko, M. B. Eisen, M. D. Biggin, H. Hagen, J. Malik, D. W. Knowles and B. Hamann, "PointCloudXplore 2: Visual Exploration of 3D Gene Expression", Visualization of Large and Unstructured Data Sets, edited by C. Garth, H. Hagen, M. Hering-Bertram, (Gesellschaft fuer Informatik (GI): 2008) LBNL 249E,

M.-Y. Huang, O. Rübel, G.H. Weber, C.L. Luengo Hendriks, M.D. Biggin, H. Hagen, B. Hamann, "Segmenting Gene Expression Patterns of Early-stage Drosophila Embryos.", Mathematical Methods for Visualization in Medicine and Life Sciences, edited by L. Linsen, H. Hagen, B. Hamann, (Springer-Verlag: January 2008) Pages: 313--327, LBNL 62450,

Reports

E. Wes Bethel, David Camp, Hank Childs, Mark Howison, Hari Krishnan, Burlen Loring, Joerg Meyer, Prabhat, Oliver Ruebel, Daniela Ushizima, Gunther Weber, "Towards Exascale: High Performance Visualization and Analytics – Project Status Report. Technical Report", DOE Exascale Research Conference, April 2012,

Posters

Oliver Rübel, Cameron, G. R. Geddes, Min Chen, Estelle Cormier-Michel, and E. Wes Bethel, "Query-driven Analysis of Plasma-based Particel Acceleration Data", Poster Abstracts of IEEE VisWeek, October 2012,

Soile V.E. Keränen, Oliver Rübel, David W. Knowles and Mark D. Biggin, "Computational modeling of cis-regulatory modules from 3D exprression data in Drosophila blastoderm atlas", Drosophila Genetics, March 2012,

R. Ryne, B. Austin, J. Byrd, J. Corlett, E. Esarey, C. G. R. Geddes, W. Leemans, X. Li, Prabhat, J. Qiang, O. Rübel, J.-L. Vay, M. Venturini, K. Wu, B. Carlsten, D. Higdon and N. Yampolsky, "High Performance Computing in Accelerator Science: Past Successes, Future Challenges", Workshop on Data and Communications in Basic Energy Sciences: Creating a Pathway for Scientific Discovery, October 2011,

O. Rübel, Prabhat, K. Wu, H. Childs, J. Meredith, C.G.R. Geddes, E. Cormier-Michel, S. Ahern, G.H. Weber, P. Messmer, H. Hagen, B. Hamann and E.W. Bethel, "Application of High-performance Visual Analysis Methods to Laser Wakefield Particle Acceleration Data", IEEE Visualization 2008, October 2008,

Others

Kesheng Wu, Wes Bethel, Ming Gu, David, Oliver R\ ubel, Testing VPIN on Big Data, Available at SSRN 2318259, 2013,

C. G. R. Geddes, E Cormier-Michel, E. H. Esarey, C. B. Schroeder, J.-L. Vay, W. P. Leemans, D. L.. Bruhwiler, J. R. Cary, B. Cowan, M. Durant, P. Hamill, P. Messmer, P. Mullowney, C. Nieter, K. Paul, S. Shasharina, S. Veitzer, G. Weber, O. Rübel, D. Ushizima, Prabhat, E. W.Bethel, K. Wu, Large Fields for Smaller Facility Sources, SciDAC Review, Pages: 13-21, 2009,