In my work and research I focus on tools, methodology, and applications in exploratory analysis, visualization, computational statistics, statistical model building, and machine learning on large, complex datasets.
Prior to consulting, I was at PNNL, working on applications in the fields of cyber security, power systems engineering, nuclear forensics, high energy physics, and biological sciences.
Selected R packages (see GitHub for more).
rmote enables locally viewing visualizations that are created in an R session on a remote server.
stlplus extends base-R STL time series seasonal trend decomposition, including handling missing data and smoothing with higher order polynomials.
Aside from building tools for data analysis, I spend a significant amount of time analyzing data. I gained a lot of experience analyzing large complex datasets during my graduate studies at Purdue with Bill Cleveland and while a research scientist at PNNL. I have been working as an independent consultant since 2014.
As a consultant, I have worked on two DARPA programs, XDATA and D3M, building open source tools for analyzing and visualizing big data, tools for helping domain experts build models, as well as using the tools to analyze interesting datasets such as the Bitcoin blockchain and high frequency trading data. I also work with the Bill & Melinda Gates Foundation, applying visualization, exploratory analysis, and collaborative tool development to the field of global health.
I released an R package over 9 months ago called geofacet, and have long promised a blog post about the approach. This is the first …
It’s been over a year since I have written a blog post. At the end of 2016 I meant to write a post looking back on the year, but …
I’m always looking for ways to spark my kid’s interest in computers, data, etc. This has proven to be more difficult than …