I am a data scientist working as an independent consultant, a consultant with Preva Group, and a (remote) adjunct assistant professor at Purdue University.
In my work and research I focus on tools, methodology, and applications in exploratory analysis, visualization, computational statistics, statistical model building, and machine learning on large, complex datasets.
Prior to consulting, I was at PNNL, working on applications in the fields of cyber security, power systems engineering, nuclear forensics, high energy physics, and biological sciences.
I’m active in the data science open source community, mainly working on projects in R and JavaScript.
Selected R packages (see GitHub for more).
rmote
enables locally viewing visualizations that are created in an R session on a remote server.
GitHub
stlplus
extends base-R STL time series seasonal trend decomposition, including handling missing data and smoothing with higher order polynomials.
GitHub
geovis
is an interactive geo-temporal data exploration R package and corresponding JavaScript library, built on Mapbox.
GitHub
For TrelliscopeJS and GeoVis, I have written corresponding JavaScript libraries, using React, for which the R package serves as an interface. The projects for these can be found here and here.
Aside from building tools for data analysis, I spend a significant amount of time analyzing data. I gained a lot of experience analyzing large complex datasets during my graduate studies at Purdue with Bill Cleveland and while a research scientist at PNNL. I have been working as an independent consultant since 2014.
As a consultant, I have worked on two DARPA programs, XDATA and D3M, building open source tools for analyzing and visualizing big data, tools for helping domain experts build models, as well as using the tools to analyze interesting datasets such as the Bitcoin blockchain and high frequency trading data. I work with a large philanthropic foundation, applying visualization, exploratory analysis, and collaborative tool development in the field of global health.
I released an R package over 9 months ago called geofacet, and have long promised a blog post about the approach. This is the first …
It’s been over a year since I have written a blog post. At the end of 2016 I meant to write a post looking back on the year, but …
I’m always looking for ways to spark my kid’s interest in computers, data, etc. This has proven to be more difficult than …
In response to a user’s request and after a short conversation with Carson Sievert (creator / maintainer of the plotly R …