Higher order integrators do not help too much
Higher order integrators do not help too much
Automatically finding a scale for your sampler
Motivation, and an implementation in about 20 lines
Plot the paths of all your runs from a year. Using Strava.
A PyMC3 implementation of the algorithms from: Validating Bayesian Inference Algorithms with Simulation-Based Calibration (Talts, Betancourt, Simpson, Vehtari, Gelman).
A twitter bot inspired by the wonderful @NYT_first_said, this project uses the Media Cloud project to find the earliest mention of a word in major English language newspapers. It uses this to issue a smugly superior tweet.
imcmc
(im-sea-em-sea) is a small library for turning 2d images into probability distributions and then sampling from them to create images and gifs.
A library to extract a publication date from a web page, along with a measure of the accuracy. Built with the support and help of the Center for Civic Media at the MIT Media Lab.
A library for finding atom, rss, rdf, and xml feeds from web pages. Produced at the mediacloud project. An incremental improvement over feedfinder2
, which was itself based on feedfinder
, written by Mark Pilgrim, and maintained by Aaron Swartz until his untimely death.
A web app for generating samples from a sketched probability distribution function.
I am a contributor to PyMC3, a “Python package for Bayesian statistical modeling and Probabilistic Machine Learning which focuses on advanced Markov chain Monte Carlo and variational fitting algorithms.”
An offshoot of another project, this allows you to compare times between most collegiate cross country courses.
A demonstration of using Flask, React, and d3js to visualize machine learning models. This is a port of a previous project from Angular to React.
Using hundreds of thousands of historical cross country running results to make predictions about future meets. The page is updated more than weekly during the season.
A small project which once a day checks for new words on Futility Closet and keeps a searchable, downloadable list of the curious words it finds there. Click the random button a few times, or make an Anki study deck.
For the second year in a row, I took part in Kaggle’s contest to predict March Madness winners. The code for the actual model is not very expository (get in touch if you are interested), but I also built a friendlier page to query predictions interactively at the link.
An interactive page for demonstrating and experimenting with the beta distribution. Built with AngularJS and d3js.
A command line utility for automatically compiling $\LaTeX$, and eliminating auxiliary files. Try it out with pip install tidytex
.
A small script built on top of d3js allowing you to define simple animated parametric equations.
A demonstration of linear regression, overfitting, normalization, and regularization. Allows you to choose data from a distribution, and interactively fit polynomials to the data using least squares, ridge, or Lasso regression.
I am a software engineer and mathematician in Cambridge, MA, working on machine learning and natural language processing. I am a contributor to the popular PyMC3 library. I received my doctorate studying geometric measure theory with Dr. Robert Hardt at Rice University.
In my spare time I run, walk in the woods with Pete the pup, and launch balloons into [near] space.
PhD in Mathematics, 2012
Rice University
MA in Mathematics and Economics, 2007
Williams College