The Yhat Blog


machine learning, data science, engineering




  • Scraping and Analyzing Baseball Data with R

    by Greg | Dec 23 2014

    A quick howto on scraping and analyzing MLB data using R.


  • Reducing your R memory footprint by 7000x

    by Greg | Dec 17 2014

    R can be a bit bloated someitmes. Learn how to make your R models more effecient.


  • Naive Bayes in Python

    by Greg | Dec 11 2014

    How to implement your own naive bayes classifier in Python and a detailed explanation of how it all works.


  • Introducing db.r

    by Greg | Dec 04 2014

    db.py but for R. A database library that makes working with SQL in R a little more enjoyable.


  • How Yhat Does Cloud Balancing: A Case Study

    by Ryan J. O'Neil | Nov 10 2014

    How we use optimization to minimize our server costs without impacting server up-time.


  • Introducing db.py

    by Greg Lamp | Nov 05 2014

    Our latest contribution to the open source community: db.py. A database library for working with SQL in pandas/python.


  • Using data science to build better products

    by Colin Ristig | Sep 17 2014

    How data science and machine learning can be embedded into products to make them better.


  • Analysing your e-commerce funnel with R

    by Justin Marciszewski | Aug 05 2014

    Case study using R to evaluate the impact of your website changes


  • Fuzzy Matching with Yhat

    by Greg | Jul 23 2014

    An use case of using Yhat and the python library "fuzzywuzzy" to build your own string matching service.


  • Yhat ScienceBox

    by Colin Ristig | Jun 17 2014

    A brief overview of our newest product: ScienceBox!