The Yhat Blog

machine learning, data science, engineering

  • Scraping and Analyzing Baseball Data with R

    by Greg | Dec 23 2014

    A quick howto on scraping and analyzing MLB data using R.

  • Reducing your R memory footprint by 7000x

    by Greg | Dec 17 2014

    R can be a bit bloated someitmes. Learn how to make your R models more effecient.

  • Naive Bayes in Python

    by Greg | Dec 11 2014

    How to implement your own naive bayes classifier in Python and a detailed explanation of how it all works.

  • Introducing db.r

    by Greg | Dec 04 2014 but for R. A database library that makes working with SQL in R a little more enjoyable.

  • How Yhat Does Cloud Balancing: A Case Study

    by Ryan J. O'Neil | Nov 10 2014

    How we use optimization to minimize our server costs without impacting server up-time.

  • Introducing

    by Greg Lamp | Nov 05 2014

    Our latest contribution to the open source community: A database library for working with SQL in pandas/python.

  • Using data science to build better products

    by Colin Ristig | Sep 17 2014

    How data science and machine learning can be embedded into products to make them better.

  • Analysing your e-commerce funnel with R

    by Justin Marciszewski | Aug 05 2014

    Case study using R to evaluate the impact of your website changes

  • Fuzzy Matching with Yhat

    by Greg | Jul 23 2014

    An use case of using Yhat and the python library "fuzzywuzzy" to build your own string matching service.

  • Yhat ScienceBox

    by Colin Ristig | Jun 17 2014

    A brief overview of our newest product: ScienceBox!