The Yhat Blog


machine learning, data science, engineering




  • Rodeo 1.0: a Python IDE on your Desktop

    by Greg | Oct 12 2015

    A Python IDE for Data Science on your Desktop


  • ScienceCluster Meets Spark

    by Elise Breda | Oct 02 2015

    Integrating ScienceCluster with Spark


  • ScienceCluster: Same Tools, Better Workplace

    by Elise Breda | Sep 23 2015

    ScienceCluster: We don’t want to change the way you work. We just want to make it better.


  • Sharks, Landsharks, Geoplotting, and KDTrees!

    by Colin Ristig | Sep 14 2015

    Swimming with Sharks! An intro to Geoplotting with Python


  • Customer Segmentation in Python

    by Greg | Aug 25 2015

    Using K-Means clustering to analyze your customer base.


  • The Code Behind Building a FiveThirtyEight post

    by Greg | Jul 20 2015

    Dissecting a FiveThirtyEight post on baseball payrolls and wins.


  • Decision Making Under Uncertainty: An Introduction to Robust Optimization (Part 2)

    by Saba Neyshabouri | Jul 16 2015

    Part 2 of an introduction to robust optimization and how to deal with uncertainty.


  • Decision Making Under Uncertainty: An Introduction to Robust Optimization (Part 1)

    by Saba Neyshabouri | Jul 14 2015

    An introduction to robust optimization and how to deal with uncertainty.


  • Introducing Gobenchdb

    by Eric Cox | Jul 06 2015

    A command line tool that stores Go benchmark data in a database


  • 7 Datasets You've Likely Never Seen Before

    by Greg | Jun 22 2015

    Some datasets that may have fallen by the wayside.