The Yhat Blog


machine learning, data science, engineering




  • Summarizing Data in SQL

    by Matt DeLand | Feb 01 2016

    4 ways to summarize your dataset in SQL so you can run it directly on your database.


  • What is Model-Based Machine Learning?

    by Tom Diethe | Jan 14 2016

    An introduction to model-based machine learning, plus useful tools and a case study.


  • Unearthing Data to Unleash Impact: Using Unique Data Sources to Drive Change

    by Nick Eng and Neal Myrick | Jan 06 2016

    Three cases of DataKind and Tableau Foundation helping nonprofits use data for social good.


  • Rodeo 1.2: Python Paths, Interrupt, Stickers

    by Colin | Dec 29 2015

    The Rodeo 1.2 release: multiple python paths, interrupt & stickers!


  • Exploring U.S. Traffic Fatality Data

    by Ben Van Dyke | Dec 16 2015

    Python analysis and data visualization of traffic fatalities in the U.S.


  • Rodeo 1.1 - Markdown, Autoupdates, Feedback

    by Colin | Nov 20 2015

    The Rodeo 1.1 release, including Markdown pdf rendering


  • Write for the Yhat blog

    by Elise | Nov 05 2015

    A blogpost inviting you to write a blogpost


  • How we built Rodeo with Electron

    by Greg | Oct 28 2015

    How we built Rodeo, an IDE for data science.


  • Rodeo 1.0: a Python IDE on your Desktop

    by Greg | Oct 12 2015

    A Python IDE for Data Science on your Desktop


  • ScienceCluster Meets Spark

    by Elise Breda | Oct 02 2015

    Integrating ScienceCluster with Spark