The Yhat Blog
machine learning, data science, engineering

ML Pitfalls: Measuring Performance (Part 1)
Tweetby Eric  Mar 03 2015Common machine learning pitfalls and how to avoid them.

Base R Plots
Tweetby Greg  Feb 23 2015Introduction to plotting and graphics in R (without ggplot2)

What is Linear Regression? A Qualitative Exploration
Tweetby Greg  Feb 19 2015A high level introduction to what linear regression is and how it works.

11 Python Libraries You Might Not Know
Tweetby Greg  Jan 20 2015A highlight of 10 lesserknown Python libraries, that even you experienced Pythonistas may have not seen!

Running R in Parallel (the easy way)
Tweetby Greg  Jan 14 2015Running code in parallel is tricky. This post shows how to quickly (and easily) parallelize your R code.

Currency Portfolio Optimization Using ScienceOps
Tweetby Ryan J. O'Neil  Jan 05 2015Create a currency portfolio optimization algorithm and deploy it to ScienceOps

Scraping and Analyzing Baseball Data with R
Tweetby Greg  Dec 23 2014A quick howto on scraping and analyzing MLB data using R.

Reducing your R memory footprint by 7000x
Tweetby Greg  Dec 17 2014R can be a bit bloated someitmes. Learn how to make your R models more effecient.

Naive Bayes in Python
Tweetby Greg  Dec 11 2014How to implement your own naive bayes classifier in Python and a detailed explanation of how it all works.

Introducing db.r
Tweetby Greg  Dec 04 2014db.py but for R. A database library that makes working with SQL in R a little more enjoyable.