A couple of weeks ago I wrote a guest post on churn prediction for Kissmetrics, and they just published it.

Churn prediction is one of the most popular Big Data use cases in business. It consists in detecting which customers are likely to cancel a subscription to a service based on how they use the service. Being able to predict churn based on customer data has proven extremely valuable to big telecom companies. Now, thanks to prediction services and APIs, it’s accessible to businesses of all sizes — not only those who can afford to hire teams of data scientists.

The process is as follows:

  • Step 1: Gather historical customer data that you save to a CSV file.
  • Step 2: Upload that data to a prediction service that automatically creates a “predictive model.”
  • Step 3: Use the model on each current customer to predict whether they are at risk of leaving.

Check out my full post on the Kissmetrics blog for details on these steps and for an illustration of how to go through Steps 2 and 3 with BigML.

If you find this article useful, please vote for it on HackerNews!

If you want to dig deeper and see what to do next, check out my book, Bootstrapping Machine Learning.