Category Encoders now on conda forge

My scikit-learn compatible library of categorical data encoders (category_encoders) is now published on conda forge!  Conda, if you didn’t know, is an open source package manager for python (and other things) developed primarily by continuum analytics.  Thanks to continuum developer https://github.com/bollwyvl for doing pretty much all of the work to get it working. Check out the […]

Read More

Data Science Things Roundup #9

Things got a bit busy and I feel off the wagon posting, but here we are back for the ninth edition of the data science things roundup.  If you haven’t seen previous editions, it’s basically just 3 data science or python related articles or packages that I’ve stumbled across recently and thought were interesting.  This […]

Read More

Projects Update: July 2016

About a quarter ago (April), I posted my first regular update on all of the various projects I’m working on.  As side projects tend to go, some fall into and out of favor, and occasionally new ones crop up.  As I develop on projects, I post regular updates, but it’s helpful to me (and hopefully […]

Read More

Twitter-Pandas, first release

Thanks to some great help from contributors, we’ve just pushed the first release of twitter pandas, v0.0.1. The first release is aimed at replicating the data-providing (no create/update/delete functions) from the tweepy API with the git-pandas style pandas interface. To install twitterpandas, just use pip pip install twitterpandas And then you can use it right […]

Read More

Mixed-mode estimation in petersburg

A couple of months ago I posted an overview of simple estimation of hierarchical events using python and petersburg.  At the time it probably seemed a little bit trivial, just building a structured frequency model and drawing samples from it. But I have finally implemented the next step to complete the intended functionality.  This post […]

Read More

Data Science Things Roundup #6

Time again for the weekly data science things roundup.  If you haven’t seen this before, check out some of the previous ones to get a feel for it.  Each Tuesday I run through 3 things I’ve found interesting and bookmarked recently, generally related to python and data science (with some admitted diversions). This week is […]

Read More