The world of BAAS, or MBAAS, was shocked the past week with the announcement made by Parse on it’s site: Parse’s hosted services will be fully retired on January 28, 2017. We’re proud that we’ve been able to help so many of you build great mobile apps. Read more on this announcement and what this means […]
Git-pandas v1.0.0, or how to check for a stable release
In the process of making the v1.0.0 release of git-pandas, I had one primary goal: to simplify and solidify the interface to git-pandas objects (the ProjectDirectory and the Repository). At the end of the day, the usefulness of a project like git-pandas versus one off analysis or rolling your own interface is consistent and predictable […]
Shiny Developers Conference Review
by Aimee Gott Late in 2015 I was delighted to receive an invite to the inaugural Shiny Developers Conference to be held in Stanford, California. I didn’t have to think twice about wanting to be there and now that it is over I am delighted I got the invite and made the trip. Joe Cheng […]
Summarizing Data in SQL
About Matt: Matt DeLand is Co-Founder and Data Scientist at Wagon. His team is building a collaborative SQL editor for analysts and engineers. He studied algebraic geometry at Columbia University, taught at the University of Michigan, and now enjoys applied machine learning— his mom is very proud! Introduction How quickly can you understand data from […]
Github.com cumulative blame in 5 lines of python
Git-pandas has gotten to be pretty capable. Currently in the master branch and soon to be in the v1.0.0 release, we’ve included a github.com interface to git-pandas via the GitHubProfile class. With this, in just a few lines of code, you can see how your profile has grown over time: from gitpandas.utilities.plotting import plot_cumulative_blame from […]
FQDN (Fully Qualified Domain Names) in R
By Steph Locke This is just a quick post, to mention how you can get your computer name with the domain it is registered in i.e.  the fully qualified domain name (FQDN) by using R. Base R In Windows to get the computer name in it’s fully qualified form you need to do: paste(Sys.getenv(“COMPUTERNAME”), Sys.getenv(“USERDNSDOMAIN”), […]
Market while developing 101: ideas & tools for the early days
It is usually useful to try to see the whole picture. In the field of internet startups thinking about development without planning market fit, might not be a great idea. Let’s put it this way: you have an amazing idea and gathered enough capital to make it work; sure, go lean and start to build […]
Powering applications with a Notebook Microservice
Previously, we learned how to create a microservice from a notebook using the Jupyter kernel gateway. We learned both how to annotate existing cells, as well as how to generate a new notebook with code cells ready to be filled-in. Now let’s look at this from the other direction: breaking down a problem into the microservices needed to implement its solution. […]
Notebook Microservice And Swagger
In previous posts we learned how to create a microservice in a notebook using the Jupyter kernel gateway. This will be the foundation for today’s post where we will be creating a notebook microservice with Swagger, a set of tools for representing REST APIs. With this this approach, notebook authors can create and deploy APIs that are […]
Jupyter Notebooks as RESTful Microservices
“Data science enables the creation of data products.” – Mike Loukides in What is data science? Data products take on many forms. Web articles, dashboard applications, and cloud services are all common vehicles for delivering value from data. Tools that help produce artifacts such as these are a necessary part of any data mining methodology. In […]
Decision strategies: beyond expected value
Oftentimes when making some kind of uncertain decision, the decision maker will use a measure such as expected value to make that decision. Imagine the case of a single coin flip where the better pays 5 dollars to play, and gets 2 dollars for heads and 10 dollars for tails. The expected value of this […]
Our Functional Future or: How I Learned to Stop Worrying and Love Haskell
This fall, I had the chance to dive headfirst into functional programming languages. It’s not a journey for the dabbler or the faint of heart, but it’s one that’s worthwhile. I’d been aware of functional programming and Haskell in particular for some time, but my attempts to learn it had been less than successful. The […]
WordPress on IBM Containers Update
Back in May 2015, Miguel showed us all how to deploy WordPress on Bluemix using containers. Now you can use the official WordPress image from the Docker Hub instead of the custom image used in the previous tutorial! How? This tutorial assumes you have already installed docker, the cf cli, and the cf “ic” plugin. For […]
Introducing the pygeohash stats module
Pygeohash version 1.1.0 is now live on pypi! It include the first release of the stats module, intended to provide high level stats and manipulations for lists of geohashes. The initial functions are: mean: provides the mean position from a list of geohashes northern: the northernmost geohash in the list southern: the southernmost geohash in […]
Solving the two envelopes problem with python and petersburg
In a couple of previous posts (here, here, and here) we’ve explored how petersburg represents uncertain decisions as a directed acyclic graph with weighted random decisions for which edge to take out of a given node. It turns out this is very similar to Bayesian networks, which will be the subject of post in the […]