Data Science Things Roundup #10

This post was originally published here

Hey all, I haven't done one of these in quite a while, but thought I'd share a few more articles I've found interesting recently.

An analysis of twitter influencers in the field of data science & big data

This is a pretty in depth medium article that goes through some of the concepts in network analysis, through the lens of twitter data. It's not an area I know a ton about, but I found it approachable and really interesting. Check it out here.

StashPy

I am a pretty heavy user of the Elasticsearch ecosystem, and have found it to be a really powerful tool.  I also, as you probably know if you read this blog, work a lot in python.  StashPy is a python3 project that does more or less the same thing as a minimal logstash.  So it takes a config, runs listening on a TCP port, and pipes log data though a processing pipeline before indexing into Elasticsearch. Super cool. Check it out here.

Bayesian Survival Analysis with python and pymc3

Survival analysis is a really powerful branch of statistics concerned with predicting the time until some event happens.  It comes up a lot in the medical field in particular (predicting time to death for different cases, as an example).  I've used it lightly in a past post to try to predict time until a programmers code would be replaced or deleted, you can check that out here.  In this article, Austin walks through the math backing some of the more common algorithms, and then how to translate that into python. Check it out here.

Related Posts

Getting Started with Kaggle: House Prices Competition Founded in 2010, Kaggle is a Data Science platform where users can share, collaborate, and compete. One key feature of Kaggle is “Competitions&r...
Test Driven Development of a Django RESTful API This post walks through the process of developing a CRUD-based RESTful API with Django and Django REST Framework, which is used for rapidly building R...
How to become a data scientist Data science is one of the most buzzed about fields right now, and data scientists are in extreme demand. And with good reason – data scientists...
A Dramatic Tour through Python’s Data Visualization Landscape (including ggpy and Altair) by Dan Saber | April 19, 2017 This post originally appeared on Dan Saber's blog. We thought it was hilarious, so we asked him if we could repost ...

Leave a Reply

Be the First to Comment!

Notify of
avatar
wpDiscuz