I’ve written about using vagrant for 99.9% of my python work on here before (see here and here for examples). In addition to vagrant, I use jupyter notebooks on 99.9% of the work that I do, so I figured I’d spend a little time describing how I use jupyter with vagrant. First off, you’ll […]
Author: Python Data
Stockstats – Python module for various stock market indicators
I’m always working with stock market data and stock market indicators. During this work, there’s times that I need to calculate things like Relative Strength Index (RSI), Average True Range (ATR), Commodity Channel Index (CCI) and other various indicators and stats. My go-to for this type of work is TA-Lib and the python wrapper for […]
Vagrant on Windows
There are many different ways to install python and work with python on Windows. You can install Canopy or Anaconda to have an entire python ecosystem self-contained or you can install python directly onto your machine and configure all the bits and bytes yourself. My current recommendation is to use Vagrant on Windows combined with Virtualbox to […]
pandas Cheat Sheet (via yhat)
The folks over at yhat just released a cheat sheet for pandas. You can download the cheat sheet in PDF for here. There’s a couple important functions that I use all the time missing from their cheat sheet (actually….there are a lot of things missing, but its a great starter cheat sheet). A few things […]
Getting the ‘next’ row of data in a pandas dataframe
I’m currently working with stock market trade data that is output from a backtesting engine (I’m working with backtrader currently) in a pandas dataframe. The format of the ‘transcations’ data that is provided out of the backtesting engine is shown below. amount price value date 2016-01-07 00:00:00+00:00 79.017119 195.33 -15434.413883 2016-09-07 00:00:00+00:00 -79.017119 218.84 17292.106354 […]
Dask – A better way to work with large CSV files in Python
In a recent post titled Working with Large CSV files in Python, I shared an approach I use when I have very large CSV files (and other file types) that are too large to load into memory. While the approach I previously highlighted works well, it can be tedious to first load data into sqllite […]
Working with large CSV files in Python
I’m currently working on a project that has multiple very large CSV files (6 gigabytes+). Normally when working with CSV data, I read the data in using pandas and then start munging and analyzing the data. With files this large, reading the data into pandas directly can be difficult (or impossible) due to memory constrictions, […]
Installing Python on OSX (and the necessary modules)
If you need help installing python on OSX, read on. For the last three years, I’ve used a mac for all my development. I love the fact that everything ‘just works’ on the platform. That said, when you get into scientific computing and data analytics, especially with python, you can run into some issues. Just […]
Installing python on Windows
If you’ve done any work with python on Windows, you may be cringing right now at the thought of trying to do any type of python development work on the platform. Have no fear though…there is hope for python developers on Windows, especially if you are only going to be using python for data analysis, […]
Data Analytics & Python
So you want (or need) to analyze some data. You’ve got some data in an excel spreadsheet or database somewhere and you’ve been asked to take that data and do something useful with it. Maybe its time for data analytics & Python? Maybe you’ve been asked to build some models for predictive analytics. Maybe you’ve […]