Three ways to do it- sometimes package dependencies force analysts and developers to require older versions of Python use conda to downgrade Python version (if Anaconda installed already) conda install python=3.5.0 Hat tip- http://chris35wills.github.io/conda_python_version/ https://docs.anaconda.com/anaconda/faq#how-do-i-get-the-latest-anaconda-with-python-3-5 2. you download the latest version of Anaconda and then make a Python 3.5 environment. To create the new environment for Python 3.6, […]
Author: Ajay Ohri
Importing data from csv file using PySpark
There are two ways to import the csv file, one as a RDD and the other as Spark Dataframe(preferred) !pip install pyspark from pyspark import SparkContext, SparkConf sc =SparkContext() A SparkContext represents the connection to a Spark cluster, and can be used to create RDD and broadcast variables on that cluster. https://spark.apache.org/docs/latest/rdd-programming-guide.html#overview To create a […]
Data Analytics post Demonetization in India
The demonetisation of ₹500 and ₹1000 banknotes was a policy enacted by the Government of India on 8 November 2016. The announcement was made by the Prime Minister Narendra Modi .PM Modi declared that use of all ₹500 and ₹1000 banknotes would be invalid from midnight and announced the issuance of new ₹500 and ₹2000 […]
Internet of Things: Trends and Challenges – A Data Science Perspective
Filed under: Internet
The Hack of the Century
Fake news propogated by social media just before the US General Election Hacking of Democratic Party by hackers of East European Russian origin /affiliation coordinating publication of leaked emails Leaked emails by hackers causing FBI chief to make fateful statement days before election Recanting of statement by FBI chief further undermining Clintonian credibility I really […]
Which tool to learn for a better data science career
Some questions I get from new data scientists I like R a lot, so should I work towards being better at just that or should I learn excel and python and sas as well (Like a jack of all master of none)? I like R so much I wrote two books on it. Then I…
PYTHON FOR R USERS : ; Come September
I am writing a new book on a new language for me (python) for a new publisher ( Wiley) This book is the first of its kind to provide a reference that enables students and practitioners to easily learn to code in Python if they are familiar with R and vice versa, even if…
Coming to Bay Area in April
Despite my visa blues ( see more at https://todayilearnedinamerica.wordpress.com/2016/02/15/night-13-make-epic-shit/ ) I am still hanging on and traveling on in the United States of America. I am also going to TWO of the best conferences I have never attended despite being a blog Partner since past three years. Predictive Analytics World San Francisco – April 3-7, 2016…
Secured Communication for Hacker Activists and Liberals
Does the NSA track Git requests. I mean can’t the terrorists just be talking to each other by Visual Cryptography of Arabic through Git Repo requests. Basically increase the cost of decryption. This is Visual Cryptography. Now Imagine using a one time pad codebook of just emojis and talking through mobile and Kik. Etherpad is…
Is Python going to be better than R for Big Data Analytics and Data Science? #rstats #python
Uptil now the R ecosystem of package developers has mostly shrugged away the Big Data question. In a fascinating insight Hadley Wickham said this in a recent interview- shockingly it mimicks the FUD you know who has been accused of ( source https://peadarcoyle.wordpress.com/2015/08/02/interview-with-a-data-scientist-hadley-wickham/ 5. How do you respond when you hear the phrase ‘big […]
Install Package in Python from Github
You can use pip install git+git://github.com/yhat/ggplot.git or pip install –upgrade https://github.com/yhat/ggplot/tarball/master Filed under: Analytics Tagged: GitHub, python
Top 15 functions for Analytics in Python #python #rstats #analytics
Here is a list of top ten fifteen functions for analysis in Python import (imports a particular package library in Python) getcwd (from os library) – get current working directory chdir (from os) -change directory listdir (from os ) -list files in the specified directory read_csv(from pandas) reads in a csv file objectname.info (like proc contents […]