Using Jupyter Notebook, Pandas, and Dask to crunch medium sized data HD
Stuart Mitchell Using Jupyter (ipython) notebook, pandas, and dask to crunch medium sized data. In this talk Stu will give talk and give some examples from the last month when he has been using pandas at work. Topics will include: 1. Setting up Jupyter Notebook on EC2 2. Loading and processing data 3. Graphs 4. installing and using dask to crunch bigger data sets 5. running a small cluster with dask. distributed. Presented at the NZPUG Auckland Meetup (https://www.meetup.com/NZPUG-Auckland/) ere is the html version of the presentation https://s3.amazonaws.com/nzem-files/Installing%2BJupyter%2Band%2Bpandas%2Bon%2BEC2.html For those of you that want to install and try Jupyter and Pandas yourself here is the ipython notebook https://s3.amazonaws.com/nzem-files/Installing%2BJupyter%2Band%2Bpandas%2Bon%2BEC2.ipynb
Похожие видео
Показать еще