Date: 15 November 2021 @ 19:00 - 20:00

Timezone: Eastern Time (US & Canada)

Langue d'enseignement: Anglais

This course is a part of SHARCNET's ongoing "Introduction to Advanced Research Computing" series of online courses for 2021-2022. Compute Canada account is required to enroll.To register for any of the courses:

• Follow this link: https://training.sharcnet.ca

• Click the Log in link at the top right-hand side

• Log in with your Compute Canada login and password

• Click Site Home in the left-hand side menu

• Click 2021-2022 Introduction to Advanced Research Computing (ARC)

• Browse the list of (currently available) courses and enroll in the ones you are interested in

• To enroll in a course click on the course name and then click on that course’s enroll button

Course Syllabus:

Some common libraries for data analytics in Python, such as Numpy, Pandas, Scikit-Learn, etc. usually work well if the dataset fits into the existing RAM on a single machine. However, when dealing with large datasets, it can be a significant challenge to work around such memory constraints. This is where Dask can help. Dask provides a framework and libraries that can handle large datasets on a single multi-core machine or on a cluster.

This course provides an introduction to Dask.

Keywords: Python, Programming, Statistics, Data Analysis


Activity log