IEEE International Geoscience and Remote Sensing Symposium (IGARSS)

Scalable Machine Learning with High Performance and Cloud Computing

26-27/09/2020

Abstract

Deep Learning is emerging as the leading AI technique owing to the current convergence of scalable computing capability (i.e., HCP and Cloud computing), easy access to large volumes of data, and the emergence of new algorithms enabling robust training of large-scale deep neural networks. The tutorial aims at providing a complete overview for an audience that is not familiar with these topics.

Lecture 1: Introduction

🏛Jülich Supercomputing Centre - Forschungszentrum Jülich

🌐Machine learning and Deep Learning in remote sensing

🎛Deep learning and Supercomputing

Get lecture

Lecture 2: Levels of Parallelism and High Performance Computing

🍴The Free Lunch is Over

⛓Hardware Levels of Parallelism

📱High Performance Computing (HPC)

⚙️Jupyter-JSC

Get lecture

Get notebook

Lecture 3: Distributed Deep Learning

🏄Distributed training

🪓Horovod

🪐DeepSpeed

Get lecture

Lecture 4: Hands-on Distributed Deep Learning

🍽🍛🍹

Become familiar with Horovod, a data distributed training framework

Understand how to modify existing code to enable parallelism

Understand the importance of distributing data beforehand

Understand what Horovod does looking at the lines of code to be added

Create a job script to execute Python code on the GPUs

Play around with model architecture, optimizer, learning rate

Get lecture

Get notebook

Lecture 5: Big Data Analytics using Apache Spark

💥Apache Spark Basics

🔧Developing on Spark and Clouds

☁️Machine Learning on Spark

Get lecture

Get codes

No items found.

Previous teaching