Parallel and Distributed Programming
Kenjiro Taura

1 What’s new (in the newest-first order)

2 Slides

  1. Introduction
  2. OpenMP
  3. CUDA
  4. SIMD
  5. How to get nearly peak FLOPS (with CPU)
  6. What You Must Know about Memory, Caches, and Shared Memory
  7. OpenMP for GPU
  8. Divide and Conquer
  9. Neural Network Basics
  10. Analyzing Data Access of Algorithms and How to Make Them Cache-Friendly?
  11. Understanding Task Scheduling Algorithms

3 Languages

4 Hands-on programming exercise

5 How to get the credit

  1. participate in in-class exercise (some may be group work)
  2. submit programming exercise assignments
  3. write and submit a final report (term paper).

6 Topics covered