Parallel and Distributed Programming
Kenjiro Taura

1 What’s new (in the newest-first order)

2 Slides

  1. Introduction
  2. OpenMP
  3. CUDA
  4. OpenMP for GPU
  5. SIMD
  6. How to get nearly peak FLOPS (with CPU)
  7. What You Must Know about Memory, Caches, and Shared Memory
  8. Analyzing Data Access of Algorithms and How to Make Them Cache-Friendly
  9. Divide and Conquer
  10. Neural Network Basics
  11. Understanding Task Scheduling Algorithms

3 Languages

4 Hands-on programming exercise

5 How to get the credit

6 Topics covered