Sumários

Class #7 – Performance Analysis (1)

17 Dezembro 2025, 18:00 António Raimundo


  • Profiling tools for GPU;

  • Kernel performance metrics;

  • Profiling workflow demonstration;

Class #6 – CUDA Compilation (2)

11 Dezembro 2025, 18:00 António Raimundo


  • Registers and memory hierarchy;

  • Global vs shared memory;

  • Memory access & optimisation strategies;

Class #5 – CUDA Compilation (1)

10 Dezembro 2025, 18:00 António Raimundo


  • CUDA kernel structure;

  • Launch configuration (blocks/threads);

  • Kernel execution workflow;

  • Quiz #1;

Class #4 – GPU Programming: Introduction to Numba

5 Dezembro 2025, 18:00 António Raimundo


  • JIT compilation concepts;

  • @cuda.njit for CPU;

  • Universal Functions (Vectorize)

  • Python performance optimisation basics;

Class #3 – GPU Programming: Introduction to CuPy

3 Dezembro 2025, 18:00 António Raimundo


  • Introduction to CUDA Architecture

  • Introduction to CuPy

  • Tensor operations and CPU <-> GPU array transfers.

  • NumPy vs CuPy for GPU acceleration;