Sumários
Class #7 – Performance Analysis (1)
17 Dezembro 2025, 18:00 • António Raimundo
Profiling tools for GPU;
-
Kernel performance metrics;
-
Profiling workflow demonstration;
Class #6 – CUDA Compilation (2)
11 Dezembro 2025, 18:00 • António Raimundo
Registers and memory hierarchy;
-
Global vs shared memory;
-
Memory access & optimisation strategies;
Class #5 – CUDA Compilation (1)
10 Dezembro 2025, 18:00 • António Raimundo
CUDA kernel structure;
-
Launch configuration (blocks/threads);
-
Kernel execution workflow;
-
Quiz #1;
Class #4 – GPU Programming: Introduction to Numba
5 Dezembro 2025, 18:00 • António Raimundo
JIT compilation concepts;
-
@cuda.njit for CPU;
Universal Functions (Vectorize)
-
Python performance optimisation basics;
Class #3 – GPU Programming: Introduction to CuPy
3 Dezembro 2025, 18:00 • António Raimundo
Introduction to CUDA Architecture
Introduction to CuPy
Tensor operations and CPU <-> GPU array transfers.
NumPy vs CuPy for GPU acceleration;