Paralelización de Aplicaciones de Alto Rendimiento Utilizando Arquitecturas Híbridas
1
OpenMP Fundamentals. Data Environment
2
Clase 02 (Versión Borrador)
3
Clase 03 (Versión Borrador)
4
Introduction to CUDA: The Basics. Memory Allocation and Data Movement API Functions. Introduction to the CUDA Toolkit. Thread Scheduling. Memory and Data Locality. Tiled Parallel Algorithms. Handling Arbitrary Matrix Sizes in Tiled Algorithms
5
Efficiency and Performance Considerations. Memory Access Performance. Reductions
6
Clase 06 (Versión Borrador)
7
Introduction to OpenACC. Profiling and Parallelizing
10
MPI: One-Sided Communication