Project 2: Understanding and Optimizing Energy Usage from HPC Centers

This project focuses on understanding and optimizing energy efficiency in High-Performance Computing (HPC) facilities, addressing the significant energy demands of modern computing systems. 

 

Project Description

This project deals with energy efficiency understanding and optimization in High-Performance Computing facilities. As there is growing demand for computing resources, HPC systems draw enormous amounts of energy and need to be optimally monitored and optimized. Students will be familiarized with patterns of energy consumption, monitoring, and performance modeling, emphasizing sustainability and cost-effectiveness in HPC facilities. Participants will explore energy consumption patterns, monitoring techniques, and performance modeling, with an emphasis on sustainability and cost-efficiency in HPC environments. This project will focus on  principles related to understanding power-aware computing, generating power usage data, and optimization techniques in accordance with real-world HPC operational challenges. Upon the project completion, students will possess practical competencies to measure, analyze, and optimize supercomputing power usage within centers.

Learning Objectives

  1. Understand the key energy consumers in an HPC system, including compute nodes, cooling, networking, and storage.
  2. Measure and analyze energy consumption using real-world metrics such as Power Usage Effectiveness (PUE) and FLOPS/Watt.
  3. Utilize monitoring tools to track power usage in HPC systems.

Materials

All Jupyter notebooks, data files, and other required materials for Project 2 can be found at the Github repository link below.