Podcast
Questions and Answers
What is the weightage of the CCEE Theory exam in the overall evaluation method?
What is the weightage of the CCEE Theory exam in the overall evaluation method?
Which book serves as the primary textbook for the HPC System Administration and Management course?
Which book serves as the primary textbook for the HPC System Administration and Management course?
What is the total duration of classroom and lab hours for the HPC System Administration and Management course?
What is the total duration of classroom and lab hours for the HPC System Administration and Management course?
Which of the following is NOT listed as a reference book for the course?
Which of the following is NOT listed as a reference book for the course?
Signup and view all the answers
What is the suggested prerequisite knowledge needed for enrolling in the HPC System Administration and Management course?
What is the suggested prerequisite knowledge needed for enrolling in the HPC System Administration and Management course?
Signup and view all the answers
What is a critical component of data center design that significantly influences energy efficiency?
What is a critical component of data center design that significantly influences energy efficiency?
Signup and view all the answers
Which lecture sessions focus on the design of HPC clusters?
Which lecture sessions focus on the design of HPC clusters?
Signup and view all the answers
What method is suggested for managing network usage in high-performance computing systems?
What method is suggested for managing network usage in high-performance computing systems?
Signup and view all the answers
Which of the following best describes the purpose of the latest trends and technologies in HPC as discussed?
Which of the following best describes the purpose of the latest trends and technologies in HPC as discussed?
Signup and view all the answers
What is a primary focus during the sessions that discuss liquid cooling in data centers?
What is a primary focus during the sessions that discuss liquid cooling in data centers?
Signup and view all the answers
Which session primarily addresses the requirements for building an HPC environment?
Which session primarily addresses the requirements for building an HPC environment?
Signup and view all the answers
What is the significance of system benchmarking in HPC management?
What is the significance of system benchmarking in HPC management?
Signup and view all the answers
Which tool is specifically mentioned for configuring monitoring in HPC systems?
Which tool is specifically mentioned for configuring monitoring in HPC systems?
Signup and view all the answers
What is a primary theme of the assignments related to HPC system management?
What is a primary theme of the assignments related to HPC system management?
Signup and view all the answers
What is the purpose of adapting standard Linux for the HPC environment?
What is the purpose of adapting standard Linux for the HPC environment?
Signup and view all the answers
Study Notes
HPC System Administration and Management - PG-DHPCSA
- Duration: 60 classroom hours + 60 lab hours
- Objective: Introduce HPC system administration and management
- Prerequisites: Knowledge of computer networks
- Evaluation: CCEE Theory Exam (40%), Lab Exam (Case Study Based - 40%), Internal Exam (20%)
- Textbooks: High Performance Cluster Computing: Architectures & Systems (Volume-1) by Rajkumar Buyya, Pearson
- References: Include various books and articles on parallel computing, distributed computing, and networking.
Data Center Design & Management (14 Hrs Theory)
- Session 1 & 2: Data center overview, design issues
- Session 3 & 4: HVAC, power sizing
- Session 5: Data center matrices, best practices, security & safety
- Session 6 & 7: Collection, rejection and reuse of heat, liquid cooling, energy use systems, cabinet & cable management
Ecosystem: Architecture of HPC Cluster (30 Hrs Theory + 44 Hrs Lab)
- Sessions 8 & 9: Requirement Analysis
- Sessions 10 & 11: Building blocks of HPC
- Sessions 12 & 13: Hardware and software selection process, Cluster Planning, Adapting Standard Linux for HPC environment
- Sessions 14 & 15: Design of HPC Cluster
- Sessions 16 & 17: Architecture and Cluster software
- Sessions 18 & 19: Cluster building tools
- Session 20 & 21: Multicore-architecture, Pascal, Accelerator cards, Configuring & setting environment for accelerator cards (CUDA Library)
- Session 22: Latest trends and technologies in HPC, Case study: Param Shavak and Use Cases of Param Shavak for HPC solutions, White Survey Paper on Multicore processor and latest advancement in this
HPC System Management and Monitoring (16 Hrs Theory + 36 Hrs Lab)
- Session 23: IPMI, HMC
- Sessions 24 & 25: Case study about Data Center and Visit of Data Center
- Sessions 26, 27, 28, 29 & 30: User management (LDAP/NIS), processor usage, memory usage, network monitoring, Gangila, Nagios, Node resources, System Benchmarking, Theoretical peak performance, HPL bench mark, Tuning HPL, Problem size, Block size, process grid PxQ
Assignments
- Data Center Visit: Building a manual HPC Cluster, HPC Cluster using different Cluster building and management tools, Monitoring tools installation & configuration, Network monitoring using Nagios, IPMI configuration, System benchmarking using HPL, Case study HPC Solution (PARAM Shavak)
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.
Related Documents
Description
This quiz covers key concepts in HPC system administration and data center design. It includes topics such as system architectures, performance metrics, and best practices for managing high-performance computing clusters. Ideal for students looking to enhance their understanding of HPC and data center management.