0% found this document useful (0 votes)

48 views51 pages

Lecture Week - 1 Introduction 1 - SP-24

Uploaded by

imhafeezniazi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views51 pages

Lecture Week - 1 Introduction 1 - SP-24

Uploaded by

imhafeezniazi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 51

Parallel & Distributed Computing

Lecture NO: 01
Introduction

Farhad M. Riaz
[email protected]

Department of Computer Science

NUML, Islamabad
Course Pre-requisites

 Programming Experience (preferably

Python/C++/Java)
 Understanding of Computer Organization
and Architecture
 Understanding of Operating System
Requirements & Grading

 Roughly
– 50 % Final Exam
– 25% Internal Evaluation
 Quiz 8 Marks
 Assignments 8 Marks
 Project 9 Marks
– 25% Mid term exam
Books

 Some good books are:

– Distributed Systems Third edition
– PRINCIPLES OF PARALLEL PROGRAMMING
– Designing and Building Parallel Programs
– Distributed and Cloud Computing
Course Project

 At the end of the semester students needs to

submit a semester project like
– Distributed computing & smart city services
– Large scale convolutional neural networks
– Distributed computing with delay tolerant network
Course Overview

 This course covers following main concepts

– Concepts of parallel and distributed computing
– Analysis and profiling of applications
– Shared memory concepts
– Distributed memory concepts
– Parallel and distributed programming (OpenMP, MPI)
– GPU based computing and programming (CUDA)
– Virtualization
– Cloud Computing, MapReduce
– Grid Computing
– Peer-to-Peer Computing
– Future trends in computing
Recommended Material
 Distributed Systems, Maarten van Steen & Andrew S. Tanenbaum, 3rd
Edition (2020), Pearson.
 Parallel Programming: Concepts and Practice, Bertil Schmidt, Jorge
Gonzalez-Dominguez, Christian Hundt, Moritz Schlarb, 1st Edition
(2018), Elsevier.
 Parallel and High-Performance Computing, Robert Robey and Yuliana
Zamora, 1st Edition (2021).
 Distributed and Cloud Computing: From Parallel Processing to the
Internet of Things, Kai Hwang, Jack Dongarra, Geoffrey Fox, 1st
Edition (2012), Elsevier.
 Multicore and GPU Programming: An Integrated Approach,
Gerassimos Barlas, 2nd Edition (2015), Elsevier.
 Parallel programming: For multicore and cluster systems. Rauber,
Thomas, and Gudula Rünger. Springer Science & Business Media,
2013.
Recent Jobs
Jobs
Research In Parallel & Distributed
Computing
Single Processor Architecture
Memory Hierarchy
5 years of Technology Advance
Productivity Gap
Pipelining
Pipelining
Multicore Trend
Application Partitioning
High-Performance Computing
(HPC)

 HPC is the use of parallel processing for running

advanced application programs efficiently, reliably
and quickly.
 It applies especially to systems that function above a
tera FLOPs (floating-point operations per second)
processing speed.
 The term HPC is occasionally used as a synonym for
supercomputing, although technically a
supercomputer is a system that performs at or near
the currently highest operational rate for computers.
High Performance Computing
GPU-accelerated Computing

 GPU-accelerated computing is the use of a graphics

processing unit (GPU) together with a CPU to
accelerate deep learning, analytics, and engineering
applications.
 Pioneered in 2007 by NVIDIA, GPU accelerators now
power energy-efficient data centers in government labs,
universities, enterprises, and small-and-medium
businesses around the world.
 They play a huge role in accelerating applications in
platforms ranging from artificial intelligence to cars,
drones, and robots.
What is GPU?

 It is a processor optimized for 2D/3D graphics, video,

visual computing, and display.
 It is highly parallel, highly multithreaded
multiprocessor optimized for visual computing.
 It provide real-time visual interaction with computed
objects via graphics images, and video.
 It serves as both a programmable graphics
processor and a scalable parallel computing
platform.
 Heterogeneous Systems: combine a GPU with a
CPU
SGI Altix Supercomputer 2300 processors
HPC System composition
Parallel Computers

 Virtually all stand-alone computers

today are parallel from hardware
perspective:
– Multiple functional units (L1 cache,
L2 cache, branch, pre-fetch,
decode, floating-point, graphics
processing (GPU), integer, etc.)
– Multiple execution units/cores
– Multiple hardware threads

IBM BG/Q Compute Chip with 18 cores (PU) and 16 L2 Cache units (L2)
Parallel Computers
 Networks connect multiple
stand-alone computers
(nodes) to make larger
parallel computer clusters.
 Parallel computer cluster
– Each compute node is a
multi-processor parallel
computer in itself
– Multiple compute nodes are
networked together with an
Infiniband network
– Special purpose nodes, also
multi-processor, are used for
other purposes
Types of Parallel and Distributed
Computing

 Parallel Computing
– Shared Memory
– Distributed Memory

 Distributed Computing
– Cluster Computing
– Grid Computing
– Cloud Computing
– Distributed Pervasive Systems
Parallel Computing
Distributed (Cluster) Computing

 Essentially a group of high-end

systems connected through a
LAN
 Homogeneous: same OS, near-
identical hardware
 Single managing node
Distributed (Grid) Computing

 Lots of nodes from everywhere

– Heterogeneous
– Dispersed across several organizations
– Can easily span a wide-area network

 To allow for collaborations, grids generally use virtual

organizations.
 In essence, this is a grouping of users (or their IDs) that will
allow for authorization on resource allocation.
Distributed (Cloud) Computing
Distributed (Pervasive) Computing

 Emerging next-generation of distributed systems in which

nodes are small, mobile, and often embedded in a larger
system, characterized by the fact that the system naturally
blends into the user’s environment.
 Three subtypes
– Ubiquitous computing systems: pervasive and
continuously present, i.e., there is a continuous
interaction between system and user.
– Mobile computing systems: pervasive, but emphasis is
on the fact that devices are inherently mobile.
– Sensor (and actuator) networks: pervasive, with
emphasis on the actual (collaborative) sensing and
actuation of the environment.
Why Use Parallel Computing?
The Real World is Massively
Parallel
 In the natural world, many
complex, interrelated events
are happening at the same
time, yet within a temporal
sequence.
 Compared to serial computing,
parallel computing is much
better suited for modeling,
simulating and understanding
complex, real world
phenomena.
 For example, imagine modeling
these serially =>
SAVE TIME AND/OR MONEY
(Main Reasons)

 In theory, throwing
more resources at a
task will shorten its
time to completion,
with potential cost
savings.
 Parallel computers
can be built from
cheap, commodity
components.
SOLVE LARGER / MORE COMPLEX
PROBLEMS (Main Reasons)
 Many problems are so large and/or complex
that it is impractical or impossible to solve
them on a single computer, especially given
limited computer memory.
 Example: Web search engines/databases
processing millions of transactions every
second
PROVIDE CONCURRENCY
(Main Reasons)

 A single compute resource can only do one

thing at a time. Multiple compute resources
can do many things simultaneously.
 Example: Collaborative Networks provide a
global venue where people from around the
world can meet and conduct work "virtually".
MAKE BETTER USE OF UNDERLYING
PARALLEL HARDWARE
(Main Reasons)

 Modern computers, even

laptops, are parallel in
architecture with multiple
processors/cores.
 Parallel software is
specifically intended for
parallel hardware with
multiple cores, threads, etc.
 In most cases, serial
programs run on modern
computers "waste" potential
computing power.

Intel Xeon processor with 6 cores and 6

L3 cache units
The Future
(Main Reasons)
 During the past 20+ years, the trends
indicated by ever faster networks,
distributed systems, and multi-
processor computer architectures
(even at the desktop level) clearly
show that parallelism is the future of
computing.
 In this same time period, there has
been a greater than 500,000x
increase in supercomputer
performance, with no end currently in
sight.
 The race is already on for Exascale
Computing!
 Exaflop = 1018 calculations per
second
That’s all for today!!

OpenAI SOC 3 Report
No ratings yet
OpenAI SOC 3 Report
12 pages
SGOS 6.1.x Administration Guide.9
No ratings yet
SGOS 6.1.x Administration Guide.9
1,537 pages
SPP2 - Siemens TXP HW Manual
100% (1)
SPP2 - Siemens TXP HW Manual
293 pages
Microsoft in High Performance Computing: An Introduction: Aditya Krishnan Technical Product Manager Microsoft Corp
No ratings yet
Microsoft in High Performance Computing: An Introduction: Aditya Krishnan Technical Product Manager Microsoft Corp
21 pages
CC 1 Unit Notes
No ratings yet
CC 1 Unit Notes
8 pages
HPC - Unit Test-I (9 July 2020) : Mark Only One Oval
No ratings yet
HPC - Unit Test-I (9 July 2020) : Mark Only One Oval
5 pages
Module 2 Class 1
No ratings yet
Module 2 Class 1
9 pages
Using Ffmpeg With Nvidia Gpu Hardware Acceleration: Application Note
No ratings yet
Using Ffmpeg With Nvidia Gpu Hardware Acceleration: Application Note
20 pages
High Performance Computing Lecture 2 Parallel Programming With MPI Pub
No ratings yet
High Performance Computing Lecture 2 Parallel Programming With MPI Pub
50 pages
Nvidia Opencl Best Practices Guide: Optimization
No ratings yet
Nvidia Opencl Best Practices Guide: Optimization
49 pages
FOSDEM14 HPC Devroom 12 Sniper
No ratings yet
FOSDEM14 HPC Devroom 12 Sniper
33 pages
NGC Registry Launch Technical Overview
No ratings yet
NGC Registry Launch Technical Overview
11 pages
High Performance Computing Update 0908 - InCOSE
No ratings yet
High Performance Computing Update 0908 - InCOSE
19 pages
Nvidia - Rapids
No ratings yet
Nvidia - Rapids
33 pages
High Performance Network-on-Chip Through MPLS
No ratings yet
High Performance Network-on-Chip Through MPLS
4 pages
Using FFmpeg With NVIDIA GPU Hardware Acceleration
No ratings yet
Using FFmpeg With NVIDIA GPU Hardware Acceleration
22 pages
Scalar Security Study 2019
No ratings yet
Scalar Security Study 2019
76 pages
Accelerating Matrix Multiplication With Block Sparse Format and NVIDIA Tensor Cores - NVIDIA Technical Blog
No ratings yet
Accelerating Matrix Multiplication With Block Sparse Format and NVIDIA Tensor Cores - NVIDIA Technical Blog
7 pages
Tesla V100 Performance Guide
No ratings yet
Tesla V100 Performance Guide
23 pages
344.48 Nvidia Control Panel Quick Start Guide PDF
No ratings yet
344.48 Nvidia Control Panel Quick Start Guide PDF
33 pages
Cap 100M PDF
No ratings yet
Cap 100M PDF
35 pages
Dgx1 v100 System Architecture Whitepaper
No ratings yet
Dgx1 v100 System Architecture Whitepaper
43 pages
ACER Altos R3 Server Datasheet
No ratings yet
ACER Altos R3 Server Datasheet
2 pages
dgx2 User Guide
No ratings yet
dgx2 User Guide
125 pages
Nvidia RTX A2000 Datasheet
No ratings yet
Nvidia RTX A2000 Datasheet
1 page
Nvidia XID - Errors
No ratings yet
Nvidia XID - Errors
12 pages
Triton X-100-1
No ratings yet
Triton X-100-1
9 pages
Nvidia DGX Station Print Infographic 738375 Web
No ratings yet
Nvidia DGX Station Print Infographic 738375 Web
1 page
Insert Project Title: Business Requirements Specification
No ratings yet
Insert Project Title: Business Requirements Specification
19 pages
Introduction To High Performance Scientific Computing
No ratings yet
Introduction To High Performance Scientific Computing
464 pages
TB 04631 001 - v01
No ratings yet
TB 04631 001 - v01
25 pages
Nvidia DGX A100 Datasheet
No ratings yet
Nvidia DGX A100 Datasheet
2 pages
ResQ Redis Cache ER Diagram
No ratings yet
ResQ Redis Cache ER Diagram
1 page
Software Life-Cycle Management: Openup and Architecture Handbook Overview
No ratings yet
Software Life-Cycle Management: Openup and Architecture Handbook Overview
58 pages
362.00 Nvidia Control Panel Quick Start Guide
No ratings yet
362.00 Nvidia Control Panel Quick Start Guide
33 pages
2021-02-04 DAIM Company Presentation
No ratings yet
2021-02-04 DAIM Company Presentation
17 pages
Uncertainty in Modeling
No ratings yet
Uncertainty in Modeling
25 pages
Nvidia Cuda Arc
No ratings yet
Nvidia Cuda Arc
16 pages
Immediate Joiner
No ratings yet
Immediate Joiner
9 pages
CUDA Compute Unified Device Architecture
No ratings yet
CUDA Compute Unified Device Architecture
26 pages
Nvidia Nano Datasheet
No ratings yet
Nvidia Nano Datasheet
41 pages
Deep Learning With Databricks: Srijith Rajamohan, Ph.D. John O'Dwyer
No ratings yet
Deep Learning With Databricks: Srijith Rajamohan, Ph.D. John O'Dwyer
38 pages
CUDA Installation Guide Windows
100% (1)
CUDA Installation Guide Windows
17 pages
Performance Computing
100% (1)
Performance Computing
102 pages
Zoom System Design PDF
No ratings yet
Zoom System Design PDF
9 pages
Speaker -A02- 5747- Best Practices in Networking for AI
No ratings yet
Speaker -A02- 5747- Best Practices in Networking for AI
15 pages
DS Mod4
No ratings yet
DS Mod4
32 pages
NV Applications Catalog Lowres
No ratings yet
NV Applications Catalog Lowres
20 pages
HPC Datasheet sc23 h200 Datasheet 3002446
No ratings yet
HPC Datasheet sc23 h200 Datasheet 3002446
3 pages
High Performance Computing (HPC)
No ratings yet
High Performance Computing (HPC)
8 pages
Gene AI
No ratings yet
Gene AI
13 pages
CATIAV5 Generative Part Structural Analysis
No ratings yet
CATIAV5 Generative Part Structural Analysis
24 pages
Nvidia-Learning-Training Course-Catalog
No ratings yet
Nvidia-Learning-Training Course-Catalog
27 pages
Installation of 11510
100% (1)
Installation of 11510
20 pages
GPFS and HDFS
No ratings yet
GPFS and HDFS
5 pages
Optimizing Hadoop for MapReduce
From Everand
Optimizing Hadoop for MapReduce
Khaled Tannir
No ratings yet
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet
Integration platform The Ultimate Step-By-Step Guide
From Everand
Integration platform The Ultimate Step-By-Step Guide
Gerardus Blokdyk
No ratings yet
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet
Co-Evolution of Metamodels and Model Transformations: An operator-based, stepwise approach for the impact resolution of metamodel evolution on model transformations.
From Everand
Co-Evolution of Metamodels and Model Transformations: An operator-based, stepwise approach for the impact resolution of metamodel evolution on model transformations.
Steffen Kruse
No ratings yet
TOGAF® Business Architecture Level 1 Study Guide
From Everand
TOGAF® Business Architecture Level 1 Study Guide
Andrew Josey
No ratings yet
Switchview 1000 Switch: Installer/User Guide
No ratings yet
Switchview 1000 Switch: Installer/User Guide
20 pages
FC Solaris
No ratings yet
FC Solaris
16 pages
Computer Fundamentals Questions and Answers MCQs 2022
No ratings yet
Computer Fundamentals Questions and Answers MCQs 2022
19 pages
A Concept of Data Transfer Via Bluetooth in Pen Drive
No ratings yet
A Concept of Data Transfer Via Bluetooth in Pen Drive
6 pages
Panel Builder 1400e
No ratings yet
Panel Builder 1400e
114 pages
Acer Travelmate 3000 Quanta ZH1 Free Laptop Schematic
No ratings yet
Acer Travelmate 3000 Quanta ZH1 Free Laptop Schematic
27 pages
354 39 Solutions Instructor Manual 9 Introduction 8051 Microcontrollers Chapter 9
No ratings yet
354 39 Solutions Instructor Manual 9 Introduction 8051 Microcontrollers Chapter 9
1 page
Acer Swift 3 14" Notebook
No ratings yet
Acer Swift 3 14" Notebook
2 pages
Replacing Powerbook G4 Aluminum 15" 1.67 GHZ Upper Case: Written By: Andrew Bookholt
No ratings yet
Replacing Powerbook G4 Aluminum 15" 1.67 GHZ Upper Case: Written By: Andrew Bookholt
6 pages
Workstation For Simulation FEA
No ratings yet
Workstation For Simulation FEA
5 pages
MES - IV Sem ECE VTU Class 3
No ratings yet
MES - IV Sem ECE VTU Class 3
24 pages
Foundation of Sequential Programming CSC 210 Lecturer in Charge: Bola Orogun (Mtech, MITPA)
No ratings yet
Foundation of Sequential Programming CSC 210 Lecturer in Charge: Bola Orogun (Mtech, MITPA)
20 pages
MC-MODULE-4
No ratings yet
MC-MODULE-4
53 pages
Parts of A Motherboard and Their Function
No ratings yet
Parts of A Motherboard and Their Function
9 pages
Career Paths English IT SB 47
No ratings yet
Career Paths English IT SB 47
2 pages
EEC 117 Introduction to Computer Hardware I PRACTICAL WORKSHOP
No ratings yet
EEC 117 Introduction to Computer Hardware I PRACTICAL WORKSHOP
20 pages
Super POSH Spec.V1.03 (0114)
No ratings yet
Super POSH Spec.V1.03 (0114)
28 pages
Mano M.M. - Computer System Architecture-PH (1992)
No ratings yet
Mano M.M. - Computer System Architecture-PH (1992)
524 pages
Exp 2.4
No ratings yet
Exp 2.4
6 pages
Css 111 Lect - 1
No ratings yet
Css 111 Lect - 1
85 pages
High Color Print Speed For Quick Handling of Massive Printing
No ratings yet
High Color Print Speed For Quick Handling of Massive Printing
8 pages
Computer Lab Manager Training Course: Zambia Pacific Trust
No ratings yet
Computer Lab Manager Training Course: Zambia Pacific Trust
48 pages
Smart Attendance System Based On Face Recognition
No ratings yet
Smart Attendance System Based On Face Recognition
23 pages
GFK Retailerstandardre It Ua Sep20
No ratings yet
GFK Retailerstandardre It Ua Sep20
25 pages
CS101 (Introductiont o Computing) Short Notes From Lecture No 01 To 22 For Midterm Exam (WWW - Vuhelpinghands.blogspot - Com) PDF
No ratings yet
CS101 (Introductiont o Computing) Short Notes From Lecture No 01 To 22 For Midterm Exam (WWW - Vuhelpinghands.blogspot - Com) PDF
38 pages
MPU 3263 Office Application First Assignment
No ratings yet
MPU 3263 Office Application First Assignment
6 pages
Bommireddy Rambabu: MPMC Ii-Unit
No ratings yet
Bommireddy Rambabu: MPMC Ii-Unit
12 pages
Add Mode
No ratings yet
Add Mode
28 pages
PDC 1 - PD Computing
No ratings yet
PDC 1 - PD Computing
12 pages

Uploaded by

Uploaded by

Parallel & Distributed Computing

Department of Computer Science

 Programming Experience (preferably

 Some good books are:

 At the end of the semester students needs to

 This course covers following main concepts

 HPC is the use of parallel processing for running

 GPU-accelerated computing is the use of a graphics

 It is a processor optimized for 2D/3D graphics, video,

 Virtually all stand-alone computers

 Essentially a group of high-end

 Lots of nodes from everywhere

 To allow for collaborations, grids generally use virtual

 Emerging next-generation of distributed systems in which

 A single compute resource can only do one

 Modern computers, even

Intel Xeon processor with 6 cores and 6

You might also like