Professional Documents
Culture Documents
Kiến Trúc Máy Tính CS2009: Khoa Khoa học và Kỹ thuật Máy tính BM Kỹ thuật Máy tính Võ Tấn Phương
Kiến Trúc Máy Tính CS2009: Khoa Khoa học và Kỹ thuật Máy tính BM Kỹ thuật Máy tính Võ Tấn Phương
2009
Võ Tấn Phương
http://www.cse.hcmut.edu.vn/~vtphuong
dce
2009
Course Syllabus
• The Content
– Chapter1 (week1-2): Introduction to Computer Abstraction and
Technology
– Chapter2 (week3-5): Instructions – Language of the Computer
– Chapter3 (week6-7): Arithmetic for Computers
– Chapter4 (week10-12): The Processor
– Chapter5 (week13-14): Storage and Other IO topics
– Chapter6 (week15-16): Memory Systems
• References
– David A. Patterson and John L. Hennessy, Computer Organization &
Design – The Hardware/Software Interface, 4th Edition, Morgan
Kaufmann Publishers, 2008
– William Stallings, Computer Organization and Architecture –
Designing for Performance, 7th Edition, Pearson International Edition,
2006.
• Homepage:
http://www.cse.hcmut.edu.vn/~anhvu/teaching/2009/504002CS/
• Grading Policy:
– Homework: 20%
– Midterm examination: 30%
– Final examination: 50%
9/14/2009 ©2009, CE Department 2
dce
2009
Course Overview
• Principle and organization of digital
computers,
• Bus organization and memory design,
• Principle of computer’s instruction set and
programming in assembly language (some
popular processors are used such as
MIPS, Intel x86, ARM, …),
• Interface between the processor and
peripherals,
• Performance issues in computer
architecture.
9/14/2009 ©2009, CE Department 3
dce
2009
Why study Computer Architecture
• To be a professional in any field of computing today,
you should not regard the computer as just a back
box that executes program by magic.
• You should understand a computer system’s
functional components, their characteristics, their
performance, and their interactions.
• You need to understand computer architecture in
order to build a program so that it runs efficiently on
a machine.
• When selecting a system to use, you should be able
to understand the tradeoff among various
components, such as CPU clock speed vs. memory
size.
9/14/2009 ©2009, CE Department 4
dce
2009
Chapter 1
Components of a Computer
The BIG Picture • Same components for
all kinds of computer
– Desktop, server,
embedded
• Input/output includes
– User-interface devices
• Display, keyboard, mouse
– Storage devices
• Hard disk, CD/DVD, flash
– Network adapters
• For communicating with
other computers
Output
device
Network
cable
Input Input
device device
Anatomy of a Mouse
• Optical mouse
– LED illuminates
desktop
– Small low-res camera
– Basic image processor
• Looks for x, y
movement
– Buttons & wheel
• Supersedes roller-ball
mechanical mouse
BA C/Sud BA C/Sud
Concorde Concorde
Douglas Douglas DC-
DC-8-50 8-50
0 100 200 300 400 500 0 2000 4000 6000 8000 10000
BA C/Sud BA C/Sud
Concorde Concorde
Douglas Douglas DC-
DC-8-50 8-50
Clock (cycles)
Data transfer
and computation
Update state
Relative frequency
Sequence 1: IC = 5 Sequence 2: IC = 6
Clock Cycles Clock Cycles
= 2×1 + 1×2 + 2×3 = 4×1 + 1×2 + 1×3
= 10 =9
Avg. CPI = 10/5 = 2.0 Avg. CPI = 9/6 = 1.5
• Performance depends on
– Algorithm: affects IC, possibly CPI
– Programming language: affects IC, CPI
– Compiler: affects IC, CPI
– Instruction set architecture: affects IC, CPI, Tc
• In CMOS IC technology
Power = Capacitive load × Voltage 2 × Frequency
×30 5V → 1V ×1000
9/14/2009 ©2009, CE Department Chapter 1 — Computer Abstractions and Technology — 36
dce
2009
Reducing Power
• Suppose a new CPU has
– 85% of capacitive load of old CPU
– 15% voltage and 15% frequency reduction
Pnew Cold × 0.85 × (Vold × 0.85)2 × Fold × 0.85
= 2
= 0.85 4
= 0.52
Pold Cold × Vold × Fold
n
n
∏ Execution time ratio
i=1
i
Name Description IC×109 CPI Tc (ns) Exec time Ref time SPECratio
perl Interpreted string processing 2,118 0.75 0.40 637 9,777 15.3
bzip2 Block-sorting compression 2,389 0.85 0.40 817 9,650 11.8
gcc GNU C Compiler 1,050 1.72 0.47 24 8,050 11.1
mcf Combinatorial optimization 336 10.00 0.40 1,345 9,120 6.8
go Go game (AI) 1,658 1.09 0.40 721 10,490 14.6
hmmer Search gene sequence 2,783 0.80 0.40 890 9,330 10.5
sjeng Chess game (AI) 2,176 0.96 0.48 37 12,100 14.5
libquantum Quantum computer simulation 1,623 1.61 0.40 1,047 20,720 19.8
h264avc Video compression 3,102 0.80 0.40 993 22,130 22.3
omnetpp Discrete event simulation 587 2.94 0.40 690 6,250 9.1
astar Games/path finding 1,082 1.79 0.40 773 7,020 9.1
xalancbmk XML parsing 1,058 2.70 0.40 1,143 6,900 6.0
Geometric mean 11.7
⎛ 10 ⎞ ⎛ 10 ⎞
Overall ssj_ops per Watt = ⎜ ∑ ssj_ops i ⎟ ⎜ ∑ poweri ⎟
⎝ i=0 ⎠ ⎝ i=0 ⎠
Instruction count
MIPS =
Execution time × 10 6
Instruction count Clock rate
= =
Instruction count × CPI CPI × 10 6
× 10 6
Clock rate
CPI varies between programs on a given CPU