Welcome to Scribd!

EE204 Computer Architecture: Lecture 31-Cache Performance

Uploaded by

0% found this document useful (0 votes)

17 views11 pages

This document discusses cache performance and how to measure and improve it. It covers reducing cache miss rates by decreasing the probability of conflicts and adding additional cache levels. Cache performance is affected by memory stall clock cycles due to reads and writes. Read stall cycles depend on read miss rates and penalties while write stall cycles depend on write miss rates, penalties, and buffer stalls. The document provides an example to calculate performance improvements from a perfect cache versus one with misses.

Original Description:

cache + performance

Original Title

Lec31-Cache+Performance

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

17 views11 pages

EE204 Computer Architecture: Lecture 31-Cache Performance

Uploaded by

Anonymous tbEoUfv

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 11

Search inside document

EE204

Computer Architecture

Lecture 31- Cache

Performance
14th Apr, 2011

EE204 L31 Humaira, Spring 11 1

Measuring & Improving Cache
Performance
• Reducing Miss Rate
– Reducing the probability of two different memory
address for same Cache location
– Adding an Additional Level of Cache
• CPU time
– Clock cycles spent in instruction execution
– Clock cycles spent in waiting for memory system
(Memory-Stall Clock Cycles)

EE204 L31 Humaira, Spring 11 2

Cache Performance
• Memory-Stall Clock cycles
– Clocks spent in Cache Misses
– Read-Stall Cycles + Write-Stall Cycles
• Read-Stall Cycles
– Read accesses per program x
– Read Miss Rate x
– Read Miss Penalty (Clock Cycles)

EE204 L31 Humaira, Spring 11 3

Cache Performance
• Write-Stall Cycles
• Write-through Scheme
– Write Misses (requires fetching of block)
– Write Buffer Stalls (write buffer is full)
– Write Buffer stalls depends on timing of writes,
with sufficient write-buffer depth buffer stalls
are insignificant and can be ignored

EE204 L31 Humaira, Spring 11 4

Cache Performance
• Write-Stall Cycles
– Write accesses per program x
– Write Miss Rate x
– Write Miss Penalty (Clock Cycles)
• Write-Back scheme
– Write-stall when a Cache block is written back to
memory when block is replaced

EE204 L31 Humaira, Spring 11 5

Cache Performance
• Write-through Cache Scheme
– Read Miss Penalty = Write Miss Penalty
• Memory-Stall Clock cycles
– Memory Access per program x
– Miss Rate x
– Miss Penalty
• Memory-Stall Clock cycles
– Instructions per program x
– Misses per Instruction x
– Miss Penalty

EE204 L31 Humaira, Spring 11 6

Cache Performance Example
• Instruction Cache Miss Rate = 2%
• Data Cache Miss Rate = 4%
• Machine CPI = 2 without memory stalls
• Miss Penalty = 40 cycles
• 36% of Instructions are Data Access instructions
• How much faster will machine run with a perfect
Cache which never Misses?

EE204 L31 Humaira, Spring 11 7

Cache Performance Example
• Instruction Miss cycles = I x 2% x 40 = 0.80I
• Data Miss cycles = I x 36% x 4% x 40 = 0.57I
• Total Memory Stall cycles = 1.37I
• CPI with Memory stall = 2+1.37 = 3.37

EE204 L31 Humaira, Spring 11 8

Cache Performance Example
• Performance
= CPU time with stalls/CPU time with perfect Cache
= I x CPIstall x clock cycle/I x CPIperfect x clock cycle
= CPIstall /CPIperfect
= 3.37/2
= 1.68 faster
• Processor is made faster CPI = 1
• Performance = 2.37/1 = 2.37 faster

EE204 L31 Humaira, Spring 11 9

Cache Performance Example
• Clock Rate is doubled
• Total miss cycles/instruction
= (2% x 80) + 36% x (4% x 80) = 2.75
• CPI = 2 + 2.75 = 4.75
• Performance with fast clock/Performance with slow
clock
= exec. time with slow clock/exec. time with fast
clock
= IC x CPI x clock cycle/IC x CPI x clock cycle/2
= 3.36/4.75 x (1/2)
= 1.41
• m/c with faster clock is 1.41 times faster instead of
2.00
EE204 L31 Humaira, Spring 11 10
Cache Performance Example
• Relative Cache penalties increase as machine becomes
faster
• If Clock rate & CPI improve the performance suffers
– Lower the CPI, the more pronounced the effect of
stall cycles
– A higher CPU clock rate leads to a larger miss
penalty

EE204 L31 Humaira, Spring 11 11

Nabil Anwar Academic CV Vitae 2014
Document4 pages
Nabil Anwar Academic CV Vitae 2014
Gary Tom
No ratings yet
ch2 Appb
Document58 pages
ch2 Appb
Krupa Urankar
No ratings yet
Computer System Overview: 1 Spring 2015
Document48 pages
Computer System Overview: 1 Spring 2015
AsadKhan
No ratings yet
Lecture 16: Basic CPU Design
Document20 pages
Lecture 16: Basic CPU Design
Farid Mansur
No ratings yet
Computer Architecture and Organization: Lecture15: Cache Performance
Document17 pages
Computer Architecture and Organization: Lecture15: Cache Performance
Matthew R. Pon
No ratings yet
A Study On Hyper-Threading: Vimal Reddy Ambarish Sule Aravindh Anantaraman
Document29 pages
A Study On Hyper-Threading: Vimal Reddy Ambarish Sule Aravindh Anantaraman
Vetrivel Subramani
No ratings yet
Computer Architecture and Organization: Lecture16: Cache Performance
Document17 pages
Computer Architecture and Organization: Lecture16: Cache Performance
Matthew R. Pon
No ratings yet
Final Exam Topics: CSE 564 Computer Architecture Summer 2017
Document78 pages
Final Exam Topics: CSE 564 Computer Architecture Summer 2017
smart songs listen
No ratings yet
Lecture 12: Memory Hierarchy - Cache Optimizations: CSCE 513 Computer Architecture
Document69 pages
Lecture 12: Memory Hierarchy - Cache Optimizations: CSCE 513 Computer Architecture
Fahim Shaik
No ratings yet
Improving and Measuring Cache Performance
Document8 pages
Improving and Measuring Cache Performance
udhaya kumar
No ratings yet
ILP
Document47 pages
ILP
vengat.mailbox5566
No ratings yet
The Central Processing Unit:: What Goes On Inside The Computer
Document42 pages
The Central Processing Unit:: What Goes On Inside The Computer
Mag Creation
No ratings yet
Mod6 2 PDF
Document15 pages
Mod6 2 PDF
sourav giri
No ratings yet
Memory Management: Sadaqat Ali Khan Bangash
Document27 pages
Memory Management: Sadaqat Ali Khan Bangash
Fasihuddin Khan
No ratings yet
M3 Main PDF
Document63 pages
M3 Main PDF
SHAWN EZEKIEL ABIERA
No ratings yet
Cache Performance Average Memory Access Time
Document23 pages
Cache Performance Average Memory Access Time
Pulagam Lakshmi Sampath Reddy 21BME1298
No ratings yet
Operating System Reviw
Document24 pages
Operating System Reviw
Rafael D. Sanchez
No ratings yet
1.parallel Processing
Document20 pages
1.parallel Processing
dev chauhan
100% (7)
unit 5
Document44 pages
unit 5
Apurva Jarwal
No ratings yet
08 - Operating System Support
Document66 pages
08 - Operating System Support
ade_kuntil
No ratings yet
Chapter 1 (Parallel Computer Models)
Document20 pages
Chapter 1 (Parallel Computer Models)
Kushal Sh
No ratings yet
5.2 Eleven Advanced Optimizations of Cache Performance
Document13 pages
5.2 Eleven Advanced Optimizations of Cache Performance
Cieluu Panda
No ratings yet
MCS 041 (MCA 4th Sem Assignment) PDF
Document22 pages
MCS 041 (MCA 4th Sem Assignment) PDF
Johnvin Sunny
No ratings yet
Computer System Overview
Document51 pages
Computer System Overview
Tamanna Grewal
No ratings yet
Storage Management
Document11 pages
Storage Management
Simbiso
No ratings yet
Operting System Book
Document37 pages
Operting System Book
basit qamar
100% (3)
5 1
Document39 pages
5 1
tinni09112003
No ratings yet
Operating Systems CS240: Computer System Overview
Document26 pages
Operating Systems CS240: Computer System Overview
Michael Justine de Gracia
No ratings yet
Computer System Overview (Review) : Operating Systems
Document50 pages
Computer System Overview (Review) : Operating Systems
Lộc Khang Phúc
No ratings yet
Memory Management
Document27 pages
Memory Management
imrank39199
No ratings yet
Minmin 9
Document10 pages
Minmin 9
Mennah Tullah Sameh
No ratings yet
Computer Performance
Document27 pages
Computer Performance
hackstar742
No ratings yet
COMPX203 Computer Systems: Multitasking
Document50 pages
COMPX203 Computer Systems: Multitasking
Amiel Bougen
No ratings yet
M116C 1 M116C 1 Lect02-Performance
Document23 pages
M116C 1 M116C 1 Lect02-Performance
tinhtrilac
No ratings yet
Parameters of Cache Memory: - Cache Hit - Cache Miss - Hit Ratio - Miss Penalty
Document18 pages
Parameters of Cache Memory: - Cache Hit - Cache Miss - Hit Ratio - Miss Penalty
Majety S Lskshmi
No ratings yet
Input Unit: Memory: in Processing Element (PE) or CPU: Output
Document24 pages
Input Unit: Memory: in Processing Element (PE) or CPU: Output
Hamzah Akhtar
No ratings yet
ILP - Appendix C PDF
Document52 pages
ILP - Appendix C PDF
Dhananjay Jahagirdar
No ratings yet
Intel 80586 (Pentium)
Document24 pages
Intel 80586 (Pentium)
Soumya Ranjan Panda
No ratings yet
Interrupt and Memory Hierarchy
Document32 pages
Interrupt and Memory Hierarchy
Jam Farhad Athar
No ratings yet
R RRRRRRRR Final
Document28 pages
R RRRRRRRR Final
Rachell Benemerito
No ratings yet
Lesson 01 Computersystemoverview
Document34 pages
Lesson 01 Computersystemoverview
Amir Amjad
No ratings yet
Ee4304 Fall2018 Lecture27
Document23 pages
Ee4304 Fall2018 Lecture27
Nathan Musial
No ratings yet
Test 6 PracticeQuestion Cachememory 1
Document21 pages
Test 6 PracticeQuestion Cachememory 1
anik.additional
No ratings yet
Main Memory
Document57 pages
Main Memory
Bilal Warraich
No ratings yet
Chapter 1 Lecture 2 & 3 - Performance
Document36 pages
Chapter 1 Lecture 2 & 3 - Performance
Seid Degu
No ratings yet
Chapter01-Computer System Overview
Document50 pages
Chapter01-Computer System Overview
James Lee
No ratings yet
Chapter4-Memory Management
Document35 pages
Chapter4-Memory Management
Prathamesh
No ratings yet
Os Chap2
Document69 pages
Os Chap2
amsalu alemu
No ratings yet
Lecture (2) .PPT-1
Document19 pages
Lecture (2) .PPT-1
nalahelmy
No ratings yet
EE (CE) 6304 Computer Architecture Lecture #2 (8/28/13)
Document35 pages
EE (CE) 6304 Computer Architecture Lecture #2 (8/28/13)
Vishal Mehta
No ratings yet
4 Performance
Document27 pages
4 Performance
1352 : NEEBESH PADHY
No ratings yet
Parallel Programming Platforms
Document109 pages
Parallel Programming Platforms
Karthik Laxmikanth
No ratings yet
Fundamentals of Computer Design - 1
Document32 pages
Fundamentals of Computer Design - 1
qwety300
No ratings yet
Chapter 1 Lecture 2 & 3 - Computer Performance
Document37 pages
Chapter 1 Lecture 2 & 3 - Computer Performance
Isiyak Solomon
No ratings yet
Lesson 7 The Central Processing Unit (CPU)
Document32 pages
Lesson 7 The Central Processing Unit (CPU)
Just Me
No ratings yet
CH 2 SymmShared Performance Issues
Document37 pages
CH 2 SymmShared Performance Issues
Aruna Shanmugakumar
No ratings yet
Memory Hierarchy Design-Aca
Document15 pages
Memory Hierarchy Design-Aca
GuruCharan Singh
No ratings yet
EEL 4768: Computer Architecture: Instruction Level Parallelism (ILP)
Document33 pages
EEL 4768: Computer Architecture: Instruction Level Parallelism (ILP)
miguel gonzalez
No ratings yet
2 Key Concepts: Assignments
Document18 pages
2 Key Concepts: Assignments
Anonymous Wu14iV9dq
No ratings yet
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
From Everand
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
Digital Equipment Corporation
No ratings yet
Advanced Backend Code Optimization
From Everand
Advanced Backend Code Optimization
Sid Touati
No ratings yet
S7-1200 1221, 1222, and 1223 Signal Boards
Document6 pages
S7-1200 1221, 1222, and 1223 Signal Boards
AlexanderViloriaMorante
No ratings yet
Operating Instructions/system Description Zener Barriers
Document24 pages
Operating Instructions/system Description Zener Barriers
Arith Krishnanandan
No ratings yet
Telephone-Cables FINOLEX PDF
Document2 pages
Telephone-Cables FINOLEX PDF
santosh kumar
No ratings yet
Project Report Water Logging Thane
Document79 pages
Project Report Water Logging Thane
avinash_mokashi7073
78% (9)
Liebherr LTM 11200 9.1 M
Document36 pages
Liebherr LTM 11200 9.1 M
tylerlhsmith
100% (1)
GustoMSC PRD12,000 Qdrill
Document2 pages
GustoMSC PRD12,000 Qdrill
jojojo
No ratings yet
At Destroyer
Document6 pages
At Destroyer
Xarly Bedlam
No ratings yet
Vishnukumar PH.D Resume
Document2 pages
Vishnukumar PH.D Resume
GCVishnuKumar
No ratings yet
Research On Operating Systems
Document6 pages
Research On Operating Systems
Patrick Ramos
No ratings yet
Penawaran Harga Mowa@ Astra Honda Motor Office Jakarta
Document3 pages
Penawaran Harga Mowa@ Astra Honda Motor Office Jakarta
Yulianto Eko
No ratings yet
JMF Ac - Base 2015 Coba
Document68 pages
JMF Ac - Base 2015 Coba
Cahyo 03
No ratings yet
Material Requirement Planning Bmhs
Document31 pages
Material Requirement Planning Bmhs
Bikesh Gautam
No ratings yet
Vane Type Single Pump VPF Series
Document3 pages
Vane Type Single Pump VPF Series
RAY
No ratings yet
GND Plane
Document4 pages
GND Plane
Patrick Sucre Mumo
No ratings yet
FMDS0796
Document26 pages
FMDS0796
Felipe Mees Faraco
No ratings yet
Swivel Joints Eng
Document24 pages
Swivel Joints Eng
hendry_hdw
No ratings yet
Semester - 4 - HT - Term Paper Topics - Even - 2020
Document3 pages
Semester - 4 - HT - Term Paper Topics - Even - 2020
Pradeep
No ratings yet
RX8200 Configuration Packs
Document7 pages
RX8200 Configuration Packs
evvn
No ratings yet
Physical Chem
Document12 pages
Physical Chem
Nicole Manju
No ratings yet
Lab Exercise
Document3 pages
Lab Exercise
Umie Nur Aisya
No ratings yet
Pulk / Sled / Ahkio: Huck Finland Outsidecamping
Document10 pages
Pulk / Sled / Ahkio: Huck Finland Outsidecamping
quae
No ratings yet
Project Brief Summary
Document19 pages
Project Brief Summary
Researcher
No ratings yet
BDC Mat
Document6 pages
BDC Mat
sathish11407144
No ratings yet
ME214 3 WI16 Syllabus
Document2 pages
ME214 3 WI16 Syllabus
Yakaj
No ratings yet
Outdoor Unit L: Ayout Guide
Document20 pages
Outdoor Unit L: Ayout Guide
aditarian .p
No ratings yet
Orthographic View of Bench Vise With Stopper Attachment
Document1 page
Orthographic View of Bench Vise With Stopper Attachment
ice
No ratings yet
SANDVIK Coromant Heavymachining Lathe Tools Inserts
Document64 pages
SANDVIK Coromant Heavymachining Lathe Tools Inserts
DesotoJoe
No ratings yet
Software Product Quality Metrics
Document48 pages
Software Product Quality Metrics
Devika Rankhambe
No ratings yet
50 Top Design of Masonry Structures Multiple Choice Questions
Document8 pages
50 Top Design of Masonry Structures Multiple Choice Questions
saikiran1493
No ratings yet