Professional Documents
Culture Documents
ARM Cortex-A57 Block Diagram
ARM Cortex-A57 Block Diagram
Coherency Port
Slave
L1 Instruction Cache Branch Prediction
Accelerator ITLB 48KB
(ACP)
5 Stages
Indirect Predictor
w/path history
Snoop
128 bits Global History Buffer
MicroBTB (64-entry)
Instruction Fetch Branch Target Buffer
Extensions (ACE)
(BTB)(2k-4k)
Master
AXI Coherency
Return Stack
12 Stage
7 Stages
Fill/Evict
Buffer
L2 Arbitration
(512-entry)
L2 TLB
32-entry Register Rename
Loop Buffer Virtual to Physical Register Pool
(16-way set-associative
Shared L2 Cache
512KB/1MB/2MB
CP14/CP15
1 Stage
Registers Dispatch States
Commit
TAG RAM TAG RAM
Snoop
Up to 8 micro-ops Issue
1 Stage
Load/Store
Load/Store
Divide Cluster
Multiply,MAC
Branch
Cluster 1
Simple
Cluster 0
Simple
L2
4 Stages
4 Stages
Load-Store Unit
Prefetch
Engine
Store Buffer
2-10 Stages
3-12 Stage
1 Load & ARM Integer ALU & Out-of-Order
1 Store multiply & Shifter Pipeline
Load TLB L1 Data Cache per cycle Integer (includes v6-
(32-entry)
32KB ECC divide, SIMD)
Store TLB (2-way set-associative MAC
(32-entry) /64-byte cache line)
1 Stage
WriteBack
128 micro-ops in-flight Retirement Buffer