Professional Documents
Culture Documents
Why Multiprocessors? The Structure of Multiprocessors. Elements of Multiprocessors
Why Multiprocessors? The Structure of Multiprocessors. Elements of Multiprocessors
Why multiprocessing?
True parallelism:
Task level. Data level.
mem
CPU
print engine
mem
CPU
Heterogeneous multiprocessors
Will often have a heterogeneous structure.
Different types of PEs. Specialized memory structure. Specialized interconnect.
Multiprocessor system-onchip
Multiple processors.
CPUs, DSPs, etc. Hardwired blocks. Mixed-signal.
System-on-chip applications
Sophisticated markets:
High volume. Demanding performance, power requirements. Strict price restrictions.
Terminology
PE: processing element. Interconnection network: may require more than one clock cycle to transfer data. Message: address+data packet.
Generic multiprocessor
Shared memory:
PE PE PE
Message passing:
mem PE mem PE mem PE
Interconnect network
mem
mem mem
Interconnect network
Task allocation
Tasks may be created at:
Design time (video encoder). Run time (user interface).
Caches:
Memory consistency.
Overheads for Computers as Components 2e
PE
PE
PE
Cache consistency
Problem: caches hide memory updates. Solution: have caches snoop changes.
PE cache network mem mem PE cache
Network topologies
Major choices.
Bus. Crossbar. Buffered crossbar. Mesh. Application-specific.
Bus network
Advantages:
Well-understood. Easy to program. Many standards.
Disadvantages:
Contention. Significant capacitive load.
2004 Wayne Wolf Overheads for Computers as Components 2e
Crossbar
Advantages:
No contention. Simple design.
Disadvantages:
Not feasible for large numbers of ports.
Overheads for Computers as Components 2e
Buffered crossbar
Advantages:
Smaller than crossbar. Can achieve high utilization.
Disadvantages:
Requires scheduling.
2004 Wayne Wolf Overheads for Computers as Components 2e
Xbar
Mesh
Advantages:
Well-understood. Regular architecture.
Disadvantages:
Poor utilization.
Application-specific.
Advantages:
Higher utilization. Lower power.
Disadvantages:
Must be designed. Must carefully allocate data.
Overheads for Computers as Components 2e
TI OMAP
Targets communications, multimedia. Multiprocessor with DSP, RISC.
OMAP 5910:
C55x DSP MMU Memory ctrl ARM9 MPU interface System DMA control
bridge
I/O
P1
P3
P2
P5 M1 M2
P4 M3
t2
P2
t3
P4
task t1 t2 t3
P3
CPU 1
P1
P2
P2
CPU 2
P3
P4
P3
P4
P2 delayed on CPU 1; data dependency delays P3; priority delays P4. Worst-case t3 delay is 80, not 50.
2004 Wayne Wolf Overheads for Computers as Components 2e