Professional Documents
Culture Documents
FPGA Co-Processor For The ALICE High Level Trigger: Gaute Grastveit University of Bergen Norway
FPGA Co-Processor For The ALICE High Level Trigger: Gaute Grastveit University of Bergen Norway
2Kirchhoff
5Institute
ALICE
TPC
- Time Projection Chamber
=> 15 Gbyte/s
Max data-rate to tape is 1.25 Gbyte/s Compression/selection is needed Conventional, lossless methods: factor 2
HLT functionality
Compress
Reduce the amount of data required to encode the event as far as possible without loosing physics information
Trigger
Accept/reject events on the basis of physics application
Select
Select regions of interest within an event
remove pile-up in p-p ...
Task: reconstruct the tracks of 20.000 charged particles (each producing 150 clusters) in the TPC Timebudget: 5 ms
PCI
RCU Readout Controller Unit DDL Data Detector Link RORC ReadOut Reciver Card
RCU ALTRO TPC FEE Buffer (8 Events) DDL RORC reveiver Buffer > 1000 Events
HLT farm
PCI
RcvBd NIC
PCI kernel in the FPGA FPGA will also be utilised for pattern recognition Reduces number of CPUs needed
FPGA: APEX 20K400 Next prototype: Altera Stratix FPGA Large internal memory DSP cores
Conventional approach with (2d) cluster finder and track follower High occupancy
(overlapping clusters):
Hough transform on raw data Cluster analysis for deconvolution (Kalman filter)
Cluster Finder
time
The numbers represent Charge (ADC values) A vertical uninterrupted stack of numbers is called a sequence. The square shows the geometric centre of the sequence.
Pad
RAM (lpm) T
Decoder
seq
FIFO (lpm)
seq
Merger
cluster
File: charges
C++ model
Relative Scales
As before the mean is calculated by:
+ Smaller numbers, only multiplies by <11 - Multiplication cant be done until merging takes place Alternative, (absolute): Decoder FIFO (lpm) Pre_Calc
(2 mult, 1 add)
Merger
Deconvolution
Simplified implementation, almost for free splits at minima in both directions (time and pad)
off
on
Merger Goals
spend few clock cycles per sequence use few logic elements
Clock cycles spent in the different states 6%
high clockspeed
&
30 % 22 %
new data
&
4% 0% 11 %
11 %
merge store W
idle
send all
&
11 %
send one
calc_dist insert_seq
Outlook
Implementation of Hough transformation
Back Linked List (ALTRO sequences) TPC coordinates (Padrow, Pad, Time) Local coordinates (X, Y, Z) (A,B,E) Parameter Space (k,phi,eta-index)
Histogram 1 Histogram 1
.. .. ..
Histogram N-1 Histogram N-1
Histogram N Histogram N
Conclusion
We have demonstrated the feasibility of a real time cluster finder implemented in an FPGA
ALICE
18 sectors on each side, each sector is readout in 6 subsectors Total is ca. 570.000 pads