1 Status of Online Neural Networks Bruce Denby Université de Versailles and Laboratoire des...

Post on 04-Jan-2016

213 views 1 download

Tags:

transcript

1

Status of Online Neural Networks

Bruce DenbyUniversité de Versailles and

Laboratoire des Instruments et Systèmes, Paris, France

Rapporteur’s Presentation

ACAT2000 Fermilab 16-20 October, 2000

2

I. The current situationII. Developments foreseenIII. Neural net hardwareIV. Conclusions

OUTLINE OF THE PRESENTATION

3

Acknowledgements

Most of my transparencies were borrowed from the talks of:

• Sotirios Vlachos• Erez Etzion• Jean-Christophe Prévotet• Christian Kiesling• Bertrand Granado

4

The Current Situation

Neural network triggers are being used to produce physics.

Examples: 1) Dirac Experiment at the CERN PS2) H1 Experiment at HERA

5

6

•34 GeV p on target•Measure lifetime of pionium•Hodoscope input to NN

7

-The network is trained to select low Q events

8

• Net architecture 55-2-1• Note that the multiply/accumulate and sigmoid evaluation are done using look-up table memories.

9

- i.e., it works….

10

The H1 Neural Network Trigger Project The H1 Neural Network Trigger Project

Christian Kiesling Max-Planck-Institut für Physik

München, Germany

11

p

920 GeV

27.6 GeV

e

The H1 Experiment at HERA

Mission:

u

u

dGluon

Hardware (MPI):Liquid Argon Calorimeter (forward barrel section)LAr front end electronicsLAr trigger (L1)Neural Network Trigger (L2)

Physics analysis :Measurements of the structure functions F2, FL, F3, F2

D

Jet Measurements (strong coupling constant)Charm/Beauty Production (gluon content of proton)Diffractive Vector Meson Production (gluon struct.) Search for Instanton Effects (QCD „exotics“)

study• the structure of the nucleon • the fundamental interactions of quarks and gluons : Quantum chromodynamics (QCD)• electroweak interference

search• for physics beyond the Standard Model

12

The H1 Trigger Scheme

hard

ware

soft

ware

L1 trigger: OR of individual subdetectortriggers, such asMWPC, CJC, LAr, SpaCal, system ...

Neural Network at Level 2:

Global Event Decision

L2 systems: have access to informationfrom all subdetectors(information prepared bysubtrigger processors)

13

“physics”

“background”

MWPC’s (2 sets)

z-VRTX (from MWPC)

Trigger towers,global energies(8 bit numbers)

Trigger towersabove threshold(single bits)

Hits (single bits)

Hits (single bits)

Nr. of tracks(8 bit numbers)

16 bin histogram(8bit numbers)

Detector Informationat level 2

(example of photoproduction)/J

and there is much more physics in H1 ...

Calorimeter (LAr)hadronic electromagnetic

Central Jet chamber

SpaCal (Pb scint.)

µ chambers

14

Output (only one neuron)

Three-Layer Feed Forward Neural Net

),(0 ijwxFy weights

Architecture of the H1 Neural Network Trigger

00 y background

One hidden layer

Inputs (from detector)

10 y physics

discriminate„physics“ from „background“ :

Central Problem:Inputs for the Neural Nets

Data Selection

Data Transformation

15

Organization and Processing of Data from L1

Subdetector information arrives in consecutive time slices ti („frames“, orbunch crossings BC)

(tmax = 32 BC’s at present)

1 BC = 96 ns = 10 MHz transfer rate0 2 4 6 8 ...t(BC)

01234567

Subdetector 1

Subdetector 2

Subdetector 3

DDBI

DDBI

to neural network

L2 crate

backplane:L2 Bus

The L2 Bus (8 subbusses, 16 bit wide)

The Data Distribution Board (preprocessing of neural input)

Cables from subdetectors (maximum of 40)

data input units:

Selection of input data

Processing (look-up, summing)

16

CN

AP

S 0

CN

AP

S 1

CN

AP

S 2

CN

AP

S 3

CN

AP

S 4

CN

AP

S 5

CN

AP

S 6

CN

AP

S 7

CN

AP

S 8

CN

AP

S 9

CN

AP

S 1

0

CN

AP

S 1

1

VM

E S

UN

/S

Bus

Int

erfa

ceM

onit

orin

g

DD

B 0

DD

B 1

DD

B 2

DD

B 3

DD

B 4

DD

B 5

DD

B 6

DD

B 7

DD

B 8

DD

B 9

DD

B 1

0

DD

B 1

1

SB

us I

nter

face

Data from DetectorTo Final Decider

X11Terminal

Loading and Control

The Neural Trigger System

Set of independentnetworks,each one trained for a specific physics reaction

Network processors

Data selection andData transformation

: Modular and Expandable

17

The complete System

12 independent networks

Pre-processing modules (one for eachneural network)

Cables carrying rawinput data from the detector

Total of 1024 processors

Integrated computingpower:over 20 Giga MAC/sec

18

(random day in early 1999)

Trigger rate Monitor (24h)The Neural Network Trigger in Operation:

1:

2:

4:

5:

6:

7:

8:

9:

10:

11:

diff)( XKKp

peJp ),(/

peeJp )(/

peeJp )(/

peeJp )(/

XKDepe )(*

XDp *

Xp

VM

XJp )(/

(Boxes 0 and 3 also active during 99/00)

Backgroundrejection factor > 100 !

19

Some Physics: Elastic Photoproduction of Mesons /J

22

2

2

222

~,~

),()()(

Vp

Vg

ggS

MQWM

x

QxgxQW

2

2

2

22),(

1)( W

Q

xQxF

QW

expected large in QCD

expected small in Regge theory

Due to highly selective NN trigger background is under control up to the highest HERA energies

QCD

xg

C. Adloff et al., Phys. Lett. B483 (2000) 23

20

Photoproduction of Mesons withProton Dissociation

Recent results on d/dt :

Measurement possible due toneural trigger

(publication in preparation)

21

Developments Foreseen

I. H1 upgradeII. Atlas

22

Why a new preprocessor?

Neural Network Trigger successfully in operationsince Summer 1996, promising physics results, but:

NOW: need to prepare for higher selectivity (luminosity upgrade: HERA 2000:

factor 5 more physics @ constant logging rate)

New Goal: separate “interesting” physics from “uninteresting”physics

Need more Intelligent Preprocessing

H1: New Network Preprocessing - The DDB II

So far no information from LAr trigger towers used, only global energy sums, no subdetector correlations(limitation was dictated by time schedule for the realization of the trigger)

23

Intelligent Preprocessing for Neural NetworksJean-Christophe Prévotet,

MPI MünchenLaboratoire des Instruments et Systèmes (Paris VI)

24

New Preprocessing : The DDB2

Principle- “intelligent” preprocessing”

extract physical values for the neural net (impulse, energy, particle type)

- Combination of information from different subdetectors (the,phi plane)- Executed in 4 steps

Clustering Matching OrderingPost

Processing

find regions of interest

within a given detector layer

combination of clustersbelonging to the same

object

sorting of objectsby parameter

generatesvariablesfor the

neural network

25

Description of a DDB2 boardL2 bus

Matching OrderingPost

Processing

Clustering BT/TT

Clustering MWPC

Clustering CJC

Clustering FTT

Clustering Muon

Clustering Spacal

Workabledata

givento the NN

MEM

MEM

MEM

MEM

MEM

MEM

Data

Addresses

Storageofparameters

Addresses

Matching

26

Hardware specifications

Each board works on thesame data but parameterized differently

Organization :5 DDB2 boards connected to 5 CNAPS

Re-configurable hardwareindependent of data format changes

Time : 8µs (Clustering, Matching, Ordering, Post Processing)

27

Hardware resources

Time : 8 µs

Parallel processingPipeline steps

FPGA :- Low cost (prototype board)- Speed

- Xilinx Virtex Family XCV200, XCV400

XCV200 236K 14 75K XCV400 468K 20 153K

Data format Luts Lot of small memories

Type N° gates Rams SelRam bits

Clustering Matching Ordering Post processing

6 to 8 XCV200 2 XCV400 1 to ? XCV200 1 to ? XCV200

Algorithm

NumberType

28

Gain about a factor of 2 in efficiency with the new DDB II algorithms for this case.Expect increased selectivity also for other physics ...

How does Physics profit from the DDB II ?

PhysicsBackgr.

DDB I

Backgr. Physics

„DDB II“

(DDB II simulated with DDB I)

photo- production

Test reaction: photo-production

29

Momentum Reconstruction and Triggering in the ATLAS Detector

FermiLab, October 2000

Erez Etzion1, Gideon Dror2, David Horn1, Halina Abramowicz1

1. Tel-Aviv University, Tel Aviv, Israel.

2. The Academic College of Tel-Aviv-Yaffo, Tel Aviv, Israel.

30

ATLASS.C

SolenoidHadronCalorimeter

Muon DetectorsEM Calorimeters

Inner Detector

S.C Air coreToroids

31

LowPt High Pt trigger

Complicated magnetic field map => difficult problem

32

Network architecture

PTQ

sigmoid hidden layers

linear output

input

parameters of straight track of muon (preprocessing LMS)

33

Testing network performance

Training set 2500 events.In one octant.

Test set of 1829 events.

Distribution of network errors - approximately gaussian.

compatible with stochasticity of the data.

charge is discrete!!! 95.8% correct sign.

34

Summary & discussion

• The network can successfully estimate the charge and transverse momentum of the muon.

• Classification (triggering) is most efficient by specially trained network.

• The data is intrinsically stochastic giving rise to approximately gaussian errors.

• The simplicity of the network enables very fast hardware realization. (See presentation this workshop)

35

Neural Network Hardware

• Off-the-shelf neural net hardware is scarce• Many standard products no longer exist• What should we do in HEP?

36

ETANN, 1991 (Electrically Trainable Artificial Neural Network by Intel)(64x64x4 in 5 s)

CNAPS 1993 (Adaptive Solutions, Oregon) 64 @20 MHz 8/16

„Silicon Brain“ (Irvine Sensors Inc.)3D analog FPGA array

NeuroClassifier, 1994 (by P. Masa, Univ. Twente, NL) (70x6x1 in 20 ns)

SAND1 1995 (KfK, Germany) 4 @50 MHz 16/16

recent development:

Maharadja, 1999 (Paris, France) details at this conference(see talk of B. Granado, AI, Sess.I)

back to analog (?)

Analog Devices:

Digital Devices:

MA16 1994 (Siemens, Germany) 16 @50 MHz 16/16

TOTEM 1994 (Trento, Italy) 32 @30 MHz 16/ 8

towards a complexity similarto the human brain ...

Blue color: chip no longer produced

37

- One interesting solution: use memories to evaluate NN’s

38

- Another solution: can we use a fast ‘general purpose’ NN processor implemented in FPGA’s?

39

40

41

42

43

44

45

- FPGA clock speed of 100 MHz will be available soon.- implying execution times of a few 100 ns.

46

Conclusions

• Fast preprocessing is a concern – FPGA’s are one way to go

• H1 NN trigger upgrade is in the works• There is some NN trigger Neural net triggers exist and

they work• activity in LHC experiments: ATLAS muon proposal

(this workshop), CMS (electron trigger, Varela et al.)• Finding NN hardware is a problem• Memory or FPGA implementations may be the answer• See also Neural Networks in High Energy Physics: A

Ten Year Perspective, B. Denby, Comp. Phys. Comm.119, August 1, 1999, p 219.