This content has been downloaded from IOPscience. Please scroll down to see the full text.

Download details:

IP Address: 186.217.236.119

This content was downloaded on 25/02/2015 at 16:54

Please note that terms and conditions apply.

 Description and performance of track and primary-vertex reconstruction with the CMS tracker

View the table of contents for this issue, or go to the journal homepage for more

2014 JINST 9 P10009

(http://iopscience.iop.org/1748-0221/9/10/P10009)

Home Search Collections Journals About Contact us My IOPscience

iopscience.iop.org/page/terms
http://iopscience.iop.org/1748-0221/9/10
http://iopscience.iop.org/1748-0221
http://iopscience.iop.org/
http://iopscience.iop.org/search
http://iopscience.iop.org/collections
http://iopscience.iop.org/journals
http://iopscience.iop.org/page/aboutioppublishing
http://iopscience.iop.org/contact
http://iopscience.iop.org/myiopscience


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

PUBLISHED BY IOP PUBLISHING FOR SISSA MEDIALAB

RECEIVED: May 26, 2014
REVISED: July 21, 2014

ACCEPTED: August 19, 2014
PUBLISHED: October 16, 2014

Description and performance of track and
primary-vertex reconstruction with the CMS tracker

The CMS collaboration

E-mail: cms-publication-committee-chair@cern.ch

ABSTRACT: A description is provided of the software algorithms developed for the CMS tracker
both for reconstructing charged-particle trajectories in proton-proton interactions and for using
the resulting tracks to estimate the positions of the LHC luminous region and individual primary-
interaction vertices. Despite the very hostile environment at the LHC, the performance obtained
with these algorithms is found to be excellent. For tt events under typical 2011 pileup conditions,
the average track-reconstruction efficiency for promptly-produced charged particles with transverse
momenta of pT > 0.9GeV is 94% for pseudorapidities of |η | < 0.9 and 85% for 0.9 < |η | <
2.5. The inefficiency is caused mainly by hadrons that undergo nuclear interactions in the tracker
material. For isolated muons, the corresponding efficiencies are essentially 100%. For isolated
muons of pT = 100GeV emitted at |η | < 1.4, the resolutions are approximately 2.8% in pT, and
respectively, 10 µm and 30 µm in the transverse and longitudinal impact parameters. The position
resolution achieved for reconstructed primary vertices that correspond to interesting pp collisions
is 10–12 µm in each of the three spatial dimensions. The tracking and vertexing software is fast and
flexible, and easily adaptable to other functions, such as fast tracking for the trigger, or dedicated
tracking for electrons that takes into account bremsstrahlung.

KEYWORDS: Pattern recognition, cluster finding, calibration and fitting methods; Large detector-
systems performance; Performance of High Energy Physics Detectors

ARXIV EPRINT: 1405.6569

c© CERN 2014 for the benefit of the CMS collaboration, published under the terms
of the Creative Commons Attribution 3.0 License by IOP Publishing Ltd and Sissa

Medialab srl. Any further distribution of this work must maintain attribution to the author(s) and the
published article’s title, journal citation and DOI.

doi:10.1088/1748-0221/9/10/P10009

mailto:cms-publication-committee-chair@cern.ch
http://arxiv.org/abs/1405.6569
http://creativecommons.org/licenses/by/3.0/
http://creativecommons.org/licenses/by/3.0/
http://dx.doi.org/10.1088/1748-0221/9/10/P10009


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

Contents

1 Introduction 1

2 The CMS tracker 2

3 Reconstruction of hits in the pixel and strip tracker 4
3.1 Hit reconstruction in the pixel detector 5

3.1.1 First-pass hit reconstruction 5
3.1.2 Template-based hit reconstruction 6

3.2 Hit reconstruction in the strip detector 7
3.3 Hit efficiency 8
3.4 Hit resolution 9

4 Track reconstruction 12
4.1 Seed generation 13
4.2 Track finding 16
4.3 Track fitting 19
4.4 Track selection 21
4.5 Specialized tracking 23

4.5.1 Electron track reconstruction 23
4.5.2 Track reconstruction in the high-level trigger 25

5 Track reconstruction performance 27
5.1 Tracking efficiency and fake rate 28

5.1.1 Results from simulation of isolated particles 28
5.1.2 Results from simulated pp collision events 32
5.1.3 Efficiency estimated from data 34

5.2 Resolution in the track parameters 37
5.2.1 Results from simulation of isolated particles 37
5.2.2 Results from simulated pp collision events 42

5.3 CPU execution time 42

6 Beam spot and primary-vertex reconstruction and its performance 42
6.1 Primary-vertex reconstruction 42

6.1.1 Primary-vertex resolution 49
6.1.2 Efficiency of primary-vertex reconstruction 50

6.2 Track and vertex reconstruction with the pixel detector 51
6.2.1 Tracking efficiency and fake rate for pixel tracks 52
6.2.2 Resolution in the parameters of pixel tracks 52
6.2.3 Position resolution for pixel based vertices 52

6.3 Reconstruction of the LHC beam spot 53

– i –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

6.3.1 Determination of the position of the centre of the beam spot 55
6.3.2 Determining the size of the beam spot 56

7 Summary and conclusions 59

The CMS collaboration 64

1 Introduction

At an instantaneous luminosity of 1034 cm−2 s−1, typical of that expected at the Large Hadron Col-
lider (LHC), with the proton bunches crossing at intervals of 25 ns, the Compact Muon Solenoid
(CMS) tracker is expected to be traversed by about 1000 charged particles at each bunch cross-
ing, produced by an average of more than twenty proton-proton (pp) interactions. These multiple
interactions are known as pileup, to which prior or later bunch crossings can also contribute be-
cause of the finite time resolution of the detector. Reconstructing tracks in such a high-occupancy
environment is immensely challenging. It is difficult to attain high track-finding efficiency, while
keeping the fraction of fake tracks small. Fake tracks are falsely reconstructed tracks that may
be formed from a combination of unrelated hits or from a genuine particle trajectory that is badly
reconstructed through the inclusion of spurious hits. In addition, the tracking software must run
sufficiently fast to be used not only for offline event reconstruction (of ≈109 events per year), but
also for the CMS High-Level Trigger (HLT), which processes events at rates of up to 100 kHz.

The scientific goals of CMS [1, 2] place demanding requirements on the performance of the
tracking system. Searches for high-mass dilepton resonances, for example, require good momen-
tum resolution for transverse momenta pT of up to 1TeV. At the same time, efficient reconstruction
of tracks with very low pT of order 100MeV is needed for studies of hadron production rates and to
obtain optimum jet energy resolution with particle-flow techniques [3]. In addition, it is essential to
resolve nearby tracks, such as those from 3-prong τ-lepton decays. Furthermore, excellent impact
parameter resolution is needed for a precise measurement of the positions of primary pp interaction
vertices as well as for identifying b-quark jets [4].

While the CMS tracker [5] was designed with the above requirements in mind, the track-
finding algorithms must fully exploit its capabilities, so as to deliver the desired performance. The
goal of this paper is to describe the algorithms used to achieve this and show the level of perfor-
mance attained. The focus here is purely on pp collisions, with heavy ion collisions being beyond
the scope of this document. Section 2 introduces the CMS tracker; and section 3 describes the
reconstruction of the hits created by charged particles crossing the tracker’s sensitive layers. The
algorithms used to reconstruct tracks from these hits are explained in section 4; and the perfor-
mance obtained in terms of track-finding efficiency, proportion of fake tracks and track parameter
resolution is presented in section 5. Primary vertices from pp collisions are distributed over a lu-
minous region known as the beam spot. Reconstruction of the beam spot and of the primary vertex
positions is described in section 6. This is intimately connected with tracking, since on the one
hand, the beam spot and primary vertices are found using reconstructed tracks, and on the other

– 1 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

hand, an approximate knowledge of their positions is needed before track finding can begin. All
results shown in this paper are based on pp collision data collected or events simulated at a centre-
of-mass energy of

√
s = 7TeV in 2011. The simulated events include a full simulation of the CMS

detector response based on GEANT4 [6]. All events are reconstructed using software from the same
period. The track-reconstruction algorithms have been steadily evolving since then, but still have a
similar design now.

The CMS detector [5] was commissioned initially using cosmic ray muons and subsequently
using data from the first LHC running period. Results obtained using cosmic rays in 2008 [7] are ex-
tensively documented in several publications pertaining to the pixel detector [8], strip detector [9],
tracker alignment [10], and magnetic field [11], and are of particular relevance to the present paper.
Results from the commissioning of the tracker using pp collisions in 2010 are presented in [12].

2 The CMS tracker

The CMS collaboration uses a right-handed coordinate system, with the origin at the centre of the
detector, the x-axis pointing to the centre of the LHC ring, the y-axis pointing up (perpendicular to
the plane of the LHC ring), and with the z-axis along the anticlockwise-beam direction. The polar
angle θ is defined relative to the positive z-axis and the azimuthal angle φ is defined relative to the
x-axis in the x-y plane. Particle pseudorapidity η is defined as − ln[tan(θ/2)].

The CMS tracker [5] occupies a cylindrical volume 5.8m in length and 2.5m in diameter, with
its axis closely aligned to the LHC beam line. The tracker is immersed in a co-axial magnetic
field of 3.8 T provided by the CMS solenoid. A schematic drawing of the CMS tracker is shown
in figure 1. The tracker comprises a large silicon strip tracker with a small silicon pixel tracker
inside it. In the central pseudorapidity region, the pixel tracker consists of three co-axial barrel
layers at radii between 4.4cm and 10.2cm and the strip tracker consists of ten co-axial barrel layers
extending outwards to a radius of 110cm. Both subdetectors are completed by endcaps on either
side of the barrel, each consisting of two disks in the pixel tracker, and three small plus nine large
disks in the strip tracker. The endcaps extend the acceptance of the tracker up to a pseudorapidity
of |η |< 2.5.

The pixel detector consists of cylindrical barrel layers at radii of 4.4, 7.3 and 10.2cm, and two
pairs of endcap disks at z = ±34.5 and ±46.5cm. It provides three-dimensional (3-D) position
measurements of the hits arising from the interaction of charged particles with its sensors. The
hit position resolution is approximately 10 µm in the transverse coordinate and 20–40 µm in the
longitudinal coordinate, while the third coordinate is given by the sensor plane position. In total,
its 1440 modules cover an area of about 1m2 and have 66 million pixels.

The strip tracker has 15 148 silicon modules, which in total cover an active area of about
198m2 and have 9.3 million strips. It is composed of four subsystems. The Tracker Inner Barrel
(TIB) and Disks (TID) cover r < 55cm and |z| < 118cm, and are composed of four barrel lay-
ers, supplemented by three disks at each end. These provide position measurements in rφ with
a resolution of approximately 13–38 µm. The Tracker Outer Barrel (TOB) covers r > 55cm and
|z|< 118cm and consists of six barrel layers providing position measurements in rφ with a resolu-
tion of approximately 18–47 µm. The Tracker EndCaps (TEC) cover the region 124 < |z|< 282cm.

– 2 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

r 
(c

m
)

0

10

20

30

40

50

60

70

80

90

100

110

z (cm)
-300 -200 -100 0 100 200 300

3.0
2.8

2.6

2.4

2.2

2.0

1.8

1.6

-3.0
-2.8

-2.6

-2.4

-2.2

-2.0

-1.8

-1.6
-1.4 -1.2 -1.0 -0.8 -0.6 -0.4 -0.2 0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4

→ η →

−TEC TEC+

TOB

TIB−TID TID+

PIXEL

Figure 1. Schematic cross section through the CMS tracker in the r-z plane. In this view, the tracker
is symmetric about the horizontal line r = 0, so only the top half is shown here. The centre of the tracker,
corresponding to the approximate position of the pp collision point, is indicated by a star. Green dashed lines
help the reader understand which modules belong to each of the named tracker subsystems. Strip tracker
modules that provide 2-D hits are shown by thin, black lines, while those permitting the reconstruction of
hit positions in 3-D are shown by thick, blue lines. The latter actually each consist of two back-to-back strip
modules, in which one module is rotated through a ‘stereo’ angle. The pixel modules, shown by the red
lines, also provide 3-D hits. Within a given layer, each module is shifted slightly in r or z with respect to its
neighbouring modules, which allows them to overlap, thereby avoiding gaps in the acceptance.

Each TEC is composed of nine disks, each containing up to seven concentric rings of silicon strip
modules, yielding a range of resolutions similar to that of the TOB.

To refer to the individual layers/disks within a subsystem, we use a numbering convention
whereby the barrel layer number increases with its radius and the endcap disk number increases
with its |z|-coordinate. When referring to individual rings within an endcap disk, the ring number
increases with the radius of the ring.

The modules of the pixel detector use silicon of 285 µm thickness, and achieve resolutions
that are roughly the same in rφ as in z, because of the chosen pixel cell size of 100× 150 µm2 in
rφ × z. The modules in the TIB, TID and inner four TEC rings use silicon that is 320 µm thick,
while those in the TOB and the outer three TEC rings use silicon of 500 µm thickness. In the barrel,
the silicon strips usually run parallel to the beam axis and have a pitch (i.e., the distance between
neighbouring strips) that varies from 80 µm in the inner TIB layers to 183 µm in the inner TOB
layers. The endcap disks use wedge-shaped sensors with radial strips, whose pitch varies from
81 µm at small radii to 205 µm at large radii.

The modules in the innermost two layers of both the TIB and the TOB, as well as the modules
in rings 1 and 2 of the TID, and 1, 2 and 5 of the TEC, carry a second strip detector module, which
is mounted back-to-back to the first and rotated in the plane of the module by a ‘stereo’ angle of
100mrad. The hits from these two modules, known as ‘rφ ’ and ‘stereo hits’, can be combined
into matched hits that provide a measurement of the second coordinate (z in the barrel and r on the

– 3 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

disks). The achieved single-point resolution of this measurement is an order of magnitude worse
than in rφ .

The principal characteristics of the tracker are summarized in table 1.
Figure 2 shows the material budget of the CMS tracker, both in units of radiation lengths and

nuclear interaction lengths, as estimated from simulation. The simulation describes the tracker
material budget with an accuracy better than 10% [13], as was established by measuring the distri-
bution of reconstructed nuclear interactions and photon conversions in the tracker.

Table 1. A summary of the principal characteristics of the various tracker subsystems. The number of disks
corresponds to that in a single endcap. The location specifies the region in r (z) occupied by each barrel
(endcap) subsystem.

Tracker subsystem Layers Pitch Location
Pixel tracker barrel 3 cylindrical 100×150 µm2 4.4 < r < 10.2cm
Strip tracker inner barrel (TIB) 4 cylindrical 80–120 µm 20 < r < 55cm
Strip tracker outer barrel (TOB) 6 cylindrical 122–183 µm 55 < r < 116cm
Pixel tracker endcap 2 disks 100×150 µm2 34.5 < |z|< 46.5cm
Strip tracker inner disks (TID) 3 disks 100–141 µm 58 < |z|< 124cm
Strip tracker endcap (TEC) 9 disks 97–184 µm 124 < |z|< 282cm

η
-4 -3 -2 -1 0 1 2 3 4

0
t/X

0

0.5

1

1.5

2

2.5 Support Tube TOB Pixel

TEC TIB and TID Beam Pipe

CMS simulation

η
-4 -3 -2 -1 0 1 2 3 4

Iλt/

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7 Support Tube TOB Pixel

TEC TIB and TID Beam Pipe

CMS simulation

Figure 2. Total thickness t of the tracker material traversed by a particle produced at the nominal interaction
point, as a function of pseudorapidity η , expressed in units of radiation length X0 (left) and nuclear interac-
tion length λI (right). The contribution to the total material budget of each of the subsystems that comprise
the CMS tracker is shown, together with contributions from the beam pipe and from the support tube that
surrounds the tracker.

3 Reconstruction of hits in the pixel and strip tracker

The first step of the reconstruction process is referred to as local reconstruction. It consists of the
clustering of zero-suppressed signals above specified thresholds in pixel and strip channels into

– 4 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

hits, and then estimating the cluster positions and their uncertainties defined in a local orthogonal
coordinate system (u,v) in the plane of each sensor. A pixel sensor consists of 100× 150 µm2

pixels with the u-axis oriented parallel to the shorter pixel edge. In the strip sensors, the u-axis
is chosen perpendicular to the central strip in each sensor (which in the TEC is not parallel to the
other strips in the same sensor).

3.1 Hit reconstruction in the pixel detector

In the data acquisition system of the pixel detector [14], zero-suppression is performed in the
readout chips of the sensors [15], with adjustable thresholds for each pixel. This pixel readout
threshold is set to a single-pixel threshold corresponding to an equivalent charge of 3200 electrons.
Offline, pixel clusters are formed from adjacent pixels, including both side-by-side and corner-by-
corner adjacent cells. Each cluster must have a minimum charge equivalent to 4000 electrons. For
comparison, a minimum ionizing particle deposits usually around 21000 electrons. Miscalibration
of residual charge caused by pixel-to-pixel differences of the charge injection capacitors, which are
used to calibrate the pixel gain, are extracted from laboratory measurements and included in the
Monte Carlo (MC) simulation.

Two algorithms are used to determine the position of pixel clusters. A fast algorithm (described
in section 3.1.1) is used during track seeding and pattern recognition, and a more precise algorithm
(section 3.1.2), based on cluster shapes, is used in the final track fit.

3.1.1 First-pass hit reconstruction

The position of a pixel cluster along the transverse (u) and longitudinal (v) directions on the sensor
is obtained as follows. The procedure is described only for the case of the u coordinate, but is
identical for the v coordinate.

The cluster is projected onto the u-axis by summing the charge collected in pixels with the
same u-coordinate [16]. The result is referred to as a projected cluster. For projected clusters
that are only one pixel large, the u-position is given by the centre of that pixel, corrected for the
Lorentz drift of the collected charge in the CMS magnetic field. For larger projected clusters, the
hit position uhit is determined using the relative charge in the two pixels at each end of the projected
cluster:

uhit = ugeom +
Qu

last−Qu
first

2(Qu
last +Qu

first)
|W u−W u

inner|−
Lu

2
, (3.1)

where Qfirst and Qlast are the charges collected in the first and last pixel of the projected cluster,
respectively; ugeom is the position of the geometrical centre of the projected cluster; and the param-
eter Lu/2 = D tanΘu

L/2 is the Lorentz shift along the u-axis, where Θu
L is the Lorentz angle in this

direction, and D is the sensor thickness. For the pixel barrel, the Lorentz shift is approximately
59 µm. The parameter W u

inner is the geometrical width of the projected cluster, excluding its first
and last pixels. It is zero if the width of the projected cluster is less than three pixels. The charge
width W u is defined as the width expected for the deposited charge, as estimated from the angle of
the track with respect to the sensor, and equals

W u = D |tan(αu−π/2)+ tanΘ
u
L| , (3.2)

– 5 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

where the angle αu is the impact angle of the track relative to the plane of the sensor, measured
after projecting the track into the plane perpendicular to the v-axis. If no track is available, αu is
calculated assuming that the particle producing the hit moved in a straight line from the centre of
the CMS detector.

The motivation for eq. (3.1) is that the charge deposited by the traversing particle is expected
to only partially cover the two pixels at each end of the projected cluster. The quantity W u−W u

inner,
which is expected to have a value between zero and twice the pixel pitch, (a modified version of
eq. (3.1) is used for any hits that do not meet this expectation), provides an estimate of the total
extension of charge into these two outermost pixels, while the relative charge deposited in these
two pixels provides a way to deduce how this total distance is shared between them. The distance
that the charge extends into each of the two pixels can thereby be deduced. This gives the position
of the two edges of the charge distribution, and the mean value of these edges, corrected for the
Lorentz drift, equals the position of the cluster.

3.1.2 Template-based hit reconstruction

The high level of radiation exposure of the pixel detector can affect significantly the collection of
charge by the pixels during the detector’s useful life. This degrades particularly the performance
of the standard hit reconstruction algorithm, sketched in the previous section, as this algorithm
only uses the end pixels of projected clusters when determining hit positions. The reconstructed
positions of hits can be biased by up to 50 µm in highly irradiated sensors, and the hit position
resolution can be severely degraded. In the template-based reconstruction algorithm, the observed
distribution of the cluster charge is compared to expected projected distributions, called templates,
to estimate the positions of hits [17].

The templates are generated based on a large number of simulated particles traversing pixel
modules, which are modelled using the detailed PIXELAV simulation [18–20]. Since the PIXELAV

program can describe the behaviour of irradiated sensors, new templates can be generated over the
life of the detector to maintain the performance of the hit reconstruction. To allow the template-
based algorithm to be applied to tracks crossing the silicon at various angles, different sets of
templates are generated for several ranges of the angle between the particle trajectory and the
sensor. Working in each dimension independently, each pixel is subdivided into nine bins along the
u (or v) axis, where each bin has a width of one-eighth of the size of a pixel and the end bins are
centred on the pixel boundaries. The u (or v) coordinate of the point of interception of the particle
trajectory and the pixel (defined as the position at which the track crosses the plane that lies halfway
between the front and back faces of the sensor) is used to assign the interception point to one of
the nine bins, j, indicating its location within the pixel. The charge profile of the cluster produced
by each particle is projected into an array that is 13 pixels long along the u axis (or 23 pixels long
along the v axis) and centred on the intercepted pixel. The resulting charge in each element i of this
array is recorded. Only clusters with a charge below some specified angle-dependent maximum,
determined from simulation, are used, as the charge distributions can be distorted by the significant
ionization caused by energetic delta rays. This procedure provides an accurate determination of the
projected cluster distributions, determined by effects of geometry, charge drift, trapping, and charge
induction. In each dimension, the mean charge Si, j in bin (i, j), averaged over all the particles, is
then determined. In addition, the RMS charge distributions for the two projected pixels at the two

– 6 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

ends of the cluster are extracted, as are the charge in the projected pixel that has the highest charge
within the cluster, and the cluster charge, both averaged over all tracks.

The charge distribution of a reconstructed cluster, projected onto either the u or v axis, can be
described in terms of a charge Pi in each pixel i of the cluster. This can be compared to the expected
charge distributions Si, j stored in the templates, so as to determine the bin j where the particle is
likely to have crossed the sensor, and hence the best estimate of the reconstructed hit position. This
is accomplished by minimizing a χ2 function for several or all of the bins:

χ
2( j) = ∑

i

(
Pi−N jSi, j

∆Pi

)2

, (3.3)

with

N j = ∑
i

Pi

(∆Pi)
2

/
∑

i

Si, j

(∆Pi)
2 . (3.4)

In this expression, ∆Pi is the expected RMS of a charge Pi from the PIXELAV simulation and N j

represents a normalization factor between the observed cluster charge and the template. While
a sum over all the template bins yields an absolute minimum, different strategies can be used to
optimize the performance of the algorithm as a function of allowed CPU time. As described in
section 4.3, this χ2 is also used to reject outliers during track fitting, in particular pixel hits on a
track that are incompatible with the distribution expected for the reconstructed track angle.

A simplified estimate of the position of a hit is performed for cluster projections consisting of
a single pixel by correcting the position of the hit for bias from Lorentz drift and possible radiation
damage. The bias is defined by the average residual of all single-pixel clusters, as detailed below.

For cluster projections consisting of multiple pixels, the estimate of the hit position is further
refined. The charge template expected for a track crossing the pixel at an arbitrary position r, near
the best j bin is approximated by the expression (1−r)Si, j−1 +rSi, j+1. Substituting this expression
in place of Si, j in eq. (3.3), and minimizing χ2 with respect to r, yields an improved estimate of the
hit position.

Finally, the above-mentioned hit reconstruction algorithm is applied to the same PIXELAV MC
samples originally used to generate the templates. Since the true hit position is known, any bias
in the reconstructed hit position can be determined and accounted for when the algorithm is run
on collision data. In addition, the RMS of the difference between the reconstructed and true hit
position is used to define the uncertainty in the position of a reconstructed hit.

3.2 Hit reconstruction in the strip detector

The data acquisition system of the strip detector [21] runs algorithms on off-detector electronics
(namely, on the modules of the front-end driver (FED) [22]) to subtract pedestals (the baseline
signal level when no particle is present) and common mode noise (event-by-event fluctuations in
the baseline within each tracker readout chip), and to perform zero-suppression. Zero-suppression
accepts a strip if its charge exceeds the expected channel noise by at least a factor of five, or if both
the strip and one of its neighbours have a charge exceeding twice the channel noise. As a result,
information for only a small fraction of the channels in any given event is retained for offline
storage.

– 7 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

Offline, clusters are seeded by any channel passing zero-suppression that has a charge at least
a factor of three greater than the corresponding channel noise [1]. Neighbouring strips are added to
each seed, if their strip charge is more than twice the strip noise. A cluster is kept if its total charge
is a factor five larger than the cluster noise, defined as σcluster =

√
∑i σ2

i , where σi is the noise for
strip i, and the sum runs over all the strips in the cluster.

The position of the hit corresponding to each cluster is determined from the charge-weighted
average of its strip positions, corrected by approximately 10 µm (20 µm) in the TIB (TOB) to ac-
count for the Lorentz drift. One additional correction is made to compensate for the fact that charge
generated near the back-plane of the sensitive volume of the thicker silicon sensors is inefficiently
collected. This inefficiency shifts the cluster barycentre along the direction perpendicular to the
sensor plane by approximately 10 µm in the 500 µm thick silicon, while its effect is negligible in
the 320 µm thick silicon. The inefficient charge collection from the sensor backplane is caused by
the narrow time window during which the APV25 readout chip [23] integrates the collected charge,
and whose purpose is to reduce background from out-of-time hits.

The uncertainty in the hit position is usually parametrized as a function of the expected width
of the cluster obtained from the track angle (i.e., the ‘charge width’ defined in section 3.1.1).
However, in rare cases, when the observed width of a cluster exceeds the expected width by at least
a factor of 3.5, and is incompatible with it, the uncertainty in the position is then set to the ‘binary
resolution’, namely, the width of the cluster divided by

√
12. This broadening of the cluster is

caused by capacitive coupling between the strips or energetic delta rays.

3.3 Hit efficiency

The hit efficiency is the probability to find a cluster in a given silicon sensor that has been traversed
by a charged particle.

In the pixel detector, the efficiency is measured using isolated tracks originating from the pri-
mary vertex. The pT is required to be >1GeV, and the tracks are required to be reconstructed with
a minimum of 11 hits measured in the strip detector. Hits from the pixel layer under study are not
removed when the tracks are reconstructed. To minimize any ensuing bias, all tracks are required
to have hits in the other two pixel layers, ensuring thereby that they would be found even with-
out using the studied layer. A restrictive selection is set on the impact parameter to reduce false
tracks and tracks from secondary interactions. To avoid inactive regions and to allow for residual
misalignment, track trajectories passing near the edges of the sensors or their readout chips are
excluded. Specifically, they must not pass within 0.6mm (1.0–1.5mm) of a sensor edge in the
pixel endcap (barrel) or within 0.6mm of the edge of a pixel readout chip. The efficiency is de-
termined from the fraction of tracks to which either a hit is associated in the layer under study, or
if it is found within 500 µm of the predicted position of the track. Given the high track density,
only tracks that have no additional trajectories within 5mm are considered so as to reduce false
track-to-cluster association. The average efficiency for reconstructing hits is >99%, as shown in
figure 3(left), when excluding the 2.4% of the pixel modules known to be defective. The hit effi-
ciency depends on the instantaneous luminosity and on the trigger rate, as shown in figure 3(right).
The systematic uncertainty in these measurements is estimated to be 0.2%. Several sources of
loss have been identified. First, the limited size of the internal buffer of the readout chips cause
a dynamic inefficiency that increases with the instantaneous luminosity and with the trigger rate.

– 8 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

Layer 1 Layer 2 Layer 3 Disk -2 Disk -1 Disk +1 Disk +2

E
ffi

ci
en

cy

0.98

0.982

0.984

0.986

0.988

0.99

0.992

0.994

0.996

0.998

1
 = 7 TeVsCMS

)-1s-1Instantaneous luminosity (nb
0 0.5 1 1.5 2 2.5 3 3.5

E
ffi

ci
en

cy

0.98

0.982

0.984

0.986

0.988

0.99

0.992

0.994

0.996

0.998

1

Forward disks
Layer 1
Layer 2
Layer 3

 = 7 TeVsCMS

Figure 3. The average hit efficiency for layers or disks in the pixel detector excluding defective modules
(left), and the average hit efficiency as a function of instantaneous luminosity (right). The peak luminosity
ranged from 1 to 4nb−1s−1 during the data taking.

Single-event upsets temporarily cause loss of information at a negligible rate of approximately two
readout chips per hour. Finally, readout errors signalled by the FED modules depend on the rate of
beam induced background.

The efficiency in the strip tracker is measured using tracks that have a minimum of eight hits in
the pixel and strip detectors. Where two hits are found in one of the closely-spaced double layers,
which consist of rφ and stereo modules, both hits are counted separately. The efficiency in any
given layer is determined using only the subset of tracks that have at least one hit in subsequent
layers, further away from the beam spot. This requirement ensures that the particle traverses the
layer under study, but also means that the efficiency cannot be measured in the outermost layers
of the TOB (layer 6) and the TEC (layer 9). To avoid inactive regions and to take account of any
residual misalignment, tracks that cross a module within five standard deviations from the sensor’s
edges, based on the uncertainty in the extrapolated track trajectory, are excluded from considera-
tion. The efficiency is determined from the fraction of traversing tracks with a hit anywhere within
the non-excluded region of a traversed module. In the strip tracker, 2.3% of the modules are ex-
cluded because of short circuits of the high voltage, communication problems with the front-end
electronics, or other faults. Once the defective modules are excluded from the measurement, the
overall hit efficiency is 99.8%, as shown in figure 4. This number is compatible with the 0.2%
fraction of defective channels observed during the construction of the strip tracker.

All defective components of the tracker are taken into account, both in the MC simulation of
the detector and in the reconstruction of tracks.

3.4 Hit resolution

The hit resolution in the pixel and strip barrel sensors has been studied by measuring residuals,
defined by the difference between the measured and the expected hit position as predicted by the

– 9 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

TI
B 

 1
TI

B 
 2

TI
B 

 3
TI

B 
 4

TO
B 

 1
TO

B 
 2

TO
B 

 3
TO

B 
 4

TO
B 

 5
TI

D
- 1

TI
D

- 2
TI

D
- 3

TI
D

+ 
1

TI
D

+ 
2

TI
D

+ 
3

TE
C

- 1
TE

C
- 2

TE
C

- 3
TE

C
- 4

TE
C

- 5
TE

C
- 6

TE
C

- 7
TE

C
- 8

TE
C

+ 
1

TE
C

+ 
2

TE
C

+ 
3

TE
C

+ 
4

TE
C

+ 
5

TE
C

+ 
6

TE
C

+ 
7

TE
C

+ 
80.9

0.92

0.94

0.96

0.98

1

TI
B 

 1
TI

B 
 2

TI
B 

 3
TI

B 
 4

TO
B 

 1
TO

B 
 2

TO
B 

 3
TO

B 
 4

TO
B 

 5
TI

D
- 1

TI
D

- 2
TI

D
- 3

TI
D

+ 
1

TI
D

+ 
2

TI
D

+ 
3

TE
C

- 1
TE

C
- 2

TE
C

- 3
TE

C
- 4

TE
C

- 5
TE

C
- 6

TE
C

- 7
TE

C
- 8

TE
C

+ 
1

TE
C

+ 
2

TE
C

+ 
3

TE
C

+ 
4

TE
C

+ 
5

TE
C

+ 
6

TE
C

+ 
7

TE
C

+ 
8

Ef
fic

ie
nc

y

0.9

0.92

0.94

0.96

0.98

1

Good modules

All modules

 = 7 TeVsCMS

Figure 4. Average hit efficiency for layers or disks in the strip tracker. The black squares show the hit
efficiency in all modules, and the red dots for modules included in the readout.

fitted track. Each trajectory is refitted excluding the hit under study in order to minimize biases of
the procedure.

The resolution of the pixel detector is measured from the RMS width of the hit residual dis-
tribution in the middle of the three barrel layers, using only tracks with pT > 12GeV, for which
multiple scattering between the layers does not affect the measurement. The expected hit position
in the middle layer, as determined from the track trajectory, has an uncertainty that is dominated
by the resolution of the hits assigned to the track in the first and third barrel layers. Assuming
that the three barrel layers all have the same hit resolution σhit and because they are approximately
equally spaced in radius from the z-axis of CMS, then this uncertainty is given by σhit/

√
2. Adding

this in quadrature with the uncertainty σhit in the measured position of the hit in the middle layer,
demonstrates that the RMS width of the residual distribution is given by σhit

√
3/2. The measured

hit resolution σhit in the rφ coordinate, as derived using this formula, is 9.4 µm. The resolution in
the longitudinal direction is shown in figure 5, and found to agree within 1 µm with MC simula-
tion. The longitudinal resolution depends on the angle of the track relative to the sensor. For longer
clusters, sharing of charge among pixels improves the resolution, with optimal resolution reached
for interception angles of ±30◦.

Because of multiple scattering, the uncertainty in track position in the strip detector is usually
much larger than the inherent resolution; consequently, individual residuals of hits are not sensitive
to the resolution. However, the difference in a track’s residuals for two closely spaced modules
can be measured with much greater precision. Any offset in a track’s position caused by multiple
scattering will be largely common to both modules. A technique based on tracks passing through
overlapping modules from the same tracker layer is employed to compare the difference in residuals
for the two measurements in the overlapping modules [24]. The difference in hit positions (∆xhit)
can be compared to the difference in predicted positions (∆xpred) derived from the track trajectory,
and their difference, fitted to a Gaussian function, provides a hit resolution convoluted with the
uncertainty from the trajectory propagation. The bias from translational misalignment between
modules affects only the mean of the Gaussian distribution, and not its RMS width. As the two

– 10 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

)°Incident angle (
-60 -40 -20 0 20 40 60

m
)

µ
Ba

rre
l p

ix
el

 R
M

S 
z 

re
so

lu
tio

n 
(

0

5

10

15

20

25

30

35

40

45

50
CMS

 > 12 GeV
T

p

 = 7 TeVs
-1 L = 42pb

Data
Simulation

Figure 5. Resolution in the longitudinal (z) coordinate of hits in the barrel section of the pixel detector,
shown as a function of the incident angle of the track, which is defined as 90◦−θ , and equals the angle of
the track relative to the normal to the plane of the sensor. Data are compared with MC simulation for tracks
with pT > 12GeV.

overlapping modules are expected to have the same resolution, the resolution of a single sensor is
determined by dividing this RMS width by

√
2.

Only tracks of high purity (defined in section 4.4) are used for the above-described study. To
reduce the uncertainty from multiple Coulomb scattering, the track momenta are required to be
>10GeV. The χ2 probability of the track fit is required to be >0.1%, and the tracks are required to
be reconstructed using a minimum of six hits in the strip detector. Tracks in the overlapping barrel
modules are analysed only when the residual rotational misalignment is less than 5 µm. Remaining
uncertainties from multiple scattering and rotational misalignment for the overlapping modules are
included as systematic uncertainties of the measurement.

Sensor resolution depends strongly on the size of the cluster and on the pitch of the sensor. The
resolutions for the strip detector are shown in table 2, where they are compared to the predictions
from MC simulation. The resolution varies not only as a function of the cluster width, but also as a
function of pseudorapidity, as the energy deposited by a charged particle in the silicon depends on
the angle at which it crosses the sensor plane. The resolution is worse in simulation than in data,
implying the need for additional tuning of the MC simulation. The results in the table are valid only
for tracks with momenta >10GeV. At lower momenta, the simulations indicate that the resolution
in hit position improves, but this is not important for tracking performance, as the resolution of the
track parameters for low-momentum tracks is dominated by the multiple scattering and by not the
hit resolution.

– 11 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

Table 2. A comparison of hit resolution in the barrel strip detector as measured in data with the correspond-
ing prediction from simulation, for track momenta >10GeV. The resolution is given as function of both the
barrel layer and the width of the cluster in strips. Since the resolution is observed to vary with φ and η , a
range of resolution values is quoted in each case.

Sensor Pitch Resolution [µm] vs. width of cluster [strips]
layer (µm) width=1 =2 =3 =4

TIB 1–2 80
Data 11.7–19.1 10.9–17.9 10.1–18.1
MC 14.5–20.5 15.0–19.8 14.0–20.6

TIB 3–4 120
Data 20.9–29.5 21.8–28.8 20.8–29.2
MC 26.8–30.4 27.6–30.8 27.9–32.5

TOB 1–4 183
Data 23.4–40.0 32.3–42.3 16.9–28.5
MC 42.5–50.5 43.0–48.6 18.8–35.2

TOB 5–6 122
Data 18.4–26.6 11.8–19.4
MC 26.1–29.5 17.8–21.6

4 Track reconstruction

Track reconstruction refers to the process of using the hits, obtained from the local reconstruction
described in section 3, to obtain estimates for the momentum and position parameters of the charged
particles responsible for the hits (tracks). As part of this process, a translation between the local
coordinate system of the hits and the global coordinate system of the track is necessary. This
translation takes into account discrepancies between the assumed and actual location and surface
deformation of detector elements as found through the alignment process [25]. In addition, the
uncertainty in the detector element location is added to the intrinsic uncertainty in the local hit
position.

Reconstructing the trajectories of charged particles is a computationally challenging task. An
overview of the difficulties and solutions can be found in review articles [26–28]. The tracking
software at CMS is commonly referred to as the Combinatorial Track Finder (CTF), which is
an adaptation of the combinatorial Kalman filter [29–31], which in turn is an extension of the
Kalman filter [32] to allow pattern recognition and track fitting to occur in the same framework.
The collection of reconstructed tracks is produced by multiple passes (iterations) of the CTF track
reconstruction sequence, in a process called iterative tracking. The basic idea of iterative tracking
is that the initial iterations search for tracks that are easiest to find (e.g., of relatively large pT, and
produced near the interaction region). After each iteration, hits associated with tracks are removed,
thereby reducing the combinatorial complexity, and simplifying subsequent iterations in a search
for more difficult classes of tracks (e.g., low-pT, or greatly displaced tracks). The presented results
reflect the status of the software in use from May through August, 2011, which is applied in a series
of six iterations of the track reconstruction algorithm. Later versions of the software retain the
same basic structure but with different iterations and tuned values for the configurable parameters
to adapt to the higher pileup conditions. Iteration 0, the source of most reconstructed tracks, is
designed for prompt tracks (originating near the pp interaction point) with pT > 0.8GeV that have
three pixel hits. Iteration 1 is used to recover prompt tracks that have only two pixel hits. Iteration 2

– 12 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

is configured to find low-pT prompt tracks. Iterations 3–5 are intended to find tracks that originate
outside the beam spot (luminous region of the pp collisions) and to recover tracks not found in
the previous iterations. At the beginning of each iteration, hits associated with high-purity tracks
(defined in section 4.4) found in previous iterations are excluded from consideration (masked).

Each iteration proceeds in four steps:

• Seed generation provides initial track candidates found using only a few (2 or 3) hits. A seed
defines the initial estimate of the trajectory parameters and their uncertainties.

• Track finding is based on a Kalman filter. It extrapolates the seed trajectories along the
expected flight path of a charged particle, searching for additional hits that can be assigned
to the track candidate.

• The track-fitting module is used to provide the best possible estimate of the parameters of
each trajectory by means of a Kalman filter and smoother.

• Track selection sets quality flags, and discards tracks that fail certain specified criteria.

The main differences between the six iterations lie in the configuration of the seed generation
and the final track selection.

4.1 Seed generation

The seeds define the starting trajectory parameters and associated uncertainties of potential tracks.
In the quasi-uniform magnetic field of the tracker, charged particles follow helical paths and there-
fore five parameters are needed to define a trajectory. Extraction of these five parameters requires
either three 3-D hits, or two 3-D hits and a constraint on the origin of the trajectory based on the
assumption that the particle originated near the beam spot. (A ‘3-D hit’ is defined to be any hit that
provides a 3-D position measurement). To limit the number of hit combinations, seeds are required
to satisfy certain weak restrictions, for example, on their minimum pT and their consistency with
originating from the pp interaction region.

In principle, it is possible to construct seeds in the outermost regions of the tracker, where the
track density is smallest, and then construct track candidates by searching inwards from the seeds
for additional hits at smaller distances from the beam-line. However, there are several reasons why
an alternative approach, of constructing seeds in the inner part of the tracker and building the track
candidates outwards, has been chosen instead.

First, although the track density is much higher in the inner region of the tracker, the high
granularity of the pixel detector ensures that the channel occupancy (fraction of channels that are
hit) of the inner pixel layer is much lower than that of the outer strip layer. This can be seen in
figure 6, which shows the mean channel occupancy in strip and pixel sensors in data collected with
a ‘zero-bias’ trigger, (which took events from randomly selected non-empty LHC bunch crossings).
This data had a mean of about nine pp interactions per bunch crossing. The channel occupancy is
0.002–0.02% in the pixel detector and 0.1–0.8% in the strip detector. Second, the pixel layers pro-
duce 3-D spatial measurements, which provide more constraints and better estimates of trajectory
parameters. Finally, generating seeds in the inner tracker leads to a higher efficiency for recon-
structing tracks. Although most high-pT muons traverse the entire tracker, a significant fraction

– 13 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

of the produced pions interact inelastically in the tracker (figure 7). In addition, many electrons
lose a significant fraction of their energy to bremsstrahlung radiation in the tracker. Therefore,
to ensure high efficiency, track finding begins with trajectory seeds created in the inner region of
the tracker. This also facilitates reconstruction of low-momentum tracks that are deflected by the
strong magnetic field before reaching the outer part of the tracker.

r (
cm

)

0

20

40

60

80

100

120

140

z (cm)
-300 -200 -100 0 100 200 300

3.0
2.8
2.6

2.4

2.2

2.0

1.8

1.6

-3.0
-2.8
-2.6

-2.4

-2.2

-2.0

-1.8

-1.6
-1.4 -1.2 -1.0 -0.8 -0.6 -0.4 -0.2 0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4

η

CMS  = 7 TeVs

C
ha

nn
el

 o
cc

up
an

cy

-410

-310

-210

Figure 6. Channel occupancy (labelled by the scale on the right) for CMS silicon detectors in events taken
with unbiased triggers with an average of nine pp interactions per beam crossing, displayed as a function of
η , r, and z.

Layers
0 2 4 6 8 10 12 14

S
ur

vi
va

l p
ro

ba
bi

lit
y

0.7

0.75

0.8

0.85

0.9

0.95

1

 = 1 GeV
T

 pπ
 = 10 GeV

T
 pπ

 = 100 GeV
T

 pπ

CMS simulation

Figure 7. Fraction of pions produced with |η |< 2.5 that do not undergo a nuclear interaction in the tracker
volume, as a function of the number of traversed layers.

– 14 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

Seed generation requires information on the position of the centre of the reconstructed beam
spot, obtained prior to track finding using the method described in section 6.3. It also requires the
locations of primary vertices in the event, including those from pileup events. This information is
obtained by running a very fast track and vertex reconstruction algorithm, described in section 6.2,
that uses only hits from the pixel detector. The tracks and primary vertices found with this algorithm
are known as pixel tracks and pixel vertices, respectively.

The seed generation algorithm is controlled by two main sets of parameters: seeding layers
and tracking regions. The seeding layers are pairs or triplets of detector layers in which hits are
searched for. The tracking regions specify the limits on the acceptable track parameters, includ-
ing the minimum pT, and the maximum transverse and longitudinal distances of closest approach
to the assumed production point of the particle, taken to be located either at the centre of the re-
constructed beam spot or at a pixel vertex. If the seeding layers correspond to pairs of detector
layers, then seeds are constructed using one hit in each layer. A hit pair is accepted as a seed if
the corresponding track parameters are consistent with the requirements of the tracking region. If
the seeding layers correspond to triplets of detector layers, then, after pairs of hits are found in
the two inner layers of each triplet, a search is performed in the outer detector layer for another
hit. If the track parameters derived from the three hits are compatible with the tracking region
requirements, the seed is accepted. It is also possible to check if the hits associated with the seed
have the expected charge distribution from the track parameters: a particle that enters the detector
at a grazing angle will have a larger cluster size than a particle that enters the detector at a normal
angle. Requiring the reconstructed charge distribution to match the expected charge distribution
can remove many fake seeds.

In simulated tt events at
√

s = 7TeV, more than 85% of the charged particles produced within
the geometrical acceptance of the tracker (|η |< 2.5) cross three pixel layers and can therefore be
reconstructed starting from trajectory seeds obtained from triplets of pixel hits. Nevertheless, other
trajectory seeds are also needed, partially to compensate for inefficiencies in the pixel detector
(from gaps in coverage, non-functioning modules, and saturation of the readout), and partially
to reconstruct particles not produced directly at the pp collision point (decay products of strange
hadrons, electrons from photon conversions, and particles from nuclear interactions). To improve
the speed and quality of the seeding algorithm, only 3-D space points are used, either from a pixel
hit or a matched strip hit. Matched strip hits are obtained from the closely-spaced double strip
layers, which are composed of two sensors mounted back-to-back, one providing an rφ view and
one providing a stereo view (rotated by 100mrad relative to the other, in the plane of the sensor).
The ‘rφ ’ and ‘stereo hits’ in such a layer are combined into a matched hit, which provides a 3-
D position measurement. Table 3 shows the seeding requirements for each of the six tracking
iterations. The seeding layers listed in this table are defined as follows:

• Pixel triplets are seeds produced from three pixel hits. These seeds are used to find most
of the tracks corresponding to promptly produced charged particles. The three precise 3-D
space points provide seeds of high quality and with well-measured starting trajectories. A
mild constraint on the compatibility of these trajectories with the centre of the beam spot is
employed, to remove seeds inconsistent with promptly produced particles. Also, the charge
distribution of each pixel hit is required to be compatible with that expected for the crossing
angle of the seed trajectory and the corresponding sensor.

– 15 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

• Mixed pairs with vertex constraint are seeds that use two hits and a third space-point given
by the location of a pixel vertex. If more than one pixel vertex is found in an event, which
often happens because of pileup, all are considered in turn. The pixel vertices are required
to pass quality criteria; the most important is that a vertex must contain at least four pixel
tracks. The two hits used for these seeds can be provided by the pixel tracker, or by the two
inner rings of the three inner TEC layers, where the TEC layers are used to increase coverage
in the very forward regions.

• Mixed triplets are seeds produced from three hits formed from a combination of pixel hits
and matched strip hits. Each triplet contains between one and three pixel hits and < 3 strip
hits. This iteration is implemented for finding displaced tracks and prompt tracks that do
not have three hits in the pixel detector. The beam spot related constraint is less restrictive,
providing higher efficiency for finding tracks arising from decays of hadrons containing s, c,
or b quarks, photon conversions, and nuclear interactions.

• Strip pairs are seeds constructed using two matched hits from the strip detector. Iteration 4
uses the two inner TIB layers and rings 1–2 of the TID/TEC, which are the same strip layers
used in Iteration 3. In Iteration 5, hits from the two inner TOB layers and ring 5 of the TEC
are used for seeds. These two iterations have even weaker constraints on the compatibility
of the seed trajectory with the centre of the beam spot than has Iteration 3, and they do not
require pixel hits. These iterations are therefore useful for finding tracks produced outside of
the pixel detector volume or tracks that do not leave hits in the pixel detector.

Table 3. The configuration of the track seeding for each of the six iterative tracking steps. Shown are the
layers used to seed the tracks, as well as the requirements on the minimum pT and the maximum transverse
(d0) and longitudinal (z0) impact parameters relative to the centre of the beam spot. The Gaussian standard
deviation corresponding to the length of the beam spot along the z-direction is σ . The asterisk symbol
indicates that the longitudinal impact parameter is calculated relative to a pixel vertex instead of to the
centre of the beam spot.

Iteration Seeding layers pT (GeV) d0 (cm) |z0|
0 Pixel triplets >0.8 <0.2 <3σ

1 Mixed pairs with vertex >0.6 <0.2 <0.2cm∗

2 Pixel triplets >0.075 <0.2 <3.3σ

3 Mixed triplets >0.35 <1.2 <10cm
4 TIB 1+2 & TID/TEC ring 1+2 >0.5 <2.0 <10cm
5 TOB 1+2 & TEC ring 5 >0.6 <5.0 <30cm

4.2 Track finding

The track-finding module of the CTF algorithm is based on the Kalman filter method [29–32]. The
filter begins with a coarse estimate of the track parameters provided by the trajectory seed, and then
builds track candidates by adding hits from successive detector layers, updating the parameters at
each layer. The information needed at each layer includes the location and uncertainty of the

– 16 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

detected hits, as well as the amount of material crossed, which is used to estimate the effects of
multiple Coulomb scattering and energy loss. The track finding is implemented in the four steps
listed below.

The first step (navigation) uses the parameters of the track candidate, evaluated at the current
layer, to determine which adjacent layers of the detector can be intersected through an extrapola-
tion of the trajectory, taking into account the current uncertainty in that trajectory. The navigation
service can be configured to propagate along or opposite to the momentum vector, and uses a fast
analytical propagator to find the intercepted layers. The analytical propagator assumes a uniform
magnetic field, and does not include effects of multiple Coulomb scattering or energy loss. With
these assumptions, the track trajectory is a perfect helix, and the propagator can therefore extrapo-
late the trajectory from one layer to the next using rapid analytical calculations. In the barrel, the
cylindrical geometry makes navigation particularly easy, since the extrapolated trajectory can only
intercept the layer adjacent to the current one. In the endcap and barrel-endcap transition regions,
navigation is more complex, as the crossing from one layer does not uniquely define the next one.

The second step involves a search for compatible silicon modules in the layers returned by the
navigation step. A module is considered compatible with the trajectory if the position at which
the trajectory intercepts the module surface is no more than some given number (currently three)
of standard deviations outside the module boundary. The propagation of the trajectory parameters,
and of the corresponding uncertainties, to the sensor surface involves mathematical operations and
routines that are generally quite time-consuming [33]. Hence, the code responsible for searching
for compatible modules has been optimized to limit the number of sensors that are considered,
while preserving an efficiency of >99% in finding the relevant sensors. A complication is that the
design of the CMS tracker is such that sensors often slightly overlap their neighbours, meaning that
a particle can cross two sensors in the same layer. This possibility is accommodated by dividing the
compatible modules in each layer into groups of mutually exclusive modules, defined such that if
a particle passes through one member of a group, it is not physically possible for it to pass through
a second member of the same group. Any two modules that have some overlap are not mutually
exclusive, and are therefore assigned to different groups. This feature is used in the third and fourth
steps of the track finding, described next.

The third step forms groups of hits, each of which is defined by the collection of all the hits
from one of the module groups. A configurable parameter provides the possibility of adding a
ghost hit to represent the possibility that the particle failed to produce a hit in the module group, for
example, as a result of module inefficiency. The hit positions and uncertainties are refined using
the trajectory direction on the sensor surface, to calculate more accurately the Lorentz drift of the
ionization-charge carriers inside the silicon bulk. A χ2 test is used to check which of the hits are
compatible with the extrapolated trajectory. The current (configurable) requirement is χ2 < 30 for
one degree of freedom (dof). The χ2 calculation takes into account both the hit and trajectory
uncertainties. In the endcap regions and the barrel-endcap transition regions, the extrapolation
distances and the amount of material traversed are generally greater, with correspondingly larger
uncertainties in the trajectory, and the probability of finding spurious hits compatible with the track
tends therefore to be greater.

The fourth and last step is to update the trajectories. From each of the original track candidates,
new track candidates are formed by adding exactly one of the compatible hits from each module

– 17 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

grouping (where this hit may be a ghost hit). As the modules in a given group are mutually exclu-
sive, it would not be expected that a track would have more than one hit contributing from each
group. The trajectory parameters for each new candidate are then updated at the location of the
module surface, by combining the information from the added hits with the extrapolated trajectory
of the original track candidate.

For the above second, third, and fourth steps of the procedure, a more accurate material prop-
agator is used when extrapolating the track trajectory, which includes the effect of the material in
the tracker. This differs from the method of the simple analytical propagator, in that it increases
the uncertainty in the trajectory parameters according to the predicted RMS scattering angle in the
tracker material. It also adjusts the momentum of the trajectory by the predicted mean energy loss
of the Bethe-Bloch equation. Since all detector material is assumed to be concentrated in the de-
tector layers, the track propagates along a simple helix between the layers, allowing the material
propagator to extrapolate the track analytically. The ghost hits include the effect of material without
providing position information to the propagator.

All resulting track candidates found at each layer are then propagated to the next compatible
layers, and the procedure is repeated until a termination condition is satisfied. However, to avoid a
rapid increase in the number of candidates, only a limited number (default is 5) of the candidates
are retained at each step, with the best candidates chosen based on the normalized χ2 and a bonus
given for each valid hit, and a penalty for each ghost hit. The standard termination conditions are
if a track reaches the end of the tracker or contains too many missing hits (limit is Nlost), or if its pT

drops below a user specified value. The number of missing hits on a track is equal to the number of
ghost hits, except that hits not found due to attributable known detector conditions, for example, if
a detector module is turned off, are not counted. The building of a trajectory can also be terminated
when the uncertainty in its parameters falls below a given threshold or the number of hits is above
a threshold; these kinds of termination conditions tend to be used only in the high-level trigger
(HLT), where the required accuracy on track parameters is often reached after 5 or 6 hits are added
to the track candidate, and the continuation of the track building would correspond to a waste of
CPU time.

When the search for hits in the outward direction reveals a minimum number of valid hits
(Nrebuild), an inwards search is initiated for additional hits. Otherwise, the track candidate remains
as formed. The inwards search starts by taking all of the hits assigned to the track, excluding those
belonging to the track seed, and using them to fit the track trajectory. In case this exclusion of
the seeding hits leaves fewer than Nrebuild hits to fit, some of the seeding hits are also used (taking
first the outer contributions) so as to obtain at least Nrebuild hits. Then, as in the outward track
building, the trajectory is propagated inwards through the seeding layers and then further, until the
inner edge of the tracker is reached or too many ghost hits are found. There are three reasons for
this inward search. First, additional hits can be found in the seeding layers (for example, from
overlapping sensors). Second, hits can be found in layers closer to the interaction region than the
seeding layers. Third, when strip layers are used in seeding, matched hits are used to increase
computational speed and reduce the combinations of hits available for seeding. However, some rφ

or stereo hits are not part of any matched hit. While these hits are not available during seeding,
they can be found during the inward track building process. The effect of the inward search is an
increase in the mean number of hits per track by 0.15, (i.e., a 1% increase relative to a total of ≈14

– 18 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

hits), which translates to a better signal-to-background ratio, impact parameter resolution, and pT

resolution, with maximum improvements of 2%, 1%, and 0.5%, respectively.
The track of a single charged particle can be reconstructed more than once, either starting from

different seeds, or when a given seed develops into more than one track candidate. To remedy this
feature, a trajectory cleaner is applied after all the track candidates in a given iteration have been
found. The trajectory cleaner calculates the fraction of shared hits between two track candidates:
fshared = Nhits

shared
min(Nhits

1 ,Nhits
2 ) where Nhits

1 and Nhits
2 are, respectively, the number of hits used in forming the

first (second) track candidate. If this fraction exceeds the (configurable) value of 19% (determined
empirically), the trajectory cleaner removes the track with the fewest hits; if both tracks have the
same number of hits, the track with the largest χ2 value is discarded. The procedure is repeated
iteratively on all pairs of track candidates. The same algorithm is applied when tracks from the six
iterations are combined into a single track collection.

The requirements applied during the track-finding stage are shown in table 4 for each tracking
iteration. In addition to the requirement on Nlost, the completed track candidates must also pass
requirements on the minimum number of hits (Nhits) and minimum track pT. The minimum pT

requirements have very little effect, as they are weaker than those applied to the seeds, given in
table 3. Since the later iterations do not have strong requirements that the tracks originate close to
the centre of the beam spot, the probability of random hits forming tracks increases, which leads to
more fake tracks and greater usage of CPU time. To compensate for this tendency, the criteria for
the minimum number of hits, and maximum number of lost hits, are tightened in the later iterations.

Table 4. Selection requirements applied to track candidates during the six iterative steps of track finding,
the minimum pT, the minimum number of hits Nhits, and the maximum number of missing hits Nlost. Also
shown is the minimum number of hits needed to be found in the outward track building step to trigger the
inward track building step Nrebuild, although candidates failing this requirement are not rejected.

Iteration pT (GeV) Nhits Nlost Nrebuild

0 0.3 3 1 5
1 0.3 3 1 5
2 0.1 3 1 5
3 0.1 4 0 5
4 0.1 7 0 5
5 0.1 7 0 4

4.3 Track fitting

For each trajectory, the track-finding stage yields a collection of hits and an estimate of the track
parameters. However, the full information about the trajectory is only available at the final hit of
the trajectory (when all hits are known). Furthermore, the estimate can be biased by constraints,
such as a beam spot constraint applied to the trajectory during the seeding stage. The trajectory is
therefore refitted using a Kalman filter and smoother.

The Kalman filter is initialized at the location of the innermost hit, with the trajectory estimate
obtained by performing a Kalman filter fit to the innermost hits (typically four) on the track. The
corresponding covariance matrix is scaled up by a large factor (10 for the last iteration and 100 for

– 19 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

the other iterations) in order to limit the bias. The fit then proceeds in an iterative way through the
full list of hits, from the inside outwards, updating the track trajectory estimate sequentially with
each hit. For each valid hit, the estimated hit position uncertainty is reevaluated using the current
values of the track parameters. In the case of pixel hits, the estimated hit position is also reevaluated.
This first filter is followed by the smoothing stage, whereby a second filter is initialized with the
result of the first one (except for the covariance matrix, which is scaled by a large factor), and is
run backward towards the beam-line. The track parameters at the surface associated with any of its
hits, can then be obtained from the weighted average of the track parameters of these two filters,
evaluated on this same surface, as one filter uses information from all the hits found before, and
the other uses information from all the hits found after the surface. This provides the optimal track
parameters at any point, including the innermost and outermost hit on the track, which are used
to extrapolate the trajectory to the interaction region and to the calorimeter and muon detectors,
respectively. A configurable parameter determines whether the silicon strip matched hits are used
as is or split into their component rφ and stereo hits. For the standard offline reconstruction, the
split hits are used to improve the track resolution, while for the HLT, the matched hits are used to
improve speed.

To obtain the best precision, this filtering and smoothing procedure uses a Runge-Kutta prop-
agator to extrapolate the trajectory from one hit to the next. This not only takes into account the
effect of material, but it also accommodates an inhomogeneous magnetic field. The latter means
that the particle may not move along a perfect helix, and its equations of motion in the magnetic
field must therefore be solved numerically. To do so, the Runge-Kutta propagator divides the dis-
tance to be extrapolated into many small steps. It extrapolates the track trajectory over each of
these steps in turn, using a well-known mathematical technique for solving first-order differential
equations, called the fourth-order Runge-Kutta method, so called because it is accurate to fourth
order in the step size. The optimal step size is chosen automatically, according to how non-linear
the problem is. This automatic determination of step size employs the method [34], which is based
on how well the fourth and fifth order Runge-Kutta predictions agree with each other. Use of the
Runge-Kutta propagator is most important in the region |η |> 1, where the magnetic field inhomo-
geneities are greatest. For example, in this region, tracks fitted using the simple material propagator
are biased by up to 1% for particles with pT = 10GeV. This bias is almost completely eliminated
when using the Runge-Kutta propagator. To assure an accurate extrapolation of the track trajectory,
the Runge-Kutta propagator uses a detailed map of the magnetic field, which was measured before
LHC collisions to a precision of < 0.01%.

Estimates of the track trajectory at any other points, such as the point of closest approach to
the beam-line, can be obtained by extrapolating the trajectory evaluated at the nearest hit to that
very point. This extrapolation also uses the Runge-Kutta propagator.

After filtering and smoothing, a search is made for spurious hits (outliers), incorrectly asso-
ciated to the track. Such hits can be related to an otherwise well-defined track, e.g., from δ -rays,
or unrelated, such as hits from nearby tracks or electronic noise. Two methods are used to find
outliers. One uses the measured residual between a hit and the track to reject hits whose χ2 com-
patibility with the track exceeds a configurable threshold (20 for Iterations 0–4 and 30 for Iteration
5). While a χ2 requirement of 30 on each hit is already applied during track finding, the outlier
rejection criterion provides a more powerful restriction as it uses information from the full fit [32].

– 20 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

The other method calculates a probability that a pixel hit is consistent with the track, taking into
account the charge distribution of the pixel hit, which generally comprises several pixel channels.
This probability corresponds to the χ2 defined in eq. (3.3). After removing the outlier, the track is
again filtered and smoothed and another check for outliers is made. This continues until no more
outliers are found. In cases where removing an outlier results in two consecutive ghost hits, the
track is terminated and the remaining outer hits discarded (although not used, a configurable pa-
rameter is available to allow the track fitting to continue). If a track is found to have less than three
hits after outlier rejection or for the track fitting to fail, the track is discarded (although not used, a
configurable parameter is available to return the original track).

The default value of 20 for the χ2 requirement is chosen to reject a significant fraction of
outliers, while removing few genuine hits. With this value, approximately 20% of the spurious
outliers are removed from tracks reconstructed in high-density dijet events, whereas <0.2% of the
good hits are removed.

4.4 Track selection

In a typical LHC event containing jets, the track-finding procedure described above yields a signif-
icant fraction of fake tracks, where a fake track is defined as a reconstructed track not associated
with a charged particle, as defined in section 5. The fake rate (fraction of reconstructed tracks
that are fake) can be reduced substantially through quality requirements. Tracks are selected on
the basis of the number of layers that have hits, whether their fit yielded a good χ2/dof, and how
compatible they are with originating from a primary interaction vertex. If several primary vertices
are present in the event, as often happens due to pileup, all are considered. To optimize the perfor-
mance, several requirements are imposed as a function of the track η and pT, and on the number of
layers (Nlayers) with an assigned hit (where a layer with both rφ and stereo strip modules is counted
as a single layer). The selection criteria are as follows.

• A requirement on the minimum number of layers in which the track has at least one asso-
ciated hit. This differs from selections based on the number of hits on the track, because
more than one hit in a given layer can be assigned to a track, as in the case of layers with
overlapping sensors or double-sided layers in which two sensors are mounted back-to-back.

• A requirement on the minimum number of layers in which the track has an associated 3-D
hit (i.e., in the pixel tracker or matched hits in the strip tracker).

• A requirement on the maximum number of layers intercepted by the track containing no as-
signed hits, not counting those layers inside its innermost hit or outside its outermost hit, nor
those layers where no hit was expected because the module was known to be malfunctioning.

• χ2/dof < α0Nlayers.

• |dBS
0 |/δd0 <

(
α3Nlayers

)β .

• |zPV
0 |/δ z0 <

(
α4Nlayers

)β .

• |dBS
0 |/σd0(pT) <

(
α1Nlayers

)β .

• |zPV
0 |/σz0(pT,η) <

(
α2Nlayers

)β .

– 21 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

The parameters αi and β are configurable constants. The track’s impact parameters are dBS
0

and zPV
0 , where dBS

0 is the distance from the centre of the beam spot in the plane transverse to
the beam-line and zPV

0 is the distance along the beam-line from the closest pixel vertex. These
pixel vertices, described in section 6.2, are required to have at least three pixel tracks and if no
pixel vertices meet this requirement, then zPV

0 is required to be within 3σ of the z-position of
the centre of the beam spot, where σ is the Gaussian standard deviation corresponding to the
length of the beam spot in the z-direction. The above selection criteria include requirements on the
transverse |dBS

0 |/δd0 and longitudinal |zPV
0 |/δ z0 impact parameter significances of the track, where

the impact parameter uncertainties, δd0 and δ z0, are calculated from the covariance matrix of the
fitted track trajectory. A second pair of requirements is also imposed on these significances, but
calculated differently, with the uncertainties in the impact parameters being parametrized in terms
of pT and polar angle of the track: σ(d0) = σ(z0 sinθ) = a⊕ b

pT
, where ⊕ represents the sum in

quadrature and a and b are parameters. Their nominal values are a = 30 µm and b = 10 µmGeV,
but b increases to 100 µmGeV for the loose and tight selection criteria used (and defined below) in
Iterations 0 and 1.

The fraction of fake tracks decreases roughly exponentially as a function of the number of
layers in which the track has associated hits: dNfake/dNlayers ∼ exp(−ωNlayers), with ω in the range
0.9–1.3 depending on the pT of the track. As a consequence, weaker selection criteria can be
applied for tracks having many hit layers, which is the reason for the chosen selection criteria. For
tracks with hits in at least 10 layers, the selection requirements on χ2 and impact parameters are
found to reject no tracks. However, the criteria become far more stringent for tracks with relatively
few hit layers.

The above quality criteria were initially optimized as a function of track pT and Nlayers, so as
to maximize the quality Q(ρ) = s/

√
s+ρb, where s is the number of selected genuine (non-fake)

tracks, b is the number of selected fake tracks and ρ ' 10 inflates the importance of the fake tracks
to achieve low fake rates (below 1% for PYTHIA QCD events with p̂T of the two outgoing partons
in the range 170–230GeV). As data taking conditions have evolved, the parameters have been
adjusted to maintain high efficiency and low fake rate.

The track selection criteria for each iteration are given in table 5. The loose criteria denote
the minimum requirements for a track to be kept in the general track collection. The tight and
high-purity criteria provide progressively more stringent requirements, which reduce the efficiency
and fake rate. In general, high-purity tracks are used for scientific analysis, although in cases where
efficiency is essential and purity is not a major concern, the loose tracks can be used. The criteria
for the initial tracking iterations emphasise compatibility with originating from a primary vertex as
a means of assuring quality, while the criteria used for the later iterations rely on other measures
of track quality such as fit χ2 and the number of hits, ensuring thereby that they are still useful
for selecting displaced tracks. This matches the seeding and track-finding requirements shown in
tables 3–4, and is aligned with the goals for the six iterations.

After the track selection is complete, the tracks found by each of the six iterations are merged
into a single collection.

– 22 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

Table 5. Parameter values used in selecting tracks reconstructed by each of the six iterative tracking steps.
The first table shows the three requirements on the number of layers that contain hits assigned to tracks and
the parameter α0 that controls selection criteria based on χ2/dof. The second table shows the parameters αi

and β that define compatibility of impact parameters with the interaction point. Each parameter has three
entries, corresponding to the loose (L), tight (T), and high-purity (H) selection requirements. Iterations 2 and
3 use two paths that emphasise track quality (Trk) or primary-vertex compatibility (Vtx). A track produced
by these iterations is retained if it passes either of these criteria.

Iteration
Min layers Min 3-D layers Max lost layers α0

L T H L T H L T H L T H
0 & 1 0 3 4 0 3 4 ∞ 2 2 2.0 0.9 0.9
2 Trk 4 5 5 0 3 3 ∞ 1 1 0.9 0.7 0.5
2 Vtx 3 3 3 0 3 3 ∞ 1 1 2.0 0.9 0.9
3 Trk 4 5 5 2 3 4 1 1 1 0.9 0.7 0.5
3 Vtx 3 3 3 2 3 3 1 1 1 2.0 0.9 0.9

4 5 5 6 3 3 3 1 0 0 0.6 0.4 0.3
5 6 6 6 2 2 2 1 0 0 0.6 0.35 0.25

Iteration β
α1 α2 α3 α4

L T H L T H L T H L T H
0 & 1 4 0.55 0.30 0.30 0.65 0.35 0.35 0.55 0.40 0.40 0.45 0.40 0.40
2 Trk 4 1.50 1.00 0.90 1.50 1.00 0.90 1.50 1.00 0.90 1.50 1.00 0.90
2 Vtx 3 1.20 0.95 0.85 1.20 0.90 0.80 1.30 1.00 0.90 1.30 1.00 0.90
3 Trk 4 1.80 1.10 1.00 1.80 1.10 1.00 1.80 1.10 1.00 1.80 1.10 1.00
3 Vtx 3 1.20 1.00 0.90 1.20 1.00 0.90 1.30 1.10 1.00 1.30 1.10 1.00

4 3 1.50 1.20 1.00 1.50 1.20 1.00 1.50 1.20 1.00 1.50 1.20 1.00
5 3 1.80 1.30 1.20 1.50 1.20 1.10 1.80 1.30 1.20 1.50 1.20 1.10

4.5 Specialized tracking

The track reconstruction described above produces the main track collection used by the CMS
collaboration. However, variants of this software are also used for more specialized purposes, as
described in this section.

4.5.1 Electron track reconstruction

Electrons, being charged particles, can be reconstructed through the standard track reconstruction.
However, as electrons lose energy primarily through bremsstrahlung, rather than ionization, large
energy losses are common. For example, about 35% of electrons radiate more than 70% of their ini-
tial energy before reaching the electromagnetic calorimeter (ECAL) that surrounds the tracker. The
energy loss distribution is highly non-Gaussian, and therefore the standard Kalman filter, which is
optimal when all variables have Gaussian uncertainties, is not appropriate. As a result, the effi-
ciency and resolution of the standard tracking are not particularly good for electrons and therefore
electron candidates are reconstructed using a combination of two techniques that make use of in-
formation, not only from the tracker, but also from the ECAL. As this is a subject beyond the scope
of this paper, only a brief description of these methods is given.

– 23 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

The first method [35] starts by searching for clusters of energy in the ECAL. The curvature
of electrons in the strong CMS magnetic field means that bremsstrahlung photons emitted by the
electrons will, in general, strike the ECAL at η values similar to that of the electron, but at different
azimuthal coordinates (φ). To recover this radiated energy, ECAL superclusters are formed, by
merging clusters of similar η over some range of φ . The knowledge of the energy and position
of each supercluster, and the assumption that the electron originated near the centre of the beam
spot, constrains the trajectory of the electron through the tracker (aside from a two-fold ambiguity
introduced by its unknown charge). Tracker seeds compatible with this trajectory are sought in the
pixel tracker (and also in the TEC to improve efficiency in the forward region).

The second method [36] takes the standard track collection (excluding tracks found by Itera-
tion 5, as described in table 3) and attempts to identify a subset of these tracks that are compatible
with being electrons. Electrons that suffer only little bremsstrahlung loss can be identified by
searching for tracks extrapolated to the ECAL that pass close to an ECAL cluster. Electrons that
suffer large bremsstrahlung loss can be identified by the fact that the fitted track will often have
poor χ2 or few associated hits. The track seeds originally used to generate these electron-like
tracks are retained.

The seed collections obtained by using these two methods are merged, and used to initiate
electron track finding. This procedure is similar to that used in standard tracking, except that the
χ2 threshold, used by the Kalman filter to decide whether a hit is compatible with a trajectory,
is weakened from 30 to 2000. This is to accommodate tracks that deviate from their expected
trajectory because of bremsstrahlung. In addition, the penalties assigned to track candidates for
passing through a tracker layer without being assigned a hit are adjusted. This is necessary because
bremsstrahlung photons can convert into e+e− pairs with the track-finding algorithm incorrectly
forming a track by combining hits from the primary electron with one of the conversion electrons.

To obtain the best parameter estimates, the final track fit is performed using a modified version
of the Kalman filter, called the Gaussian Sum Filter (GSF) [37]. In essence, the fractional energy
loss of an electron, as it traverses material of a given thickness, is expected to have a distribution
described by the Bethe-Heitler formula. This distribution is non-Gaussian, making it unsuitable for
use in a conventional Kalman filter algorithm. The GSF technique solves this by approximating
the Bethe-Heitler energy-loss distribution as the sum of several Gaussian functions, whose means,
widths, and relative amplitudes are chosen so as to optimize this approximation. The parame-
ters of these Gaussian energy-loss functions are determined only once. Each track trajectory is
also represented by a mixture of several ‘trajectory components’, where each trajectory component
has helix parameters with Gaussian uncertainties, and a ‘weight’ corresponding to the probability
that it correctly describes the true path of the particle. Initially, a track trajectory is described by
only a single such trajectory component, derived from the track seed. When propagating a tra-
jectory component through a layer of material in the tracker, the estimated mean energy of the
trajectory component is reduced and its uncertainty increased, according to the mean and width
of each Gaussian component of the energy-loss distribution applied independently, in turn, to the
original trajectory component. Thus after passing through the a layer of material, each original
trajectory component gives rise to several new trajectory components, each one obtained using one
of the Gaussian energy-loss functions. The weight of each new trajectory component is given by
the product of the weight of the original trajectory component and the weight of the correspond-

– 24 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

ing Gaussian component of the energy-loss distribution. To avoid an exponential explosion in the
number of trajectory components being followed, as the track candidate is propagated through suc-
cessive tracker layers, the less probable trajectory components are dropped or merged (by grouping
together similar trajectory components), so as to limit their number to 12. Each trajectory compo-
nent will also be updated by the Kalman filter if an additional hit is assigned to it when passing
through a layer. When this happens, the weight of the trajectory component is further adjusted
according to its compatibility with the hit.

The GSF fit provides estimates of the track parameters, whose uncertainties are described not
by a single Gaussian distribution, but instead by the sum of several Gaussian distributions, each
corresponding to the uncertainty on one of the trajectory components that make up the track. For
each parameter, the mode of this distribution is used as it is found to provide the best estimates of
the parameters.

The performance of the GSF electron tracking has been studied both with simulations [37] and
with data [38], with good agreement observed between the two.

4.5.2 Track reconstruction in the high-level trigger

The CMS high-level trigger (HLT) [39] uses a processor farm running C++ software to achieve
large reductions in data rate. The HLT filters events selected at rates of up to 100 kHz using the
Level-1 (hardware) trigger. Whereas Level-1 uses information only from the CMS calorimeters
and muon detectors, the HLT is also able to capture information from the tracker, thereby adding
the powerful tool of track reconstruction to the HLT. Some examples of how this improves the
HLT performance are listed below.

• Requiring muon candidates that are reconstructed in the muon detectors to be confirmed
through the presence of a corresponding track in the tracker greatly reduces the false recon-
struction rate and substantially improves momentum resolution.

• Energy clusters found in the electromagnetic calorimeters can be identified as electrons or
photons through the presence of a track of appropriate momentum pointing to the cluster.

• The background rejection rate for lepton triggers can be enhanced by requiring leptons to be
isolated. One method of doing this is to use a veto on the presence of (too many) tracks in a
cone around the lepton direction.

• Triggering on jets produced by b quarks can be done by counting the number of tracks in a jet
that have transverse impact parameters statistically incompatible with the track originating
from the beam-line.

• Triggers on τ decays τ → `ν`ντ , where ` = e or µ , can be extended to τ → hντ decays,
where h represents one or more charged hadrons, by reconstructing a narrow, isolated jet
using tracks in combination with calorimeter information.

The HLT uses track reconstruction software that is identical to that used for offline reconstruc-
tion, but it must run much faster. This is achieved by using a modified configuration of the track
reconstruction.

– 25 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

Tracks can be reconstructed from triplets of hits found using only the pixel tracker, as doc-
umented in sections 4.1 and 6.2. This is extremely fast, and can be used with great effect in the
reconstruction of the primary-vertex position in the HLT, described in section 6.2.

Tracks can also be reconstructed in the HLT using hits from both the pixel and strip detectors.
Such tracks have superior momentum resolution and a lower probability of being fake. However,
this requires much more CPU time than just reconstructing pixel tracks, since the strip tracker
does not provide the precise 3-D hits of the pixel tracker, and suffers from a higher hit occupancy.
This can be mitigated using some or all of the following techniques (the details vary significantly,
depending on the type of trigger).

• Rather than trying to reconstruct all tracks in the event, regional track reconstruction can be
performed instead, where the software is used to reconstruct tracks lying within a specified
η-φ region around some object of interest (which might be a muon, electron, or jet candidate
reconstructed using the calorimeters or muon detectors). This saves CPU time, and is ac-
complished by using regional seeding. This method differs from the track seeding described
in section 4.1, in that it only forms seeds from combinations of hits that are consistent with
a track heading into the desired η-φ region. Another important ingredient of regional track-
ing concerns the extraction of hits. As discussed in section 3.2, hits are reconstructed after
unpacking the original data blocks produced by the FED readout boards. Significant time
is saved by unpacking only the data from those FED units that read out tracker modules
within the region of interest [40]. This is not used in the offline reconstruction as the track
reconstruction searches the entire η-φ region and therefore needs all hits.

• Further gains in speed can be made by performing just a single iteration in the iterative track-
ing, such that only seeds made from pairs of pixel hits are considered, where these hits are
compatible with a track originating within a few millimetres of a primary pixel vertex. Fur-
thermore, the HLT uses a higher pT requirement when forming the seeds (usually >1GeV)
than is used for offline reconstruction. These stringent requirements on track impact param-
eter and pT reduce the number of seeds, and thereby the amount of time spent building track
candidates.

• Track finding can differ from that described in section 4.2, in that it can rely on partial track
reconstruction. With this technique, the building of each track candidate is stopped once a
specific condition is met, for example, a given minimum number of hits (typically eight), or
a certain precision requirement on the track parameters. As a consequence, the hits in the
outermost layers of the tracker tend not to be used. While such partially reconstructed tracks
will have slightly poorer momentum resolution and higher fake rates than fully reconstructed
tracks, they also take less CPU time to construct.

• Other changes in the tracking configuration can further enhance the speed of reconstruction.
For example, when building track candidates from a given seed, the offline track recon-
struction retains at most the best five partially reconstructed candidates for extrapolating to
the next layer. Changing this configurable parameter to retain fewer candidates can save
CPU time.

– 26 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

Pixel tracking and other aspects of track reconstruction absorb about 20% of the total HLT
CPU time. This is kept low by performing track reconstruction only when necessary, and only
after other requirements have been satisfied, so as to reduce the rate at which tracking must be
performed. Track reconstruction is employed in a variety of ways to satisfy different needs in the
HLT. Examples of track reconstruction at the HLT include seeds originating in the muon detector,
tracking in a specific η-φ region defined by a jet, and searching for tracks over the full detector.
Even the most comprehensive (and slowest) track reconstruction configuration at the HLT is more
than ten times faster than the offline reconstruction of tracks in events representative of data taken
in 2011 (tt + 10 pileup events).

5 Track reconstruction performance

In this section, the performance of the CTF tracking algorithm is evaluated in terms of tracking
efficiency and fake rate, track parameter resolutions, and the CPU time required for processing
collision events. Two different categories of simulated samples are used: isolated particles and pp
collision events. Comparing the results helps one understand both the performance of the tracking
for isolated particles and to what extent it is degraded in a high hit occupancy environment.

Simulated events offer the possibility of detailed studies of track reconstruction, such as the
way characteristics of the tracker and the design of the track reconstruction algorithms influence
its performance over a wide range of particle momenta and rapidities, and how much its perfor-
mance depends on the type of charged particle being reconstructed, and on whether this particle
is isolated or not. The performance in simulation can be compared with that in data in certain re-
gions of phase space to verify that the results from simulation are realistic. The CMS collaboration
demonstrated previously that its simulation describes the momentum resolution of muons from J/ψ
decay to an accuracy of better than 5% [41]; and does similarly well in describing the dimuon mass
resolution of muons from Z boson decay [42]. The transverse and longitudinal impact parame-
ters of tracks reconstructed in typical multijet events agree in data and simulation to better than
10% [43]. The CMS collaboration also showed that the tracking efficiency for particles from J/ψ
and charmed hadron decays is simulated with a precision better than 5% [44]. A similar compari-
son for the higher-momentum muons from Z boson decay will be presented in section 5.1.3 of the
present work.

The isolated particle samples that are used here consist of simple events with just a single gen-
erated muon, pion or electron, although secondary particles may also be present due to interactions
with the detector material. The single particles are generated with a flat distribution in pseudora-
pidity inside the tracker acceptance |η |< 2.5. Their transverse momenta are either fixed to 1, 10 or
100GeV, or are generated according to a flat distribution in ln(pT). The former set of particles with
fixed momenta is used for studying the tracking performance as a function of η , while the latter is
used to quantify the performance as a function of pT.

For pp collisions, simulated inclusive tt events are used, either with or without superimposed
pileup events. The average number of pileup collisions per LHC bunch crossing depends on the
instantaneous luminosity of the machine and on the period of data-taking over which the luminosity
is averaged. For the sake of simplicity, the number of pileup interactions superimposed on each
simulated tt event is randomly generated from a Poisson distribution with mean equal to 8. This

– 27 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

amount of pileup corresponds roughly to what was delivered by the LHC, when averaged over the
whole 2011 running period. The tt events, and also the minimum-bias events used for the pileup,
are generated with the PYTHIA 6 program [45].

Simulated particles are paired to reconstructed tracks for evaluating tracking efficiency, fake
rate, and other quantities discussed in this section. A simulated particle is associated with a re-
constructed track if at least 75% of the hits assigned to the reconstructed track originate from the
simulated particle. The association of simulated hits with reconstructed hits is possible because the
simulation software records the particles responsible for the signal in each channel of the tracker.
Strip and pixel response to electronic noise is also recorded. Reconstructed tracks that are not
associated with a simulated particle are referred to as fake tracks.

Results for tracking efficiencies and for fake rates are presented in section 5.1. While the lat-
ter is evaluated only using simulated samples, the former is also measured in data as described in
section 5.1.3. The resolution obtained for track parameters is discussed in section 5.2. Unless indi-
cated otherwise, all results pertaining to the performance are obtained using the set of ‘high-purity’
tracks defined in section 4.4. Finally, section 5.3 provides estimates of the CPU time required for
different components of track reconstruction.

5.1 Tracking efficiency and fake rate

For simulated samples, the tracking efficiency is defined as the fraction of simulated charged parti-
cles that can be associated with corresponding reconstructed tracks, where the association criterion
is the one described at the beginning of this section. This definition of efficiency depends not only
on the quality of the track-finding algorithm, but also upon the intrinsic properties of the tracker,
such as its geometrical acceptance and material content. Using the same association criterion as
used for the efficiency, the fake rate is defined as the fraction of reconstructed tracks that are not
associated with any simulated particle. This quantity represents the probability that a reconstructed
track is either a combination of unrelated hits or a genuine particle trajectory that is badly recon-
structed through the inclusion of spurious hits. The efficiency and fake rate presented in this section
are given as a function of pT and η of the simulated particle and reconstructed track, respectively.
The efficiency is obtained for simulated particles generated within |η | < 2.5, with a production
point <3cm and <30cm from the centre of the beam spot for r and |z|, respectively. These cri-
teria select fairly prompt particles. We also require pT > 0.9GeV, for the study of efficiency as
a function of η , or pT > 0.1GeV for studying efficiency over the entire pT spectrum. Since the
‘high-purity’ requirement described in section 4.4 is the default track selection for the majority of
analyses in CMS, unless otherwise stated efficiency and fake rate are measured and presented here
using only the subset of reconstructed tracks that are identified as ‘high-purity’.

5.1.1 Results from simulation of isolated particles

This section presents the performance of the CTF tracking software in reconstructing trajectories
of particles in events containing just a single muon, a pion or an electron.

Muons are reconstructed better than any other charged particle in the tracker, as they mainly
interact with the silicon detector through ionization of the medium and, unlike electrons, their
energy loss through bremsstrahlung is negligible. Muons therefore tend to cross the entire volume
of the tracking system, producing detectable hits in several sensitive layers of the apparatus. Finally,

– 28 –


2
0
1
4
 
J
I
N
S
T
 
9
 
P
1
0
0
0
9

muon trajectories are altered almost exclusively by Coulomb scattering and energy loss, whose
effects are straightforward to include within the formalism of Kalman filter. For isolated muons
with 1 < pT < 100GeV, the tracking efficiency is >99% over the full η-range of tracker acceptance,
and does not depend on pT (figure 8, top). The fake rate is completely negligible.

Charged pions, as muons, undergo multiple scattering and energy loss through ionization as
they cross the tracker volume. However, like all hadrons, pions are al