This content has been downloaded from IOPscience. Please scroll down to see the full text. Download details: IP Address: 186.217.236.119 This content was downloaded on 25/02/2015 at 16:54 Please note that terms and conditions apply. Description and performance of track and primary-vertex reconstruction with the CMS tracker View the table of contents for this issue, or go to the journal homepage for more 2014 JINST 9 P10009 (http://iopscience.iop.org/1748-0221/9/10/P10009) Home Search Collections Journals About Contact us My IOPscience iopscience.iop.org/page/terms http://iopscience.iop.org/1748-0221/9/10 http://iopscience.iop.org/1748-0221 http://iopscience.iop.org/ http://iopscience.iop.org/search http://iopscience.iop.org/collections http://iopscience.iop.org/journals http://iopscience.iop.org/page/aboutioppublishing http://iopscience.iop.org/contact http://iopscience.iop.org/myiopscience 2 0 1 4 J I N S T 9 P 1 0 0 0 9 PUBLISHED BY IOP PUBLISHING FOR SISSA MEDIALAB RECEIVED: May 26, 2014 REVISED: July 21, 2014 ACCEPTED: August 19, 2014 PUBLISHED: October 16, 2014 Description and performance of track and primary-vertex reconstruction with the CMS tracker The CMS collaboration E-mail: cms-publication-committee-chair@cern.ch ABSTRACT: A description is provided of the software algorithms developed for the CMS tracker both for reconstructing charged-particle trajectories in proton-proton interactions and for using the resulting tracks to estimate the positions of the LHC luminous region and individual primary- interaction vertices. Despite the very hostile environment at the LHC, the performance obtained with these algorithms is found to be excellent. For tt events under typical 2011 pileup conditions, the average track-reconstruction efficiency for promptly-produced charged particles with transverse momenta of pT > 0.9GeV is 94% for pseudorapidities of |η | < 0.9 and 85% for 0.9 < |η | < 2.5. The inefficiency is caused mainly by hadrons that undergo nuclear interactions in the tracker material. For isolated muons, the corresponding efficiencies are essentially 100%. For isolated muons of pT = 100GeV emitted at |η | < 1.4, the resolutions are approximately 2.8% in pT, and respectively, 10 µm and 30 µm in the transverse and longitudinal impact parameters. The position resolution achieved for reconstructed primary vertices that correspond to interesting pp collisions is 10–12 µm in each of the three spatial dimensions. The tracking and vertexing software is fast and flexible, and easily adaptable to other functions, such as fast tracking for the trigger, or dedicated tracking for electrons that takes into account bremsstrahlung. KEYWORDS: Pattern recognition, cluster finding, calibration and fitting methods; Large detector- systems performance; Performance of High Energy Physics Detectors ARXIV EPRINT: 1405.6569 c© CERN 2014 for the benefit of the CMS collaboration, published under the terms of the Creative Commons Attribution 3.0 License by IOP Publishing Ltd and Sissa Medialab srl. Any further distribution of this work must maintain attribution to the author(s) and the published article’s title, journal citation and DOI. doi:10.1088/1748-0221/9/10/P10009 mailto:cms-publication-committee-chair@cern.ch http://arxiv.org/abs/1405.6569 http://creativecommons.org/licenses/by/3.0/ http://creativecommons.org/licenses/by/3.0/ http://dx.doi.org/10.1088/1748-0221/9/10/P10009 2 0 1 4 J I N S T 9 P 1 0 0 0 9 Contents 1 Introduction 1 2 The CMS tracker 2 3 Reconstruction of hits in the pixel and strip tracker 4 3.1 Hit reconstruction in the pixel detector 5 3.1.1 First-pass hit reconstruction 5 3.1.2 Template-based hit reconstruction 6 3.2 Hit reconstruction in the strip detector 7 3.3 Hit efficiency 8 3.4 Hit resolution 9 4 Track reconstruction 12 4.1 Seed generation 13 4.2 Track finding 16 4.3 Track fitting 19 4.4 Track selection 21 4.5 Specialized tracking 23 4.5.1 Electron track reconstruction 23 4.5.2 Track reconstruction in the high-level trigger 25 5 Track reconstruction performance 27 5.1 Tracking efficiency and fake rate 28 5.1.1 Results from simulation of isolated particles 28 5.1.2 Results from simulated pp collision events 32 5.1.3 Efficiency estimated from data 34 5.2 Resolution in the track parameters 37 5.2.1 Results from simulation of isolated particles 37 5.2.2 Results from simulated pp collision events 42 5.3 CPU execution time 42 6 Beam spot and primary-vertex reconstruction and its performance 42 6.1 Primary-vertex reconstruction 42 6.1.1 Primary-vertex resolution 49 6.1.2 Efficiency of primary-vertex reconstruction 50 6.2 Track and vertex reconstruction with the pixel detector 51 6.2.1 Tracking efficiency and fake rate for pixel tracks 52 6.2.2 Resolution in the parameters of pixel tracks 52 6.2.3 Position resolution for pixel based vertices 52 6.3 Reconstruction of the LHC beam spot 53 – i – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 6.3.1 Determination of the position of the centre of the beam spot 55 6.3.2 Determining the size of the beam spot 56 7 Summary and conclusions 59 The CMS collaboration 64 1 Introduction At an instantaneous luminosity of 1034 cm−2 s−1, typical of that expected at the Large Hadron Col- lider (LHC), with the proton bunches crossing at intervals of 25 ns, the Compact Muon Solenoid (CMS) tracker is expected to be traversed by about 1000 charged particles at each bunch cross- ing, produced by an average of more than twenty proton-proton (pp) interactions. These multiple interactions are known as pileup, to which prior or later bunch crossings can also contribute be- cause of the finite time resolution of the detector. Reconstructing tracks in such a high-occupancy environment is immensely challenging. It is difficult to attain high track-finding efficiency, while keeping the fraction of fake tracks small. Fake tracks are falsely reconstructed tracks that may be formed from a combination of unrelated hits or from a genuine particle trajectory that is badly reconstructed through the inclusion of spurious hits. In addition, the tracking software must run sufficiently fast to be used not only for offline event reconstruction (of ≈109 events per year), but also for the CMS High-Level Trigger (HLT), which processes events at rates of up to 100 kHz. The scientific goals of CMS [1, 2] place demanding requirements on the performance of the tracking system. Searches for high-mass dilepton resonances, for example, require good momen- tum resolution for transverse momenta pT of up to 1TeV. At the same time, efficient reconstruction of tracks with very low pT of order 100MeV is needed for studies of hadron production rates and to obtain optimum jet energy resolution with particle-flow techniques [3]. In addition, it is essential to resolve nearby tracks, such as those from 3-prong τ-lepton decays. Furthermore, excellent impact parameter resolution is needed for a precise measurement of the positions of primary pp interaction vertices as well as for identifying b-quark jets [4]. While the CMS tracker [5] was designed with the above requirements in mind, the track- finding algorithms must fully exploit its capabilities, so as to deliver the desired performance. The goal of this paper is to describe the algorithms used to achieve this and show the level of perfor- mance attained. The focus here is purely on pp collisions, with heavy ion collisions being beyond the scope of this document. Section 2 introduces the CMS tracker; and section 3 describes the reconstruction of the hits created by charged particles crossing the tracker’s sensitive layers. The algorithms used to reconstruct tracks from these hits are explained in section 4; and the perfor- mance obtained in terms of track-finding efficiency, proportion of fake tracks and track parameter resolution is presented in section 5. Primary vertices from pp collisions are distributed over a lu- minous region known as the beam spot. Reconstruction of the beam spot and of the primary vertex positions is described in section 6. This is intimately connected with tracking, since on the one hand, the beam spot and primary vertices are found using reconstructed tracks, and on the other – 1 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 hand, an approximate knowledge of their positions is needed before track finding can begin. All results shown in this paper are based on pp collision data collected or events simulated at a centre- of-mass energy of √ s = 7TeV in 2011. The simulated events include a full simulation of the CMS detector response based on GEANT4 [6]. All events are reconstructed using software from the same period. The track-reconstruction algorithms have been steadily evolving since then, but still have a similar design now. The CMS detector [5] was commissioned initially using cosmic ray muons and subsequently using data from the first LHC running period. Results obtained using cosmic rays in 2008 [7] are ex- tensively documented in several publications pertaining to the pixel detector [8], strip detector [9], tracker alignment [10], and magnetic field [11], and are of particular relevance to the present paper. Results from the commissioning of the tracker using pp collisions in 2010 are presented in [12]. 2 The CMS tracker The CMS collaboration uses a right-handed coordinate system, with the origin at the centre of the detector, the x-axis pointing to the centre of the LHC ring, the y-axis pointing up (perpendicular to the plane of the LHC ring), and with the z-axis along the anticlockwise-beam direction. The polar angle θ is defined relative to the positive z-axis and the azimuthal angle φ is defined relative to the x-axis in the x-y plane. Particle pseudorapidity η is defined as − ln[tan(θ/2)]. The CMS tracker [5] occupies a cylindrical volume 5.8m in length and 2.5m in diameter, with its axis closely aligned to the LHC beam line. The tracker is immersed in a co-axial magnetic field of 3.8 T provided by the CMS solenoid. A schematic drawing of the CMS tracker is shown in figure 1. The tracker comprises a large silicon strip tracker with a small silicon pixel tracker inside it. In the central pseudorapidity region, the pixel tracker consists of three co-axial barrel layers at radii between 4.4cm and 10.2cm and the strip tracker consists of ten co-axial barrel layers extending outwards to a radius of 110cm. Both subdetectors are completed by endcaps on either side of the barrel, each consisting of two disks in the pixel tracker, and three small plus nine large disks in the strip tracker. The endcaps extend the acceptance of the tracker up to a pseudorapidity of |η |< 2.5. The pixel detector consists of cylindrical barrel layers at radii of 4.4, 7.3 and 10.2cm, and two pairs of endcap disks at z = ±34.5 and ±46.5cm. It provides three-dimensional (3-D) position measurements of the hits arising from the interaction of charged particles with its sensors. The hit position resolution is approximately 10 µm in the transverse coordinate and 20–40 µm in the longitudinal coordinate, while the third coordinate is given by the sensor plane position. In total, its 1440 modules cover an area of about 1m2 and have 66 million pixels. The strip tracker has 15 148 silicon modules, which in total cover an active area of about 198m2 and have 9.3 million strips. It is composed of four subsystems. The Tracker Inner Barrel (TIB) and Disks (TID) cover r < 55cm and |z| < 118cm, and are composed of four barrel lay- ers, supplemented by three disks at each end. These provide position measurements in rφ with a resolution of approximately 13–38 µm. The Tracker Outer Barrel (TOB) covers r > 55cm and |z|< 118cm and consists of six barrel layers providing position measurements in rφ with a resolu- tion of approximately 18–47 µm. The Tracker EndCaps (TEC) cover the region 124 < |z|< 282cm. – 2 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 r (c m ) 0 10 20 30 40 50 60 70 80 90 100 110 z (cm) -300 -200 -100 0 100 200 300 3.0 2.8 2.6 2.4 2.2 2.0 1.8 1.6 -3.0 -2.8 -2.6 -2.4 -2.2 -2.0 -1.8 -1.6 -1.4 -1.2 -1.0 -0.8 -0.6 -0.4 -0.2 0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 → η → −TEC TEC+ TOB TIB−TID TID+ PIXEL Figure 1. Schematic cross section through the CMS tracker in the r-z plane. In this view, the tracker is symmetric about the horizontal line r = 0, so only the top half is shown here. The centre of the tracker, corresponding to the approximate position of the pp collision point, is indicated by a star. Green dashed lines help the reader understand which modules belong to each of the named tracker subsystems. Strip tracker modules that provide 2-D hits are shown by thin, black lines, while those permitting the reconstruction of hit positions in 3-D are shown by thick, blue lines. The latter actually each consist of two back-to-back strip modules, in which one module is rotated through a ‘stereo’ angle. The pixel modules, shown by the red lines, also provide 3-D hits. Within a given layer, each module is shifted slightly in r or z with respect to its neighbouring modules, which allows them to overlap, thereby avoiding gaps in the acceptance. Each TEC is composed of nine disks, each containing up to seven concentric rings of silicon strip modules, yielding a range of resolutions similar to that of the TOB. To refer to the individual layers/disks within a subsystem, we use a numbering convention whereby the barrel layer number increases with its radius and the endcap disk number increases with its |z|-coordinate. When referring to individual rings within an endcap disk, the ring number increases with the radius of the ring. The modules of the pixel detector use silicon of 285 µm thickness, and achieve resolutions that are roughly the same in rφ as in z, because of the chosen pixel cell size of 100× 150 µm2 in rφ × z. The modules in the TIB, TID and inner four TEC rings use silicon that is 320 µm thick, while those in the TOB and the outer three TEC rings use silicon of 500 µm thickness. In the barrel, the silicon strips usually run parallel to the beam axis and have a pitch (i.e., the distance between neighbouring strips) that varies from 80 µm in the inner TIB layers to 183 µm in the inner TOB layers. The endcap disks use wedge-shaped sensors with radial strips, whose pitch varies from 81 µm at small radii to 205 µm at large radii. The modules in the innermost two layers of both the TIB and the TOB, as well as the modules in rings 1 and 2 of the TID, and 1, 2 and 5 of the TEC, carry a second strip detector module, which is mounted back-to-back to the first and rotated in the plane of the module by a ‘stereo’ angle of 100mrad. The hits from these two modules, known as ‘rφ ’ and ‘stereo hits’, can be combined into matched hits that provide a measurement of the second coordinate (z in the barrel and r on the – 3 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 disks). The achieved single-point resolution of this measurement is an order of magnitude worse than in rφ . The principal characteristics of the tracker are summarized in table 1. Figure 2 shows the material budget of the CMS tracker, both in units of radiation lengths and nuclear interaction lengths, as estimated from simulation. The simulation describes the tracker material budget with an accuracy better than 10% [13], as was established by measuring the distri- bution of reconstructed nuclear interactions and photon conversions in the tracker. Table 1. A summary of the principal characteristics of the various tracker subsystems. The number of disks corresponds to that in a single endcap. The location specifies the region in r (z) occupied by each barrel (endcap) subsystem. Tracker subsystem Layers Pitch Location Pixel tracker barrel 3 cylindrical 100×150 µm2 4.4 < r < 10.2cm Strip tracker inner barrel (TIB) 4 cylindrical 80–120 µm 20 < r < 55cm Strip tracker outer barrel (TOB) 6 cylindrical 122–183 µm 55 < r < 116cm Pixel tracker endcap 2 disks 100×150 µm2 34.5 < |z|< 46.5cm Strip tracker inner disks (TID) 3 disks 100–141 µm 58 < |z|< 124cm Strip tracker endcap (TEC) 9 disks 97–184 µm 124 < |z|< 282cm η -4 -3 -2 -1 0 1 2 3 4 0 t/X 0 0.5 1 1.5 2 2.5 Support Tube TOB Pixel TEC TIB and TID Beam Pipe CMS simulation η -4 -3 -2 -1 0 1 2 3 4 Iλt/ 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 Support Tube TOB Pixel TEC TIB and TID Beam Pipe CMS simulation Figure 2. Total thickness t of the tracker material traversed by a particle produced at the nominal interaction point, as a function of pseudorapidity η , expressed in units of radiation length X0 (left) and nuclear interac- tion length λI (right). The contribution to the total material budget of each of the subsystems that comprise the CMS tracker is shown, together with contributions from the beam pipe and from the support tube that surrounds the tracker. 3 Reconstruction of hits in the pixel and strip tracker The first step of the reconstruction process is referred to as local reconstruction. It consists of the clustering of zero-suppressed signals above specified thresholds in pixel and strip channels into – 4 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 hits, and then estimating the cluster positions and their uncertainties defined in a local orthogonal coordinate system (u,v) in the plane of each sensor. A pixel sensor consists of 100× 150 µm2 pixels with the u-axis oriented parallel to the shorter pixel edge. In the strip sensors, the u-axis is chosen perpendicular to the central strip in each sensor (which in the TEC is not parallel to the other strips in the same sensor). 3.1 Hit reconstruction in the pixel detector In the data acquisition system of the pixel detector [14], zero-suppression is performed in the readout chips of the sensors [15], with adjustable thresholds for each pixel. This pixel readout threshold is set to a single-pixel threshold corresponding to an equivalent charge of 3200 electrons. Offline, pixel clusters are formed from adjacent pixels, including both side-by-side and corner-by- corner adjacent cells. Each cluster must have a minimum charge equivalent to 4000 electrons. For comparison, a minimum ionizing particle deposits usually around 21000 electrons. Miscalibration of residual charge caused by pixel-to-pixel differences of the charge injection capacitors, which are used to calibrate the pixel gain, are extracted from laboratory measurements and included in the Monte Carlo (MC) simulation. Two algorithms are used to determine the position of pixel clusters. A fast algorithm (described in section 3.1.1) is used during track seeding and pattern recognition, and a more precise algorithm (section 3.1.2), based on cluster shapes, is used in the final track fit. 3.1.1 First-pass hit reconstruction The position of a pixel cluster along the transverse (u) and longitudinal (v) directions on the sensor is obtained as follows. The procedure is described only for the case of the u coordinate, but is identical for the v coordinate. The cluster is projected onto the u-axis by summing the charge collected in pixels with the same u-coordinate [16]. The result is referred to as a projected cluster. For projected clusters that are only one pixel large, the u-position is given by the centre of that pixel, corrected for the Lorentz drift of the collected charge in the CMS magnetic field. For larger projected clusters, the hit position uhit is determined using the relative charge in the two pixels at each end of the projected cluster: uhit = ugeom + Qu last−Qu first 2(Qu last +Qu first) |W u−W u inner|− Lu 2 , (3.1) where Qfirst and Qlast are the charges collected in the first and last pixel of the projected cluster, respectively; ugeom is the position of the geometrical centre of the projected cluster; and the param- eter Lu/2 = D tanΘu L/2 is the Lorentz shift along the u-axis, where Θu L is the Lorentz angle in this direction, and D is the sensor thickness. For the pixel barrel, the Lorentz shift is approximately 59 µm. The parameter W u inner is the geometrical width of the projected cluster, excluding its first and last pixels. It is zero if the width of the projected cluster is less than three pixels. The charge width W u is defined as the width expected for the deposited charge, as estimated from the angle of the track with respect to the sensor, and equals W u = D |tan(αu−π/2)+ tanΘ u L| , (3.2) – 5 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 where the angle αu is the impact angle of the track relative to the plane of the sensor, measured after projecting the track into the plane perpendicular to the v-axis. If no track is available, αu is calculated assuming that the particle producing the hit moved in a straight line from the centre of the CMS detector. The motivation for eq. (3.1) is that the charge deposited by the traversing particle is expected to only partially cover the two pixels at each end of the projected cluster. The quantity W u−W u inner, which is expected to have a value between zero and twice the pixel pitch, (a modified version of eq. (3.1) is used for any hits that do not meet this expectation), provides an estimate of the total extension of charge into these two outermost pixels, while the relative charge deposited in these two pixels provides a way to deduce how this total distance is shared between them. The distance that the charge extends into each of the two pixels can thereby be deduced. This gives the position of the two edges of the charge distribution, and the mean value of these edges, corrected for the Lorentz drift, equals the position of the cluster. 3.1.2 Template-based hit reconstruction The high level of radiation exposure of the pixel detector can affect significantly the collection of charge by the pixels during the detector’s useful life. This degrades particularly the performance of the standard hit reconstruction algorithm, sketched in the previous section, as this algorithm only uses the end pixels of projected clusters when determining hit positions. The reconstructed positions of hits can be biased by up to 50 µm in highly irradiated sensors, and the hit position resolution can be severely degraded. In the template-based reconstruction algorithm, the observed distribution of the cluster charge is compared to expected projected distributions, called templates, to estimate the positions of hits [17]. The templates are generated based on a large number of simulated particles traversing pixel modules, which are modelled using the detailed PIXELAV simulation [18–20]. Since the PIXELAV program can describe the behaviour of irradiated sensors, new templates can be generated over the life of the detector to maintain the performance of the hit reconstruction. To allow the template- based algorithm to be applied to tracks crossing the silicon at various angles, different sets of templates are generated for several ranges of the angle between the particle trajectory and the sensor. Working in each dimension independently, each pixel is subdivided into nine bins along the u (or v) axis, where each bin has a width of one-eighth of the size of a pixel and the end bins are centred on the pixel boundaries. The u (or v) coordinate of the point of interception of the particle trajectory and the pixel (defined as the position at which the track crosses the plane that lies halfway between the front and back faces of the sensor) is used to assign the interception point to one of the nine bins, j, indicating its location within the pixel. The charge profile of the cluster produced by each particle is projected into an array that is 13 pixels long along the u axis (or 23 pixels long along the v axis) and centred on the intercepted pixel. The resulting charge in each element i of this array is recorded. Only clusters with a charge below some specified angle-dependent maximum, determined from simulation, are used, as the charge distributions can be distorted by the significant ionization caused by energetic delta rays. This procedure provides an accurate determination of the projected cluster distributions, determined by effects of geometry, charge drift, trapping, and charge induction. In each dimension, the mean charge Si, j in bin (i, j), averaged over all the particles, is then determined. In addition, the RMS charge distributions for the two projected pixels at the two – 6 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 ends of the cluster are extracted, as are the charge in the projected pixel that has the highest charge within the cluster, and the cluster charge, both averaged over all tracks. The charge distribution of a reconstructed cluster, projected onto either the u or v axis, can be described in terms of a charge Pi in each pixel i of the cluster. This can be compared to the expected charge distributions Si, j stored in the templates, so as to determine the bin j where the particle is likely to have crossed the sensor, and hence the best estimate of the reconstructed hit position. This is accomplished by minimizing a χ2 function for several or all of the bins: χ 2( j) = ∑ i ( Pi−N jSi, j ∆Pi )2 , (3.3) with N j = ∑ i Pi (∆Pi) 2 / ∑ i Si, j (∆Pi) 2 . (3.4) In this expression, ∆Pi is the expected RMS of a charge Pi from the PIXELAV simulation and N j represents a normalization factor between the observed cluster charge and the template. While a sum over all the template bins yields an absolute minimum, different strategies can be used to optimize the performance of the algorithm as a function of allowed CPU time. As described in section 4.3, this χ2 is also used to reject outliers during track fitting, in particular pixel hits on a track that are incompatible with the distribution expected for the reconstructed track angle. A simplified estimate of the position of a hit is performed for cluster projections consisting of a single pixel by correcting the position of the hit for bias from Lorentz drift and possible radiation damage. The bias is defined by the average residual of all single-pixel clusters, as detailed below. For cluster projections consisting of multiple pixels, the estimate of the hit position is further refined. The charge template expected for a track crossing the pixel at an arbitrary position r, near the best j bin is approximated by the expression (1−r)Si, j−1 +rSi, j+1. Substituting this expression in place of Si, j in eq. (3.3), and minimizing χ2 with respect to r, yields an improved estimate of the hit position. Finally, the above-mentioned hit reconstruction algorithm is applied to the same PIXELAV MC samples originally used to generate the templates. Since the true hit position is known, any bias in the reconstructed hit position can be determined and accounted for when the algorithm is run on collision data. In addition, the RMS of the difference between the reconstructed and true hit position is used to define the uncertainty in the position of a reconstructed hit. 3.2 Hit reconstruction in the strip detector The data acquisition system of the strip detector [21] runs algorithms on off-detector electronics (namely, on the modules of the front-end driver (FED) [22]) to subtract pedestals (the baseline signal level when no particle is present) and common mode noise (event-by-event fluctuations in the baseline within each tracker readout chip), and to perform zero-suppression. Zero-suppression accepts a strip if its charge exceeds the expected channel noise by at least a factor of five, or if both the strip and one of its neighbours have a charge exceeding twice the channel noise. As a result, information for only a small fraction of the channels in any given event is retained for offline storage. – 7 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 Offline, clusters are seeded by any channel passing zero-suppression that has a charge at least a factor of three greater than the corresponding channel noise [1]. Neighbouring strips are added to each seed, if their strip charge is more than twice the strip noise. A cluster is kept if its total charge is a factor five larger than the cluster noise, defined as σcluster = √ ∑i σ2 i , where σi is the noise for strip i, and the sum runs over all the strips in the cluster. The position of the hit corresponding to each cluster is determined from the charge-weighted average of its strip positions, corrected by approximately 10 µm (20 µm) in the TIB (TOB) to ac- count for the Lorentz drift. One additional correction is made to compensate for the fact that charge generated near the back-plane of the sensitive volume of the thicker silicon sensors is inefficiently collected. This inefficiency shifts the cluster barycentre along the direction perpendicular to the sensor plane by approximately 10 µm in the 500 µm thick silicon, while its effect is negligible in the 320 µm thick silicon. The inefficient charge collection from the sensor backplane is caused by the narrow time window during which the APV25 readout chip [23] integrates the collected charge, and whose purpose is to reduce background from out-of-time hits. The uncertainty in the hit position is usually parametrized as a function of the expected width of the cluster obtained from the track angle (i.e., the ‘charge width’ defined in section 3.1.1). However, in rare cases, when the observed width of a cluster exceeds the expected width by at least a factor of 3.5, and is incompatible with it, the uncertainty in the position is then set to the ‘binary resolution’, namely, the width of the cluster divided by √ 12. This broadening of the cluster is caused by capacitive coupling between the strips or energetic delta rays. 3.3 Hit efficiency The hit efficiency is the probability to find a cluster in a given silicon sensor that has been traversed by a charged particle. In the pixel detector, the efficiency is measured using isolated tracks originating from the pri- mary vertex. The pT is required to be >1GeV, and the tracks are required to be reconstructed with a minimum of 11 hits measured in the strip detector. Hits from the pixel layer under study are not removed when the tracks are reconstructed. To minimize any ensuing bias, all tracks are required to have hits in the other two pixel layers, ensuring thereby that they would be found even with- out using the studied layer. A restrictive selection is set on the impact parameter to reduce false tracks and tracks from secondary interactions. To avoid inactive regions and to allow for residual misalignment, track trajectories passing near the edges of the sensors or their readout chips are excluded. Specifically, they must not pass within 0.6mm (1.0–1.5mm) of a sensor edge in the pixel endcap (barrel) or within 0.6mm of the edge of a pixel readout chip. The efficiency is de- termined from the fraction of tracks to which either a hit is associated in the layer under study, or if it is found within 500 µm of the predicted position of the track. Given the high track density, only tracks that have no additional trajectories within 5mm are considered so as to reduce false track-to-cluster association. The average efficiency for reconstructing hits is >99%, as shown in figure 3(left), when excluding the 2.4% of the pixel modules known to be defective. The hit effi- ciency depends on the instantaneous luminosity and on the trigger rate, as shown in figure 3(right). The systematic uncertainty in these measurements is estimated to be 0.2%. Several sources of loss have been identified. First, the limited size of the internal buffer of the readout chips cause a dynamic inefficiency that increases with the instantaneous luminosity and with the trigger rate. – 8 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 Layer 1 Layer 2 Layer 3 Disk -2 Disk -1 Disk +1 Disk +2 E ffi ci en cy 0.98 0.982 0.984 0.986 0.988 0.99 0.992 0.994 0.996 0.998 1 = 7 TeVsCMS )-1s-1Instantaneous luminosity (nb 0 0.5 1 1.5 2 2.5 3 3.5 E ffi ci en cy 0.98 0.982 0.984 0.986 0.988 0.99 0.992 0.994 0.996 0.998 1 Forward disks Layer 1 Layer 2 Layer 3 = 7 TeVsCMS Figure 3. The average hit efficiency for layers or disks in the pixel detector excluding defective modules (left), and the average hit efficiency as a function of instantaneous luminosity (right). The peak luminosity ranged from 1 to 4nb−1s−1 during the data taking. Single-event upsets temporarily cause loss of information at a negligible rate of approximately two readout chips per hour. Finally, readout errors signalled by the FED modules depend on the rate of beam induced background. The efficiency in the strip tracker is measured using tracks that have a minimum of eight hits in the pixel and strip detectors. Where two hits are found in one of the closely-spaced double layers, which consist of rφ and stereo modules, both hits are counted separately. The efficiency in any given layer is determined using only the subset of tracks that have at least one hit in subsequent layers, further away from the beam spot. This requirement ensures that the particle traverses the layer under study, but also means that the efficiency cannot be measured in the outermost layers of the TOB (layer 6) and the TEC (layer 9). To avoid inactive regions and to take account of any residual misalignment, tracks that cross a module within five standard deviations from the sensor’s edges, based on the uncertainty in the extrapolated track trajectory, are excluded from considera- tion. The efficiency is determined from the fraction of traversing tracks with a hit anywhere within the non-excluded region of a traversed module. In the strip tracker, 2.3% of the modules are ex- cluded because of short circuits of the high voltage, communication problems with the front-end electronics, or other faults. Once the defective modules are excluded from the measurement, the overall hit efficiency is 99.8%, as shown in figure 4. This number is compatible with the 0.2% fraction of defective channels observed during the construction of the strip tracker. All defective components of the tracker are taken into account, both in the MC simulation of the detector and in the reconstruction of tracks. 3.4 Hit resolution The hit resolution in the pixel and strip barrel sensors has been studied by measuring residuals, defined by the difference between the measured and the expected hit position as predicted by the – 9 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 TI B 1 TI B 2 TI B 3 TI B 4 TO B 1 TO B 2 TO B 3 TO B 4 TO B 5 TI D - 1 TI D - 2 TI D - 3 TI D + 1 TI D + 2 TI D + 3 TE C - 1 TE C - 2 TE C - 3 TE C - 4 TE C - 5 TE C - 6 TE C - 7 TE C - 8 TE C + 1 TE C + 2 TE C + 3 TE C + 4 TE C + 5 TE C + 6 TE C + 7 TE C + 80.9 0.92 0.94 0.96 0.98 1 TI B 1 TI B 2 TI B 3 TI B 4 TO B 1 TO B 2 TO B 3 TO B 4 TO B 5 TI D - 1 TI D - 2 TI D - 3 TI D + 1 TI D + 2 TI D + 3 TE C - 1 TE C - 2 TE C - 3 TE C - 4 TE C - 5 TE C - 6 TE C - 7 TE C - 8 TE C + 1 TE C + 2 TE C + 3 TE C + 4 TE C + 5 TE C + 6 TE C + 7 TE C + 8 Ef fic ie nc y 0.9 0.92 0.94 0.96 0.98 1 Good modules All modules = 7 TeVsCMS Figure 4. Average hit efficiency for layers or disks in the strip tracker. The black squares show the hit efficiency in all modules, and the red dots for modules included in the readout. fitted track. Each trajectory is refitted excluding the hit under study in order to minimize biases of the procedure. The resolution of the pixel detector is measured from the RMS width of the hit residual dis- tribution in the middle of the three barrel layers, using only tracks with pT > 12GeV, for which multiple scattering between the layers does not affect the measurement. The expected hit position in the middle layer, as determined from the track trajectory, has an uncertainty that is dominated by the resolution of the hits assigned to the track in the first and third barrel layers. Assuming that the three barrel layers all have the same hit resolution σhit and because they are approximately equally spaced in radius from the z-axis of CMS, then this uncertainty is given by σhit/ √ 2. Adding this in quadrature with the uncertainty σhit in the measured position of the hit in the middle layer, demonstrates that the RMS width of the residual distribution is given by σhit √ 3/2. The measured hit resolution σhit in the rφ coordinate, as derived using this formula, is 9.4 µm. The resolution in the longitudinal direction is shown in figure 5, and found to agree within 1 µm with MC simula- tion. The longitudinal resolution depends on the angle of the track relative to the sensor. For longer clusters, sharing of charge among pixels improves the resolution, with optimal resolution reached for interception angles of ±30◦. Because of multiple scattering, the uncertainty in track position in the strip detector is usually much larger than the inherent resolution; consequently, individual residuals of hits are not sensitive to the resolution. However, the difference in a track’s residuals for two closely spaced modules can be measured with much greater precision. Any offset in a track’s position caused by multiple scattering will be largely common to both modules. A technique based on tracks passing through overlapping modules from the same tracker layer is employed to compare the difference in residuals for the two measurements in the overlapping modules [24]. The difference in hit positions (∆xhit) can be compared to the difference in predicted positions (∆xpred) derived from the track trajectory, and their difference, fitted to a Gaussian function, provides a hit resolution convoluted with the uncertainty from the trajectory propagation. The bias from translational misalignment between modules affects only the mean of the Gaussian distribution, and not its RMS width. As the two – 10 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 )°Incident angle ( -60 -40 -20 0 20 40 60 m ) µ Ba rre l p ix el R M S z re so lu tio n ( 0 5 10 15 20 25 30 35 40 45 50 CMS > 12 GeV T p = 7 TeVs -1 L = 42pb Data Simulation Figure 5. Resolution in the longitudinal (z) coordinate of hits in the barrel section of the pixel detector, shown as a function of the incident angle of the track, which is defined as 90◦−θ , and equals the angle of the track relative to the normal to the plane of the sensor. Data are compared with MC simulation for tracks with pT > 12GeV. overlapping modules are expected to have the same resolution, the resolution of a single sensor is determined by dividing this RMS width by √ 2. Only tracks of high purity (defined in section 4.4) are used for the above-described study. To reduce the uncertainty from multiple Coulomb scattering, the track momenta are required to be >10GeV. The χ2 probability of the track fit is required to be >0.1%, and the tracks are required to be reconstructed using a minimum of six hits in the strip detector. Tracks in the overlapping barrel modules are analysed only when the residual rotational misalignment is less than 5 µm. Remaining uncertainties from multiple scattering and rotational misalignment for the overlapping modules are included as systematic uncertainties of the measurement. Sensor resolution depends strongly on the size of the cluster and on the pitch of the sensor. The resolutions for the strip detector are shown in table 2, where they are compared to the predictions from MC simulation. The resolution varies not only as a function of the cluster width, but also as a function of pseudorapidity, as the energy deposited by a charged particle in the silicon depends on the angle at which it crosses the sensor plane. The resolution is worse in simulation than in data, implying the need for additional tuning of the MC simulation. The results in the table are valid only for tracks with momenta >10GeV. At lower momenta, the simulations indicate that the resolution in hit position improves, but this is not important for tracking performance, as the resolution of the track parameters for low-momentum tracks is dominated by the multiple scattering and by not the hit resolution. – 11 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 Table 2. A comparison of hit resolution in the barrel strip detector as measured in data with the correspond- ing prediction from simulation, for track momenta >10GeV. The resolution is given as function of both the barrel layer and the width of the cluster in strips. Since the resolution is observed to vary with φ and η , a range of resolution values is quoted in each case. Sensor Pitch Resolution [µm] vs. width of cluster [strips] layer (µm) width=1 =2 =3 =4 TIB 1–2 80 Data 11.7–19.1 10.9–17.9 10.1–18.1 MC 14.5–20.5 15.0–19.8 14.0–20.6 TIB 3–4 120 Data 20.9–29.5 21.8–28.8 20.8–29.2 MC 26.8–30.4 27.6–30.8 27.9–32.5 TOB 1–4 183 Data 23.4–40.0 32.3–42.3 16.9–28.5 MC 42.5–50.5 43.0–48.6 18.8–35.2 TOB 5–6 122 Data 18.4–26.6 11.8–19.4 MC 26.1–29.5 17.8–21.6 4 Track reconstruction Track reconstruction refers to the process of using the hits, obtained from the local reconstruction described in section 3, to obtain estimates for the momentum and position parameters of the charged particles responsible for the hits (tracks). As part of this process, a translation between the local coordinate system of the hits and the global coordinate system of the track is necessary. This translation takes into account discrepancies between the assumed and actual location and surface deformation of detector elements as found through the alignment process [25]. In addition, the uncertainty in the detector element location is added to the intrinsic uncertainty in the local hit position. Reconstructing the trajectories of charged particles is a computationally challenging task. An overview of the difficulties and solutions can be found in review articles [26–28]. The tracking software at CMS is commonly referred to as the Combinatorial Track Finder (CTF), which is an adaptation of the combinatorial Kalman filter [29–31], which in turn is an extension of the Kalman filter [32] to allow pattern recognition and track fitting to occur in the same framework. The collection of reconstructed tracks is produced by multiple passes (iterations) of the CTF track reconstruction sequence, in a process called iterative tracking. The basic idea of iterative tracking is that the initial iterations search for tracks that are easiest to find (e.g., of relatively large pT, and produced near the interaction region). After each iteration, hits associated with tracks are removed, thereby reducing the combinatorial complexity, and simplifying subsequent iterations in a search for more difficult classes of tracks (e.g., low-pT, or greatly displaced tracks). The presented results reflect the status of the software in use from May through August, 2011, which is applied in a series of six iterations of the track reconstruction algorithm. Later versions of the software retain the same basic structure but with different iterations and tuned values for the configurable parameters to adapt to the higher pileup conditions. Iteration 0, the source of most reconstructed tracks, is designed for prompt tracks (originating near the pp interaction point) with pT > 0.8GeV that have three pixel hits. Iteration 1 is used to recover prompt tracks that have only two pixel hits. Iteration 2 – 12 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 is configured to find low-pT prompt tracks. Iterations 3–5 are intended to find tracks that originate outside the beam spot (luminous region of the pp collisions) and to recover tracks not found in the previous iterations. At the beginning of each iteration, hits associated with high-purity tracks (defined in section 4.4) found in previous iterations are excluded from consideration (masked). Each iteration proceeds in four steps: • Seed generation provides initial track candidates found using only a few (2 or 3) hits. A seed defines the initial estimate of the trajectory parameters and their uncertainties. • Track finding is based on a Kalman filter. It extrapolates the seed trajectories along the expected flight path of a charged particle, searching for additional hits that can be assigned to the track candidate. • The track-fitting module is used to provide the best possible estimate of the parameters of each trajectory by means of a Kalman filter and smoother. • Track selection sets quality flags, and discards tracks that fail certain specified criteria. The main differences between the six iterations lie in the configuration of the seed generation and the final track selection. 4.1 Seed generation The seeds define the starting trajectory parameters and associated uncertainties of potential tracks. In the quasi-uniform magnetic field of the tracker, charged particles follow helical paths and there- fore five parameters are needed to define a trajectory. Extraction of these five parameters requires either three 3-D hits, or two 3-D hits and a constraint on the origin of the trajectory based on the assumption that the particle originated near the beam spot. (A ‘3-D hit’ is defined to be any hit that provides a 3-D position measurement). To limit the number of hit combinations, seeds are required to satisfy certain weak restrictions, for example, on their minimum pT and their consistency with originating from the pp interaction region. In principle, it is possible to construct seeds in the outermost regions of the tracker, where the track density is smallest, and then construct track candidates by searching inwards from the seeds for additional hits at smaller distances from the beam-line. However, there are several reasons why an alternative approach, of constructing seeds in the inner part of the tracker and building the track candidates outwards, has been chosen instead. First, although the track density is much higher in the inner region of the tracker, the high granularity of the pixel detector ensures that the channel occupancy (fraction of channels that are hit) of the inner pixel layer is much lower than that of the outer strip layer. This can be seen in figure 6, which shows the mean channel occupancy in strip and pixel sensors in data collected with a ‘zero-bias’ trigger, (which took events from randomly selected non-empty LHC bunch crossings). This data had a mean of about nine pp interactions per bunch crossing. The channel occupancy is 0.002–0.02% in the pixel detector and 0.1–0.8% in the strip detector. Second, the pixel layers pro- duce 3-D spatial measurements, which provide more constraints and better estimates of trajectory parameters. Finally, generating seeds in the inner tracker leads to a higher efficiency for recon- structing tracks. Although most high-pT muons traverse the entire tracker, a significant fraction – 13 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 of the produced pions interact inelastically in the tracker (figure 7). In addition, many electrons lose a significant fraction of their energy to bremsstrahlung radiation in the tracker. Therefore, to ensure high efficiency, track finding begins with trajectory seeds created in the inner region of the tracker. This also facilitates reconstruction of low-momentum tracks that are deflected by the strong magnetic field before reaching the outer part of the tracker. r ( cm ) 0 20 40 60 80 100 120 140 z (cm) -300 -200 -100 0 100 200 300 3.0 2.8 2.6 2.4 2.2 2.0 1.8 1.6 -3.0 -2.8 -2.6 -2.4 -2.2 -2.0 -1.8 -1.6 -1.4 -1.2 -1.0 -0.8 -0.6 -0.4 -0.2 0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 η CMS = 7 TeVs C ha nn el o cc up an cy -410 -310 -210 Figure 6. Channel occupancy (labelled by the scale on the right) for CMS silicon detectors in events taken with unbiased triggers with an average of nine pp interactions per beam crossing, displayed as a function of η , r, and z. Layers 0 2 4 6 8 10 12 14 S ur vi va l p ro ba bi lit y 0.7 0.75 0.8 0.85 0.9 0.95 1 = 1 GeV T pπ = 10 GeV T pπ = 100 GeV T pπ CMS simulation Figure 7. Fraction of pions produced with |η |< 2.5 that do not undergo a nuclear interaction in the tracker volume, as a function of the number of traversed layers. – 14 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 Seed generation requires information on the position of the centre of the reconstructed beam spot, obtained prior to track finding using the method described in section 6.3. It also requires the locations of primary vertices in the event, including those from pileup events. This information is obtained by running a very fast track and vertex reconstruction algorithm, described in section 6.2, that uses only hits from the pixel detector. The tracks and primary vertices found with this algorithm are known as pixel tracks and pixel vertices, respectively. The seed generation algorithm is controlled by two main sets of parameters: seeding layers and tracking regions. The seeding layers are pairs or triplets of detector layers in which hits are searched for. The tracking regions specify the limits on the acceptable track parameters, includ- ing the minimum pT, and the maximum transverse and longitudinal distances of closest approach to the assumed production point of the particle, taken to be located either at the centre of the re- constructed beam spot or at a pixel vertex. If the seeding layers correspond to pairs of detector layers, then seeds are constructed using one hit in each layer. A hit pair is accepted as a seed if the corresponding track parameters are consistent with the requirements of the tracking region. If the seeding layers correspond to triplets of detector layers, then, after pairs of hits are found in the two inner layers of each triplet, a search is performed in the outer detector layer for another hit. If the track parameters derived from the three hits are compatible with the tracking region requirements, the seed is accepted. It is also possible to check if the hits associated with the seed have the expected charge distribution from the track parameters: a particle that enters the detector at a grazing angle will have a larger cluster size than a particle that enters the detector at a normal angle. Requiring the reconstructed charge distribution to match the expected charge distribution can remove many fake seeds. In simulated tt events at √ s = 7TeV, more than 85% of the charged particles produced within the geometrical acceptance of the tracker (|η |< 2.5) cross three pixel layers and can therefore be reconstructed starting from trajectory seeds obtained from triplets of pixel hits. Nevertheless, other trajectory seeds are also needed, partially to compensate for inefficiencies in the pixel detector (from gaps in coverage, non-functioning modules, and saturation of the readout), and partially to reconstruct particles not produced directly at the pp collision point (decay products of strange hadrons, electrons from photon conversions, and particles from nuclear interactions). To improve the speed and quality of the seeding algorithm, only 3-D space points are used, either from a pixel hit or a matched strip hit. Matched strip hits are obtained from the closely-spaced double strip layers, which are composed of two sensors mounted back-to-back, one providing an rφ view and one providing a stereo view (rotated by 100mrad relative to the other, in the plane of the sensor). The ‘rφ ’ and ‘stereo hits’ in such a layer are combined into a matched hit, which provides a 3- D position measurement. Table 3 shows the seeding requirements for each of the six tracking iterations. The seeding layers listed in this table are defined as follows: • Pixel triplets are seeds produced from three pixel hits. These seeds are used to find most of the tracks corresponding to promptly produced charged particles. The three precise 3-D space points provide seeds of high quality and with well-measured starting trajectories. A mild constraint on the compatibility of these trajectories with the centre of the beam spot is employed, to remove seeds inconsistent with promptly produced particles. Also, the charge distribution of each pixel hit is required to be compatible with that expected for the crossing angle of the seed trajectory and the corresponding sensor. – 15 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 • Mixed pairs with vertex constraint are seeds that use two hits and a third space-point given by the location of a pixel vertex. If more than one pixel vertex is found in an event, which often happens because of pileup, all are considered in turn. The pixel vertices are required to pass quality criteria; the most important is that a vertex must contain at least four pixel tracks. The two hits used for these seeds can be provided by the pixel tracker, or by the two inner rings of the three inner TEC layers, where the TEC layers are used to increase coverage in the very forward regions. • Mixed triplets are seeds produced from three hits formed from a combination of pixel hits and matched strip hits. Each triplet contains between one and three pixel hits and < 3 strip hits. This iteration is implemented for finding displaced tracks and prompt tracks that do not have three hits in the pixel detector. The beam spot related constraint is less restrictive, providing higher efficiency for finding tracks arising from decays of hadrons containing s, c, or b quarks, photon conversions, and nuclear interactions. • Strip pairs are seeds constructed using two matched hits from the strip detector. Iteration 4 uses the two inner TIB layers and rings 1–2 of the TID/TEC, which are the same strip layers used in Iteration 3. In Iteration 5, hits from the two inner TOB layers and ring 5 of the TEC are used for seeds. These two iterations have even weaker constraints on the compatibility of the seed trajectory with the centre of the beam spot than has Iteration 3, and they do not require pixel hits. These iterations are therefore useful for finding tracks produced outside of the pixel detector volume or tracks that do not leave hits in the pixel detector. Table 3. The configuration of the track seeding for each of the six iterative tracking steps. Shown are the layers used to seed the tracks, as well as the requirements on the minimum pT and the maximum transverse (d0) and longitudinal (z0) impact parameters relative to the centre of the beam spot. The Gaussian standard deviation corresponding to the length of the beam spot along the z-direction is σ . The asterisk symbol indicates that the longitudinal impact parameter is calculated relative to a pixel vertex instead of to the centre of the beam spot. Iteration Seeding layers pT (GeV) d0 (cm) |z0| 0 Pixel triplets >0.8 <0.2 <3σ 1 Mixed pairs with vertex >0.6 <0.2 <0.2cm∗ 2 Pixel triplets >0.075 <0.2 <3.3σ 3 Mixed triplets >0.35 <1.2 <10cm 4 TIB 1+2 & TID/TEC ring 1+2 >0.5 <2.0 <10cm 5 TOB 1+2 & TEC ring 5 >0.6 <5.0 <30cm 4.2 Track finding The track-finding module of the CTF algorithm is based on the Kalman filter method [29–32]. The filter begins with a coarse estimate of the track parameters provided by the trajectory seed, and then builds track candidates by adding hits from successive detector layers, updating the parameters at each layer. The information needed at each layer includes the location and uncertainty of the – 16 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 detected hits, as well as the amount of material crossed, which is used to estimate the effects of multiple Coulomb scattering and energy loss. The track finding is implemented in the four steps listed below. The first step (navigation) uses the parameters of the track candidate, evaluated at the current layer, to determine which adjacent layers of the detector can be intersected through an extrapola- tion of the trajectory, taking into account the current uncertainty in that trajectory. The navigation service can be configured to propagate along or opposite to the momentum vector, and uses a fast analytical propagator to find the intercepted layers. The analytical propagator assumes a uniform magnetic field, and does not include effects of multiple Coulomb scattering or energy loss. With these assumptions, the track trajectory is a perfect helix, and the propagator can therefore extrapo- late the trajectory from one layer to the next using rapid analytical calculations. In the barrel, the cylindrical geometry makes navigation particularly easy, since the extrapolated trajectory can only intercept the layer adjacent to the current one. In the endcap and barrel-endcap transition regions, navigation is more complex, as the crossing from one layer does not uniquely define the next one. The second step involves a search for compatible silicon modules in the layers returned by the navigation step. A module is considered compatible with the trajectory if the position at which the trajectory intercepts the module surface is no more than some given number (currently three) of standard deviations outside the module boundary. The propagation of the trajectory parameters, and of the corresponding uncertainties, to the sensor surface involves mathematical operations and routines that are generally quite time-consuming [33]. Hence, the code responsible for searching for compatible modules has been optimized to limit the number of sensors that are considered, while preserving an efficiency of >99% in finding the relevant sensors. A complication is that the design of the CMS tracker is such that sensors often slightly overlap their neighbours, meaning that a particle can cross two sensors in the same layer. This possibility is accommodated by dividing the compatible modules in each layer into groups of mutually exclusive modules, defined such that if a particle passes through one member of a group, it is not physically possible for it to pass through a second member of the same group. Any two modules that have some overlap are not mutually exclusive, and are therefore assigned to different groups. This feature is used in the third and fourth steps of the track finding, described next. The third step forms groups of hits, each of which is defined by the collection of all the hits from one of the module groups. A configurable parameter provides the possibility of adding a ghost hit to represent the possibility that the particle failed to produce a hit in the module group, for example, as a result of module inefficiency. The hit positions and uncertainties are refined using the trajectory direction on the sensor surface, to calculate more accurately the Lorentz drift of the ionization-charge carriers inside the silicon bulk. A χ2 test is used to check which of the hits are compatible with the extrapolated trajectory. The current (configurable) requirement is χ2 < 30 for one degree of freedom (dof). The χ2 calculation takes into account both the hit and trajectory uncertainties. In the endcap regions and the barrel-endcap transition regions, the extrapolation distances and the amount of material traversed are generally greater, with correspondingly larger uncertainties in the trajectory, and the probability of finding spurious hits compatible with the track tends therefore to be greater. The fourth and last step is to update the trajectories. From each of the original track candidates, new track candidates are formed by adding exactly one of the compatible hits from each module – 17 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 grouping (where this hit may be a ghost hit). As the modules in a given group are mutually exclu- sive, it would not be expected that a track would have more than one hit contributing from each group. The trajectory parameters for each new candidate are then updated at the location of the module surface, by combining the information from the added hits with the extrapolated trajectory of the original track candidate. For the above second, third, and fourth steps of the procedure, a more accurate material prop- agator is used when extrapolating the track trajectory, which includes the effect of the material in the tracker. This differs from the method of the simple analytical propagator, in that it increases the uncertainty in the trajectory parameters according to the predicted RMS scattering angle in the tracker material. It also adjusts the momentum of the trajectory by the predicted mean energy loss of the Bethe-Bloch equation. Since all detector material is assumed to be concentrated in the de- tector layers, the track propagates along a simple helix between the layers, allowing the material propagator to extrapolate the track analytically. The ghost hits include the effect of material without providing position information to the propagator. All resulting track candidates found at each layer are then propagated to the next compatible layers, and the procedure is repeated until a termination condition is satisfied. However, to avoid a rapid increase in the number of candidates, only a limited number (default is 5) of the candidates are retained at each step, with the best candidates chosen based on the normalized χ2 and a bonus given for each valid hit, and a penalty for each ghost hit. The standard termination conditions are if a track reaches the end of the tracker or contains too many missing hits (limit is Nlost), or if its pT drops below a user specified value. The number of missing hits on a track is equal to the number of ghost hits, except that hits not found due to attributable known detector conditions, for example, if a detector module is turned off, are not counted. The building of a trajectory can also be terminated when the uncertainty in its parameters falls below a given threshold or the number of hits is above a threshold; these kinds of termination conditions tend to be used only in the high-level trigger (HLT), where the required accuracy on track parameters is often reached after 5 or 6 hits are added to the track candidate, and the continuation of the track building would correspond to a waste of CPU time. When the search for hits in the outward direction reveals a minimum number of valid hits (Nrebuild), an inwards search is initiated for additional hits. Otherwise, the track candidate remains as formed. The inwards search starts by taking all of the hits assigned to the track, excluding those belonging to the track seed, and using them to fit the track trajectory. In case this exclusion of the seeding hits leaves fewer than Nrebuild hits to fit, some of the seeding hits are also used (taking first the outer contributions) so as to obtain at least Nrebuild hits. Then, as in the outward track building, the trajectory is propagated inwards through the seeding layers and then further, until the inner edge of the tracker is reached or too many ghost hits are found. There are three reasons for this inward search. First, additional hits can be found in the seeding layers (for example, from overlapping sensors). Second, hits can be found in layers closer to the interaction region than the seeding layers. Third, when strip layers are used in seeding, matched hits are used to increase computational speed and reduce the combinations of hits available for seeding. However, some rφ or stereo hits are not part of any matched hit. While these hits are not available during seeding, they can be found during the inward track building process. The effect of the inward search is an increase in the mean number of hits per track by 0.15, (i.e., a 1% increase relative to a total of ≈14 – 18 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 hits), which translates to a better signal-to-background ratio, impact parameter resolution, and pT resolution, with maximum improvements of 2%, 1%, and 0.5%, respectively. The track of a single charged particle can be reconstructed more than once, either starting from different seeds, or when a given seed develops into more than one track candidate. To remedy this feature, a trajectory cleaner is applied after all the track candidates in a given iteration have been found. The trajectory cleaner calculates the fraction of shared hits between two track candidates: fshared = Nhits shared min(Nhits 1 ,Nhits 2 ) where Nhits 1 and Nhits 2 are, respectively, the number of hits used in forming the first (second) track candidate. If this fraction exceeds the (configurable) value of 19% (determined empirically), the trajectory cleaner removes the track with the fewest hits; if both tracks have the same number of hits, the track with the largest χ2 value is discarded. The procedure is repeated iteratively on all pairs of track candidates. The same algorithm is applied when tracks from the six iterations are combined into a single track collection. The requirements applied during the track-finding stage are shown in table 4 for each tracking iteration. In addition to the requirement on Nlost, the completed track candidates must also pass requirements on the minimum number of hits (Nhits) and minimum track pT. The minimum pT requirements have very little effect, as they are weaker than those applied to the seeds, given in table 3. Since the later iterations do not have strong requirements that the tracks originate close to the centre of the beam spot, the probability of random hits forming tracks increases, which leads to more fake tracks and greater usage of CPU time. To compensate for this tendency, the criteria for the minimum number of hits, and maximum number of lost hits, are tightened in the later iterations. Table 4. Selection requirements applied to track candidates during the six iterative steps of track finding, the minimum pT, the minimum number of hits Nhits, and the maximum number of missing hits Nlost. Also shown is the minimum number of hits needed to be found in the outward track building step to trigger the inward track building step Nrebuild, although candidates failing this requirement are not rejected. Iteration pT (GeV) Nhits Nlost Nrebuild 0 0.3 3 1 5 1 0.3 3 1 5 2 0.1 3 1 5 3 0.1 4 0 5 4 0.1 7 0 5 5 0.1 7 0 4 4.3 Track fitting For each trajectory, the track-finding stage yields a collection of hits and an estimate of the track parameters. However, the full information about the trajectory is only available at the final hit of the trajectory (when all hits are known). Furthermore, the estimate can be biased by constraints, such as a beam spot constraint applied to the trajectory during the seeding stage. The trajectory is therefore refitted using a Kalman filter and smoother. The Kalman filter is initialized at the location of the innermost hit, with the trajectory estimate obtained by performing a Kalman filter fit to the innermost hits (typically four) on the track. The corresponding covariance matrix is scaled up by a large factor (10 for the last iteration and 100 for – 19 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 the other iterations) in order to limit the bias. The fit then proceeds in an iterative way through the full list of hits, from the inside outwards, updating the track trajectory estimate sequentially with each hit. For each valid hit, the estimated hit position uncertainty is reevaluated using the current values of the track parameters. In the case of pixel hits, the estimated hit position is also reevaluated. This first filter is followed by the smoothing stage, whereby a second filter is initialized with the result of the first one (except for the covariance matrix, which is scaled by a large factor), and is run backward towards the beam-line. The track parameters at the surface associated with any of its hits, can then be obtained from the weighted average of the track parameters of these two filters, evaluated on this same surface, as one filter uses information from all the hits found before, and the other uses information from all the hits found after the surface. This provides the optimal track parameters at any point, including the innermost and outermost hit on the track, which are used to extrapolate the trajectory to the interaction region and to the calorimeter and muon detectors, respectively. A configurable parameter determines whether the silicon strip matched hits are used as is or split into their component rφ and stereo hits. For the standard offline reconstruction, the split hits are used to improve the track resolution, while for the HLT, the matched hits are used to improve speed. To obtain the best precision, this filtering and smoothing procedure uses a Runge-Kutta prop- agator to extrapolate the trajectory from one hit to the next. This not only takes into account the effect of material, but it also accommodates an inhomogeneous magnetic field. The latter means that the particle may not move along a perfect helix, and its equations of motion in the magnetic field must therefore be solved numerically. To do so, the Runge-Kutta propagator divides the dis- tance to be extrapolated into many small steps. It extrapolates the track trajectory over each of these steps in turn, using a well-known mathematical technique for solving first-order differential equations, called the fourth-order Runge-Kutta method, so called because it is accurate to fourth order in the step size. The optimal step size is chosen automatically, according to how non-linear the problem is. This automatic determination of step size employs the method [34], which is based on how well the fourth and fifth order Runge-Kutta predictions agree with each other. Use of the Runge-Kutta propagator is most important in the region |η |> 1, where the magnetic field inhomo- geneities are greatest. For example, in this region, tracks fitted using the simple material propagator are biased by up to 1% for particles with pT = 10GeV. This bias is almost completely eliminated when using the Runge-Kutta propagator. To assure an accurate extrapolation of the track trajectory, the Runge-Kutta propagator uses a detailed map of the magnetic field, which was measured before LHC collisions to a precision of < 0.01%. Estimates of the track trajectory at any other points, such as the point of closest approach to the beam-line, can be obtained by extrapolating the trajectory evaluated at the nearest hit to that very point. This extrapolation also uses the Runge-Kutta propagator. After filtering and smoothing, a search is made for spurious hits (outliers), incorrectly asso- ciated to the track. Such hits can be related to an otherwise well-defined track, e.g., from δ -rays, or unrelated, such as hits from nearby tracks or electronic noise. Two methods are used to find outliers. One uses the measured residual between a hit and the track to reject hits whose χ2 com- patibility with the track exceeds a configurable threshold (20 for Iterations 0–4 and 30 for Iteration 5). While a χ2 requirement of 30 on each hit is already applied during track finding, the outlier rejection criterion provides a more powerful restriction as it uses information from the full fit [32]. – 20 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 The other method calculates a probability that a pixel hit is consistent with the track, taking into account the charge distribution of the pixel hit, which generally comprises several pixel channels. This probability corresponds to the χ2 defined in eq. (3.3). After removing the outlier, the track is again filtered and smoothed and another check for outliers is made. This continues until no more outliers are found. In cases where removing an outlier results in two consecutive ghost hits, the track is terminated and the remaining outer hits discarded (although not used, a configurable pa- rameter is available to allow the track fitting to continue). If a track is found to have less than three hits after outlier rejection or for the track fitting to fail, the track is discarded (although not used, a configurable parameter is available to return the original track). The default value of 20 for the χ2 requirement is chosen to reject a significant fraction of outliers, while removing few genuine hits. With this value, approximately 20% of the spurious outliers are removed from tracks reconstructed in high-density dijet events, whereas <0.2% of the good hits are removed. 4.4 Track selection In a typical LHC event containing jets, the track-finding procedure described above yields a signif- icant fraction of fake tracks, where a fake track is defined as a reconstructed track not associated with a charged particle, as defined in section 5. The fake rate (fraction of reconstructed tracks that are fake) can be reduced substantially through quality requirements. Tracks are selected on the basis of the number of layers that have hits, whether their fit yielded a good χ2/dof, and how compatible they are with originating from a primary interaction vertex. If several primary vertices are present in the event, as often happens due to pileup, all are considered. To optimize the perfor- mance, several requirements are imposed as a function of the track η and pT, and on the number of layers (Nlayers) with an assigned hit (where a layer with both rφ and stereo strip modules is counted as a single layer). The selection criteria are as follows. • A requirement on the minimum number of layers in which the track has at least one asso- ciated hit. This differs from selections based on the number of hits on the track, because more than one hit in a given layer can be assigned to a track, as in the case of layers with overlapping sensors or double-sided layers in which two sensors are mounted back-to-back. • A requirement on the minimum number of layers in which the track has an associated 3-D hit (i.e., in the pixel tracker or matched hits in the strip tracker). • A requirement on the maximum number of layers intercepted by the track containing no as- signed hits, not counting those layers inside its innermost hit or outside its outermost hit, nor those layers where no hit was expected because the module was known to be malfunctioning. • χ2/dof < α0Nlayers. • |dBS 0 |/δd0 < ( α3Nlayers )β . • |zPV 0 |/δ z0 < ( α4Nlayers )β . • |dBS 0 |/σd0(pT) < ( α1Nlayers )β . • |zPV 0 |/σz0(pT,η) < ( α2Nlayers )β . – 21 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 The parameters αi and β are configurable constants. The track’s impact parameters are dBS 0 and zPV 0 , where dBS 0 is the distance from the centre of the beam spot in the plane transverse to the beam-line and zPV 0 is the distance along the beam-line from the closest pixel vertex. These pixel vertices, described in section 6.2, are required to have at least three pixel tracks and if no pixel vertices meet this requirement, then zPV 0 is required to be within 3σ of the z-position of the centre of the beam spot, where σ is the Gaussian standard deviation corresponding to the length of the beam spot in the z-direction. The above selection criteria include requirements on the transverse |dBS 0 |/δd0 and longitudinal |zPV 0 |/δ z0 impact parameter significances of the track, where the impact parameter uncertainties, δd0 and δ z0, are calculated from the covariance matrix of the fitted track trajectory. A second pair of requirements is also imposed on these significances, but calculated differently, with the uncertainties in the impact parameters being parametrized in terms of pT and polar angle of the track: σ(d0) = σ(z0 sinθ) = a⊕ b pT , where ⊕ represents the sum in quadrature and a and b are parameters. Their nominal values are a = 30 µm and b = 10 µmGeV, but b increases to 100 µmGeV for the loose and tight selection criteria used (and defined below) in Iterations 0 and 1. The fraction of fake tracks decreases roughly exponentially as a function of the number of layers in which the track has associated hits: dNfake/dNlayers ∼ exp(−ωNlayers), with ω in the range 0.9–1.3 depending on the pT of the track. As a consequence, weaker selection criteria can be applied for tracks having many hit layers, which is the reason for the chosen selection criteria. For tracks with hits in at least 10 layers, the selection requirements on χ2 and impact parameters are found to reject no tracks. However, the criteria become far more stringent for tracks with relatively few hit layers. The above quality criteria were initially optimized as a function of track pT and Nlayers, so as to maximize the quality Q(ρ) = s/ √ s+ρb, where s is the number of selected genuine (non-fake) tracks, b is the number of selected fake tracks and ρ ' 10 inflates the importance of the fake tracks to achieve low fake rates (below 1% for PYTHIA QCD events with p̂T of the two outgoing partons in the range 170–230GeV). As data taking conditions have evolved, the parameters have been adjusted to maintain high efficiency and low fake rate. The track selection criteria for each iteration are given in table 5. The loose criteria denote the minimum requirements for a track to be kept in the general track collection. The tight and high-purity criteria provide progressively more stringent requirements, which reduce the efficiency and fake rate. In general, high-purity tracks are used for scientific analysis, although in cases where efficiency is essential and purity is not a major concern, the loose tracks can be used. The criteria for the initial tracking iterations emphasise compatibility with originating from a primary vertex as a means of assuring quality, while the criteria used for the later iterations rely on other measures of track quality such as fit χ2 and the number of hits, ensuring thereby that they are still useful for selecting displaced tracks. This matches the seeding and track-finding requirements shown in tables 3–4, and is aligned with the goals for the six iterations. After the track selection is complete, the tracks found by each of the six iterations are merged into a single collection. – 22 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 Table 5. Parameter values used in selecting tracks reconstructed by each of the six iterative tracking steps. The first table shows the three requirements on the number of layers that contain hits assigned to tracks and the parameter α0 that controls selection criteria based on χ2/dof. The second table shows the parameters αi and β that define compatibility of impact parameters with the interaction point. Each parameter has three entries, corresponding to the loose (L), tight (T), and high-purity (H) selection requirements. Iterations 2 and 3 use two paths that emphasise track quality (Trk) or primary-vertex compatibility (Vtx). A track produced by these iterations is retained if it passes either of these criteria. Iteration Min layers Min 3-D layers Max lost layers α0 L T H L T H L T H L T H 0 & 1 0 3 4 0 3 4 ∞ 2 2 2.0 0.9 0.9 2 Trk 4 5 5 0 3 3 ∞ 1 1 0.9 0.7 0.5 2 Vtx 3 3 3 0 3 3 ∞ 1 1 2.0 0.9 0.9 3 Trk 4 5 5 2 3 4 1 1 1 0.9 0.7 0.5 3 Vtx 3 3 3 2 3 3 1 1 1 2.0 0.9 0.9 4 5 5 6 3 3 3 1 0 0 0.6 0.4 0.3 5 6 6 6 2 2 2 1 0 0 0.6 0.35 0.25 Iteration β α1 α2 α3 α4 L T H L T H L T H L T H 0 & 1 4 0.55 0.30 0.30 0.65 0.35 0.35 0.55 0.40 0.40 0.45 0.40 0.40 2 Trk 4 1.50 1.00 0.90 1.50 1.00 0.90 1.50 1.00 0.90 1.50 1.00 0.90 2 Vtx 3 1.20 0.95 0.85 1.20 0.90 0.80 1.30 1.00 0.90 1.30 1.00 0.90 3 Trk 4 1.80 1.10 1.00 1.80 1.10 1.00 1.80 1.10 1.00 1.80 1.10 1.00 3 Vtx 3 1.20 1.00 0.90 1.20 1.00 0.90 1.30 1.10 1.00 1.30 1.10 1.00 4 3 1.50 1.20 1.00 1.50 1.20 1.00 1.50 1.20 1.00 1.50 1.20 1.00 5 3 1.80 1.30 1.20 1.50 1.20 1.10 1.80 1.30 1.20 1.50 1.20 1.10 4.5 Specialized tracking The track reconstruction described above produces the main track collection used by the CMS collaboration. However, variants of this software are also used for more specialized purposes, as described in this section. 4.5.1 Electron track reconstruction Electrons, being charged particles, can be reconstructed through the standard track reconstruction. However, as electrons lose energy primarily through bremsstrahlung, rather than ionization, large energy losses are common. For example, about 35% of electrons radiate more than 70% of their ini- tial energy before reaching the electromagnetic calorimeter (ECAL) that surrounds the tracker. The energy loss distribution is highly non-Gaussian, and therefore the standard Kalman filter, which is optimal when all variables have Gaussian uncertainties, is not appropriate. As a result, the effi- ciency and resolution of the standard tracking are not particularly good for electrons and therefore electron candidates are reconstructed using a combination of two techniques that make use of in- formation, not only from the tracker, but also from the ECAL. As this is a subject beyond the scope of this paper, only a brief description of these methods is given. – 23 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 The first method [35] starts by searching for clusters of energy in the ECAL. The curvature of electrons in the strong CMS magnetic field means that bremsstrahlung photons emitted by the electrons will, in general, strike the ECAL at η values similar to that of the electron, but at different azimuthal coordinates (φ). To recover this radiated energy, ECAL superclusters are formed, by merging clusters of similar η over some range of φ . The knowledge of the energy and position of each supercluster, and the assumption that the electron originated near the centre of the beam spot, constrains the trajectory of the electron through the tracker (aside from a two-fold ambiguity introduced by its unknown charge). Tracker seeds compatible with this trajectory are sought in the pixel tracker (and also in the TEC to improve efficiency in the forward region). The second method [36] takes the standard track collection (excluding tracks found by Itera- tion 5, as described in table 3) and attempts to identify a subset of these tracks that are compatible with being electrons. Electrons that suffer only little bremsstrahlung loss can be identified by searching for tracks extrapolated to the ECAL that pass close to an ECAL cluster. Electrons that suffer large bremsstrahlung loss can be identified by the fact that the fitted track will often have poor χ2 or few associated hits. The track seeds originally used to generate these electron-like tracks are retained. The seed collections obtained by using these two methods are merged, and used to initiate electron track finding. This procedure is similar to that used in standard tracking, except that the χ2 threshold, used by the Kalman filter to decide whether a hit is compatible with a trajectory, is weakened from 30 to 2000. This is to accommodate tracks that deviate from their expected trajectory because of bremsstrahlung. In addition, the penalties assigned to track candidates for passing through a tracker layer without being assigned a hit are adjusted. This is necessary because bremsstrahlung photons can convert into e+e− pairs with the track-finding algorithm incorrectly forming a track by combining hits from the primary electron with one of the conversion electrons. To obtain the best parameter estimates, the final track fit is performed using a modified version of the Kalman filter, called the Gaussian Sum Filter (GSF) [37]. In essence, the fractional energy loss of an electron, as it traverses material of a given thickness, is expected to have a distribution described by the Bethe-Heitler formula. This distribution is non-Gaussian, making it unsuitable for use in a conventional Kalman filter algorithm. The GSF technique solves this by approximating the Bethe-Heitler energy-loss distribution as the sum of several Gaussian functions, whose means, widths, and relative amplitudes are chosen so as to optimize this approximation. The parame- ters of these Gaussian energy-loss functions are determined only once. Each track trajectory is also represented by a mixture of several ‘trajectory components’, where each trajectory component has helix parameters with Gaussian uncertainties, and a ‘weight’ corresponding to the probability that it correctly describes the true path of the particle. Initially, a track trajectory is described by only a single such trajectory component, derived from the track seed. When propagating a tra- jectory component through a layer of material in the tracker, the estimated mean energy of the trajectory component is reduced and its uncertainty increased, according to the mean and width of each Gaussian component of the energy-loss distribution applied independently, in turn, to the original trajectory component. Thus after passing through the a layer of material, each original trajectory component gives rise to several new trajectory components, each one obtained using one of the Gaussian energy-loss functions. The weight of each new trajectory component is given by the product of the weight of the original trajectory component and the weight of the correspond- – 24 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 ing Gaussian component of the energy-loss distribution. To avoid an exponential explosion in the number of trajectory components being followed, as the track candidate is propagated through suc- cessive tracker layers, the less probable trajectory components are dropped or merged (by grouping together similar trajectory components), so as to limit their number to 12. Each trajectory compo- nent will also be updated by the Kalman filter if an additional hit is assigned to it when passing through a layer. When this happens, the weight of the trajectory component is further adjusted according to its compatibility with the hit. The GSF fit provides estimates of the track parameters, whose uncertainties are described not by a single Gaussian distribution, but instead by the sum of several Gaussian distributions, each corresponding to the uncertainty on one of the trajectory components that make up the track. For each parameter, the mode of this distribution is used as it is found to provide the best estimates of the parameters. The performance of the GSF electron tracking has been studied both with simulations [37] and with data [38], with good agreement observed between the two. 4.5.2 Track reconstruction in the high-level trigger The CMS high-level trigger (HLT) [39] uses a processor farm running C++ software to achieve large reductions in data rate. The HLT filters events selected at rates of up to 100 kHz using the Level-1 (hardware) trigger. Whereas Level-1 uses information only from the CMS calorimeters and muon detectors, the HLT is also able to capture information from the tracker, thereby adding the powerful tool of track reconstruction to the HLT. Some examples of how this improves the HLT performance are listed below. • Requiring muon candidates that are reconstructed in the muon detectors to be confirmed through the presence of a corresponding track in the tracker greatly reduces the false recon- struction rate and substantially improves momentum resolution. • Energy clusters found in the electromagnetic calorimeters can be identified as electrons or photons through the presence of a track of appropriate momentum pointing to the cluster. • The background rejection rate for lepton triggers can be enhanced by requiring leptons to be isolated. One method of doing this is to use a veto on the presence of (too many) tracks in a cone around the lepton direction. • Triggering on jets produced by b quarks can be done by counting the number of tracks in a jet that have transverse impact parameters statistically incompatible with the track originating from the beam-line. • Triggers on τ decays τ → `ν`ντ , where ` = e or µ , can be extended to τ → hντ decays, where h represents one or more charged hadrons, by reconstructing a narrow, isolated jet using tracks in combination with calorimeter information. The HLT uses track reconstruction software that is identical to that used for offline reconstruc- tion, but it must run much faster. This is achieved by using a modified configuration of the track reconstruction. – 25 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 Tracks can be reconstructed from triplets of hits found using only the pixel tracker, as doc- umented in sections 4.1 and 6.2. This is extremely fast, and can be used with great effect in the reconstruction of the primary-vertex position in the HLT, described in section 6.2. Tracks can also be reconstructed in the HLT using hits from both the pixel and strip detectors. Such tracks have superior momentum resolution and a lower probability of being fake. However, this requires much more CPU time than just reconstructing pixel tracks, since the strip tracker does not provide the precise 3-D hits of the pixel tracker, and suffers from a higher hit occupancy. This can be mitigated using some or all of the following techniques (the details vary significantly, depending on the type of trigger). • Rather than trying to reconstruct all tracks in the event, regional track reconstruction can be performed instead, where the software is used to reconstruct tracks lying within a specified η-φ region around some object of interest (which might be a muon, electron, or jet candidate reconstructed using the calorimeters or muon detectors). This saves CPU time, and is ac- complished by using regional seeding. This method differs from the track seeding described in section 4.1, in that it only forms seeds from combinations of hits that are consistent with a track heading into the desired η-φ region. Another important ingredient of regional track- ing concerns the extraction of hits. As discussed in section 3.2, hits are reconstructed after unpacking the original data blocks produced by the FED readout boards. Significant time is saved by unpacking only the data from those FED units that read out tracker modules within the region of interest [40]. This is not used in the offline reconstruction as the track reconstruction searches the entire η-φ region and therefore needs all hits. • Further gains in speed can be made by performing just a single iteration in the iterative track- ing, such that only seeds made from pairs of pixel hits are considered, where these hits are compatible with a track originating within a few millimetres of a primary pixel vertex. Fur- thermore, the HLT uses a higher pT requirement when forming the seeds (usually >1GeV) than is used for offline reconstruction. These stringent requirements on track impact param- eter and pT reduce the number of seeds, and thereby the amount of time spent building track candidates. • Track finding can differ from that described in section 4.2, in that it can rely on partial track reconstruction. With this technique, the building of each track candidate is stopped once a specific condition is met, for example, a given minimum number of hits (typically eight), or a certain precision requirement on the track parameters. As a consequence, the hits in the outermost layers of the tracker tend not to be used. While such partially reconstructed tracks will have slightly poorer momentum resolution and higher fake rates than fully reconstructed tracks, they also take less CPU time to construct. • Other changes in the tracking configuration can further enhance the speed of reconstruction. For example, when building track candidates from a given seed, the offline track recon- struction retains at most the best five partially reconstructed candidates for extrapolating to the next layer. Changing this configurable parameter to retain fewer candidates can save CPU time. – 26 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 Pixel tracking and other aspects of track reconstruction absorb about 20% of the total HLT CPU time. This is kept low by performing track reconstruction only when necessary, and only after other requirements have been satisfied, so as to reduce the rate at which tracking must be performed. Track reconstruction is employed in a variety of ways to satisfy different needs in the HLT. Examples of track reconstruction at the HLT include seeds originating in the muon detector, tracking in a specific η-φ region defined by a jet, and searching for tracks over the full detector. Even the most comprehensive (and slowest) track reconstruction configuration at the HLT is more than ten times faster than the offline reconstruction of tracks in events representative of data taken in 2011 (tt + 10 pileup events). 5 Track reconstruction performance In this section, the performance of the CTF tracking algorithm is evaluated in terms of tracking efficiency and fake rate, track parameter resolutions, and the CPU time required for processing collision events. Two different categories of simulated samples are used: isolated particles and pp collision events. Comparing the results helps one understand both the performance of the tracking for isolated particles and to what extent it is degraded in a high hit occupancy environment. Simulated events offer the possibility of detailed studies of track reconstruction, such as the way characteristics of the tracker and the design of the track reconstruction algorithms influence its performance over a wide range of particle momenta and rapidities, and how much its perfor- mance depends on the type of charged particle being reconstructed, and on whether this particle is isolated or not. The performance in simulation can be compared with that in data in certain re- gions of phase space to verify that the results from simulation are realistic. The CMS collaboration demonstrated previously that its simulation describes the momentum resolution of muons from J/ψ decay to an accuracy of better than 5% [41]; and does similarly well in describing the dimuon mass resolution of muons from Z boson decay [42]. The transverse and longitudinal impact parame- ters of tracks reconstructed in typical multijet events agree in data and simulation to better than 10% [43]. The CMS collaboration also showed that the tracking efficiency for particles from J/ψ and charmed hadron decays is simulated with a precision better than 5% [44]. A similar compari- son for the higher-momentum muons from Z boson decay will be presented in section 5.1.3 of the present work. The isolated particle samples that are used here consist of simple events with just a single gen- erated muon, pion or electron, although secondary particles may also be present due to interactions with the detector material. The single particles are generated with a flat distribution in pseudora- pidity inside the tracker acceptance |η |< 2.5. Their transverse momenta are either fixed to 1, 10 or 100GeV, or are generated according to a flat distribution in ln(pT). The former set of particles with fixed momenta is used for studying the tracking performance as a function of η , while the latter is used to quantify the performance as a function of pT. For pp collisions, simulated inclusive tt events are used, either with or without superimposed pileup events. The average number of pileup collisions per LHC bunch crossing depends on the instantaneous luminosity of the machine and on the period of data-taking over which the luminosity is averaged. For the sake of simplicity, the number of pileup interactions superimposed on each simulated tt event is randomly generated from a Poisson distribution with mean equal to 8. This – 27 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 amount of pileup corresponds roughly to what was delivered by the LHC, when averaged over the whole 2011 running period. The tt events, and also the minimum-bias events used for the pileup, are generated with the PYTHIA 6 program [45]. Simulated particles are paired to reconstructed tracks for evaluating tracking efficiency, fake rate, and other quantities discussed in this section. A simulated particle is associated with a re- constructed track if at least 75% of the hits assigned to the reconstructed track originate from the simulated particle. The association of simulated hits with reconstructed hits is possible because the simulation software records the particles responsible for the signal in each channel of the tracker. Strip and pixel response to electronic noise is also recorded. Reconstructed tracks that are not associated with a simulated particle are referred to as fake tracks. Results for tracking efficiencies and for fake rates are presented in section 5.1. While the lat- ter is evaluated only using simulated samples, the former is also measured in data as described in section 5.1.3. The resolution obtained for track parameters is discussed in section 5.2. Unless indi- cated otherwise, all results pertaining to the performance are obtained using the set of ‘high-purity’ tracks defined in section 4.4. Finally, section 5.3 provides estimates of the CPU time required for different components of track reconstruction. 5.1 Tracking efficiency and fake rate For simulated samples, the tracking efficiency is defined as the fraction of simulated charged parti- cles that can be associated with corresponding reconstructed tracks, where the association criterion is the one described at the beginning of this section. This definition of efficiency depends not only on the quality of the track-finding algorithm, but also upon the intrinsic properties of the tracker, such as its geometrical acceptance and material content. Using the same association criterion as used for the efficiency, the fake rate is defined as the fraction of reconstructed tracks that are not associated with any simulated particle. This quantity represents the probability that a reconstructed track is either a combination of unrelated hits or a genuine particle trajectory that is badly recon- structed through the inclusion of spurious hits. The efficiency and fake rate presented in this section are given as a function of pT and η of the simulated particle and reconstructed track, respectively. The efficiency is obtained for simulated particles generated within |η | < 2.5, with a production point <3cm and <30cm from the centre of the beam spot for r and |z|, respectively. These cri- teria select fairly prompt particles. We also require pT > 0.9GeV, for the study of efficiency as a function of η , or pT > 0.1GeV for studying efficiency over the entire pT spectrum. Since the ‘high-purity’ requirement described in section 4.4 is the default track selection for the majority of analyses in CMS, unless otherwise stated efficiency and fake rate are measured and presented here using only the subset of reconstructed tracks that are identified as ‘high-purity’. 5.1.1 Results from simulation of isolated particles This section presents the performance of the CTF tracking software in reconstructing trajectories of particles in events containing just a single muon, a pion or an electron. Muons are reconstructed better than any other charged particle in the tracker, as they mainly interact with the silicon detector through ionization of the medium and, unlike electrons, their energy loss through bremsstrahlung is negligible. Muons therefore tend to cross the entire volume of the tracking system, producing detectable hits in several sensitive layers of the apparatus. Finally, – 28 – 2 0 1 4 J I N S T 9 P 1 0 0 0 9 muon trajectories are altered almost exclusively by Coulomb scattering and energy loss, whose effects are straightforward to include within the formalism of Kalman filter. For isolated muons with 1 < pT < 100GeV, the tracking efficiency is >99% over the full η-range of tracker acceptance, and does not depend on pT (figure 8, top). The fake rate is completely negligible. Charged pions, as muons, undergo multiple scattering and energy loss through ionization as they cross the tracker volume. However, like all hadrons, pions are al