Publicação:
Effective HTCondor-based monitoring system for CMS

dc.contributor.authorBalcas, J.
dc.contributor.authorBockelman, B. P.
dc.contributor.authorDa Silva, J. M. [UNESP]
dc.contributor.authorHernandez, J.
dc.contributor.authorKhan, F. A.
dc.contributor.authorLetts, J.
dc.contributor.authorMascheroni, M.
dc.contributor.authorMason, D. A.
dc.contributor.authorPerez-Calero Yzquierdo, A.
dc.contributor.authorVlimant, J. R.
dc.contributor.institutionCalifornia Institute of Technology
dc.contributor.institutionUniversity of Nebraska-Lincoln
dc.contributor.institutionUniversidade Estadual Paulista (UNESP)
dc.contributor.institutionMedioambientales y Tecnologicas
dc.contributor.institutionNational Center for Physics
dc.contributor.institutionUniversity of California
dc.contributor.institutionFermi National Accelerator Laboratory
dc.contributor.institutionPort d'Informacio Cientifica
dc.date.accessioned2022-04-28T19:07:13Z
dc.date.available2022-04-28T19:07:13Z
dc.date.issued2017-11-23
dc.description.abstractThe CMS experiment at the LHC relies on HTCondor and glideinWMS as its primary batch and pilot-based Grid provisioning systems, respectively. Given the scale of the global queue in CMS, the operators found it increasingly difficult to monitor the pool to find problems and fix them. The operators had to rely on several different web pages, with several different levels of information, and sift tirelessly through log files in order to monitor the pool completely. Therefore, coming up with a suitable monitoring system was one of the crucial items before the beginning of the LHC Run 2 in order to ensure early detection of issues and to give a good overview of the whole pool. Our new monitoring page utilizes the HTCondor ClassAd information to provide a complete picture of the whole submission infrastructure in CMS. The monitoring page includes useful information from HTCondor schedulers, central managers, the glideinWMS frontend, and factories. It also incorporates information about users and tasks making it easy for operators to provide support and debug issues.en
dc.description.affiliationCalifornia Institute of Technology
dc.description.affiliationUniversity of Nebraska-Lincoln
dc.description.affiliationUniversidade Estadual Paulista
dc.description.affiliationCentro de Investigaciones Energeticas Medioambientales y Tecnologicas
dc.description.affiliationNational Center for Physics
dc.description.affiliationUniversity of California
dc.description.affiliationFermi National Accelerator Laboratory
dc.description.affiliationPort d'Informacio Cientifica
dc.description.affiliationUnespUniversidade Estadual Paulista
dc.identifierhttp://dx.doi.org/10.1088/1742-6596/898/9/092039
dc.identifier.citationJournal of Physics: Conference Series, v. 898, n. 9, 2017.
dc.identifier.doi10.1088/1742-6596/898/9/092039
dc.identifier.issn1742-6596
dc.identifier.issn1742-6588
dc.identifier.scopus2-s2.0-85039413164
dc.identifier.urihttp://hdl.handle.net/11449/220991
dc.language.isoeng
dc.relation.ispartofJournal of Physics: Conference Series
dc.sourceScopus
dc.titleEffective HTCondor-based monitoring system for CMSen
dc.typeTrabalho apresentado em evento
dspace.entity.typePublication

Arquivos

Coleções