If you are interested in step-by-step instructions to start working with CMS Open Data, please consult these pages:
However, if you are interested in finding hints, tips and guidance for conducting a research-oriented analysis using CMS Open Data, please see our notes on this page. Note that possible solutions to frequently encountered issues can be found on our page of known errors.
I want to get a general introduction into HEP and CMS software and terminology, with a simplified event format.
I want to learn about the terms under which I can access and use the CMS Open Data, and publish results obtained from them.
I want to get inspiration for some potential physics topics.
I want to learn about the nature of the CMS physics objects and the corresponding variables and terminology.
I want to find out whether I should go for data from 2010 or 2011 (both are pp data at 7 TeV) or from 2012 (pp data at 8 TeV).
I want to install the CMS software environment needed for access to and analysis of CMS Research level data.
Note: The 2010 (SL5) virtual machine will only work on 2010 data with CMSSW 4-2-8 (and other SLC5-based CMSSW releases). The 2011 (SL6) virtual machine will only work on 2011/2012 data and MC with CMSSW 5-3-32 (and other SLC6-based CMSSW releases).
I want to produce some example physics distributions.
I want to find out which 2010 datasets exist, and how to get a feel for their content.
I also want to view some corresponding event displays.
Event display file derived from
… + the name of the CMS primary dataset you want (ZeroBias is known to be essentially empty).or, alternatively:
Open File
→ Open Files from Web
→ 2010
, andI want to find out which 2010 dataset and/or analysis/validation example is most useful for my purpose.
I want to find out which 2011-2012 data and MC sets exist, and how to get a feel for their content.
I also want to to view some corresponding event displays.
Event display file derived from
… + the name of the CMS primary dataset you want (ZeroBias is known to be essentially empty).or, alternatively:
Open File
→ Open Files from Web
→ 2011
or 2012
, andI want to find out which 2011 or 2012 dataset and/or analysis/validation example is most useful for my purpose.
Dedicated examples beyond those available in "Getting Started" can be found in software, including Higgs-to-four-lepton analysis, jet tuple production and top cross-sections. Alternatively, start from a 2010 example and adjust to run on 2011 or 2012 data (see the CMS troubleshooting guide for instructions).
For more information on Monte Carlo, see below.
I want to find out how to use the trigger and trigger prescale information in the dataset I am interested in.
I want to find out how to access the luminosity information for the dataset I am interested in and how to select "good data" only.
I want to find out whether I need condition data base information, and if so, how to access it.
I want to find the luminosity of my dataset, possibly constrained by using specific triggers.
I want to find more CMS software and data format documentation from public sources.
I want to learn about the published jet-analysis papers by the MIT group.
I want to run the examples used for validation of the 2010 datasets within my setup…
I want to backup my code, or import some external code.
scp
from and to your host from within the VM.How do I interpret the MC set names?
I want to find the effective luminosity of my MC set.
I want to find the generator cross section of a particular MC set.
Runs
→ GenRunInfoProduct_generator__SIM.
→ GenRunInfoProduct_generator__SIM.obj
→ InternalXSec
→ value_
.I want information that is not documented here and elsewhere on the CERN Open Data portal.
I ran into a problem and need help!