REU2019: Validation of Higgs to bb¯ tagging techniques with Zbb¯

Goal

Use Zbb¯ process to validate performance of techniques used to identify Hbb¯.

mass-peak

Data

Monte-Carlo samples

Event selection

Some of these cuts are already applied at ntuple level.

Ntuples

How to start

  1. A code skeleton that reads ntuples and creates histograms of relevant variables is here:
/data/users/miochoa/REU2019/reu-2019-skeleton
  1. Copy the entire folder above into your user area in /data/users/<your-username>. If your user area does not yet exist, create it and cd into it, then:
    cp -r /data/users/miochoa/REU2019/reu-2019-skeleton .

  2. Everytime you start a session, you need to setup the required tools by running:

cd reu-2019-skeleton
source setup.sh
  1. The event and object selection as well as histogram definition takes place here:
ZbbAnalysisCode/src/MyZbbAnalysis.C
Follow the existing examples to add new histograms.
  1. How to run a test after any modification:

    cd run/
    root -l -b
    .L runAnalysis.C
    runAnalysis(dataset name, "12", "local")
    

    The “dataset name” should be replaced by one of the datasets listed in:

    run/inputs/data.txt or inputs/mc.txt
    

    This step will produce an output.root file. Inspect it to make sure your histograms are properly filled.

  2. How to run on the full list of data and MC samples:

    cd run/
    ./localRun.sh inputs/data.txt
    ./localRun.sh inputs/mc.txt
    

    These two steps will produce output root files for each data or MC sample (around 250). You can combine the root files from data all in a single file, with the following command:

    hadd output_data.root output_data_*root
    

    The MC files can’t be combined, because they will have to be individually scaled by their corresponding cross-sections, which is performed in the next step.

  3. There is an example python script for making nice plots with the histograms produced in the earlier steps:

    cd Plotting/
    python new_plotting_example.py -b
    

Questions

  1. How are photons identified and reconstructed in ATLAS?
    • E.g. on the paper cited above, a loose photon requirement is mentioned. What does it consist of?
  2. This work uses two ‘types’ of jets: large-R calorimeter jets and variable-R track-jets.
    • How are jets defined and built in ATLAS?
  3. We also use b-tagging techniques to identify jets that contain b-hadrons: what properties of the b-quark are useful for this tagging?
  4. What goes into the scaling of the MC samples before you make your plots?

Studies

  1. Study data/MC agreement in different variables and selections:
    • e.g. before and after requiring two b-tagged jets associated to the large-R jet.
  2. What is the signal efficiency for different b-tagging selections?
  3. Are there distributions that provide discriminant power between γ+jets and Zbb¯?
  4. How does the efficiency to find a Zbb¯ object in data compare to the efficiency in MC?