Reference Material
- Combine Manual
- Combine Tutorial at LPC
- Practical Statistics for LHC Physicists - Three CERN Academic Lectures by Harrison Prosper
- Statistics in Theory - A lecture by Bob Cousins
- RooFit - Slides by Wouter Verkerke, one of the RooFit developers
- RooFit Tutorials - A set of macros that showcase all major features of RooFit
- RooStats Manual - A concise, clear, summary of statistics concepts and definitions
- RooStats Tutorial - Tutorial by Kyle Cranmer, one of the RooStats developers
- RooStats Tutorials - A set of macros that showcase all major features of RooStats
- CMS DAS 2014 Statistics Exercise - A tutorial on statistics as used in CMS
- Procedure for the LHC Higgs boson search combination in Summer 2011 - Paper describing LHC statistical procedures
- Combine Github - Github repository for combine
- LPC statistics course - Lectures by Harrison Prosper and Ulrich Heintz, fall 2017
Terminology and Conventions
Here we give pragmatic definitions for a few basic concepts that we will use.
- observable - something you measure in an experiment, for example, a particle’s momentum. Often, a function of measured quantities, for example, an invariant mass of several particles.
- global observable or auxiliary observable - an observable from another measurement, for example, the integrated luminosity.
- model - a set of probability functions (PFs) describing the distributions of observables or functions of observables. The probability functions are called probability density functions (PDFs) if the observables are continuous and probability mass functions (PMF) if the observables are discrete. In the Bayesian approach, the model also includes the prior density.
- model parameter - any variable in your model that is not an observable.
- parameter of interest (POI) - a model parameter of current interest, for example, a cross section.
- nuisance parameter - every model parameter other than your parameter (or parameters) of interest.
- data or data set - a set of values of observables, either measured in an experiment or simulated.
- likelihood - a model computed for a particular data set.
- hypothesis - a model in which all quantities are specified: observables, model parameters, and prior PDFs (in case of Bayesian inference).
- prior - a probability or probability density for an observable or a model parameter that is independent of the data set. Priors are a key feature of Bayesian inference. However, priors can be used in frequentist inference only if they can be interpreted as relative frequencies.
- Bayesian - a school of statistical inference based on the likelihood and a prior.
- frequentist - a school of statistical inference based on the likelihood only.