Reference Material

Combine Manual

Combine Tutorial at LPC

Practical Statistics for LHC Physicists - Three CERN Academic Lectures by Harrison Prosper

Statistics in Theory - A lecture by Bob Cousins

RooFit - Slides by Wouter Verkerke, one of the RooFit developers

RooFit Tutorials - A set of macros that showcase all major features of RooFit

RooStats Manual - A concise, clear, summary of statistics concepts and definitions

RooStats Tutorial - Tutorial by Kyle Cranmer, one of the RooStats developers

RooStats Tutorials - A set of macros that showcase all major features of RooStats

CMS DAS 2014 Statistics Exercise - A tutorial on statistics as used in CMS

Procedure for the LHC Higgs boson search combination in Summer 2011 - Paper describing LHC statistical procedures

Combine Github - Github repository for combine

LPC statistics course - Lectures by Harrison Prosper and Ulrich Heintz, fall 2017

Terminology and Conventions

Here we give pragmatic definitions for a few basic concepts that we will use.

observable - something you measure in an experiment, for example, a particle’s momentum. Often, a function of measured quantities, for example, an invariant mass of several particles.
global observable or auxiliary observable - an observable from another measurement, for example, the integrated luminosity.
model - a set of probability functions (PFs) describing the distributions of observables or functions of observables. The probability functions are called probability density functions (PDFs) if the observables are continuous and probability mass functions (PMF) if the observables are discrete. In the Bayesian approach, the model also includes the prior density.
model parameter - any variable in your model that is not an observable.
parameter of interest (POI) - a model parameter of current interest, for example, a cross section.
nuisance parameter - every model parameter other than your parameter (or parameters) of interest.
data or data set - a set of values of observables, either measured in an experiment or simulated.
likelihood - a model computed for a particular data set.
hypothesis - a model in which all quantities are specified: observables, model parameters, and prior PDFs (in case of Bayesian inference).
prior - a probability or probability density for an observable or a model parameter that is independent of the data set. Priors are a key feature of Bayesian inference. However, priors can be used in frequentist inference only if they can be interpreted as relative frequencies.
Bayesian - a school of statistical inference based on the likelihood and a prior.
frequentist - a school of statistical inference based on the likelihood only.

	Setup	Download files required for the lesson
00:00	1. Introduction	What are the goals of a statistical analysis?
00:10	2. Exercise 0
00:10	3. Exercise 1
00:10	4. Exercise 2
00:10	5. Exercise 3
00:10	6. Exercise 4
00:10	7. Exercise 5
00:10	8. Exercise 6
00:10	Finish

CMS DAS Short Exercise - Statistics: An Introduction to the Statistics Tools RooFit, RooStats, and combine

Reference Material

Terminology and Conventions

Schedule