Every coefficient directly actionable — scenarios, monitoring, communication
Full OLS diagnostic toolkit
Returns to scale estimated, not assumed — the data tells you the regime
Trend: time-varying intercept and elasticities — test with indicators
Seasonality: periodic (calendar) and non-periodic (Easter, promotions) — both modeled explicitly
Model
Index approach
flowchart LR
classDef largeNode fill:#1f2a2c,stroke:#8fe0d6,stroke-width:2px,color:#e9fffb,font-size:34px;
A("Macroeconomy GDP"):::largeNode -->
B("Industry demand Retail index"):::largeNode
B --> C("Operational driver Volume"):::largeNode
C --> D("Financial outcome Revenue"):::largeNode
Model
Index approach
flowchart LR
classDef largeNode fill:#1f2a2c,stroke:#8fe0d6,stroke-width:2px,color:#e9fffb,font-size:34px;
A("Economic Growth"):::largeNode -->
B("E-Commerce"):::largeNode
B --> C("Industry Volume"):::largeNode
C --> D("Segment Revenue"):::largeNode
flowchart LR
classDef largeNode fill:#1f2a2c,stroke:#8fe0d6,stroke-width:2px,color:#e9fffb,font-size:34px;
A("Economic Growth"):::largeNode -->
B("Full Service Restaurant"):::largeNode
B --> C("Industry Volume"):::largeNode
C --> D("Segment Revenue"):::largeNode
flowchart LR
classDef largeNode fill:#1f2a2c,stroke:#8fe0d6,stroke-width:2px,color:#e9fffb,font-size:34px;
A("Economic Growth"):::largeNode -->
B("Grocery"):::largeNode
B --> C("Industry Transactions"):::largeNode
C --> D("Segment Revenue"):::largeNode
Industry Index
Pricing Mix
Weighting factors:
sankey-beta
FSR,Revenue,40
QSR,Revenue,30
General Retail,Revenue,15
Grocery,Revenue,10
Other,Revenue,5
Building It Right
Regression Modeling — key concepts
Identification — can feel like an art form, how to know it's right
Specification — Functional Form drives everything downstream
Omitted Variable Bias — dropping a variable biases what remains
(e.g., dropping a collinear predictor that carries unique signal)
Collinearity (VIF) — individual t-tests become unreliable;
doesn't mean the model is wrong — check joint significance
Heteroscedasticity — tells you about the DGP, not just the errors
Autocorrelation — often the first signal of a missing variable
Go/No-Go
F-Test — joint significance holds even when individual t-tests fail
Out-of-sample validation — the only test that matters for forecasting
Regression Modeling — key concepts
Interpretation and Validation
Ceteris paribus — isolate the effect of one driver, holding others constant
The structural decomposition makes this operational, not just theoretical
Accuracy metrics — match the metric to the audience:
MAPE for stakeholders — "the forecast was off by 0.8%"
MAE for robustness — resistant to outlier distortion
RMSE for risk — penalizes for the misses that hurt most
Out-of-sample accuracy is the credibility test — in-sample fit is necessary, not sufficient
Regression Modeling — Key Concepts
Leveraged Outliers — What's driving your model?
Leverage: — voting power of each observation
near 0 → negligible influence; near 1 → model anchored to this point
Flag:
Influence:
Large shift → this observation is driving your estimates