5 Change of Slope Models

1 Change of Slope Model

Change of Slope Model

The change of slope model/broken-stick regression is given by $\begin{matrix} (1.1) & y_{t} = β_{0} + β_{1} t + β_{2} ReLU (t - c) + ε_{t}, \end{matrix}$ with $ε_{t} \overset{i . i . d}{\sim} N (0, σ^{2})$ , $ReLU (t - c) = max {t - c, 0}$ .^[1]

For $t \leq c$ , the slope of the regression line is $β_{1}$ , while for $t > c$ , is $β_{1} + β_{2}$ . The parameters here are $c, β_{0}, β_{1}, β_{2}, σ$ .

If $c$ were known, then (1.1) is a linear: $y = X_{c} β + c,$ here $X_{c} = (\begin{matrix} 1 & 1 & ReLU (1 - c) \\ ⋮ & ⋮ & ⋮ \\ 1 & n & ReLU (1 - c) \end{matrix})$ .

2 Parameter Estimation

2.1 MLE

We also use MLE: $RSS (c) = min_{β_{0}, β_{1}, β_{2}} \sum_{t = 1}^{n} (y_{t} - β_{0} - β_{1} t - β_{2} ReLU (t - c))^{2} .$ After numerically solving this, the model becomes linear as mentioned just now, so $\hat{β} = ({\hat{β}}_{0}, {\hat{β}}_{1}, {\hat{β}}_{2})^{T} = (X_{\hat{c}}^{T} X_{\hat{c}})^{- 1} X_{\hat{c}}^{T} y .$ Then $\hat{σ} = \sqrt{\frac{RSS (\hat{c})}{n - 3}} .$

2.2 Uncertainty Quantification

Now switch to Bayes. Suppose prior: $β_{0}, β_{1}, β_{2}, \log σ \overset{i . i . d}{\sim} Unif (- C, C)$ for a large $C$ . For $c$ , it is also uniform and the range is $1, \dots, n$ . However, $c$ can't be $1$ or $n$ : when $c = 1$ , $β_{0} + β_{1} t + β_{2} ReLU (t - 1) = β_{0} + β_{1} t + β_{2} (t - 1)$ is linear. Same for $c = n$ . Therefore $c \sim Unif {2, \dots, n - 1} .$
Then $π (c | data) \propto {(\frac{1}{RSS (c)})}^{\frac{n - 3}{2}} | X_{c}^{T} X_{c} |^{- \frac{1}{2}} 1 {c = 2, \dots, n - 1} .$ Given $c$ , the model is linear. By Homework 1 Problem 4, $\frac{RSS (c)}{σ^{2}} | data, c \sim χ_{n - 3}^{2} .$ Finally, $β | data, c, σ \sim N_{3} ({\hat{β}}_{c}, σ^{2} (X_{c}^{T} X_{c})^{- 1}), {\hat{β}}_{c} = (X_{c}^{T} X_{c})^{- 1} X_{c}^{T} y .$

2.3 Posterior Sampling for Uncertainty Quantification

Drawing posterior samples from unknown parameters is a useful way to visualize the uncertainty.

Algorithm (Posterior Sampling)

Obtain samples $c^{(1)}, \dots, c^{(N)}$ by sampling with replacement from ${2, \dots, n}$ with probability weights given by $π (c | data)$ here.
For each $j = 1, \dots, N$ :
1. Fix $c = c^{(j)}$ .
2. Calculate $RSS (c), {\hat{β}}_{c}$ by implementing linear regression with fixed $c$ .
3. Generate $χ^{2} \sim χ_{n - 3}^{2}$ , then take $σ^{(j)} = \sqrt{\frac{RSS (c)}{χ^{2}}}$ .
4. Take $β^{(j)} \sim N ({\hat{β}}_{c}, (σ^{(j)})^{2} (X_{c}^{T} X_{c})^{- 1})$ to be a generated random vector.

So now we have $(c^{(j)}, β_{0}^{(j)}, β_{1}^{(j)}, β_{2}^{(j)}, σ^{(j)})$ for $j = 1, \dots, N$ , then we can fit values $t \mapsto β_{0}^{(j)} + β_{1}^{(j)} t + β_{2}^{(j)} ReLU (t - c^{(j)}), t = 1, \dots, n .$

If we want to obtain posterior samples for $y_{t^{*}}$ for future $t^{*}$ , then based on the algorithm, in step 2, add 2.5: generate $y_{t^{*}}^{(j)} \sim N (β_{0}^{(j)} + β_{1}^{(j)} t^{*} + β_{2}^{(j)} ReLU (t^{*} - c^{(j)}, (σ^{(j)})^{2})$ .

3 More Change of Slope

If we want to introduce one more break point: $y_{t} = β_{0} + β_{1} t + β_{2} ReLU (t - c_{1}) + β_{3} ReLU (t - c_{2}) + ε_{t},$ with $ε_{t} \overset{i . i . d}{\sim} N (0, σ^{2})$ , we can also write it as $y = X_{c} β + ε, X_{c} = (\begin{matrix} 1 & 1 & ReLU (1 - c_{1}) & ReLU (1 - c_{2}) \\ ⋮ & ⋮ & ⋮ & ⋮ \\ 1 & n & ReLU (n - c_{1}) & ReLU (n - c_{2}) \end{matrix}) .$ Then we can also have posterior $π (c | data) \propto {(\frac{1}{RSS (c)})}^{\frac{n - 4}{2}} | X_{c}^{T} X_{c} |^{- \frac{1}{2}} .$ For more break points, we can consider $y_{t} = β_{0} + β_{1} t + \sum_{j = 1}^{k} β_{j + 1} ReLU (t - c_{j}) + ε_{t} .$

We can also denote this as $(\cdot)_{+}$ . ↩︎