3 Sinusoidal Model

#SinusoidalModel #MLE #PythagoreanIdentity #DFT #FourierFrequency

1 Sinusoidal Model

Goal: to fit a simple sinusoidal^[1] model to the sunspots data.

Let's consider the expression of sinusoidal model: $s (t) = A \cos (2 π f t) + B \sin (2 π f t), A = R \cos ϕ, B = - R \sin ϕ .$
For us, usually $t = 1, 2, \dots, n$ . In this case, we can restrict the frequency $f \in [0, \frac{1}{2}]$ .^[2]

Based on the remark,

Fact

For every $f, ϕ$ , there exists $f_{0} \in [0, \frac{1}{2}], ϕ_{0}$ , s.t. $s (t; f, ϕ) = s (t, f_{0}, ϕ_{0})$ , $\forall t \in N^{*}$ .

When $f = 0$ , $s (t)$ is constant; when $f = \frac{1}{2}$ , $s (t) = R \cos (π t + ϕ) = (- 1)^{t} R \cos (ϕ)$ .

Sinusoidal Model

$y_{t} = β_{0} + β_{1} \cos 2 π f t + β_{2} \sin 2 π f t + ε_{t}, ε_{t} \overset{i . i . d}{\sim} N (0, σ^{2})$ .
The parameters here are $β_{0}, β_{1}, β_{2}, σ, f$ .

If $f$ is known, it's a linear model. Otherwise it's a nonlinear regression model.

2 Frequentist Inference

Like in linear models, frequentist calculate the MLE: $\begin{aligned} \prod_{i = 1}^{n} \frac{1}{\sqrt{2 π} σ} \exp [- \frac{(y_{t} - β_{0} - β_{1} \cos 2 π f t - β_{2} \sin 2 π f t)^{2}}{2 σ^{2}}] \\ \propto & σ^{- n} \exp [- \frac{1}{2 σ^{2}} \sum_{t = 1}^{n} (y_{t} - β_{0} - β_{1} \cos 2 π f t - β_{2} \sin 2 π f t)^{2}] \\ = & σ^{- n} \exp [- \frac{1}{2 σ^{2}} S (β_{0}, β_{1}, β_{2}, f)] . \end{aligned}$
So the problem is to find $({\hat{β}}_{0}, {\hat{β}}_{1}, {\hat{β}}_{2}, \hat{f})$ that minimizes $\begin{aligned} S (β_{0}, β_{1}, β_{2}, f) & = \sum_{t = 1}^{n} (y_{t} - β_{0} - β_{1} \cos 2 π f t - β_{2} \sin 2 π f t)^{2} \\ (2.1) & = | | y - X_{f} β | |^{2} . \end{aligned}$ We first fix $f$ , so this is just a linear regression. We have $\hat{β} (f) = (X_{f}^{T} X_{f})^{- 1} X_{f}^{T} y, X_{f} = [\begin{matrix} 1 & \cos 2 π f (1) & \sin 2 π f (1) \\ ⋮ & ⋮ & ⋮ \\ 1 & \cos 2 π f (n) & \sin 2 π f (n) \end{matrix}] .$ Then $\hat{f} = \arg min_{f} S (\hat{β} (f), f), β = (β_{0}, β_{1}, β_{2})^{T} .$

3 Bayesian Inference

The posterior is $σ^{- n} \exp [- \frac{S (β, f)}{2 σ^{2}}] 1 {0 \leq f \leq \frac{1}{2}},$ and assume prior $β_{0}, β_{1}, β_{2}, \log σ \overset{i . i . d}{\sim} Uniform [- C, C]$ , $f \sim Uniform [0, \frac{1}{2}]$ .
Then^[3] $f_{β, σ, f | d a t a} = σ^{- n - 1} \exp [- \frac{S (β, f)}{2 σ^{2}}] 1 {0 \leq f \leq \frac{1}{2}} 1 {- C < β_{0}, β_{1}, β_{2}, \log σ < C},$ then the posterior density of $f$ $\begin{array}{r} \propto \iint σ^{- n - 1} \exp [- \frac{S (β, f)}{2 σ^{2}}] d β d σ . \end{array}$

By Pythagorean identity, $S (β, f) = | | y - X_{f} β | |^{2} = S ({\hat{β}}_{f}, f) + (β - {\hat{β}}_{f})^{T} X_{f}^{T} X_{f} (β - {\hat{β}}_{f}) .$
So further on $\begin{aligned} \propto & \iint σ^{- n - 1} \exp (- \frac{S ({\hat{β}}_{f}, f)}{2 σ^{2}}) [- \frac{(β - {\hat{β}}_{f})^{T} X_{f}^{T} X_{f} (β - {\hat{β}}_{f})}{2 σ^{2}}] d β d σ \\ = & \int σ^{- n - 1} \exp (- \frac{S ({\hat{β}}_{f}, f)}{2 σ^{2}}) (2 π)^{\frac{p}{2}} \sqrt{det (σ^{2} (X_{f}^{T} X_{f})^{- 1})} \\ \propto & \int σ^{- n - 1} \exp (- \frac{S ({\hat{β}}_{f}, f)}{2 σ^{2}}) σ^{p} | X_{f}^{T} X_{f} |^{- \frac{1}{2}} d σ \\ = & | X_{f}^{T} X_{f} |^{- \frac{1}{2}} \int σ^{- n + p - 1} \exp (- \frac{S ({\hat{β}}_{f}, f)}{2 σ^{2}}) d σ \\ = & \int_{0}^{\infty} t^{- n + p - 1} (S ({\hat{β}}_{f}, f))^{- \frac{n - p}{2}} \exp (- \frac{1}{2 t^{2}}) d t \\ \propto & (S ({\hat{β}}_{f}, f))^{- \frac{n - p}{2}} | X_{f}^{T} X_{f} |^{- \frac{1}{2}} . \end{aligned}$
Compare with linear regression: $Posterior \propto (S (β))^{- \frac{n}{2}} .$

4 Fourier Frequency

$RSS (f)$ ( $S$ above measures how well the sinusoid at frequency $f$ fits the data). So we should take a grid of values for $f$ , and then compute $RSS (f)$ for each grid point.

Common Choice of Grid for $f$ : (notate as $G$ )

$n$ is even, $0, \frac{1}{n}, \frac{2}{n}, \dots, \frac{1}{2}$ ,
$n$ is odd, $0, \frac{1}{n}, \frac{2}{n}, \dots, \frac{n - 1}{2 n}$ .

Note that $n$ is the size of data.

Fourier Frequency

Fourier frequency is frequency $f$ s.t. $n f$ is an integer.

So our common choice inside $G$ are all Fourier frequencies lying in $[0, \frac{1}{2}]$ .

When $f \in G$ , plug in ${\hat{β}}_{f} = (X_{f}^{T} X_{f})^{- 1} X_{t}^{T} y$ , $\begin{aligned} RSS (f) = & | | y - X_{f} {\hat{β}}_{f} | |^{2} = (y - X_{f} {\hat{β}}_{f})^{T} (y - X_{f} {\hat{β}}_{f}) \\ = & y^{T} y - {\hat{β}}_{f}^{T} X_{f}^{T} y - y^{T} X_{f} {\hat{β}}_{f} + {\hat{β}}_{f}^{T} X_{f}^{T} X_{f} {\hat{β}}_{f} \\ = & y^{T} y - y^{T} (X_{f}^{T} X_{f})^{- 1} X_{f}^{T} y - y^{T} X_{f} (X_{f}^{T} X_{f})^{- 1} X_{f}^{T} y \\ + y^{T} X_{f} (X_{f}^{T} X_{f})^{- 1} X_{f}^{T} X_{f} (X_{f}^{T} X_{f})^{- 1} X_{f}^{T} y \\ = & y^{T} y - y^{T} X_{f} (X_{f}^{T} X_{f})^{- 1} X_{f}^{T} y . \end{aligned}$
And $\begin{aligned} X_{f}^{T} X_{f} & = [\begin{array}{c} 1 & \dots & 1 \\ \cos 2 π f \cdot 1 & \dots & \cos 2 π f \cdot n \\ \sin 2 π f \cdot 1 & \dots & \sin 2 π f \cdot n \end{array}] [\begin{array}{c} 1 & \cos 2 π f \cdot 1 & \sin 2 π f \cdot 1 \\ ⋮ & ⋮ & ⋮ \\ 1 & \cos 2 π f \cdot n & \sin 2 π f \cdot n \end{array}] \\ = [\begin{array}{c} n & \sum_{t = 1}^{n} \cos (2 π f t) & \sum_{t = 1}^{n} \sin (2 π f t) \\ * & \sum_{t = 1}^{n} \cos^{2} (2 π f t) & \sum_{t = 1}^{n} \cos (2 π f t) \sin (2 π f t) \\ * & * & \sum_{t = 1}^{n} \sin^{2} 2 π f t \end{array}] . \end{aligned}$ Next let $r = e^{2 π i f t}$ , ^[4] $\begin{aligned} \sum_{t = 1}^{n} \cos (2 π f t) = \sum_{t = 1}^{n} \frac{e^{2 π i f t} + e^{- 2 π i f t}}{2} \\ \overset{r = e^{2 π i f}}{=} & \frac{1}{2} \sum_{t = 1}^{n} r^{t} + \frac{1}{2} \sum_{t = 1}^{n} r^{- t} \\ = & \frac{1}{2} \frac{e^{2 π i f}}{e^{2 π i f} - 1} (e^{2 π i f} - 1) + \frac{1}{2} \frac{e^{- 2 π i f}}{e^{- 2 π i f} - 1} (e^{- 2 π i f} - 1) = 0. \end{aligned}$

Next similarly $\begin{aligned} \sum_{t = 1}^{n} \cos^{2} (2 π f t) = \sum_{t = 1}^{n} \frac{1 + \cos (4 π f t)}{2} = \frac{n}{2}, \\ \sum_{t = 1}^{n} \cos (2 π f t) \sin (2 π f t) = \frac{1}{2} \sum_{t = 1}^{n} \sin (4 π f t) = 0. \end{aligned}$

One more fact

$f_{1}, f_{2}$ are two distinct Fourier frequencies. Then $\begin{aligned} \sum_{t = 0}^{n - 1} \cos (2 π f_{1} t) \sin (2 π f_{2} t) = 0, \\ \sum_{t = 0}^{n - 1} \sin (2 π f_{1} t) \sin (2 π f_{2} t) = 0, \\ \sum_{t = 0}^{n - 1} \sin (2 π f_{1} t) \cos (2 π f_{2} t) = 0. \end{aligned}$

So $X_{f}^{T} X_{f} = [\begin{matrix} n & 0 & 0 \\ 0 & \frac{n}{2} & 0 \\ 0 & 0 & \frac{n}{2} \end{matrix}], (X_{f}^{T} X_{f})^{- 1} = [\begin{matrix} \frac{1}{n} & 0 & 0 \\ 0 & \frac{2}{n} & 0 \\ 0 & 0 & \frac{2}{n} \end{matrix}] .$ Then $\begin{aligned} RSS (f) & = y^{T} y - (\sum_{t = 1}^{n} y_{t} \sum_{t = 1}^{n} y_{t} \cos 2 π f t \sum_{t = 1}^{n} y_{t} \sin 2 π f t) [\begin{array}{c} \frac{1}{n} & 0 & 0 \\ 0 & \frac{2}{n} & 0 \\ 0 & 0 & \frac{2}{n} \end{array}] (\begin{array}{c} \sum_{t = 1}^{n} y_{t} \\ \sum_{t = 1}^{n} y_{t} \cos 2 π f t \\ \sum_{t = 1}^{n} y_{t} \sin 2 π f t \end{array}) \\ = y^{T} y - \frac{1}{n} {(\sum_{t = 1}^{n} y_{t})}^{2} - \frac{2}{n} {(\sum_{t = 1}^{n} y_{t} \cos 2 π f t)}^{2} - \frac{2}{n} {(\sum_{t = 1}^{n} y_{t} \sin 2 π f t)}^{2} \\ = \sum_{t = 1}^{n} (y_{t} - \overset{―}{y})^{2} - \frac{2}{n} {(\sum_{t = 1}^{n} y_{t} \cos 2 π f t)}^{2} - \frac{2}{n} {(\sum_{t = 1}^{n} y_{t} \sin 2 π f t)}^{2} . \end{aligned}$ Note again that this is only true when $f \in [0, \frac{1}{2}]$ and $f$ is Fourier frequency.

Periodogram

Define periodogram as $I (f) = \frac{1}{n} [{(\sum_{t = 1}^{n} y_{t} \cos 2 π f t)}^{2} + {(\sum_{t = 1}^{n} y_{t} \sin 2 π f t)}^{2}] .$

So $RSS (f) = \sum_{t = 1}^{n} (y_{t} - \overset{―}{y})^{2} - 2 I (f) .$

We can also rewrite $I (f) = \frac{1}{n} {| \sum_{t = 1}^{n} y_{t} (\cos 2 π f t + i \sin 2 π f t) |}^{2} = \frac{1}{n} {| \sum_{t = 1}^{n} y_{t} e^{- 2 π i f t} |}^{2} .$ This is exactly a Fourier transformation.

5 Some Other Nonlinear Regression Models

$y_{t} = β_{0} + β_{1} t + β_{2} \cos (2 π f t) + β_{3} \sin (2 π f t) + ε_{t}$ . $RSS (f) = \sum_{t = 1}^{n} (y_{t} - β_{0} - β_{1} t - β_{2} \cos (2 π f t) - β_{3} \sin (2 π f t))^{2} .$
(Broken stick / change of slope) $y_{t} = β_{0} + β_{1} t + β_{2} (t - s)_{+} + ε_{t}$ , here $(t - s)_{+} = max {t - s, 0}$ .

(正弦曲线的) $s (t) = R \cos (2 π f t + ϕ)$ . Here $R$ is called the amplitude (振幅), $ϕ$ is called phase (相位), $f$ is the frequency (频率). The period here is $\frac{1}{f}$ , and $2 π f$ is angular frequencing ↩︎
Because $t$ is an integer, and we are looking at $s (t) = R \cos (2 π f t + ϕ)$ . If say $f = - 3.5$ , then $\begin{aligned} s (t) & = R \cos (2 π (3.5) t - ϕ) = R \cos (6 π t + 2 π (0.5) t - ϕ) \\ = R \cos (2 π (0.5) t - ϕ) . \end{aligned}$ ↩︎
Here the first $f$ means the likelihood function, and the subscripted $_{f}$ means frequency ↩︎
Last equation $0$ is because $f$ is a Fourier frequency, then $n f \in Z$ . ↩︎