8 Spectrum Model

High-dimensional Regression with Sinusoids

Traditional sinusoidal models cannot capture important characteristics of the sunspots dataset. We can fix this by including sinusoid terms for all possible f. We stick to Fourier frequencies. Say n is odd, then consider 1n,2n,,n12n. Denote (1.1)yt=β0+j=1m(β1jcos(2πjnt)+β2jsin(2πjnt))+εt, where εti.i.dN(0,σ2). We can write this model in matrix form as y=Xβ+ε, where X=(1cos(2π(1/n)1)sin(2π(1/n)1)cos(2π(m/n)1)sin(2π(m/n)1)1cos(2π(1/n)n)sin(2π(1/n)n)cos(2π(m/n)n)sin(2π(m/n)n)), where m=n12, and β=(β0,β11,β21,,β1m,β2m)TRn×1.

Without regularization, if we only want to minimize ||yXβ||2, we can get a perfect fit β^0=y,β^1k=2nt=1nytcos(2πknt),β^2k=2nt=1nytsin(2πknt).
To prevent overfitting, we can use ridge regularization: t=1n(ytβ0j=1m(β1jcos(2πjnt)+β2jsin(2πjnt)))2+λj=1m(β1j2+β2j2).
From discuss here, we can also understand it as (1.2)β11,β21,,β1m,β2mi.i.dN(0,τ2).

The Spectrum Model

The spectrum model is based on (1.1) and (1.2), and has two changes:

Therefore we have (2.1)yt=β0+j=1m(β1jcos(2πjnt)+β2jsin(2πjnt)), with β11,β21i.i.dN(0,τ12),,β1m,β2mi.i.dN(0,τm2).
The parameter (τ12,,τm2) is known as the spectrum of the model. Then Var(yt)=j=1mVar(β1jcos(2πjnt)+β2jsin(2πjnt))=j=1m[Var(β1j)cos2(2πjnt)+Var(β2j)sin2(2πjnt)]=j=1mτj2.
(τj2 represents the contribution of frequency j/n to the overall Var(yt)).

For β0, average both sides of (2.1), we have β0=y. I.e. yty=j=1m(β1jcos(2πjnt)+β2jsin(2πjnt)),β1j,β2ji.i.dN(0,τj2).

So data generated from this model look differently depending on τ12,,τm2.