In modern financial markets, volatility measures the degree of dispersion for assets and plays a crucial role in portfolio allocation, performance evaluation, and risk management. Low-frequency and high-frequency stock data are widely adopted to model the dynamic evolution of daily volatilities, while efforts made for volatility estimation and prediction in the past often employ these two types of data independently. Recent attempts to bridge the gap between these two include the realized GARCH model, the heterogeneous autoregressive (HAR) model, as well as the high-frequency based volatility (HEAVY) model. See Shephard and Sheppard (2010), Hansen, Huang, and Shek (2012), Corsi (2009) for more details.
In addition, Kim and Wang (2016) introduced the unified GARCH-Ito model by embedding the standard GARCH volatility structure in the instantaneous volatilities of an Ito diffusion process. The unified GARCH-Ito model is a continuous-time process at the high-frequency timescale and when restricted to the low-frequency timescale, retains the standard GARCH structure. Moreover, Song et al. (2020) introduced the realized GARCH-Ito model by embedding the realized GARCH model structure in the instantaneous volatilities of a jump-diffusion process. Comparing to the unified GARCH-Ito model, its conditional volatility has integrated volatility and jump variation as innovations, which are high-frequency data-based innovations that are more informative.
The RealizedGARCHIto package aims to provide methods for modeling the high-frequency data with unified GARCH-Ito model and realized GARCH-Ito model. It provides methods to estimate model parameters and allows one to estimate and predict conditional volatilities with the two proposed models. It also includes one sample data set that has low-frequency log returns (return) and realized measures such as realized volatility (RV), bi-power realized volatility (BPV) and jump variation (JV) computed and estimated using the CSI 300 index minute data from 2018-01-01 to 2020-06-30.
Definition The log price Xt, t∈R+ obeys the unified GARCH-Ito model if it satisfies dXt=μtdt+σtdBtσ2t=σ2[t]+(t−[t]){ω+(γ−1)σ2[t]}+β(∫t[t]σsdBs)2 where μt is a drift, [t] denotes the integer part of t and when t itself is an integer, [t]=t−1, Bt is a Brownian motion with respect to a filtration Ft, σ2t is a volatility process adapted to Ft, θ=(ω,β,γ) are the model parameters.
Proposition The conditional integrated volatility over the low-frequency period n retains the following iterative structure E[∫nn−1σ2tdt|Fn−1]=hn(θ)=ωg+γhn−1(θ)+βgZ2n−1 where τ(θ)=(ωg,βg,γ) are model parameters in the above low-frequency structure and are functions of the original θ, Zn−1=Xn−1−Xn−2 is the low-frequency log return. Moreover, E[hn(θ)]=ωg1−βg−γ. The model parameters can be estimated by maximizing the following quasi-maximum likelihood function: ˆLU(θ)=−n∑i=1[log(hi(θ))+RVihi(θ)] where RVi’s are the daily realized volatility estimates. Well-performing realized volatility estimators that can handle the market microstructure noise when frequency of the intraday data is ultra-high include the two-time scale realized volatility estimator, the multi-scale realized volatility estimator, the pre-averaging realized volatility estimator, and the kernel realized volatility estimator. See Ait-Sahalia and Yu (2009), Zhang (2006), Barndorff-Nielsen et al. (2008), Jacod et al. (2009), Christensen, Kinnebrock, and Podolskij (2010) for more details.
To maximize ˆLU(θ), we need to specify h1(θ) and we adopt its unconditional expectation such that h1(θ)=ωg1−βg−γ. We estimate the model parameter θ by maximizing ˆLU(θ) such that ˆθ=argmaxθ∈ΘˆLU(θ) and estimate the model parameter τ(θ) by τ(ˆθ).
require(GARCHIto)
## Loading required package: GARCHIto
data("sample_data")
UnifiedEst(sample_data$BPV, sample_data$return) model_unified=
##
## Iter: 1 fn: -5311.7549 Pars: 0.000002193 0.055580454 0.856415391
## Iter: 2 fn: -5311.7549 Pars: 0.000002193 0.055580454 0.856415391
## solnp--> Completed in 2 iterations
$coefficients # estimated model parameters model_unified
## omega_g beta_g gamma
## 2.192970e-06 5.558045e-02 8.564154e-01
Definition The log price Xt, t∈R+ obeys the realized GARCH-Ito model if it satisfies dXt=μdt+σtdBt+LtdΛtσ2t=σ2[t]+γ(t−[t])2{ω1+σ2[t]}−(t−[t]){ω2+σ2[t]} +α∫t[t]σ2sds+β∫t[t]L2sdΛs+ν([t]+1−t)Z2t where μt is a drift, [t] denotes the integer part of t and when t itself is an integer, [t]=t−1, Zt=∫t[t]dWt, Bt and Wt are standard Brownian motions with respect to filtration Ft with dWtdBt=ρdt, and σ2t is a volatility process adapted to Ft. For the jump part, Λt is the standard Poisson process with constant intensity λ and Lt denotes the i.i.d. jump sizes which are independent of the Poisson and continuous diffusion processes. The i.i.d. assumption on jump sizes can be further rewritten as L2t=ωL+Mt where Mt’s are i.i.d. random variables with mean zero and constant variance.
Proposition The conditional integrated volatility over the low-frequency period n retains the following iterative structure E[∫nn−1σ2tdt|Fn−1]=hn(θ)=ωg+γhn−1(θ)+αg∫n−1n−2σ2sds+βg∫n−1n−2L2tdΛt where τ(θ)=(ωg,αg,βg,γ) are model parameters in the above low-frequency structure and are functions of the original θ. Moreover, E[hn(θ)]=ωg+βgλωL1−αg−γ. The model parameters can be estimated by maximizing the following quasi-maximum likelihood function: ˆLR(θ)=−n∑i=1[log(ˆhi(θ))+RViˆhi(θ)] where ˆh1(θ)=ωg+βgλωL1−αg−γ and ˆhi(θ)=ωg+γhi−1(θ)+αgRVi−1+βgJVi−1,i=2,…,n, here RVi’s are the daily realized volatility estimates and JVi’s are the jump variation estimates. Note: in this pacakge, we approximate λωL in ˆh1(θ) by the median of all JVi’s.
We estimate the model parameter θ by maximizing ˆLR(θ) such that ˆθ=argmaxθ∈ΘˆLR(θ) and estimate the model parameter τ(θ) by τ(ˆθ).
# without the consideration of price jumps
RealizedEst(sample_data$BPV) model_realized_NJ=
##
## Iter: 1 fn: -5340.1082 Pars: 0.000004049 0.554107013 0.376603733
## Iter: 2 fn: -5340.1082 Pars: 0.000004049 0.554107013 0.376603733
## solnp--> Completed in 2 iterations
$coefficients model_realized_NJ
## omega_g alpha_g gamma
## 4.048833e-06 5.541070e-01 3.766037e-01
# with the consideration of price jumps
RealizedEst(sample_data$BPV, sample_data$JV) model_realized=
##
## Iter: 1 fn: -5340.1053 Pars: 0.000004144 0.557140538 0.000919110 0.371861977
## Iter: 2 fn: -5340.1053 Pars: 0.000004144 0.557140538 0.000919110 0.371861977
## solnp--> Completed in 2 iterations
$coefficients model_realized
## omega_g alpha_g beta_g gamma
## 4.143592e-06 5.571405e-01 9.191100e-04 3.718620e-01
plot(model_unified$sigma, cex=0.5, type="o", ylim=c(0,0.00035),
main="estimated conditional volatilities", ylab="", xlab="")
lines(model_realized_NJ$sigma,cex=0.5,type="o",col="blue",lty=2)
lines(model_realized$sigma,cex=0.5,type="o",col="red",lty=3, lwd=0.5)
legend("topleft", cex=0.8,
legend=c("Unified GARCH-Ito", "Realized GARCH-Ito No Jump","Realized GARCH-Ito with Jump"),
col = c("black", "blue", "red"),
lty=c(1,2,3))
The dynamic structure imposed in the unified GARCH-Ito and realized GARCH-Ito model allow us to predict future volatility by estimating the expected conditional integrated volatility, i.e. E[hn+1(θ)|Fn], with ˆhn+1(ˆθ).
To obtain one-step-ahead estimated value of the conditional volatility, use $pred.
c(model_unified$pred, model_realized_NJ$pred, model_realized$pred)
## [1] 3.537236e-05 2.841266e-05 2.843882e-05
To carry out rolling forecast with expanding window, we update the sample size for model construction at each rolling. To evaluate the model performance in volatility forecasting task, we compare ˆhn+1(ˆθ) with RVn+1 and compute the squared prediction error (ˆhn+1(ˆθ)−RVn+1)2.
# conduct out of sample volatility forecasting and compute the mean squared prediction error
NULL
error_unified=NULL
error_realized_NJ=NULL
error_realized=for (i in 560:603){
sink("file")
UnifiedEst(sample_data$BPV[1:i], sample_data$return[1:i])
model1=c(error_unified, (model1$pred-sample_data$BPV[i+1])^2)
error_unified=RealizedEst(sample_data$BPV[1:i])
model2=c(error_realized_NJ, (model2$pred-sample_data$BPV[i+1])^2)
error_realized_NJ=RealizedEst(sample_data$BPV[1:i], sample_data$JV[1:i])
model3=c(error_realized, (model3$pred-sample_data$BPV[i+1])^2)
error_realized=sink()
}
c(mean(error_unified), mean(error_realized_NJ), mean(error_realized))
error=names(error)=c("Unified GARCH-Ito", "Realized GARCH-Ito No Jump", "Realized GARCH-Ito with Jump")
error
## Unified GARCH-Ito Realized GARCH-Ito No Jump
## 4.439021e-10 2.760417e-10
## Realized GARCH-Ito with Jump
## 2.752327e-10
Besides high- and low-frequency stock data, option data provide one more natural source for the more precise forecast of volatilities and have been investigated thoroughly since the seminal work of Black and Scholes (1973). Thus, Song et al. (2020) also discussed how to incorporate additional option data information in parameter estimation. Let NVi’s be the estimated volatility values using option data, assume that NVi and the conditional integrated volatility hi(θ) have the following linear relationship: NVi=b+ahi(θ)+ei,i=1,…,n where b and a are the intercept and slope coefficients, respectively. Moreover, ei’s are martingale differences with mean zero and variance σ2e, and they are independent of the price process and the microstructure component.
Let ϕ=(ωg,αg,βg,γ,a,b,σ2e), the model parameters can be estimated by maximizing the following quasi-likelihood function, ˆLO(ϕ)=−n∑i=1[log(ˆhi(θ)+RViˆhi(θ))]−n∑i=1[log(σ2e)+(NVi−b−aˆhi(θ))2σ2e]. ˆϕ=argmaxϕ∈ΦˆLO(ϕ).
The homogeneous variance in the linear model can be generalized to heterogeneous variance such as replacing σ2e by σ2ehζi(θ), where ζ>0 is to adjust the level of heteroscedasticity with ζ=0 corresponding to the homogeneous case. One may replace σ2e by σ2eˆhζi(θ) in the quasi-likelihood function ˆLO(ϕ) and then estimate ζ jointly with the other parameters.
# without the consideration of price jumps
RealizedEst_Option(RV, NV) # homogeneous error
RealizedEst_Option(RV, NV, homogeneous=FALSE ) # heterogeneous error
# with the consideration of price jumps
RealizedEst_Option(RV, JV, NV) # homogeneous error
RealizedEst_Option(RV, JV, NV, homogeneous=FALSE) # heterogeneous error
Ait-Sahalia, Yacine, and Jialin Yu. 2009. “High Frequency Market Microstructure Noise Estimates and Liquidity Measures.” Annals of Applied Statistics 3 (1): 422–57.
Barndorff-Nielsen, Ole E, Peter Reinhard Hansen, Asger Lunde, and Neil Shephard. 2008. “Designing Realized Kernels to Measure the Ex Post Variation of Equity Prices in the Presence of Noise.” Econometrica 76 (6): 1481–1536.
Black, Fischer, and Myron Scholes. 1973. “The Pricing of Options and Corporate Liabilities.” Journal of Political Economy 81 (3): 637–54.
Christensen, Kim, Silja Kinnebrock, and Mark Podolskij. 2010. “Pre-Averaging Estimators of the Ex-Post Covariance Matrix in Noisy Diffusion Models with Non-Synchronous Data.” Journal of Econometrics 159 (1): 116–33.
Corsi, Fulvio. 2009. “A Simple Approximate Long-Memory Model of Realized Volatility.” Journal of Financial Econometrics 7 (2): 174–96.
Hansen, Peter Reinhard, Zhuo Huang, and Howard Howan Shek. 2012. “Realized GARCH: A Joint Model for Returns and Realized Measures of Volatility.” Journal of Applied Econometrics 27 (6): 877–906.
Jacod, Jean, Yingying Li, Per A Mykland, Mark Podolskij, and Mathias Vetter. 2009. “Microstructure Noise in the Continuous Case: The Pre-Averaging Approach.” Stochastic Processes and Their Applications 119 (7): 2249–76.
Kim, Donggyu, and Yazhen Wang. 2016. “Unified Discrete-Time and Continuous-Time Models and Statistical Inferences for Merged Low-Frequency and High-Frequency Financial Data.” Journal of Econometrics 194: 220–30.
Shephard, Neil, and Kevin Sheppard. 2010. “Realising the Future: Forecasting with High-Frequency-Based Volatility (Heavy) Models.” Journal of Applied Econometrics 25 (2): 197–231.
Song, Xinyu, Donggyu Kim, Huiling Yuan, Xiangyu Cui, Zhiping Lu, Yong Zhou, and Yazhen Wang. 2020. “Volatility Analysis with Realized GARCH-Itô Models.” Journal of Econometrics in press.
Zhang, Lan. 2006. “Efficient Estimation of Stochastic Volatility Using Noisy Observations: A Multi-Scale Approach.” Bernoulli 12 (6): 1019–43.