Differences in the Formulations of Kalman Filter and Kalman-Bucy Filter
The discrete-time filtering formulation differs from its continuous-time counterpart. The difference is revealed by looking at the observations.
Discrete-time filtering: Kalman filter
Consider a discrete-time dynamical system of the following form,
where and are unobserved states and observations at discrete time , respectively. and are standard Gaussian random noises. , , , and are matrices (assumed to be constant-in-time here for brevity). (1a) is often called the “model” and is the corresponding model error covariance. Similarly, is the observation error covariance.
The discrete-time filtering problem aims for the posterior distribution . The Kalman filter gives the following posterior (analysis) state estimate and covariance for (1), where The prior (forecast) covariance matrix is often obtained by
Continuous-time filtering: Kalman-Bucy filter
Consider a continuous stochastic differential equation of the following form, where and are unobserved states and observations at time , respectively. and are Gaussian white noises. , , , and are constant matrices. is the model error covariance, and is the observation error covariance.
The continuous-time filtering problem aims for the posterior . The posterior mean and covariance for (3) are given by the Kalman-Bucy filter (Bishop & Del Moral, 2023) as where (4b) is known as the Riccati equation.
Kalman Filter and Kalman-Bucy filter are not translatable
The Kalman filter system (1) and Kalman-Bucy filter system (3), although similar in form, are not translatable with each other. The key difference that prohibits the translation is not in the model equations (1a), (3a), but in the observation equations (1b), (3b)1. We will try the translation while focusing on the observation equations to reveal the differences.
When taking in the Kalman filter, one will quickly realize that the observation noise does not have a continuous path, while the observation noise in the Kalman-Bucy filter gives a continuous path for . This already indicates the differences between the two formulations. To look into the details, we write down the time increments of observations based on the observation equation (1b) in the Kalman filter:
Compared to the discretized observation equation (3b) in the Kalman-Bucy filter:
(5) is different in the following ways:
- The observation noise does not go to zero as , unlike in the Kalman-Bucy filter.
- The influences of the observation noises on the dynamics of do not accumulate over time.
The first point results in the discontinuity of . The second point says that previous observation noises are abandoned in time marching, which is inherited from the assumption that the observation noises of the Kalman filter are independent in time, and the observation procedure does not interfere with the underlying dynamics. In contrast, the observation noises in the Kalman-Bucy filter are involved in the dynamics of 2. Therefore, the Kalman-Bucy filter is not the continuous-time version of the Kalman filter by taking the limit of . The Kalman filter is not the discrete-time version of the Kalman-Bucy filter by simply discretizing the continuous equations.
Historically, the observations in the Kalman filter are replaced by the time derivative (or increments ) in the Kalman-Bucy filter (Kalman & Bucy, 1961; Jazwinski, 1970; Bergemann & Reich, 2012). Although this replacement leads to a form of the Kalman-Bucy filter similar to the Kalman filter, it introduces an inconsistency in the observation noises — consider as the “observations,” the observation noise has an infinite magnitude, or consider as the “observations,” the observation noise has a zero magnitude, both of which are unrealistic for the Kalman filter when taking the limit . The Kalman-Bucy filter only looks like the Kalman filter in the integral sense: taking a that is not too small, then integrating (3b) over , we have: The time increments are considered as the “observation” with a noise . By solely looking at , the in the Kalman-Bucy filter is very different from the observations defined in the Kalman filter. This should be noted with caution to avoid potential confusion.
-
(1a) is the discrete formulation of an SDE, which is usually translatable to a time discretization of an SDE.
-
"Measurement error" is a better term for the Kalman-filter-type observation noise, while the Kalman-Bucy-filter-type observation noise is closer to "observation model error" that characterizes the unresolved dynamics of observations.
- Bishop, A. N., & Del Moral, P. (2023). On the mathematical theory of ensemble (linear-Gaussian) Kalman–Bucy filtering. Mathematics of Control, Signals, and Systems, 35(4), 835–903. https://doi.org/10.1007/s00498-023-00357-2
- Kalman, R. E., & Bucy, R. S. (1961). New results in linear filtering and prediction theory. Journal of Fluids Engineering, Transactions of the ASME, 83(1), 95–108. https://doi.org/10.1115/1.3658902
- Jazwinski, A. (1970). Stochastic Processes And Filtering Theory. 64. https://doi.org/10.1016/S0076-5392(09)60368-4
- Bergemann, K., & Reich, S. (2012). An ensemble Kalman-Bucy filter for continuous data assimilation. Meteorologische Zeitschrift, 21(3), 213–219. https://doi.org/10.1127/0941-2948/2012/0307