Towards improved objective analysis of lake surface water temperature in a NWP model: preliminary assessment of statistical properties

Information about the statistical structure of the lake surface water temperature (LSWT) field is needed for assimilation of lake observations into Numerical Weather Prediction (NWP) models, to describe the lake surface state at each grid-point containing lakes. In this study, we obtain the autocorrelation function for LSWT from two types of observations, in situ and satellite-based. We use summer time measurements during 2010–2014 over selected Fennoscandian lakes and Northern European domain. The estimated autocorrelations decrease exponentially (from 0.99 to 0.73 for in situ and from 0.97 to 0.61 for satellite observations), when the distance between observations increases from zero to one thousand kilometres .A large difference in lake depth leads to a decrease of the correlation. Typical error standard deviation of LSWT observations was found to be 0.9 C for in situ observations and 1.2C for satellite observations. The exponential approximation for the LSWT autocorrelation functions is proposed, which depends on both the distance and the difference in lake depth. These results are directly applicable for the LSWT objective analysis in NWP. New autocorrelation functions, which allow interpolation of observations within and between lakes, were used in numerical experiments with the High-Resolution Limited Area Model (HIRLAM). In this preliminary assessment, we suggest adaptation of the presently used functions by increasing the influence radius and taking into account the lake depth difference. Generalization of the results to cover the melting and freezing seasons, their assessment for different geographical areas as well as their application to other prognostic lake variables within NWP are foreseen.


Introduction
The importance of energy exchanges between the earth's surface and the atmosphere is well recognized in weather forecasting. The surface heat, moisture and momentum fluxes depend not only on atmospheric conditions but also on the properties of the land cover, which in lake-rich areas are largely determined by inland water bodies. The importance of a correct description of inland water (lake) surface state is well known in climate modelling (Duguay et al., 2006;Brown and Duguay, 2010;Krinner and Boike, 2010;Samuelsson et al., 2010;Ngai et al., 2013) and in numerical weather prediction (Niziol, 1987;Niziol et al., 1995;Zhao et al., 2012;Eerola et al., 2014). For example, during freezing and melting the lake surface radiative and conductive * Corresponding author. e-mail: h2kheyro@uwaterloo.ca properties as well as the latent and sensible heat released from lakes to the atmosphere change dramatically, leading to a completely different surface energy balance. By affecting the surface fluxes, lakes modify the structure of the atmospheric boundary layer.
Lake surface water temperature (LSWT) is directly related to the heat fluxes, thus it is a critical variable to measure, assimilate and predict in NWP. The quality of observation-based lake surface state description depends crucially on the availability and selection of observations. Obtaining reliable observations on lakes in real-time, especially at high latitudes, is challenging. Satellite-based data offer the only realistic way of obtaining frequent observations with a sufficient spatial resolution over large areas. Various types of satellite observations collected over lakes represent different scales and have different accuracies, depending on the observing system. Measurements based on optical wavelength signals are limited by clouds and thus irregularly distributed in space and time. To assimilate remotely sensed observations and available in situ observations into a NWP model, knowledge of the statistical properties of the observed LSWT as well as error characteristics of observations and a model background are needed.
A correct description of the lake surface state is relevant for NWP models with high horizontal resolution, enough to resolve even small lakes. We use High-Resolution Limited Area Model HIRLAM (Unden et al., 2002;Eerola, 2013) as a framework of our study. This NWP system has been applied since 1990 for the numerical short-range weather forecast over the Northern Europe. Presently, it includes the prognostic lake temperature parameterization and objective analysis of observed LSWT.
In HIRLAM, the monthly climatological water surface temperature for both sea and lakes was used in early 1990s. The LSWT climatology was estimated by extrapolating the Sea Surface Temperature (SST) climatology to lakes, as no suitable data for lakes were available. Both SST and LSWT were kept constant during the forecast. The first step to improve the SST analysis was to apply the European Centre for Medium Range Weather Forecast (ECMWF) SST analysis fields (Chelton and Wentz, 2005) in the form of pseudo observations, with the method of successive corrections (Cressman, 1959) and later with Optimal Interpolation (OI) (Gandin, 1965). The previous analysis was used as a background for the following analysis and the background was relaxed towards climatology, to avoid drifting far from reality. LSWT climatology over 20 Finnish lakes was obtained based on the measurements of the Finnish Environment Institute (Suomen Ympäristökeskus = SYKE) and applied in the analysis, also in the form of pseudo observations (Eerola, 1995). The fractional ice cover over the sea and lakes was provided in consistency with SST and LSWT. The freshwater lake model (FLake, Mironov, 2008;Mironov et al., 2010) was implemented in the HIRLAM forecasting system in 2012 (Kourzeneva et al., 2008;Eerola et al., 2010). The model uses external data-sets on the lake depth (Kourzeneva et al., 2012a) and the lake climatology to initialize the prognostic variables of FLake at the very first model start (Kourzeneva et al., 2012b). At the same time, real-time in situ LSWT observations by SYKE for 27 Finnish lakes became available for the operational HIRLAM analysis (Eerola et al., 2010;Rontu et al., 2012). The OI method with the same assumed statistical properties for SST and LSWT fields was applied to these observations. Currently, FLake provides the background for the OI analysis of LSWT in the operational HIRLAM. However, the prognostic FLake variables are not updated using the analysed LSWT since this part of the data assimilation algorithm still remains to be implemented (according to the suggestion by Kourzeneva (2014)). 1 Developments related to the analysis of LSWT took place also at U.K. Met. Office within the Operational Sea Surface Temperature and Ice Analysis (OSTIA) system (Donlon et al., 2012;Fiedler et al., 2014). The OSTIA system produces surface water temperature analysis fields for NWP purposes. For the background, OSTIA uses the previous analysis field and treats remote-sensing and in situ observations of the surface water temperature by the OI method. The LSWT observations used in OSTIA are part of Sea Surface Temperature (SST) products from AATSR and MetOp-AVHRR (Infrared Atmospheric Sounding Interferometer, IASI). This product is based on the method of SST retrievals and does not include lake-specific processing. In OSTIA, the LSWT analysis is produced only for the large lakes (ca. 248 globally), without cross-lake interpolation. The background LSWT is backed up by lake climatology (Fiedler et al., 2014). In the OI method of OSTIA, the same statistical properties are used both for SST and LSWT.
Statistical properties of fields and estimates of the background and observational errors are essential in the OI method. The OI (also called statistical interpolation) method was introduced by Eliassen (1954) and Gandin (1965) and was applied for the upper air analysis in operational NWP systems (e.g. Lorenc, 1981;Hollingsworth and Lönnberg, 1986;Lönnberg and Hollingsworth, 1986;Daley, 1991). OI is one of the methods of objective analysis, which provide initial conditions to NWP. Such methods were developed for atmospheric models by several groups of meteorologists (Panofsky, 1949;Gilchrist and Cressman, 1954;Bergthórsson and Döös, 1955;Cressman, 1959). In general, objective analysis methods are based on the idea of minimizing errors of the analysis. The OI is still applied for the analysis of the near-surface variables such as surface water temperature, screen-level temperature and relative humidity and snow water equivalent (Thiebaux, 1975;Julian and Thiebaux, 1975;Sattler and Huang, 2002;Donlon et al., 2012). An advantage of OI is that it can be used also for the quality control of observations. The OI method accounts for the statistical properties of the analysed field via the autocorrelation function. The autocorrelation function is an internal property of the field itself, it can be derived from it by statistical methods. Usually the OI method applies analytical approximations of the autocorrelation function, with the dependency on distance (between points). Often (and in HIRLAM as well), an exponential representation is used. The influence radius in the exponential representation sometimes becomes a tuning value, dependent on the density of observations, although it should be connected with the real statistical properties of the fields. For the LSWT fields, the autocorrelation function has never been studied and no approximation proposed.
Thus, for historical reasons, currently in the operational analysis of LSWT the autocorrelation function borrowed from the SST analysis is used. However, there is no reason why the statistical properties of LSWT and SST should be generally similar. LSWT of different lakes may correlate due to the similar atmospheric conditions and air-water interactions; however, the evolution of LSWT is also dependent on lake morphology, especially on the lake depth, because of vertical mixing (Walsh et al., 1998;Toffolon et al., 2014). It is natural to expect that the correlation coefficient of LSWT for lakes with the same depth would be larger than for the case when the lake depth differ much. Moreover, the autocorrelation function of the LSWT field may be time-dependent. Therefore, it is of high interest to study the statistical properties of the LSWT field, to obtain the autocorrelation function and to find an analytical approximation for it. We assume the dependency of the autocorrelation function not only on horizontal distance, but also on difference in the lake depth. In this study, we concentrate on the summer period. It is also of interest to study the impact of the new autocorrelation function formulation for LSWT analysis in NWP.
This approach is in line with studies of coherence among lakes, where correlations of different properties between lakes are investigated (e.g. Magnuson et al., 1990). Compared with these studies, the NWP objective analysis is simpler as only LSWT is considered. However, a NWP system assumes applying a physically-based prognostic lake parameterization, which accounts for many important thermodynamic processes in lakes. Correction of the temperature profiles inside lakes, for example changes in the mixing regime, falls out of the scope of the present study. It could be treated by the Extended Kalman Filter algorithm described by Kourzeneva (2014).
Our aim is to study the LSWT autocorrelation function as an internal property of the LSWT field and to obtain its improved formulation for use in the objective analysis of NWP models. For this, we calculated observation statistics depending on the distance between the observation points as well as on the lake depth differences. For the statistics, we used two data-sets with two types of LSWT observations: local and satellite-based. Local observations are provided by SYKE for different lakes in Finland. Satellite-based observations consist of the moderate-resolution imaging spectroradiometer (MODIS) LSWT data over Fennoscandia and Northwestern Russia. We estimate also the observation error for these two types of measurements. To examine the sensitivity of different analytical formulations of the empirical autocorrelation functions in HIRLAM objective analysis of LSWT, we compare its results to independent observations. The present study builds on the work reported by Kheyrollah Pour et al. (2014b) on the use of satellite measurements for LSWT data assimilation in HIRLAM and on the influence of the lake surface state in weather forecast by Eerola et al. (2014). To our knowledge, this is the first study of its kind which could lead to an improved definition of the LSWT autocorrelation function and thus to a better objective analysis of LSWT in NWP models.
The study is organized as follows. Satellite-based and in situ observations and general statistics are described in Section 2 The algorithm to obtain the LSWT autocorrelation function and the results follows in Section 3, and the theoretical explanations are provided in Appendix 1. The sensitivity of the HIRLAM LSWT analysis to the formulation of the autocorrelation function, which was tested in numerical experiments, is presented in Section 4 Section 5 contains the summary and outlook.

Satellite-based observations
The LSWT observations were derived from the archives of NASA Land Processes Distributed Active Archive Center (LP DAAC) for the thermal remote sensing sensor MODIS, Land Surface Temperature and Emissivity (MOD/MYD11-L2, collection 5, 1 km). This data-set was developed from both Terra and Aqua satellites (Kheyrollah Pour et al., 2012, 2014a. Both day-time and night-time LSWT observations from Terra and Aqua satellites and estimated errors for each observation were retrieved. Seventy-one MODIS pixels were extracted representing 44 Fennoscandian lakes of a size larger than 6 km 2 and of various depths (Fig. 1). Lakes of dissimilar sizes and shapes over Northern Europe were chosen. The MODIS LSWT observations are represented on a grid with a resolution of 1 km× 1 km. For most lakes, one pixel was selected to represent the observation points; however, for large lakes, several pixels were chosen (e.g. 9 pixels for Lake Onega and 15 pixels for Lake Ladoga).
For this study, three months of daily averages were calculated from all available observations at the selected locations for five summers (June-July-August, JJA) of 2010-2014. We applied moving averages in a window of ±24h to remove suspicious observations which deviated for more than ±3 degrees from the average. All observations indicating a temperature below the freezing point, which could appear in MODIS data during JJA either due to undetected clouds or the presence of ice cover, were removed, since only water temperature was considered. The maximum possible number of daily average temperature values from both Terra and Aqua satellites was 32660 (71 lakes x 5 years x 92 days x 1 daily average per day). Due to cloudiness, the number of available observations was reduced to 20694 (63.4% of the possible maximum).

SYKE lake water temperature observations
Regular in situ lake water temperature measurements are collected by the Finnish Environment Institute (SYKE). SYKE operates 32 regular lake and river water temperature measurement sites in Finland. The temperature of the lake water is measured every morning at 8.00 local time, close to shore, at 20 cm below the water surface. The measurements are either recorded automatically (13 stations) or manually and are performed only during the ice-free season (Rontu et al., 2012). With respect to the diurnal cycle, SYKE temperatures represent most closely the daily minimum surface water temperature because in the morning the solar heating has a small impact.
In this study, we used measurements from 27 lakes (Fig. 1). The time period was the same as for MODIS data, namely five summers (JJA) of 2010-2014. From the maximum amount of Fig. 1. Location of the selected MODIS pixels (the main fugure) and SYKE measurement points (in the frame) over the lakes used to calculate the autocorrelation function and in HIRLAM data assimilation experiments over the Northern Europe. Locations of three passive observations, which are central in the discussion in Section 4 are shown in red. The list of all passive observations is presented in Table 3. 12400 observations (27 lakes x 5 years x 92 days x 1 measurement per day), 12227 (98.6%) were available. No preprocessing was applied to the SYKE data.

Comparison and statistics of MODIS and SYKE data
MODIS observations cover 24 SYKE lakes out of 27 with two additional Finnish northern lakes that are not included in the SYKE data-set (Orajärvi and Lokka). The main differences between the SYKE and MODIS data-sets used in this study are: (i) MODIS observations represent the temperature of the thin uppermost layer of water (skin temperature). SYKE measurements represent the water temperature at a depth of 20 cm below the surface. (ii) SYKE measurements are produced once a day in the morning (at 8:00 local time), whereas MODIS data represent daily averages of the observations made during the satellite overpasses. The Terra satellite platform orbits from north to south (descending node) passing over Finland between 10:00-11:00 and 20:00-22:00, and the Aqua travels from south to north (ascending node), flying over Finland between 1:00-3:00 and 11:00-13:00 (local time).
(iii) SYKE measurements are taken from the lake shore, while MODIS pixels were chosen close to the location of SYKE measurements but far enough to avoid land contamination. (iv) SYKE measurements were available at all locations practically every day. MODIS observations were available only in clear-sky conditions during the satellites'overpass times. (v) SYKE measurements cover the area of Finland only, with a maximum distance between the lakes of 1100 km and maximum depth difference of ca. 20 m. MODIS observations were selected over a larger Scandinavian domain, Baltic countries and North-Western Russia, and covered 18 additional lakes in comparison to SYKE, with maximum distance of 1600 km and maximum depth difference of ca. 100 m (Fig. 2). The distribution of lake pairs according to distance both for SYKE and MODIS is quite uniform, although 12-20% of points are located within a distance of 100-200 km. The distribution in depth difference is not uniform: the majority of lake pairs (40-60%) has depth differences of 0-10 m. (Information about the lake depth is described in Section 2.4) Both for the distance and difference in depth, MODIS data are more uniformly distributed than SYKE. The statistics from the two LSWT data-sets are calculated using data for 5 years, combining different days of different years into one sample. It is important to ensure that the sample for each data-set is homogeneous and not much influenced by the seasonal cycle as inhomogeneity might cause large errors in calculation of the autocorrelation function (see explanation in Gandin, 1965). We assumed that if the empirical distribution of the variable (in our case, LSWT) is close to normal, a sample is homogeneous (not much affected by the seasonal cycle). The general statistics of the two LSWT data-sets are given in Fig. 3 and Table 1.
According to Table 1 and Fig. 3, the empirical distribution of data is close to normal for both MODIS and SYKE datasets. For both data-sets, the mean value and median are close to each other, the skewness is -0.55 and -0.52 for MODIS and SYKE, respectively. In addition, the normal probability plot (Fig. 4) shows the validity of the distribution assumption. On   strongly affected by the annual cycle. The variance of MODIS data is larger than for SYKE. This reflects the fact that MODIS observations cover a larger area, a larger range of differences in depth and probably contain a larger observation error. Another possible reason is that the MODIS observations represent highly variable radiative (skin) temperature at the very surface of water, while SYKE observations represent a 20 cm deep water layer where the temperature changes are smaller.

Lake depth data
Lake depth values for the current study were obtained from the Global Lake Data Base (Choulga et al., 2014;Kourzeneva et al., 2012a), which has a resolution of 30 arc seconds. It contains the mean lake depth, and for several large lakes it provides the bathymetry. In our study area, the bathymetry was available for Lake Ladoga, Lake Onega, Lake Vänern, Lake Vättern and Lake Peipsi. In HIRLAM experiments (Section 4), the fraction of lake in each grid-square is based on the Global Land Cover Characteristics data-set (GLCC, Loveland et al., 2000).

Optimal interpolation
In meteorology, the task of data assimilation is to provide the best possible initial value of a progonstic variable at each grid point by using all available information (objective analysis). Thus, the information contains both observations (provided by MODIS and SYKE here) and a model state (from the previous analysisforecast cycle) as a background field. The background field is updated using new observations. The analysed value remains the same as in the background if no observations are available. The objective analysis by OI uses weighting factors based on statistical properties of the analysed field. The error of each observation type as well as the background error are taken into account. In the basic (univariate) set-up of OI, the weight of a certain observation depends on the distance between the observation and the grid point and the distance between this and the other observations. The method was introduced into the field of meteorology by Gandin (1965) and has been applied to treat continuous atmospheric variables such as temperature, wind and pressure. Strictly speaking, the LSWT field is discontinous because it is defined only over lakes, not over the surrounding land. However, this type of discontinuity can be taken into account by masking land. In addition, the discontinuity problem is alleviated when the LSWT field is represented on the NWP model grid so that each grid box contains a fraction (from 0 to 1) of lake water.
Following Gandin (1965), we calculate the LSWT deviations from the norm and consider this field to be isotropic and homogeneous. In OI, information about the statistical structure of the field of the considered variable is incorporated into the procedure through the use of autocorrelation functions. In this study, we are concerned with the statistical structure of the LSWT field represented by the autocorrelation function. Due to the assumption that the field is homogeneous and isotropic, we may consider the autocorrelation function of LWST to be continuous and defined within and across lakes. During the objective analysis, the OI method gives the state of LSWT at a particular time at all model grid points where the lake fraction > 0. The basic algorithm of OI in its application to the LSWT analysis is described in Appendix 1, also information about the autocorrelation and structure functions is included.

Obtaining the autocorrelation function from observations
Determination of the autocorrelation function for LSWT with dependency on the horizontal distance and the depth difference between lakes requires a reliable and homogeneous observational network.
In this study, the LSWT MODIS and SYKE observations (see Section 2) were used to calculate the autocorrelations. In both cases, the same procedure was applied as suggested by Gandin (1965). First, for each observation-point, the mean LSWT values (the norms) together with deviations from these norms (Equation (A5)), were calculated. Then, all observation points were organized in pairs: a point in question with all other points. If the number of observation points equals to P, the number of pairs equals to (P 2 − P)/2. For all pairs, the distances between point locations were calculated, and distance categories were defined (0 to 100 km, 100 to 200 km, etc., till the maximum distance of 1600 km). Similarly, the differences in depth for all pairs and the related categories (0 to 5 m, 5 to 10 m or 0 to 10 m, 10 to 20 m, etc.) were defined. The structure (Equation (A4)) and autocorrelation (Equation (A6)) function values for each category were calculated. For that, the averaging was performed through all pairs and all observations (in time) within each category. Pairs with a missing observation were excluded at the times concerned. The results were plotted as functions of the distance and depth differences.
An estimate of the observation error variance σ 2 for each data-set was obtained by extrapolating the structure function (Equation (A4)) towards the zero distance (to y-axis), and estimating the corresponding value on the y-axis. This gives the double observation error variance, 2σ 2 = b(0) (Chapter 2 of Gandin, 1965). The total variance of the LWST observations within each category was calculated and the normalized autocorrelation function μ (Equation (A9)) was obtained using this total variance minus σ 2 in denominator of Equation (A9) instead of f 2 . This allows to take into account the influence of the observation errors on the normalized autocorrelation function. The same method of calculation was applied independently for both the distance and depth difference categories but the estimate of the observation error variance was based only on the distance.

Estimation of the autocorrelation function based on SYKE and MODIS observations
First, structure functions and autocorrelation functions of LSWT depending only on distance were obtained from SYKE and MODIS datasets. They are shown in Fig. 5. The MODIS-based structure and autocorrelation functions in comparison with SYKE-based are less smooth and show much larger values. This is due to the larger variability of MODIS data as MODIS observations cover the whole Fennoscandia, whereas SYKE observations cover only Finnish lakes and probably due to the larger observational error of MODIS data. The SYKE-based structure function increases monotonically, until the distance of approximately 800 km, and has a slight maximum after that. It may be interpreted to reach saturation at this distance (for distances larger than 1100 km, we can only extrapolate its behaviour). The corresponding autocorrelation function decreases monotonically. The MODIS-based structure function also increases with distance, but only in general. Still, it may be interpreted to reach saturation at the distance of 600-800 km. The corresponding autocorrelation function decreases in general.
Observation error for both SYKE and MODIS observations was obtained from the corresponding structure functions by the method described in Section 3.2 According to our estimates, the observation error for SYKE data is 0.9 • C, and for MODIS data is 1.2 • C. Note that the further results are sensitive to these somewhat subjective error estimates.
The normalized autocorrelation functions were calculated from both data-sets taking into account the total variance and observation error variance as explained in Section 3.2 The results obtained for the distance dependency are shown in the lower panel of Fig. 5 and in Table 2. In Table 2, the normalized autocorrelation function values μ(ρ) depending on distance ρ, together with the number of available observations (observation pairs at all times) are presented. The SYKE-based values decrease more monotonically but slower than the MODIS-based. The normalized autocorrelation functions were approximated using an exponential function (see Section A.3). An iterational algorithm for least-squares estimation of nonlinear parameters (Marquardt, 1963) was applied for fitting. In general, the best fit was obtained using the influence radius of approximately L H = 1010 km for the SYKE-based function and L H = 1000 km for the MODIS-based. However, for the exponential approximation with the best general fit, large errors appear at the short distances and small errors at the large distances. Thus, the tail of the function is described better (not shown).
One way to achieve a better approximation for the short distances is to choose another function (not exponential). Another way is to keep the exponential function, but to concentrate on the small and medium distances, considering the large distances as less important. This means cutting the distances and recalculating the approximation. Based on preliminary testing, it was decided to cut the distances to 900 km for the SYKE data-set and 800 km for the MODIS data-set (with these distances, the SYKE and MODIS data-sets still contain 89 and 76% of available observation pairs). In this case, the best fit for the normalized autocorrelation function was obtained with the horizontal influence radius of L H = 1050 km for the SYKE-based function, and of 630 km for the MODIS-based. These functions are depicted in Fig. 5, lower panel.
The same procedure was applied to obtain a dependency of the normalized autocorrelation function from both distance and difference in depth. Results are displayed in Fig. 6. To keep the exponential approximation (see Equation (A11)) and provide the best fit for the central part of the plot, for the SYKE-based function the tail parts were cut at the distance of 900 km (see Fig. 6a). For this exponential function, the influence radius is L H = 1100 km for the distance and L V = 20 m for the difference in depth. For the MODIS-based function, the tail parts were cut off at the distance of 800 km and at the difference in depth of 40 m (see Fig. 6b). The influence radiuses for the appropriate exponential approximation are L H = 740 km and L V = 50 m.
For comparison, the MODIS-based normalized autocorrelation function, including also the tails, is shown in Fig. 6c. In this case, the influence radiuses in the exponential approximation for the distance and difference in depth are L H = 1100 km and L V = 140 m. However, in the central part of the plot the approximation errors are very large and the fit is quite poor. The estimated parameter values, especially the depth scale, seem to lose their physical relevance, falling outside of the input data range. This is a possible situation, discussed already by Gandin (1965), when the observational data are too inhomogeneous (e.g. in different parts of the domain, or with respect to other characteristics). Table 2 shows also the confidence interval and confidence index C i of the calculated autocorrelation (see Section A.4 for the definitions) in each distance category of SYKE and MODIS observations. The confidence index values are everywhere >> 3, showing that the autocorrelation function values are reliable and statistically significant at the chosen confidence level (0.05). The confidence interval increases and the confidence index decreases with increasing distance because of the decreasing amount of lake pairs available for calculation. Also the two-dimensional autocorrelation function values are reliable (not shown).

The set-up of the HIRLAM experiments
The HIRLAM NWP system, version 7.4, (www.hirlam.org) was used to study the sensitivity of the LSWT analysis to the modifications in the autocorrelation function. We limit our study on validating the objective analysis against independent observations. Hence, we run only short (+6h) HIRLAM forecasts to provide background for the next analysis-forecast cycle. For the meteorological impact of the initial LSWT analysis on the forecast we refer to the detailed discussion in a case study over the freezing Lake Ladoga by Eerola et al. (2014).
The model domain, resolution and basic set-up of the system in our experiments were the same as in Kheyrollah Pour et al. (2014b). The surface data assimilation methods and parametrisations of atmospheric and surface processes relevant for this study were described in Eerola et al. (2014). In our experiments, only the surface data assimilation was applied, while the upper air analysis was replaced by the interpolated fields from the ECMWF analyses. It is important that the lake model FLake, which is fully integrated in HIRLAM and provides the background for the LSWT analysis (Rontu et al., 2012), was not used in our experiments. Instead, the previous six-hour LSWT analysis was used as the background for the next analysis. This made it possible to see the influence of observations on the analysis (and forecast) more clearly, since FLake may dominate the result via the background LSWT. However, relaxation to the climatology was necessary in order to prevent the drift of the analysis with time in case when no influencing observations are available. 2 In our experiments, the observation error standard deviation in the LSWT analysis was kept at 1.5 • C, based on earlier work with MODIS observations by Kheyrollah Pour et al. (2014b) and results of this study (Section 3.3). The background error standard deviation of 1.0 • C was retained. Data quality control rejected the LSWT values which deviated from the background for more than ca. 5 • C (see Equation (3) in Kheyrollah Pour et al. 2014b). Preliminary tests showed that this tolerance value worked satisfactorily for our dataset. The preliminary tests also revealed the importance of strict separation between the lake and ocean observations, so that ocean observations would not affect  the analysis over lakes. Note that most of the lakes in our study were small and shallow when compared to the neighbouring ocean. Both SYKE (daily) and MODIS (max. 4 overpasses per day) observations, shown in the map of Fig. 1, were used in our experiments. Timing of MODIS observations is not the same as the times of HIRLAM analysis. Thus all available MODIS observations within a ±2-hour window from the analysis time were used. For an independent validation, 10 observations over 8 lakes (8 MODIS and 2 SYKE, Table 3) were treated as passive (not affecting the analysis).
Four HIRLAM experiments (LH80LVNO, LH800LVNO, LH80LV20 and LH800LV20) were defined (see Table 4). In the first two experiments LH80LVNO and LH800LVNO, the autocorrelation function depended only on the distance (Equation (A10)), with length scales of L H = 80 km and L H = 800 km, respectively. The value of L H = 80 km is historically used in the reference HIRLAM system both for the SST and LSWT analysis. This is a tuning value in the SST analysis, to give a larger weight to the nearby than the (possibly abundant) distant observations. The same value is currently applied for lakes but without any justification. The ten-fold value of L H = 800 km was chosen to correspond to the suggestions of this study (Section 3.3). In the last two experiments LH80LV20 and LH800LV20, the depth difference between observations or an observation and a grid point was also taken into account using the depth scale of L V = 20 m (Equation (A11)).
The length of all four experiments was four months, from 1 May to 31 August 2011. The first background values for 1 May were provided by earlier experiments, where FLake was used (Kheyrollah Pour et al., 2014b). We focus on the period when most of the lakes were free of ice and discuss the results between 10 May and 31 August 2011.

Results of the sensitivity experiments
Three examples of the response of the analysed LSWT values for changing autocorrelation function are discussed. In all cases, the independent (passive) observations were compared with the objective analysis. The passive observations pass through the quality control but do not affect the analysis. Locations of passive observations, on which we focus here, are shown in red in Fig. 1.
The first example is from Lake Valday, located in Russia (58.0 • N 33.3 • E). Its mean depth is 14 m and surface area 97.2 km 2 . Here, independent MODIS observations were used for validation. In the experiments LH80LVNO and LH80LV20, with the 80 km length scale (see Fig. 7a and c), the analysis was only marginally influenced by the observations because the lake is located far from other observation sites. Therefore, the analysed LSWT was almost totally controlled by the background, which was relaxed towards climatology. In the experiments LH800LVNO and LH800LV20, when using the length scale of 800 km (see Fig. 7b and d), the analysis followed more closely the observations, because now even distant observations influenced the analysis. In spring (May and early June), the results showed improvements also in experiment LH800LV20 compared to LH800LVNO, when the depth difference was taken into account in addition to the distance ( Fig. 7b and d). This is because less weight was given to the observations representing deep lake conditions, here especially the deep parts of Lake Ladoga which warm up slowly in spring.
The second example comes from Lake Rehja-Nuasjärvi, situated in the North-Eastern part of Finland (64.2 • N 28.0 • E, Fig. 1). This is a shallow medium-sized lake (96.4 km 2 , the mean depth is ca. 10 m). At this location, both SYKE and MODIS observations were available and treated as passive. We compare the analysis against SYKE observations because they are very consistent in time.
There were many LSWT observations within a radius of less than 300 km around the lake. The experiments LH80LVNO and LH80LV20 with the 80 km horizontal length scale ( Fig. 8a and 8c) gave a better fit to the independent observations compared to experiments LH800LVNO and LH800LV20 with the 800 km horizontal length scale ( Fig. 8b and d), especially in spring. In this case of the large length scale, the analysis corresponded well to the observations after midsummer, while in spring and early summer it was for several degrees too cold, due to the colder observations from North and East. The depth difference did not play a significant role here, as Lake Rehja-Nuasjärvi has approximately the same mean depth as the neighbouring lakes (around 10 metres). Thus accounting for the lake depth difference did not improve the situation.
The third example is a MODIS observation point in the central part of Lake Ladoga (Fig. 1). Here, the independent MODIS observations from this pixel were used for validation. The depth at this point was 39 m, and the nearest observation point represented even slightly deeper conditions (40 -50 m). Over Lake Ladoga, a dense network of 15 MODIS pixels provided the LSWT measurements, available in clear-sky conditions. Therefore, the experiments LH80LVNO and LH80LV20 with the 80 km length scale produced a reasonable analysis (Fig. 9a and c). In this case, including the depth difference in the autocorrelation function did not change the result significantly because the neighbouring observations represented similar depths.
However, an interesting feature was the following. In experiment LH800LVNO using a 800 km length scale and ignoring the depth difference, the analysis led to too warm LSWT in late May and early June (Fig. 9b). This was because in this experiment, the measurements from the shallow and warm lakes in Finland affected the analysis of the cool and slowly warming Lake Ladoga. Around the 10th of June, when the positive analysis error was the largest, very warm temperatures were observed in the Finnish lakes, see for instance the observations at Lake Rehja-Nuasjärvi (Fig. 8). Taking into account the depth  difference in the experiment LH800LV20 improved the analysis by several degrees (Fig. 9d)  (ii) Accounting for the depth difference seemed to be useful in cases where there were lakes of very different depths close to each other or the same large lake contained deep and shallow parts (although in this case the temperature gradients may be decreased by the horizontal motions).
As an example, the benefit could be seen for Lake Ladoga. The depth differences seem to be less influential over the Finnish lakes, which are predominantly shallow, close to each other and represent similar conditions. (iii) The differences between the resulting analyses were the largest in spring and early summer. During these periods, the lakes are warming up differently depending on their location, size and depth. Similar conditions may be expected to prevail to some extent in autumn, when the lakes are cooling. (iv) The in situ LSWT measurements from SYKE on the selected Finnish lakes showed up as a very consistent and reliable set of observations. Over Finland, these observations played a stabilizing role for the objective analysis of LSWT, when the more extensive but more variable MODIS observations were also included.
(v) When the background is provided by the previous analysis (not by a prognostic lake model), the relaxation towards the climatological LSWT is recommended, to avoid the analysis to drift too far from reality. (vi) The quality control within the HIRLAM system seemed to work well, removing the obviously erroneous observations during the analysis. The quality control was effective when checking the observations against the background. The OI check (comparison to the neighbouring observations) played a minor role, presumably because the values of the observation and background errors used in this study were not optimized yet. (vii) It was evident already in the first tests that it is very important to prevent the ocean observations from influencing the LSWT analysis on lakes.

Summary and outlook
In this study, we investigated for the first time an approach of calculating the LSWT autocorrelation function from in situ and satellite-based observations. The data-sets of the LSWT Fig. 9. Notation as for Fig. 7, but for Lake Ladoga. The normalized autocorrelation functions depending on distance and depth difference were obtained and their analytic approximations were proposed. The MODIS observations suggested the horizontal length scale of L H = 740 km and the scale for the difference in the lake depth L V equal to ca. 50 m. The corresponding estimates based on SYKE observations were L H = 1100 km and L V = 20 m. The empirical autocorrelation function values were shown to be statistically significant. Observation error standard deviations of 0.9 and 1.2 • C for SYKE and for MODIS data were obtained by extrapolation of the appropriate structure functions.
Both the autocorrelation function formulations and observation error estimates are directly applicable to the optimal interpolation of LSWT in the surface data assimilation of NWP models. The length scales of the LSWT autocorrelation function suggested by the observation statistics differ from those applied in the present reference HIRLAM in two respects. First, the horizontal length scale is an order of magnitude larger than the current value. Second, the vertical (depth) scale had not been applied earlier. To introduce it, the one-dimensional analytic exponential function, depending only on distance, should be replaced with the two-dimensional function depending on both the distance and depth difference.
Three-dimensional HIRLAM experiments were performed for the summer (May-August) of 2011 to test the sensitivity of the LSWT analysis to the formulation of the autocorrelation function. All possible data -both SYKE and MODIS observations -were used as input. In order to reveal better the impact of the new formulation of the LSWT structure function on the analysis, the background (first guess) was taken from the previous analysis, not from the lake model FLake forecast (which is used by default in HIRLAM). The resulting analyses were compared to independent observations, which were put aside during the objective analysis. Use of a 10-fold value for the influence radius, i.e. L H = 800 km, allowed to introduce more observations and to obtain the meaningful analysis also for remote lakes. Including the depth-dependency by using the depth scale L V = 20 m allowed to improve the analysis in cases where shallow-lake observations could lead to the wrong LSWT values over deep waters (as for Lake Ladoga), or vice versa. However, when in addition to the neighbouring observation, there are also distant observations representing very different conditions. If such observations prevail, using a large length scale may deteriorate the result. When there were sufficiently observations, the values of the length and depth scales played a minor role.
Several practical issues were noticed when performing the HIRLAM experiments. It is important to keep ocean and lake observations well separated in the analysis in order to avoid spurious outcome of the analysis. Due to the variability and uncertainty of the satellite observations, it is advisable to combine them with available in situ instrumental LSWT measurements, such as those from SYKE. Making such local observations available for the operational NWP data assimilation, together with the real-time remote sensing data, is a demanding but necessary practical task.
LSWT statistics obtained in this study and the appropriate conclusions are based on summer time observational data over a limited Northern European domain, where lake depth variations are moderate. The results are valid for the summer season and for such a uniform environment, and everything that breaks the uniform conditions may influence the conclusions. In spring, during and after the ice break-up, even close observations may represent quite different conditions. To a lesser extent, the same happens in early winter when lakes are freezing. We suggest studying the autocorrelation function for other seasons (spring, autumn) for the future.
LSWT may have quite a large diurnal cycle (Woolway et al., 2015), which should be addressed in NWP. A prognostic lake parameterization scheme is usually able to reproduce it (see e.g. Mironov, 2008). However, in data assimilation of LSWT, it has not yet been considered. This aspect should be studied specially, which will require observations with high temporal resolution.
Lakes in areas of complex orography would present another example of a non-uniform environment, where accounting for the depth differences may not be sufficient to obtain a reasonable objective analysis of LSWT. Another parameter that could influence the LSWT evolution is the light attenuation coefficient of lake water. This is known both from observations (Williamson et al., 2015) and modelling results (Kourzeneva et al., 2012b;Kirillin, 2010). Satellite methods to measure the transparency of inland water are being developed (Potes et al., 2011). In the future, these observations might be used in NWP. Finally, for the full data assimilation over lakes in NWP, the algorithm (such as the Extended Kalman Filter method described in Kourzeneva (2014)), should be combined with the LSWT objective analysis in order to allow updating the lake model prognostic variables.