Modification and validation of a new method to improve the accuracy of MODIS-derived dew point temperature over mainland China

ABSTRACT MODIS atmospheric profile products (MOD07_L2 and MYD07_L2) have been widely used for near-surface dew point temperature ( ) estimation. However, their accuracy over large scale has seldom been evaluated. In this study, we validated these two products comprehensively against 2153 stations over mainland China. MOD07_L2 was suggested by our study because it achieved higher accuracy in either of two frequently-used methods. To be specific, the root-mean-square error (RMSE) achieved by MOD07_L2 and MYD07_L2 was 5.82 and 7.42 °C, respectively. On this basis, a recent ground-based correction method was modified to further improve their accuracy. Our focus is to investigate whether this ground-based approach is applicable to large-scale remote sensing applications. The results show that this new method showed great potential for estimation independently from ground observations. Through the introduction of MODIS land surface products, the RMSE it achieved for MOD07_L2 and MYD07_L2 was 5.23 and 5.59 °C, respectively. Further analysis shows that it was particularly useful in capturing the annual average patterns. The R2 , RMSE, and bias of annual average daily mean estimates were 0.95, 1.84 °C, and 0.53 °C, and those achieved for annual average instantaneous estimates were 0.94, 2.09 °C, and 0.75 °C, respectively.


Introduction
Dew point temperature (T d ) is the temperature to which the air must be cooled to become saturated with water vapor given that the air pressure and water vapor remain constant (Merva 1975;Shank, Hoogenboom, and McClendon 2008).As a direct indicator of near-surface humidity, T d is one of the most important climatic variables controlling many biological and physical processes between the hydrosphere, atmosphere and biosphere (Ali, Fowler, and Mishra 2018;Dou et al. 2021;Pumo and Noto 2021).Consequently, accurate near-surface T d monitoring is vital for a wide range of applications such as agriculture production (Yue, Yang, and Wang 2022), terrestrial hydrology and ecology (Bello, Mailhot, and Paquin 2021;Wu et al. 2022), climate and environment change (Denson, Wasko, and Pee 2021;Sein, Ullah, and Iyakaremye 2022), public health (Iyakaremye et al. 2021;Ullah et al. 2022), and disease vectors propagating (Chen et al. 2022;Masoumi, van Genderen, and Mesgari 2020).In general, T d with high accuracy can be calculated easily from groundbased air temperature (T a ) and relative humidity (RH) observations.However, these observations measured as point samples have limited ability in reflecting the spatial heterogeneity of T d (Hashimoto et al. 2008;Ritter, Berkelhammer, and Beysens 2019;You et al. 2015).In contrast, reanalysis products provide continuous T d estimation with global coverage by combining models with observations, but their spatial resolution is usually too low for many local applications (Berg et al. 2003;Viggiano et al. 2021;Wang and Yuan 2022).There exists a key niche for T d information with fine resolution, because many hydrological, climatological, and ecological models seek to describe processes at scales of 1∼10 km (Famiglietti et al. 2018;Ji et al. 2023;Razafimaharo et al. 2020;Turner, Ollinger, and Kimball 2004;Wood et al. 2011;Zhu, Jia, and Lű 2017a).
Moderate Resolution Imaging Spectroradiometer (MODIS), provides an unprecedented global coverage of critical land surface and atmosphere parameters, which is a potentially valuable alternative to characterize fine-resolution T d patterns across large areas.This is particularly true for MOD07 Level-2 atmospheric profile product because it provides temperature and moisture profiles distributed at 20 vertical levels with 5 km pixel resolution (King et al. 2003;Sobrino et al. 2015).Consequently, the need for T d information with fine resolution has resulted in numerous studies looking into MOD07 product to meet this requirement (Hwang and Choi 2013;Jiao et al. 2015;Liu et al. 2021;Ryu et al. 2011;Zhu, Wang, and Jia 2023).According to these studies, depending on the vertical levels used, two approaches may be distinguished in the retrieval of near-surface T d from MOD07 product (Zhang et al. 2014).The first approach extracts T d at the lowest vertical pressure level as the surrogate for near-surface T d directly.For example, in the net radiation model developed by Bisht et al. (2005) and the evapotranspiration model developed by Venturini, Rodriguez, and Bisht (2011), T d at the pressure level of 1000 hPa from MOD07_L2 product was adopted as the representation of near-surface T d over the Southern Great Plains region in the USA.The second approach interpolates T d at the two lowest pressure levels to near-surface level based on the hydrostatic assumption (Bisht and Bras 2010;Famiglietti et al. 2018;Kim and Hogue 2008;Liu et al. 2021;Tang and Zhao 2008;Verma et al. 2016).Although both these two approaches have been adopted widely at regional scale, their accuracy over large scale has seldom been validated robustly, making the quality of T d retrieval in many modeling applications heretofore relatively unknown.Given this background, the accuracy of the second approach for global T d estimation was validated by Famiglietti et al. (2018) based on 109 ground meteorological stations.Their results suggest that further studies are required to improve the accuracy of T d retrieval from MODIS products.
For this purpose, we first tried to seek approaches from studies that are performed based on other satellite sensors.However, due to the lack of atmospheric profiles, few studies have been conducted to retrieve near-surface T d from other satellite observations.Fortunately, a large number of studies have been performed for near-surface T d estimation based on in-situ meteorological observations.Apart from those conducted on the basis of machine learning models such as Baghban et al. (2016), Park et al. (2021), Dong et al. (2022) and Qiu et al. (2022), a major assumption of these studies is that near-surface T d can be approximately estimated from daily minimum air temperature (T amin ).For example, in the early studies of Allen et al. (1998) and Mu et al. (2009), near-surface T d was assumed to be equivalent to T amin for the development of evapotranspiration models.Further studies show that this assumption generally holds for boreal and arctic regions under nonfrozen conditions, but leads to relatively large errors in warmer, drier climate conditions (Dong et al. 2022;Mahmood and Hubbard 2005;Paredes and Pereira 2019;Todorovic, Karic, and Pereira 2013).As a result, a variety of correction methods have been proposed subsequently to estimate T d accurately from T amin .A review of these methods is available in Paredes et al. (2020) and Qiu et al. (2022).These T amin -based correction methods at site scale have enlightening significance for the retrieval of near-surface T d from MODIS products, because numerous studies confirm that T amin estimates with high accuracy can be achieved by adopting MODIS nighttime land surface temperature (T s ) as proxy (Chen et al. 2021;Lin et al. 2012;Oyler et al. 2016;Shiff, Helman, and Lensky 2021;Vancutsem et al. 2010;Zhu, Lű, and Jia 2013).
Inspired by these studies, the main objective of our work is to perform a comprehensive validation of MODIS atmospheric profile products for the retrieval of near-surface T d and further improve their accuracy through the combination with MODIS land surface products.The validation was conducted using hourly observations from 2153 meteorological stations across mainland China.Although a similar validation has been performed at global scale in Famiglietti et al. (2018), they only evaluated the accuracy of MOD07_L2 product with hypsometric interpolation method based on 109 meteorological stations.In contrast, the evaluation in this study was conducted for both MOD07_L2 and MYD07_L2 products.In addition to hypsometric interpolation method, we also evaluated the validity of T d retrieved at the lowest vertical pressure level as the surrogate for near-surface T d .The improvement was achieved based on the correction method proposed by Qiu et al. (2021).The difference is that the original Qiu et al. (2021) was performed to estimate daily average T d on the basis of meteorological observations, while the focus of our research is to investigate whether this ground-based approach is applicable to large-scale remote sensing applications for both daily average and instantaneous T d estimation, aiming to improve the accuracy of MODIS atmospheric profile products independently from ground observations.

Study area and materials
The present study was conducted across mainland China over the course of two full years of 2016 and 2017.Hourly records from 2153 meteorological stations were adopted for the calibration and validation of T d estimates (Figure 1(a)).To be specific, hourly T d ( • C) and e a (kPa) were retrieved  The remote sensing observations used in this study for T d retrieval come from MODIS, an instrument onboard the National Aeronautics and Space Administration's (NASA's) Earth Observing System (EOS) Terra and Aqua satellites.Both of these two satellites are in circular sun-synchronous orbits.The Terra overpass time is around 10:30 AM in its descending mode and 10:30 PM in its ascending mode, while the overpass time of Aqua is around 1:30 PM in its ascending mode and 1:30 AM in its descending mode.Four MODIS Version 6.1 products were adopted in this study, including MOD07_L2 and MOD11A1 provided by Terra MODIS as well as MYD07_L2 and MYD11A1 provided by Aqua MODIS.All these four products are largely limited to clear sky conditions.Considering the uncertainties caused by cloud contamination, this study only selected pixels flagged as the highest quality in MODIS cloud masking (Li et al. 2023;Zeng et al. 2018;Zhao and Duan 2020).MOD11A1 and MYD11A1 are daily land surface temperature & emissivity products with 1 km spatial resolution (Wan 2019).MOD07_L2 and MYD07_L2 are daily atmospheric profile products with 5 km spatial resolution (Borbas et al. 2015).These two products consist of several parameters, of which atmospheric moisture profile and surface air pressure were used in our work.Its atmospheric moisture profile is distributed at 20 vertical atmospheric pressure levels, which are 1000, 950, 920, 850, 780, 700, 620, 500, 400, 300, 250, 200, 150, 100, 70, 50, 30, 20, 10, and 5 hPa, respectively.In addition to meteorological observations and MODIS products, the global aridity index (AI) dataset provided by the Consultative Group for International Agricultural Research-Consortium for Spatial Information (CGIAR-CSI) was used in this study.It defines AI as the ratio between mean annual precipitation and mean annual potential evapotranspiration.To be specific, this AI dataset with a spatial resolution of 30 arc seconds (∼ 1 km at the equator) is developed based on global raster climate datasets.These global climate datasets are available from WorldClim Global Climate Data (http://WorldClim.org),and the methods used for the derivation of AI are described in Zomer et al. (2007;2008).Following United Nations Environment Programme (UNEP 1997), mainland China was divided into four climate zones in this study on the basis of AI values: arid (AI < 0.2), semi-arid (0.2 < AI < 0.5), dry sub-humid (0.5 < AI < 0.65) and humid (AI > 0.65) zone (Figure 1(b)).

Methodology
The ultimate objective of this study is to improve the accuracy of MODIS atmospheric profile products for the estimation of near-surface T d .Four sections were designed as below to elaborate our methodology.Firstly, instantaneous T d at the daytime overpass time of Aqua and Terra satellites was retrieved from MODIS atmospheric profile products using two traditional methods so that a validation could be performed to evaluate the original accuracy of MOD07_L2 and MYD07_L2 products.The correction method proposed by Qiu et al. (2021) was designed to estimate daily mean T d from ground-based meteorological observations.Therefore, in the second section we estimated daily mean T d from a combination of MODIS land surface products and meteorological observations to test if this ground-based approach is suitable for large-scale remote sensing application.On the basis of the above two sections, this method was further modified in the third section to retrieve instantaneous T d through the combination of MODIS atmospheric profile and land surface products.Special attention was paid to investigate whether this modified approach could improve the original accuracy of MOD07_L2 and MYD07_L2 products and could be performed independently from ground observations.The fourth section describes the statistical metrics we used for accuracy evaluation.

Instantaneous T d retrieval from MODIS atmospheric profile products
As we reviewed in the introduction, according to the vertical levels used in previous studies, two approaches may be distinguished for the retrieval of near-surface T d from MODIS atmospheric profile products.Since MOD07_L2 and MYD07_L2 provide moisture profiles at 20 vertical pressure levels, the first approach extracts T d at the lowest vertical pressure (LVP) level as the surrogate for near-surface T d directly (Bisht et al. 2005;Byun, Liaqat, and Choi 2014;Venturini, Rodriguez, and Bisht 2011).The mathematical formula of this LVP method is expressed as: where T LVP d,int ( • C) represents near-surface T d estimates of the LVP method, the subscript 'int' denotes that this instantaneous T d was retrieved at the satellite overpass time, and T L d ( • C) represents T d at the lowest vertical pressure level of MOD07_L2 and MYD07_L2 products.It should be noted that the lowest vertical pressure level is dependent on surface pressure, which varies with time and space.As a result, mainland China with a wide range of elevation, has a heterogeneous distribution of the lowest vertical pressure level.For illustration, the annual average surface pressure retrieved from MOD07_L2 is presented in Figure 1(c) to show the lowest vertical pressure level we used.To be specific, mainland China had an annual average surface pressure ranging from 476.42 to 1019.78 hPa.Controlled by the elevation patterns in Figure 1(a), a clear boundary (black line) was observed in its spatial distribution.The surface pressure was much higher in the southeast part.Another obvious characteristic is that Qinghai-Tibet Plateau had the lowest surface pressure.Accordingly, nine pressure levels were adopted as the lowest vertical pressure level in Eq. (3), which were 1000, 950, 920, 850, 780, 700, 620, 500, and 400 hPa, respectively (Figure 1(d)).
Considering the difference between surface pressure (P S ) and the available lowest pressure level (P L ), the second approach takes the hydrostatic assumption into account and interpolates T d at the two lowest pressure levels to near-surface level.Unlike the hypsometric equation (HE) used in Famiglietti et al. (2018), the adiabatic lapse rate (ALR) method proposed by Bisht and Bras (2010) and Zhu et al. (2017b) was adopted in this study to demonstrate the second approach as: where ) and P U (hPa) are the dew point temperature and atmospheric pressure retrieved at the upper pressure level of P L , triangleH (m) is the height difference of these two pressure levels, r (kg/m 3 ) is the density of the air, and g (m/s 2 ) is the acceleration of gravity.The essence of HE and ARL methods is basically the same.Both of them interpolate moisture profile of MOD07_L2 and MYD07_L2 products into near-surface level by utilizing the two nearest pressure levels.The validity of HE method has already been evaluated by Famiglietti et al. (2018) on a global scale.That partly explains why this method was not adopted in our work.More importantly, both Famiglietti et al. (2018) and Zhang et al. (2021) suggest that the HE method would produce obvious outliers.Further statistical analysis is required to remove these outliers, which would eliminate 7.2% of all observations (Famiglietti et al. 2018).

Daily mean T d retrieval from MODIS and meteorological observations
The estimation of daily mean T d from daily minimum air temperature (T amin ) has been investigated in numerous studies based on in-situ meteorological observations (Dong et al. 2022;Mahmood and Hubbard 2005;Paredes et al. 2020;Todorovic, Karic, and Pereira 2013).Meanwhile, plenty of studies suggest that T amin estimates with high accuracy can be achieved by adopting MODIS nighttime land surface temperature (T s ) as proxy (Lin et al. 2012;Oyler et al. 2016;Vancutsem et al. 2010;Yoo et al. 2018;Zhu, Lű, and Jia 2013).The combination of these two aspects implies that MODIS land surface products have potential for the retrieval of near-surface T d .As we mentioned in the introduction, due to the influence of climate conditions on the accuracy of T amin -based T d estimation, a variety of correction methods have been proposed to lower this influence.Based on the correlation function between a dynamic factor (a T ) and aridity index (AI), Qiu et al. (2021) have recently developed a correction method regardless of climate zones.Their results show that this new method is superior to other correction methods in accuracy.Therefore, we adopted this method in our work to estimate near-surface T d from MODIS T s products.
The original method in Qiu et al. (2021) was developed to estimate daily mean T d using observations from 806 meteorological stations in mainland China.In order to perform a more robust validation and also lay a foundation for subsequent comparative analysis, we applied this method to 2153 meteorological stations across mainland China in this study.Specifically, this method was designed to estimate daily mean T d from T amin with a dynamic a T (°C) regardless of climate zones as: where e a,mean and T d,mean are the observed daily mean values of e a and T d , and a and b are the fitted coefficients.Compared with the studies that assume near-surface T d to be equivalent to T amin , a T was added by Qiu et al. (2021) in the above equation for correction purpose.To be specific, a T at site scale was determined by establishing its correlation function with AI based on a three-step procedure as below.Firstly, the annual average AI for each meteorological station was retrieved from the CGIAR-CSI AI dataset.Secondly, the a T value of each meteorological station was determined with a non-linear regression method on the basis of Eq. ( 6) using e a,mean and T amin observations of the year 2016.Thirdly, the correlation function between a T and AI was calibrated across all sites using the logarithmic regression model presented in Eq. ( 7), and then applied to T d,mean estimation of the year 2017 for validation based on Eq. ( 8).Qiu et al. (2021) have confirmed the validity of the above method at site scale using observations from 806 meteorological stations in mainland China.The focus of our work is to further investigate whether the observed T amin in Eq. ( 6) could be replaced by MODIS nighttime T s so that T d,mean could be retrieved from MODIS land surface products in a spatially continuous manner.Accordingly, the estimation of T d,mean from MODIS land surface products was also performed following the above three steps.The only difference is that T amin used for calibration and validation was retrieved from MODIS nighttime T s rather than ground-based T amin observations.Considering the difference in overpass time between Terra and Aqua satellites, we applied this method to both MOD11A1 and MYD11A1 to conduct a comprehensive assessment.

An improved method for instantaneous T d retrieval from MODIS products
After knowing the original accuracy of MODIS atmospheric profile products for T d, int retrieval and Qiu et al. ( 2021)'s method for T d,mean retrieval, our attention was shifted to investigate whether Qiu et al. ( 2021)'s method can be modified to improve the accuracy of T d, int retrieval.Accordingly, Eq. ( 6) and ( 8) were modified as below: The correlation function between a T and AI was also established following the three steps in section 3.2.However, depending on the inputs used for the retrieval of a T in Eq. ( 9), three models can be identified.In the first model, both T d,int and T amin were retrieved from meteorological observations to evaluate the performance of this correction method at site scale.In order to coincide with T d, int retrieved from MODIS atmospheric profile products, T d,hour observed at the time that most approximates satellite overpass time was used as the proxy for T d,int .It means that the time interval between meteorological and satellite observations was less than 30 min.In the second model, ground-based T amin observations of the first model were replaced by MODIS nighttime T s so that T d,int could be retrieved from MODIS land surface products in a spatially continuous manner.Thirdly, ground T d, int observations of the second model were further substituted by T LVP d, int and T ALR d, int to investigate whether this correction method can improve the original accuracy of MODIS atmospheric profile products independently from ground observations.Figure 2 shows the flowchart of the above T d, int estimation.

Statistical metrics
Following Famiglietti et al. (2018), we adopted three statistical metrics in this study for accuracy evaluation, including the coefficient of determination (R 2 ), root mean square error (RMSE), and bias (B).With the in-situ observations as input, R 2 was determined by fitting a regression line through the origin.The equations of RMSE and B are expressed as: where n represents the total sample size; subscript i represents the sample number at the i th position; x i and y i represent the estimated and observed near-surface T d ; and x i and y i represent the mean values of T d estimation and observation, respectively.It should be noted that the calibration of a T for each meteorological station was performed using T d and T amin observations of the year 2016.The evaluation of its validity was conducted using T d observations of the year 2017.

Original accuracy of MODIS atmospheric profile products
Table 1 shows the original accuracy of MOD07_L2 and MYD07_L2 products for the representation of instantaneous dew point temperature (T d, int ).For MOD07_L2, the ALR method was superior to LVP method in all statistical metrics.The R 2 , RMSE and B of T ALR d, int in two full years were 0.84, 5.82 °C, and 0.88 °C, and those of T LVP d, int estimates were 0.81, 6.50 °C, and −1.24 °C, respectively.The statistics of the individual year 2016 and 2017 have also confirmed the superiority of ALR method.However, when the same comparison was performed for MYD07_L2 product, the opposite situation was observed.All statistical metrics indicate that the LVP method provided a better T d, int retrieval.The R 2 , RMSE and B it achieved in two full years were 0.75, 7.42 °C, and 0.91 °C, and those achieved by the ALR method were 0.74, 7.70 °C, and 2.63 °C, respectively.Judged from B values, the LVP method underestimated T d, int for MOD07_L2 while overestimated T d, int for MYD07_L2.In contrast, the ALR method overestimated T d, int for both products.In general, MOD07_L2 was a better proxy for near-surface T d , because it provided T d, int estimation with higher accuracy in either of  these two methods.Even the accuracy achieved by the better method for MYD07_L2 was still lower than that produced by the worse method for MOD07_L2.The reasons behind can be explained by the satellite overpass time.The overpass time of Terra and Aqua is around 10:30 AM and 1:30 PM, respectively.It is obvious that the solar radiation at the Aqua overpass time is much stronger.Consequently, the complex interaction of the surface energy balance system makes the retrieval of T d from MYD07_L2 far from straightforward.
In order to investigate the spatial patterns of T d, int accuracy, Figure 3 presents the distribution of these three statistical metrics at site scale as well as their scatterplots achieved by different methods.Considering the significant advantage of MOD07_L2 over MYD07_L2, here we only adopted MOD07_L2 product to investigate the difference between LVP and ALR methods.In general, these methods have achieved similar patterns in the spatial distribution of these three metrics.Their correlation of R 2 , RMSE and B across 2153 stations was 0.97, 0.69, and 0.81, respectively.Both methods generated T d, int estimates correlated better with observations in the east of the mainland, which coincided well with the three major plains in China.Similarly, stations with relatively low R 2 values were mainly concentrated in Yunnan province.The RMSE of T LVP d, int and T ALR d, int ranged from 3.23 to 20.25 °C and from 2.65 to 14.46 °C, respectively.This confirms the superiority of ALR over LVP method from another perspective.The scatterplots of B reveal the key difference between these two methods.The B achieved by the ALR was always larger than that achieved by the LVP.It means that a positive adiabatic lapse rate was retrieved from MODIS atmospheric profile products in Eq. ( 4).Consequently, the interpolation from the lowest pressure level to near-surface level would increase T d, int from T LVP d, int to T ALR d, int .That also explains why T ALR d, int had a much smaller number of stations with negative B values than T LVP d, int .To be specific, the LVP method underestimated T d, int for 56.64% of the 2153 stations, which were widely distributed in Midwest China, Hainan province, Anhui province, Fujian province, and parts of Northeast China.In comparison, although the stations with negative B values produced by the ALR method were also located in the above regions, their number only accounted for 34.09% of the 2153 stations.Considering the altitude is obviously higher in Midwest China, the above results imply that the accuracy of MODIS atmospheric profile products is closely related to topography (Mendez 2004).

Accuracy of the new method for daily mean dew point temperature estimation
Based on a T retrieved from Eq. ( 6), the relationship between a T and AI across 2153 stations of the year 2016 was established using the logarithmic regression model of Eq. ( 7).The results are presented in Figure 4.For MOD11A1 product, the a and b calibrated by T d,mean and T amin observations were −2.01 and 1.39 with R 2 of 0.47 (Figure 4 4(d)).It should be noted that both Figure 4(a,c) were retrieved from ground-based T d,mean and T amin observations.The difference in their coefficients was completely caused by the difference in samples.In contrast, the difference in coefficients between Figure 4 (b,d) was mainly caused by the difference between MOD11A1 and MYD11A1 for the representation of T amin .Applying the above models to the year 2017, we validated and presented their accuracy in Figure 5.The accuracy of T amin as a direct surrogate for T d,mean is also presented for comparison.As expected, T amin as a direct proxy for T d,mean would lead to significant overestimation, with B and RMSE as high as 3.41∼3.47°C and 5.59∼5.63°C, respectively.This confirms the necessity of correction methods for the accurate T d estimation from T amin .To be specific, the ground-based correction method proposed by Qiu et al. (2021) has improved the accuracy significantly, with B and RMSE around 0.59∼0.60°C and 4.18∼4.19°C, respectively.In comparison, the B and RMSE achieved by MOD11A1-based correction method were 0.69 and 4.62 °C, and those achieved by MYD11A1 product were 0.63 and 4.42 °C, respectively.Both two statistics indicate that MODIS land surface products as proxy for T amin would reduce the accuracy of the above correction method slightly.
The above correction method was developed on the basis of AI.Therefore, it is necessary to investigate its performance in different climate zones.The results are presented in Table 2.For simplicity, the T d,mean estimated directly by T amin , by the correction method developed from T d,mean and T amin observations, and by the correction method developed from T d,mean observations and MODIS T s was renamed as method Ⅰ, Ⅱ, and Ⅲ, respectively.Both R 2 and RMSE show that the accuracy of these methods increased gradually from arid to humid regions.It implies that these T amin -based methods are more suitable for T d estimation in humid regions.Judged from B values, the accuracy of method Ⅰ still increased gradually from arid to humid regions.However, this finding did not hold true for the a T -based correction methods.As for the accuracy difference of these three methods, two a T -based correction methods were always superior to method Ⅰ in terms of B values.However, in terms of RMSE, although these two a T -based correction methods were still superior in most regions, their superiority seemed to decrease gradually from arid to humid zones.This is particularly obvious for method Ⅲ.Take MOD11A1 as an example.In arid, semi-arid, and dry subhumid zones, the RMSE of method Ⅰ was reduced by method Ⅲ from 8.46 to 5.91 °C, from 6.72 to 5.05 °C, and from 4.76 to 4.48 °C, respectively.It is obvious that the decrease rate became lower and lower.In humid zone, method Ⅰ was even superior to method Ⅲ in terms of RMSE.The advantage of method Ⅱ over method Ⅰ also decreased gradually from arid to humid zones, but the RMSE it produced was always lower than that produced by method Ⅰ.Therefore, the poor performance of method Ⅲ in humid region was mainly caused by the errors involved in the retrieval of T amin from MODIS land surface products.

Accuracy of the method for instantaneous dew point temperature estimation
The above analysis indicates that the correction method calibrated by MODIS T s has achieved similar accuracy with that calibrated by ground T amin observations for T d,mean estimation.This lays a good basis for its application in instantaneous T d estimation.Figure 6 shows the logarithmic regression models we calibrated for T d,int estimation based on the pooled data of the year 2016.Four models should be distinguished here: the model retrieved from ground-based T d,int and T amin observations (Figure 6(a,e)), the model retrieved from ground-based T d,int observations and MODIS T s (Figure 6(b,f)), the model retrieved from MODIS T LVP d, int and T s (Figure 6(c,g)), and the model retrieved from MODIS T ALR d, int and T s (Figure 6(d,h)).The first two models need meteorological observations for calibration.In contrast, the remaining two models were totally established based on MODIS products.Table 3 shows the accuracy of these models in the year 2017.The original accuracy of T LVP d, int and T ALR d, int is also presented for comparison.In general, similar accuracy has been achieved by these four models.The R 2 they produced was almost the same with each other.The RMSE achieved by Terra and Aqua MODIS ranged from 5.22 to 5.33 °C and from 5.47 to 6.21 °C, respectively.Although higher accuracy was still achieved by the first two groundbased models, their RMSE was only a little lower than that produced by the two models independently from ground observations.This is particularly obvious for Terra MODIS, the largest RMSE  difference of which was only 0.11 °C.Compared with the original T LVP d, int and T ALR d, int , all metrics show that these four models have improved their accuracy significantly.Take the two models that were totally developed from MODIS products as an example.The RMSE of T LVP d, int and T ALR d, int for Terra MODIS has been reduced from 6.89 to 5.23 °C and from 6.20 to 5.33 °C, respectively.For Aqua MODIS, its RMSE has been reduced from 8.07 to 5.59 °C and from 8.35 to 6.21 °C, respectively.Obviously, the accuracy of MODIS atmospheric profile products can be improved significantly through the introduction of MODIS land surface products.
The accuracy in Table 3 was achieved by applying the models calibrated in 2016 to the year 2017.This is necessary for the validation of the first two ground-based models.However, the remaining two models were totally established from MODIS products, which means that they are calibrationindependent.Accordingly, Table 4 shows the accuracy of these two models established from MODIS products of the year 2017.In view of the significant superiority of Terra MODIS over Aqua MODIS in Table 3, we only focused on Terra MODIS products here for further analysis.Across all stations the R 2 , RMSE, and B achieved by the T LVP d, int -based correction method were 0.85, 5.22 °C, and −0.65 °C, and those achieved by the T ALR d, int -based correction method were 0.85, 5.35 °C, and 1.36 °C, respectively.All these statistics were very close to those in Table 3, which demonstrates the portability of these correction methods in different years.Both R 2 and RMSE suggest that the accuracy of these a T -based correction methods increased gradually from arid to humid regions, which is consistent with the findings in Table 2.The R 2 produced by these two correction methods was the same with each other in all climate zones.However, the RMSE achieved by the T ALR d, int -based correction method was lower than that achieved by the T LVP d, int -based correction method in three out of the four climate zones.This basically follows the same order with the original accuracy of T LVP d, int and T ALR d, int .It suggests that the accuracy of these correction methods is still controlled by the original accuracy of T d,int estimation.Compared with the original T LVP d, int and T ALR d, int , both R 2 and RMSE indicate that these correction methods have distinct advantages in semi-arid, dry sub-humid, and humid zones.However, their RMSE in arid zone was much larger than that produced by the original T LVP d, int and T ALR d, int .This confirms again that these correction methods are more suitable for T d estimation in humid regions.
Figure 7 presents the distribution of these three metrics at site scale to investigate the spatial difference of these two correction methods.To be consistent with Figure 3, these metrics were also retrieved from T d,int estimations in two full years.Both two methods were developed from MOD11A1 nighttime T s in a form of T d,int = T s − a T .The only difference is that their a T was retrieved from T LVP d, int and T ALR d, int respectively.As a result, the R 2 they produced was the same with each other, which ranged from 0.21 to 0.96 with a median of 0.86.Compared with Figure 3, the correction methods have improved the R 2 values for most stations, which is particularly obvious in the middle and lower reaches of the Yangtze River, South China, and East China.These

Discussion
Implicitly or explicitly, the analysis in the above section has pointed out that the performance of Qiu et al. (2021)'s correction method is influenced by several factors in large-scale remote sensing application.We would go through these factors and the uncertainties involved in this section.
Firstly, MODIS nighttime T s was adopted in this study as the proxy for T amin so that the groundbased correction method can be performed over large scale based on MODIS products.The validity of this substitution is assessed in Figure 8.As expected, MYD11A1 was a better proxy for T amin .Its R 2 , RMSE, and B across all stations were 0.96, 2.78 °C, and −0.61 °C, and those achieved by MOD11A1 were 0.95, 3.00 °C, and 0.94 °C, respectively.That explains why the correction method developed from MYD11A1 performed a little better than that developed from MOD11A1 for T d,mean estimation (Figure 5).The overpass time of Terra and Aqua during the night is 10:30 pm and 1:30 am, respectively.Aqua has overpass time much closer to that the daily minimum air temperature occurs.That might be the reason why MYD11A1 was a better proxy for T amin (Vancutsem et al. 2010;Zhu, Lű, and Jia 2013).Figure 9(a) shows the scatterplots of the RMSE achieved by MYD11A1 for T amin estimation versus the RMSE achieved by correction method for T d,mean estimation.Their RMSE correlated well with each other with r as 0.49.An increase of 1 °C in the RMSE of T amin estimation would lead to an increase of 0.74 °C in the RMSE of T d,mean estimation on average.As for the correction method for T d, int estimation, the r between its RMSE and that achieved by MOD11A1 for T amin estimation was only 0.17 (Figure 9(b)).In contrast, its RMSE correlated significantly with the original RMSE of T ALR d, int retrieval with r as high as 0.58 (Figure 9(c)).Due to the effect of correction method, an increase of 1 °C in the original RMSE of T ALR d, int would only lead to an increase of 0.58 °C in the RMSE of final T d, int estimation on average.Therefore, compared with the accuracy of MODIS nighttime T s as the proxy for T amin , the original accuracy of MODIS atmospheric profile products had a much greater impact on the accuracy of our modified correction method for T d, int estimation.The comparison in Table 1 suggests that the T d, int retrieved from MOD07_L2 was superior to that retrieved from MYD07_L2 in accuracy.The combination of these findings explains why the correction method developed from MOD07_L2 was superior to that developed from MYD07_L2.
Besides the original accuracy of MODIS atmospheric profile and land surface products, AI was another main factor influencing the performance of these correction methods.Its impact at site scale has already been presented in Tables 2 and 4. Compared with the original method proposed by Qiu et al. (2021), the main contribution of our work is that their ground-based method was modified for large-scale remote sensing applications.Accordingly, T d could be estimated and mapped from MODIS products for whole mainland China.So here we further investigated the accuracy and uncertainties of these AI-based correction methods in a spatially continuous manner.correction methods have achieved good performance in capturing the annual average T d patterns.The R 2 , RMSE, and B of annual average T d,mean estimates were 0.95, 1.84, and 0.53 °C, and those achieved for annual average T d,int estimates were 0.94, 2.09 °C, and 0.75 °C, respectively.The accuracy of T d,int estimates was still a little lower than that achieved for T d,mean estimates, but it must be noted that the estimation of T d,int was performed independently from ground-based observations.In order to investigate the uncertainties involved in our above T d,mean and T d, int estimates in a spatially continuous manner, we first present the variations of their RMSE with AI across all sites (Figure 9(d,e), respectively).The RMSE of the original T ALR d, int versus AI is presented in Figure 9(f) for comparison.As expected, the RMSE of both T d,mean and T d, int estimation decreased generally with the increase of AI.What is unexpected is that the RMSE of the original T ALR d, int was also negatively correlated with AI.It implies that not only the correction methods but also the original MODIS atmospheric profile product were more suitable for T d estimation in humid regions.On this basis, we present the RMSE of our T d,mean and T d, int estimates in Figure 10(e,f) by applying the logarithmic regression models established in Figure 9(d,e) to mainland China.Specifically, in arid, semi-arid, dry sub-humid, and humid zone, the average RMSE of T d,mean estimates were 5.87, 4.36, 3.89, and 3.42 °C, and the average RMSE of T d,int estimates were 7.41, 5.24, 4.56, and 3.88 °C, respectively.Both T d,mean and T d,int estimates indicate that this AI-based correction method was more suitable for T d estimation in humid climate.
Influenced by the East Asian monsoon, both precipitation and temperature in China have obvious seasonal variations.It implies that the seasonal variability may have additionally affected the accuracy of T d estimation.Figure 11 shows the RMSE of T d,mean and T d,int estimates as well as the original T ALR d, int in different months.As expected, the accuracy of these three estimates had significant seasonal variations.All of them achieved the lowest RMSE in warm and humid months such as July and August.This confirms again that both the correction method and the original MODIS atmospheric profile product are more suitable for T d estimation in humid climate.The correction method adopted in this study was developed from an annual average AI provided by the CGIAR-CSI.As a result, the correction factor (a T ) derived from AI was a constant without seasonal variations.The above analysis suggests that a temporal dynamic a T retrieved from monthly climatic variables holds potential for the improvement of this correction method.As for the inter-comparison of these three estimates, the lowest RMSE was achieved by T d,mean estimates in all months except for May.It implies that the estimation of instantaneous T d was much more complex than the retrieval of its daily mean value.The accuracy of original T ALR d, int estimates has been improved by the correction method in eight months including January, February, March, July, September, October, November, and December.The accuracy improvement in dry and cold months was much more significant than that observed in humid and warm months.The RMSE of T ALR d, int estimates in April, May, June and August were even lower than that achieved by the correction method.This is consistent with the finding in Table 2 that the superiority of this correction method decreased gradually from arid to humid zones.All these results confirm the necessity of a temporal dynamic a T for the further development of this correction method.

Conclusion
Near-surface T d serves as indispensable input in many hydrological, climatological, and ecological models.The need for T d with fine resolution results in numerous studies looking into MODIS atmospheric profile products for help.However, the accuracy of these products over large scale has seldom been evaluated, making the quality of their T d retrieval for many modeling applications relatively unknown.Given this background, a large-scale validation was performed in this study across mainland China, aiming to evaluate the accuracy of these products for T d retrieval comprehensively.On this basis, a recent ground-based correction method was modified to further improve their accuracy.
The accuracy of MODIS atmospheric profile products varied with T d retrieval methods.For MOD07_L2, the ALR method was superior to LVP method in all evaluation metrics.However, all metrics show that the LVP method provided a better T d retrieval for MYD07_L2.Nonetheless, MOD07_L2 was suggested in this study because it achieved higher accuracy in either of these two methods.The RMSE it achieved by LVP and ALR method was 6.50 and 5.82 °C, respectively.In comparison, the RMSE achieved by LVP and ALR method for MYD07_L2 was 7.42 and 7.70 °C, respectively.After knowing the original accuracy of MODIS atmospheric profile products, we adopted and modified a ground-based correction method to further improve their accuracy.The focus of our modification is to investigate whether this ground-based approach is applicable to large-scale remote sensing applications.To be specific, this ground-based method was developed based on the correlation function between a correction factor (a T ) and AI. a T derived from AI was used to correct the estimation of T d from daily minimum air temperature (T amin ).In our modified approach, T amin was retrieved from MODIS nighttime land surface temperature rather than ground observation, so it is applicable to large-scale remote sensing applications.The results show that although the uncertainty increases generally with the decrease of AI, this modified method still holds great potential to improve the accuracy of T d estimates independently from ground observations.The RMSE achieved by LVP and ALR method for MOD07_L2 has been reduced to 5.23 and 5.33 °C, and the RMSE achieved by LVP and ALR method for MYD07_L2 has been reduced to 5.59 and 6.21 °C, respectively.It was particularly useful in capturing the annual average T d patterns.The R 2 , RMSE, and bias of annual average daily mean T d estimates were 0.95, 1.84 °C, and 0.53 °C, and those achieved for annual average instantaneous T d estimates were 0.94, 2.09 °C, and 0.75 °C, respectively.
Despite the above advantages, our modified approach still suffers from drawbacks.Firstly, its performance was influenced significantly by the original accuracy of MODIS atmospheric profile products.It will benefit from the development of new T d retrieval methods from MODIS atmospheric profile products (Famiglietti et al. 2018).Secondly, the validity of this AI-based correction method was sensitive to AI values (Qiu et al. 2021;Raziei and Pereira 2013).Its accuracy increased generally from arid to humid regions.Similarly, significant seasonal variations were observed in its performance.These findings imply that a temporal dynamic a T is necessary for the further development of this method.

Figure 1 .
Figure 1.Spatial distribution of 2153 meteorological stations and elevation (a), climate zones (b), annual average surface pressure (c), and the lowest vertical pressure level (d) in mainland China.

Figure 2 .
Figure 2. Flowchart of the methods we adopted for T d, int estimation.

Figure 3 .
Figure 3. Accuracy of T d, int retrieved from MOD07_L2 product at site scale and their scatterplots.(a) -(c) are R 2 , RMSE and B achieved by the LVP method, (d) -(f) are R 2 , RMSE and B achieved by the ALR method, and (g) -(i) are the scatterplots of R 2 , RMSE and B achieved by these two methods.
(a)), and the a and b calibrated by T d,mean observations and MOD11A1 nighttime T s were −1.97 and 2.31 with R 2 of 0.46 (Figure 4(b)).As for MYD11A1, the a and b calibrated by T d,mean and T amin observations were −1.98 and 1.32 with R 2 of 0.45 (Figure 4(c)), and the a and b calibrated by T d,mean observations and MYD11A1 nighttime T s were −1.65 and 0.98 with R 2 of 0.36 (Figure

Figure 4 .
Figure 4. Logarithmic regression models between a T and AI for T d, mean estimation.(a) and (c) represent models calibrated by T d,mean and T amin observations, (b) represents model calibrated by T d,mean observations and MOD11A1 nighttime T s , and (d) represents model calibrated by T d,mean observations and MYD11A1 nighttime T s .

Figure 5 .
Figure 5. Accuracy of T d, mean estimation achieved by different methods.(a) and (d) are the accuracy of T amin as a direct proxy for T d, mean , (b) and (e) are the accuracy of the correction method calibrated by T amin observations, and (c) and (f) are the accuracy of the correction method calibrated by MOD11A1 and MYD11A1 T s , respectively.

Figure 6 .
Figure 6.Logarithmic regression models between a T and AI for T d, int estimation.(a) -(d) are models for T d, int estimation at Terra overpass time, and (e) -(h) are models for T d, int estimation at Aqua overpass time.
correction methods also demonstrated significant advantage in terms of RMSE.The RMSE of T LVP d, int -based correction method ranged from 1.97 to 12.19 °C with a median of 4.57 °C, and that achieved by T ALR d, int -based correction method ranged from 1.50 to 13.69 °C with a median of 4.40 °C.In comparison, the median of RMSE produced by the original T LVP d, int and T ALR d, int was 5.50 and 5.22 °C, respectively.The B values achieved by these two correction methods correlated well with each other with r as high as 0.99.However, the T ALR d, int -based correction method always produced B values larger than T LVP d, int -based correction method.This followed the same order with the original B of T LVP d, int and T ALR d, int .Compared with Figure3, the T LVP d, int -based correction method has reduced the number of stations with serious underestimation significantly, which is particularly obvious in the southwest of China.As for the T ALR d, int -based correction method, it has not only reduced the number of stations with serious underestimation in the southwest of China, but also reduced the number of stations with obvious overestimation in Qinghai, Guangdong, Guangxi, and Liaoning province significantly.

Figure 7 .
Figure 7. Accuracy of T d, int estimates achieved by correction methods at site scale and their scatterplots.(a) -(c) are R 2 , RMSE and B achieved by the T LVP d, int -based correction method, (d) -(f) are R 2 , RMSE and B achieved by the T ALR d, int -based correction method, and (g) -(i) are the scatterplots of R 2 , RMSE and B achieved by these two methods.
Figure 8. Accuracy of MOD11A1 (a) and MYD11A1 (b) nighttime T s as the proxy for T amin .

Figure 9 .
Figure 9. Relationships of the RMSE achieved by different estimates with each other and their variations with AI.(a) and (b) are the scatterplots of the RMSE achieved for T d, mean and T d, int estimation with the RMSE achieved for T amin estimation, respectively.(c) is the scatterplots of the RMSE achieved for T d, int estimation with the RMSE achieved by T ALR d, int .(d) -(f) are the variations of the RMSE achieved by T d, mean , T d, int and T ALR d, int estimation with AI, respectively.

Figure 10 .
Figure 10.Spatial distribution of annual average T d estimation and RMSE as well as their accuracy across all sites.(a) and (b) are the spatial distribution of annual average T d, mean and T d, int estimation, (c) and (d) are the accuracy of annual average T d, mean and T d, int estimation across all sites, and (e) and (f) are the spatial distribution of the average RMSE retrieved from the logarithmic regression models in Figure 9(d,e).

Figure 11 .
Figure 11.Seasonal variations of the RMSE achieved by the ALR and correction methods.

Table 1 .
Accuracy of MODIS atmospheric profile products for T d, int estimation.

Table 2 .
Accuracy of daily mean dew pint temperature estimates achieved by different methods.The unit of RMSE and B is °C.Product Method Metrics All n = 2153 Arid n = 122 Semi-arid n = 684 Dry sub-humid n = 301 Humid n = 1046

Table 3 .
Accuracy of instantaneous dew pint temperature estimates achieved by different methods.For simplicity, the T d,int estimated from T d,int and T amin observations, from T d,int observations and T s , from T LVP d, int and T s , and from T ALR d, int and T s was renamed as T d, int − T amin , T d,int − T s , T LVP d, int − T s , and T ALR d, int − T s , respectively.The unit of RMSE and B is °C.

Table 4 .
Accuracy of instantaneous dew pint temperature estimates achieved by different methods for the year 2017.T LVP d, int − T s and T ALR d, int − T s represent the correction method developed from T LVP d, int and T s and from T ALR d, int and T s , respectively.The unit of RMSE and B is °C.