Fine-resolution mapping of the circumpolar Arctic Man-made impervious areas (CAMI) using sentinels, OpenStreetMap and ArcticDEM

ABSTRACT Man-made impervious areas map is of great demand in environmental and urbanization studies since impervious areas are considered as a key indication of urbanization, especially for circumpolar Arctic. However, to date, finer resolution and spatially continuous impervious areas information remains scarce in the Arctic. In this study, we developed an accurate and complete circumpolar Arctic Man-made impervious areas (CAMI) map at a resolution of 10 m by combining Sentinel-1 C-band Synthetic Aperture Radar, Sentinel-2 multispectral images, OpenStreetMap, and ArcticDEM via Google Earth Engine platform. A random forest classifier model was trained and used to generate corresponding impervious areas map for the year 2020. The evaluation results suggested that CAMI was the most accurate with an overall accuracy of 86.36% and kappa coefficient of 70.73% as against the three existing impervious areas products. Based on the generated map and OpenStreetMap, we estimated that total impervious areas area in the Arctic has achieved 807.80 , of which roads, industrial and resident land were three major land use types, accounting for 54.08%, 17.85% and 10.34%, respectively. The CAMI map will support for new application and provide advanced insight into the infrastructure vulnerability evaluation and environmental sustainability in the Arctic.


Introduction
As the key indicator quantifying human footprint on the Earth's land surface, artificial impervious areas plays a critical role in global environmental changes and human-environment interactions (Arnold & Gibbons, 1996;Weng, 2012). Driven by economic development and population increase, urbanization and associated land use/ cover change over the past decades have been projected to continue to intensify worldwide and become one of the most pressing issues for the humanity (Grimm et al., 2008;Seto, Güneralp, & Hutyra, 2012). Artificial impervious areas is a land cover that groups anthropogenic structures and it eliminates rainwater infiltration and natural groundwater recharge Slonecker, Jennings, & Garofalo, 2001). The proliferation of artificial impervious area alters surface runoff and storm flow, resulting in an increasing probability of storm flood (Brun & Band, 2000). Moreover, the increase of impervious land cover within a watershed or similar unit is usually accompanied by the decline of vegetation fraction (Song et al., 2018;Weng, 2012), which not only reduces the carbon sequestration capability but brings in an unpredictable consequence for regional and global carbon cycling (Feng et al., 2021). Therefore, it is of major ecological importance to understand the spatial pattern of man-made imperviousness as well as its variation.
Satellite remote sensing has revolutionized our capacity to map artificial impervious areas over large areas in a repeated, cost-efficient manner, and earlier attempts have been made for artificial impervious areas mapping at scales ranging from regions to the entire globe Liu et al., 2019;Sexton et al., 2013;Zhang et al., 2020). Specifically, the National Land Cover Dataset (NLCD) has mapped the first 30-m impervious surface in United States using Landsat imagery (Homer, Huang, Yang, Wylie, & Coan, 2004). Similarly, GMIS, HBASE, FROM_GLC, MSMT-IS30, GlobeLand30 as well as Esri-GLC products also provided the global impervious areas map (Brown de Colstoun et al., 2017;Chen et al., 2015;Gong et al., 2013;Karra et al., 2021;Wang, Huang, Brown de Colstoun, Tilton, & Tan, 2017;Zhang et al., 2020). Although these datasets are globally available, they are in general of too low spatial resolution and also lack thematic content to successfully capture the impervious features, especially for high-latitudes in the Arctic.
Despite these valuable efforts, quantification of artificial impervious surfaces area is still lacking for the Arctic, in which unprecedented environmental changes are undergoing driven by both warming climate and intensified human activities (AMAP, 2017). Previous studies suggest that artificial impervious areas are already distributed in some most remote corners of the Arctic (Beamish et al., 2020;Liang, Liu, Liu, Li, & Huang, 2019), but how these human-induced land surface disturbances translate to terrestrial ecosystem functions remains largely unknown until artificial impervious covers can be accurately estimated. Notwithstanding recent progress in land cover monitoring from space, circumpolar-scale man-made impervious areas mapping studies have been so far limited by (i) the relatively coarse spatial resolution of satellite imagery, which is unable to reliably estimate man-made impervious surfaces in many Arctic regions (Bartsch, Pointner, Ingeman-Nielsen, & Lu, 2020); and (ii) the uneven coverage of optical satellite data due to high cloud contamination (Bartsch et al., 2021). Hence, there is an urgent need to develop a fine resolution artificial impervious surface dataset that is reliable and spatially complete for multiple applications within the Arctic domain.
Fine resolution mapping of circumpolar Arctic Man-made impervious areas (CAMI hereafter) is a challenging task from almost every aspect of satellite remote sensing, including data acquisition (Bartsch et al., 2020(Bartsch et al., , 2021, feature selection , and land cover classification (Liang et al., 2019). Essentially, representation of heterogeneous artificial impervious landscape with only spectral information from optical sensors significantly simplifies the real-world composition, and this paradox is especially true for the Arctic due to uneven data coverage caused by cloud contamination and high solar zenith angles (Liu, Huang, Hui, Zhang, & Cheng, 2021). Moreover, the typical presence of small-scale, fragmented artificial impervious covers (e.g. oil and gas fields) in the Arctic gives rise to the issue of mixed pixels, which has been proved as a critical problem affecting the effective use of medium and coarse spatial resolution imagery (Gong et al., 2019). Therefore, it becomes increasingly clear that higher spatial resolution and the use of features from multiple domains may provide the key of more accurate man-made imperviousness information to capture CAMI pattern (Bartsch et al., 2020(Bartsch et al., , 2021. Observing the earth's terrestrial surface since 2014, the Sentinel satellite system developed by European Space Agency (ESA) provides unique advantages for delineating and analyzing land biophysical compositions at multiple scales. Among all launched Sentinel missions, perhaps the most well known two are Sentinel-1 and Sentinel-2, both of which are polarorbiting imaging missions carrying C-band Synthetic Aperture Radar and Multispectral Instrument respectively. Due to the high spatial resolution and complementary information on the spectral and structural characteristics, the integrated use of Sentinel-1 and Sentinel-2 data becomes increasingly popular in extracting artificial impervious areas information (Wu, Guo, & Cheng, 2020;Zhang, Lin, Li, Zhang, & Fang, 2016). Furthermore, recent years have witnessed an emerging of cloud computing platforms, including Google Earth Engine (GEE) (Gorelick et al., 2017), Amazon Web Services (AWS) (Shelestov et al., 2020) and Microsoft Azure (Wilder, 2012). These platforms enable planetary-scale data analysis, thus making fine resolution CAMI mapping feasible.
The purpose of this study is to develop a 10-m resolution CAMI map through synergistically integrating multi-sources data. In this study, four types of datasets including Sentinel-1 SAR GRD data, Sentinel-2 MSI imagery, ArcticDEM and OpenStreetMap were selected and collected for CAMI mapping. We designed a novel framework for automatic mapping of artificial impervious areas and implement it on GEE. We compared our results with existing global artificial impervious areas products as well as ground reference data for validation. The rest of this paper is structured as follows. In the next section, we introduce the study area and data sets used. Section 3 is mainly concerning on the methodology and technological processes for CAMI mapping. In section 4 we illustrate the experimental and validation results. In section 5, some extensive discussions are provided. At the end we offer a conclusive summary for this paper.

Study area
Our study area covers all of earth's northern lands for more than 7 million km 2 , including six regions: Alaska, Canada, Greenland, Iceland, Norway and Russia. The spatial extent of our study area is shown in the Figure 1. The study area includes the entire circumpolar area northward the Arctic treeline. Arctic treeline is the northernmost limit of Arctic Bioclimate Zone where trees can grow and the boundary between tundra and boreal forest, with length of 13,000 kilometers. Beyond the treeline that it is too cold all year round to sustain trees (Smol, Pienitz, & Douglas, 2004).

Multi-source remote sensing datasets
• Sentinel-1 SAR GRD Data The Sentinel-1 constellation (S1-A and S1-B) provides C-band SAR imagery at a variety of polarizations and resolutions, and offers a 6-day revisit time for A/B constellation. (Berger, Moreno, Johannessen, Levelt, & Hanssen, 2012). Among various Sentinel-1 products, we used the Ground Range Detected (GRD) data, which are acquired in interferometric wide swath with dual polarization (VV and VH) and have a 10-m pixel spacing resolution. The Sentinel-1 Level-1 GRD products make available and have been implemented on the GEE, which are suitable for the most of the land application (Balzter, Cole, Thiel, & Schmullius, 2015;Hu et al., 2021;Lehner, Naeimi, & Steinnocher, 2017). Each Sentinel-1 image on the GEE was preprocessed with the Sentinel-1 toolbox, including thermal noise removal, radiometric calibration ant terrain correction (https://developers. google.com/earth-engine/sentinel1). By calculating the mean at pixel level derived individually for ascending and descending orbits and VV and VH polarization respectively, we generated three seasonal time-series metrics (spring, summer, and winter) of backscatter using all GRD images available in 2020 (Todar, Attarchi, & Osati, 2021).
• Sentinel-2 MSI We also collected all the Sentinel-2 MSI Level-2A surface reflectance (SR) data available in 2020, and filtered them with a criterion of cloud cover less than 10%. Cloud and cloud shadows were mask according to the cloud mask band (QA60). Based on the Sentinel-2 composition, we generated all the corresponding median bands reflectance which are resampled to 10 m before (Blue, Green, Red, Red Edge1, Red Edge 2, Red Edge 3, Red Edge 4, NIR, SWIR1 and SWIR2). Then, the 10 th and 90 th percentile of NDVI (Normalized Difference Vegetation Index), NDBI (Normalized Built-up Index) and MNDWI (Modified Normalized Water Index) were calculated given that they have been demonstrated to help identify the impervious areas (Lin, Zhang, Lin, Gamba, & Liu, 2020;Liu et al., 2018;Xu, 2008Xu, , 2010. Meanwhile, we adopted a new variable (MNDWI minus NDBI) to highlight the difference between seasonal dry river bed and impervious areas.
Six textural metrics were generated according to a previous study on urban classification (Attarchi, 2020;Haralick, Shanmugam, & Dinstein, 1973). We calculated the contrast, variance, sum variance, dissimilarity, inertia and cluster prominence based on the Gray Level Co-occurrence Matrix (GLCM) for the Sentinel-2 NIR band with a 7 � 7 moving window kernels .

• ArcticDEM
ArcticDEM was released in 2016 with a high spatial resolution of 2 m, which derived from the optical stereo imagery, and the DEMs were compiled to reduce void areas and edge-matching artifacts. Three metrics, elevation, slope and aspect, were generated from the ArcticDEM .

• OpenStreetMap
OpenStreetMap (OSM) (www.openstreetmap.org) is a free map of the world created by an ever-growing community of volunteered mappers and it provides the main infrastructure information (Hjort et al., 2018). Considering its positional accuracy which had been assessed in the past (Fan, Zipf, Fu, & Neis, 2014;Girres & Touya, 2010;Haklay, Basiouka, Antoniou, & Ather, 2010), it is thought to be a firstorder positional accuracy geodata product for all data used and free-of-charged, especially in referenced geodata lacking areas, and OSM has been used for training data in pervious impervious mapping studies (Lin et al., 2020;Parekh et al., 2021). Although accuracy assessment of OSM is still lack in the Arctic regions, it is a simplification for large-scale analysis, which we assumed that our RF classified model can tolerate the noise caused by the OSM data. The geo-tapped information, i.e. OSM-Buildings, OSM-land use, OSM-Railways and OSM-Roads, covered the study area was used to produce a binary impervious areas map. Furthermore, we reclassified the OSM-land use into some sub-categories, including industrial, resident, public, military, commercial, transport, roads to indicate the impact of varies human activities.

Global impervious areas products
Considering almost no thematic product about circumpolar Arctic artificial impervious areas, two global land cover products in 2020 (GlobeLand30 and Esri-GLC) and a global thematic impervious areas product in 2015, MSMT-IS30, were selected to compare with our new CAMI product.
GlobeLand30 is a 30 m global land-cover dataset produced using the Pixel-Object-Knowledge-based (POK-based) method with a reported overall accuracy of 85.72% (Chen et al., 2015). Recently it was updated to the 2020 version. This new version GlobeLand30 consists of 10 first-class land cover types, and we extracted the impervious areas from 10 types.
• Esri-GLC Esri Land Cover is a latest 10-m global land-cover dataset, and it was produced using a deep learning AI classification model based on Microsoft Planetary Computer (Karra et al., 2021). It contains 10 land-cover classes and was reported with an overall accuracy of 85.96%. Similarly, we also extracted the impervious areas from that the original land cover map.
• MSMT-IS MSMT-IS is a 30-m global impervious areas product produced by multi-source and multi-temporal random forest classification method based on GEE platform . The overall accuracy of the map achieves 96.6%, and the impervious areas in our study area were extracted. Figure 2 presents the workflow of our CAMI mapping method on the GEE platform. Particularly, the framework includes three sequentially integrated steps. In the first part, we developed a training and validation samples collection strategy based on prior knowledge derived from Globeland30 and OSM data. In the second part, we implemented a region-specified Random Forest classification via the 33 features (Table 1) derived from Sentinel-1/2 imagery, Arctic DEM and OSM data. Then, Arctic thematic impervious areas map was generated after filtering the isolated bare lands pixels. Finally, we evaluated our CAMI map with validation samples and three existing impervious areas products. We also analyzed current land use pattern within our CAMI map.

Training and validation samples collection
A three-step training samples collection method was developed to refine training samples from GlobeLand30. Firstly, we randomly generated the geometric location of training samples for 10 land cover types using the stratified random sampling method based on their proportion in GlobeLand30. For each type, the number of the original training samples varies from 1000 to 5000. Among them, samples originally labeled as impervious in GlobaLand30 were assigned as impervious class, while other samples were assigned as non-impervious class. Then, in order to filter the outliers induced by mis-classified or omitted-classified in GlobaLand30, these training samples were clustered using the Self-organizing Map (SOM) clustering algorithm. Finally, the samples in the new clusters were visually check and  Compared with urban areas, the impervious areas in the Arctic are relatively small and scattered. The traditional random sampling method in such a situation may cause an overestimation of user accuracy. Good practices indicate that setting up some validation samples in buffer zones around the real impervious areas within an appropriate distance can reduce this overestimation . Here, we used OSM-Land use sub-data-set as the real impervious areas data and randomly generated a part of validation samples within the OSM patches. In addition, we selected some probability samples in the buffer zones. The detailed steps are illustrated in the following Figure 3.
For the first step, we directly created buffer zones within a specified distance around each OSM-Land use patch. The specified distances were determined so that the buffer zone has approximately the same area as the OSM-Land use patch. Here, we assumed all OSM-Land use patches as regular rectangle, the relationship between the buffer distance and OSM-Land use patch can be described as the following formula: buffer distance ¼ À P þ ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi P 2 þ 16A p 8 (1) Where P and A are the perimeter and area of the OSM-Land use patch respectively. The area radio of the buffer zone and the OSM-Land use patch achieves 0.96 to 0.98 statistically. Then, we randomly generated validation samples from the OSM-Land use area and the buffer area respectively. The size of the impervious or non- impervious collection validation samples was the same. Lastly, all the samples were checked by interpretation via available high-resolution remote sensing imagery in Google Earth. Using the new sampling strategy, a total of 2693 validation samples were generated, including 1431 non-impervious samples and 1262 impervious samples.

Impervious areas extraction with random forest model
Random Forest (RF), which has obvious advantages in feature selection and classification, is considered to present a high level of robustness with respect to highdimension multicollinearity data and noise (Oshiro, Perez, & Baranauskas, 2012). RF was chosen here for its relatively low computational cost but high accuracy performance while handling a large number of features with large training sample sizes among several classification algorithms and has been used in previous impervious mapping studies Li, Gong, & Liang, 2015;Li, Wang, Wang, Hu, & Gong, 2014;Liu et al., 2019). There were 33 features derived from multi-source data, including 15 spectral features and 6 textural features from the Sentinel-2 imagery, six seasonal time-series metrics from SAR imagery, 3 topographical features and OSM binary feature, were feed into the RF model. Through a preliminary test, the optimal value for the number of trees was 500, and the number of variables to split on each node was set to 10. We divided the training sample pixels into two parts, 80% of them were used for training, and the remaining 20% were employed for testing the RF classification model. Meanwhile, to speed up the implement efficiency, we built RF classifiers for six regions: Alaska, Canada, Greenland, Russia, except for Iceland and Norway, which were combined due to these two regions have tiny shares of training samples.

Accuracy assessment
To assess the performance CAMI product, we used the validation samples collected by our proposed sampling strategy to calculate the accuracy. Four common accuracy metrics, including the overall accuracy (O.A.), the producer's accuracy (P.A.), user's accuracy (U.A.) as well as kappa coefficient were used to quantitatively assess the classification accuracy. Meanwhile, we also calculated the same accuracy indexes for other three existing impervious areas products (GlobeLand30, Esri-GLC and MSMT-IS) as references. Except for the quantitative estimation, we also analyze their detailed spatial patterns in small scale, in hope to better understand the pros and cons of our mapped results.

Assessment of the impervious areas extent
As shown in Figure 4, the majority of the impervious areas in the Arctic are fragmented and tiny in size, ranging from 42 m 2 to 132.29 km 2 . 94.50% impervious areas in the Arctic are under 20 km 2 , while 9.17% of them exceeds 80 km 2 , that mostly appear in Alaska and Western Russian Arctic. To better show the distribution pattern of the impervious areas in different regions, four representative regions are selected for comparison, including Alaska, Canadian Arctic, Russian Arctic and Greenland. Russian Arctic has the greatest number of impervious clusters, most of which distribute in the eastern and western inner land. However, in other three regions, impervious areas only locate along the coastline.
The meridional and zonal total impervious surface area for each 0:05 � longitude and latitude interval are shown in Figures 5 and 6. We found in the zonal statistics ( Figure 5) result that 90% of the impervious areas distribute between 60 � N and 80 � N and the impervious areas with medium to large size mostly locate between 64 � N and 72 � N. The meridional results ( Figure 6)   The total impervious surfaces area in the Arctic reaches 807.80 km 2 , and Figure 7 presents the impervious surfaces area at national scale. Russian Arctic has the biggest impervious surfaces area, accounting for more than 60.5%, followed by the 20.3% of the Canadian Arctic and 13.2% of the Alaskan Arctic. The three other countries account for less than 5% of the total area.

Accuracy assessment using validation samples
Our result achieves the highest overall accuracy of 86.36% and kappa coefficient of 70.73% as listed in Table 2. The overall accuracy and kappa coefficient are also calculated for the three existing impervious areas products, which are 83.76% and 64.82% for GlobeLand30, 77.50% and 49.20% for Esri-GLC, 76.69% and 52.89% for MSMT-IS30, respectively. Both overall accuracy and the kappa coefficient are lower than that of Esri-GLC and MSMT-IS. As we observed, the validation samples they used are absent or    P.I., producer's accuracy of impervious areas, U.I., user's accuracy of impervious areas, P.N., producer's accuracy of nonimpervious areas, U.N., user's accuracy of non-impervious areas. MSMT-IS30 lacked of impervious areas result in Greenland and Iceland.
Our result outperforms other existing land cover products from the perspective of the producer's accuracy and user's accuracy. Only MSMT-IS30 shows similar performance as CAMI according to the producer's accuracy. However, Esri-GLC presents a negligible higher user's accuracy than that of our products (91.57% VS 90.32%), its low producer's accuracy (48.34%) implies that a huge omission of the impervious areas.
Specifically, the producer's accuracy of impervious areas and kappa coefficient varies in different regions. Canadian Arctic, Greenland, Icelandic Arctic and Norwegian Arctic in our result have the highest producer's accuracy of impervious areas. GlobeLand30 has the best performance in Alaska and MSMT-IS30 performs well in Russia. The user's accuracy from our products exceeds 80% in all the regions.
To further compare the performance of these four products, seven representative plots were selected for comparison as shown in Figure 8. Our result performs analogously with GlobeLand30, but is more sufficient in detailed impervious information, especially for roads. Roads are continuous and connected in CAMI, while they are broken and scarce in other three products. Esri-GLC has the least cover of linear imperviousness among the four products. Moreover, Esri-GLC has been overestimated because that the urban green land and bare lands are also identified as impervious areas. MSMT-IS30 missed impervious areas data in Greenland and Iceland, although it performs well than Esri-GLC in other regions. Figure 8. Comparison between the method result (the third column) and the three other impervious products (the 4th to 6th columns corresponded to the GlobeLand30, Esri-GLC and MSMT_IS) for six plots. The second column is the corresponding RGB-composite Sentinel-2 images in 2020.

Land use of impervious areas in Arctic
Land use changes are caused by human-induced activities and further affect the environment and climate changes in entire Arctic. To further explore their land use status, we presented the land use pattern by calculating their area ratio based on OSM. The results are shown in Figures 9 and 10. Figure 9 shows that Arctic has the most land uses in road, industrial, resident types, except for the unclassified type. Particularly, Russian Arctic, Norwegian Arctic, Canadian Arctic and Alaskan Arctic have the high cover of roads, accounting for 62.65%, 43.42%, 48.48% and 41.84% respectively. Iceland has more resident land as well as Greenland.
As shown in Figure 10, Russian Arctic is the main contributor of industrial land, roads, resident and transport land. More than 50% of the public land portion in the Arctic is dominated by Alaska. Military and commercial land are mostly located in Greenland and Canadian Arctic.

The advantages of multi-source features
It is very hard to accurately map impervious areas using only optical remote sensing imagery due to the spectral heterogeneity. The incorporation of multi-source data features added more useful information except for spectral dimension. In this study, we found multi-source date made contributions to high accuracy of CAMI. OSM, DEM and optical imagery were the three high-contributed data sources. Our research further explored their relative contribution within all 33 features for impervious areas mapping among six regions, which is illustrated in Figure 11. The OSM metrics obviously outperformed in all of the features. One reason is that OSM provides the prior knowledge that might reduce the confusion error caused by bare soils to some extent. On the other hand, OSM contains detailed subcategory of impervious areas, which supplemented abundant impervious information, especially when the impervious areas are not easy to be detected. Although, not all the study area covers the OSM data, our robust RF training model by integrating the multi-source datasets, such as, optical images, SAR data and topographic data, could transfer the prior knowledge to other regions without OSM data via the decision tree model. Figure 12(b) shows the detailed spatial pattern of our impervious areas and it indicates that our method could dig all the impervious areas in the region with incomplete OSM.
Generally, the DEM-related features contributed the second highest, although their importance varies among different regions. Spectral-related features and four indexes (NDVI, NDBI, MNDWI, and the difference between MNDWI and NDBI) were ranked third. In addition, the importance of SAR features was inferior to optical features.
We noticed that the features from the SAR data does not show a high importance compared to the traditional impervious areas mapping in urban areas (Attarchi, 2020;Delgado Blasco, Fitrzyk, Patruno, Ruiz-Armenteros, & Marconcini, 2020). This can be partly explained by the limited coverage of SAR in the entire Arctic, especially for high latitudes. Furthermore, in urban areas, the diversity of the urban building, such as the linear arrangement of houses, the dimensions of the streets, the type of buildings (i.e. numerous planes of the roof), resulting in the higher backscattering than other land cover types, especially the higher double bounce backscattering induced by the dihedrals. While, in our study area, the impervious areas shows same roughness as other land cover types,  Figure 11. The importance of the input features derived from the random forest model using the training samples in six regions. Mean decrease in accuracy was used to assess the importance of the input features, which refers how much the model accuracy decreases if that variable was dropped. The bar with different colors refers the mean importance of the different data sources. such as bare lands near the seasonal dry river bed (Figure 13(a-c)). Meanwhile, the smooth concrete ground and the airport, a common impervious areas type in Arctic, also show similar low backscattering when compared with waterbodies ( Figure 13(d-f)). Relatively low roughness of the impervious areas in the Arctic might cause overestimation of the impervious areas due to the commission of bare lands, while, the omission of the smooth impervious areas might bring a underestimation of it.

Uncertainties and implications of our impervious surface map
Although the proposed method has been demonstrated to have the ability to map the impervious areas in Arctic with a high overall accuracy of 86.36%, there are still some uncertainties. First, Due to the difficulty of collecting training samples in field, our training samples is collected from GlobeLand30. Some scattered and low-density impervious samples might be omitted and this could cause further omission of the scattered and small impervious surfaces, such as the fragmented villages, pipelines and informal trails. Second, such comparably small settlements and linear features cannot be completely captured due to the medium spatial resolution imagery. Figure 14 shows that CAMI has advantages on detailed roads compared to the other products. Moreover, it is still rooms for CAMI to connect the broken roads or pipelines. Third, the existing impervious areas products generally lack of subcategories of impervious areas classes. Although we subdivided them into a given land use attribute based on OSM data, identifying their land use types based on the multi-sources remotely sensed imageries and point of interest data is worthy of more attention.
Despite this limitation, our CAMI product is still a good supplement to previous coarser spatial resolution and incomplete data in the Arctic. We believe that our finer resolution CAMI product can support some significate research, such as the vulnerability of the infrastructure, the urbanization and vegetation changes due to the human activities in this region (Hjort et al., 2018;Miles & Esau, 2017;Raynolds et al., 2014).

Conclusions
It is a challenging task to precisely map the impervious areas in the Arctic due to the small proportion, diversity of its composition, and uneven distribution of area. In this study, we developed a 10-m resolution CAMI map with the use of the Sentinel-1 and Figure 13. Examples of human-impacted areas in comparison to SAR images, Google Earth and classification result with SAR derived features only in Arctic: the first column refers the Sentinel-1 spring-summer-winter composition in 2020, the second column refers Google Earth online hybrid map and the third column refers the classification result based on the SAR derived features. The highlighted boxes in yellow indicate the true impervious areas and cyan boxes indicate the bare lands.
−2 time series data, OpenStreetMap and ArcticDEM via Google Earth Engine platform. Comparing with the existing global land cover products in the Arctic, the CAMI product has a high overall accuracy of 86.36% across the Arctic and this reveals the feasibility of our approach. Furthermore, our map estimates the total impervious surfaces area in the Arctic achieves 807.80 km 2 , and it has the most land uses of roads, industrial and resident lands. Our CAMI map can serve as a base data for studying Arctic infrastructure vulnerability and ecological sustainable development. Higher spatial resolution data source, such as GF series imagery, can be then considered for improving impervious areas mapping in the Arctic. And Further work should focus on monitoring impervious areas dynamic and analyzing its driving factors via the long time-series satellite images.

Disclosure statement
No potential conflict of interest was reported by the author(s).

Data availability statement
The CAMI map will be openly accessed at http://www.doi.org/10.11922/sciencedb.01435 Figure 14. Roads comparison between CAMI and the three impervious surfaces products (the 4 th to 6 th columns corresponded to the GlobeLand30, Esri-GLC and MSMT_IS30) for four plots. The second column is the corresponding RGB-composite Sentinel-2 images in 2020.