Urban land use land cover classification based on GF-6 satellite imagery and multi-feature optimization

Abstract Urban land use/land cover (LULC) classification has long been a hotspot for remote sensing applications. With high spatio-temporal resolution and multispectral, the recently launched GF-6 satellite provides ideal open imagery for LULC mapping. In this study, we utilized multitemporal GF-6 images to generate six types of land features, including spectral bands, texture features, built-up, waterbody, vegetation, and red-edge indices. The minimum Redundancy Maximum Relevance (mRMR) algorithm was employed to optimize feature selection. Subsequently, Random Forest (RF) and Extreme Gradient Boosting (XGBT) were assessed using different feature selections. Besides, various feature configurations were designed for LULC classification and comparison. The results indicate that the mRMR-based RF method achieved the highest overall accuracy of 91.37%. The temporal red-edge indices were important features for urban LULC classification and contributed mainly to grassland and cropland. These results supplement existing classification methods and assist in improving LULC mapping in urban areas with complex landscapes.


Introduction
Urbanization is a significant process of human social development.The urbanization process manifests in geospatial terms as the expansion of urban land, and the concentrated population activities and economic development in cities drive the rapid transformation of land use types, forming an urban environment with a complex surface landscape (Fu and Weng 2016;Dadashpoor et al. 2019).Urban land use/land cover (LULC) is the fundamental data for regional urban status monitoring, dynamics development, and planning management (Vadrevu et al. 2019).Remote sensing data is an efficient tool for mapping large-scale and high-precision LULC maps.In the application of remote sensing monitoring, urban LULC classification faces problems such as 'ecological complexity, phenological diversity, and spectral mixing' (Cai et al. 2019;Jozdani et al. 2019;Pflugmacher et al. 2019), and there are still challenges in obtaining accurate LULC classification.
To address the challenges faced by urban LULC classification, in previous studies, scholars have carried out this task mainly from multi-source features and classification algorithms.Multi-source features have generally been performed by multi-source images combination (Chen et al. 2017;Zhang et al. 2020), the fusion of special bands (Adelabu et al. 2014), and extraction of remote sensing indices and texture features (Stromann et al. 2019;Yanan et al. 2021), to achieve improved accuracy of LULC classification by enriching surface information (Lv et al. 2014).For classification algorithms to determine the urban land use type, the traditional classification models include the maximum likelihood method (MLC) (Patil et al. 2012), the K-mean mean method, the Iterative Selforganized Data Analysis (ISODATA) (Sharma et al. 2013), and the fuzzy clustering method.In the last decade, machine learning classifiers such as support vector machines (SVM) (Chi et al. 2008), tree-based models (Ghosh et al. 2014;Abdi and Sensing 2020), and neural networks (Zhai et al. 2020) have been widely used in LULC studies and emerged as more accurate and effective solutions than traditional parameterization methods.However, some algorithms require complex parameters and are difficult to automate (Rodriguez-Galiano et al. 2012b).The tree-based models employ ensemble learning approaches, utilizing multi-base classifiers to generate robust classifiers through decision rules or error minimization technique, which attracted a lot of interest due to their ease of use and higher accuracy (Belgiu and Dr agut ¸2016;Saini and Ghosh 2019).Random Forest (RF) and Extreme Gradient Boosting (XGBT) are representative tree-based models of bagging and boosting ensemble methods (Breiman 2001;Chen and Guestrin 2016), respectively, and have been used in a wide range of remote sensing classifications.Multiple features are used as input data for the classifier to build classification rules.The number and quality of the input features are related to the classification accuracy, but also to the operation time (Georganos et al. 2018;Zhang et al. 2019).For feature selection, the redundant and invalid features are removed without reducing the important information.Empirical studies have demonstrated that feature selection can strongly influence classification performance in most cases (Eisavi et al. 2015;Rodriguez-Galiano et al. 2012a).Indeed, the random forest embedded feature importance metric that uses out-ofbag (OOB) samples to assess mean decrease accuracy (MDA), and has been applied to feature selection for LULC (Zhang et al. 2019).However, Yang et al. (2017) argue that the MDA metric may be biased and variable.Fortunately, existing research has found that relevance-based feature selection method could achieve the desired classification accuracy using a reduced amount of data (Ma et al. 2017), especially for high-dimensional and redundant features.These characteristics make it more suitable for multi-featured urban cover classification.
In urban surface classification studies, the data sources can be divided into mono-temporal imagery and multi-temporal imagery.Multi-temporal imagery is able to supplement the seasonal change information of classes and assist in land use discrimination (H€ utt et al. 2016;Zhao et al. 2016).Senf et al. (2015) found that remote sensing images in spring and autumn are significant for vegetation cover classification.Multi-temporal images compensate for the lack of single-temporal ones.Still, their use is subject to two limitations: it is not easy to obtain high-quality multi-period images (Knight et al. 2006), and high-resolution images are usually not accessible within a short period (Yang et al. 2015).
With the development of remote sensing technology, satellite observation systems have evolved toward high spatial, temporal, and spectral resolution (Houborg and McCabe 2018), of which the Gaofen series satellite independently developed by China is representative.The GF-6 satellite belongs to the optical remote sensing satellite of the Gaofen series, launched in June 2018, with the high spatial and temporal resolution, multi-spectrum, and wide-width imaging (Zhang et al. 2021).Its multispectral imagery has eight spectral bands, four bands as the GF-1 satellite image, two red-edge bands, a yellow band, and a purple band, and is freely available.The satellite revisit cycle is four days.High spatiotemporal resolution and multispectral imagery can provide sufficient and practical information for urban surface coverage classification.At present, studies on GF-6 have been carried out to assess its potential for agricultural landscape mapping (Xia et al. 2022).However, there are presently no studies that are evaluating the performance of LULC classification of GF-6 satellite images in complex urban landscapes.Based on the data advantages of GF-6 imagery, the classification method of long-time series images and multiple features using GF-6 data is worth exploring.
In order to address the challenge of accurately classifying complex urban land use, this study utilizes the time-series data from GF-6 MFV as the data source.A comprehensive feature set comprising various types of features, including band features, spectral index features, red-edge features, and texture features, is extracted from GF-6 data.The mRMR algorithm is employed for feature selection.Two robust tree classifiers, RF and XGBT, are evaluated using different sets of selected features.Additionally, the study explores the potential of different types of feature combinations in LULC classification.The findings of this research pinpoint the optimal classification scheme using GF-6 images, serving as a valuable reference for achieving high accuracy in LULC monitoring.

Study area
Nanchang, the capital of Jiangxi Province, is one of the central cities in the Yangtze River metropolitan area (Zhang and Xu 2015).It is located between 115 49 0 -116 3 0 E and 28 34 0 -28 48 0 N, and has a subtropical monsoon climate with abundant rain and heat.The primary urban districts, including Xihu, Donghu, Qingyunpu, and Qingshanhuqu, was chosen as study area (Figure 1).It has a slight slope from the northeast to the southwest, with a dense water network.Besides, Nanchang is known as the 'land of fish and rice' because of its well-developed agriculture (Zheng and Qingyun 2021).As a built-up area, the study area is complex in terms of landform characteristics, with various landforms interspersed.According to the actual situation of land cover in the study area, and referring to the widely adopted classification scheme (Brewster et al. 1999) for land use in remote sensing, the types of LULC in this study were classified into six types: forestland, grassland, cropland, built-up land, unused land, and water.

Data
GF-6 Satellite is a low-orbit optical imaging remote sensing satellite.It carries a high-resolution camera (PMS) and a multispectral wide-format camera (MFV).The MFV sensor innovatively realizes the Chinese CMOS detection of 8-band multispectral products (Xia et al. 2022).The bands of GF-6 are expanded to 8, for which band 1-4 are similar as GF-1 and GF-2, and band5-8 are added bands, including two red-edge bands, one yellow band and one purple band.In this study, time series MFV images were used for urban land use classification.The spectral band information of the GF-6 MFV images is shown in Table 1.
We obtained all GF-6 images from the National Resource Satellite Application Platform (http://www.cresda.com).To obtain more information about the phenological changes, we collected six MFV images with acquisition dates corresponding to 19 February 2021, 30 April 2021, 31 August 2021, 28 September 2021, 25 November 2021, and 19 December 2021.They contain seasonal information on the four seasons: spring, summer, autumn, and winter.Radiometric calibration, atmospheric correction, and orthorectification were addressed in the time-serious images using the ENV 5.3 platform.In addition, geographic alignment was performed with Automated and Robust Open-Source Image Co-Registration Software (AROSICS) to match the spatial positions of the different images, and the special error was validated within 1 pixel (Scheffler et al. 2017).We obtained administrative boundary data from the National Geomatics Center of China (http://www.ngcc.cn/ngcc/).

Training and testing samples
It is commonly believed that sample quality could directly affect the accuracy of classification results, so representative pixels should be selected (Liu et al. 2017).To collect

Method
Choosing enough features for data classification is very important, as it can provide as much as possible effective information of different kinds of objects for identifying them.
In this study, we combined bands features, index features and texture features to find optimal feature sets for classification.Within it, red-edge index features are calculated with red-edge bands, as band 5 and 6 of GF-6.We used multi-temporal GF-6 images for urban LULC classification to create time-series vegetation indices, with the image from April 30, 2021 as the main image.First, six GF-6 images are taken to build multiple features.Second, all these features are optimized with the mRMR algorithm to downscale features, meanwhile, two well-known tree machine learning models, RF and XGBT, are used to compare the impact of different feature configurations.Third, seven feature sets are developed to assess the impact of different feature types on urban LULC classification.Finally, the classification algorithms are trained and evaluated on different sets of classification features and the suitable set is explored to improve LULC classification.The main process is shown in Figure 2.

Feature variable set
Spectral bands are the pixel values of different objects generated by the sensor receiving electromagnetic waves reflected from the ground.Usually, ground objects do not differ significantly in terms of individual band information, and index features are able to emphasize the differences among ground objects through band calculation (Yang et al. 2014).In addition, texture features measure the relationship between multiple pixels within an area.The texture features obtained from the grayscale co-occurrence matrix are statistical pixel values in the local image information, that reflect the distribution and variation of pixels with their spatial neighbourhoods, and can attenuate the 'same spectrum and different objects' or 'same object and different spectrum' caused by 'misleading' (Hall-Beyer 2017).In order to fully exploit the heterogeneity characteristics of different ground objects, we selected these six types of features: spectral bands, built-up index, vegetation index, water body index, red-edge index, and texture features according to the land-cover variables based on the GF-6 images.Spectral bands refer to the GF-6 WFV image.The original image includes blue, green, red, near-infrared, red edge 1, red edge 2, purple and yellow bands, a total of 8 band features.
Using the spectral bands of the original image, this study extracted the built-up index, vegetation index, water body index, and red edge index to gain information between different ground objects.The Built-up index (BAI) has good performance in the surface detection of concrete and asphalt, and is a reflection index of construction land (Shao et al. 2016).The vegetation index includes four indicators: normalized vegetation index (NDVI) (Carlson and Ripley 1997), ratio vegetation index (RVI) (Duveiller et al. 2011), soil adjustment index (SAVI) (Huete 1988) and enhanced vegetation index (EVI) (Ahamed et al. 2011).The commonly used normalized difference water index (NDWI) (Gao 1996), mixed water index (CIWI) (Zhang et al. 2018), and simple ratio water index (SRWI) (Zarco-Tejada et al. 2003) were calculated as water index.The red-edge band has advantages in the accurate monitoring of vegetation.The normalized differential red-edge index (NDre) (Zhang et al. 2019) was calculated to enhance the information of the rededge band.The normalized red-edge vegetation index (NDVIre) (Ehammer et al. 2010) is an improved vegetation index that employs the red-edge band instead of the near-infrared band.
We used the grey co-occurrence matrix to extract texture features.Due to the consistency of the ground objects' texture in the same study area, PCA transformation was performed on the main image, and the first two principal components images were extracted to calculate the eight commonly used statistical texture indicators in GLDM, respectively mean, contrast, variance, entropy, homogeneity, second moment, autocorrelation, and dissimilarity.Referring to Zhang's proposal that 7 Â 7 was a suitable window size for texture extraction from medium resolution images (Zhang et al. 2014), this study selects four window sizes of 3, 5, 7, and 9 for comparative experiments.The results show that the window size of 5 Â 5 can extract more texture information from GF-6 images, the step size was set to 1, and the grey level to 64.
Studies have shown that NDVI exhibits regularity in the vegetation growth cycle, and NDVIre has high sensitivity in vegetation identification (Wang et al. 2008).Since seasonal difference is an essential characteristic of vegetation, we used multiple images to construct the time series of NDVI and NDVIre as seasonal features.Taking the GF-6 image on April 30, 2021 as the main data, a total of 8 spectral bands, 26 index features and 16 texture features were calculated as the feature sets for surface coverage classification.The feature list is shown in Table 2.

Feature selection
The abundant image features are conducive to a more comprehensive description of the ground object, but numerous classification features increase the computational complexity and reduce the efficiency of the classifier (Georganos et al. 2018).Among them, the features that contribute little to the classification result could become noise and reduce the classifier accuracy.The purpose of feature selection is to extract practical features for classification from numerous classification features and remove redundant features.The max-relevance min-redundancy (mRMR) algorithm presented by Peng et al. (2005), a filter-type feature selection algorithm, measures the relevance and redundancy of features by calculating the mutual information.The mRMR algorithm specifies two cost functions of information difference (Eq.( 1)) and information entropy (Eq.( 2)), whereupon the information gain of features and target variables is derived to evaluate the relevance.
The redundancy is obtained from the dependency degree between the features.Subsequently, the features are ranked according to their scores.The goal of feature selection is to obtain a set of optimal feature subsets with maximum correlation and redundancy removal.The maximum correlation and minimum redundancy are calculated according to Eqs. ( 3) and (4), respectively.Among them, Iði, hÞ is the mutual information of the feature i and the feature h, S is the feature subset, x i is the i-th feature, and c is the categorical variable.The incremental search method is used to determine the optimal solution of feature comprehensive evaluation operator uðD, RÞ, that is, assuming that Sm-1 is the acquired feature subset, then find the maximum value of uðD, RÞ in the remaining N-Sm-1 features, and select as a new feature.

Classification schemes
We use the optimal feature set selected with mRMR method as one classification sample set, also design six other classification sample sets to compare classification effects with each other.The objectives of the classification schemes in this study cover three aspects: 1) to analyse the classification contribution of the additional spectral bands of the GF-6 MFV imagery compared with the 4-band data, 2) to compare the effects of different feature types on the extraction of urban LULC from the GF-6 satellite images, and 3) to select best feature sets to develop an urban LULC classification method from the GF-6 MFV images.Therefore, using the feature sets of GF-6 images, seven feature schemes were established as shown in Table 3 for classification tests.The results were evaluated based on confusion matrix.Overall accuracy (OA), Kappa coefficient, producer's accuracy (PA) and user's accuracy (UA) are accuracy evaluation metrics.

Classification algorithms
In this study, two tree-based machine learning models are used for classification, RF and XGBT.RF is a non-parametric bagging ensemble algorithm, which is one of the most recognized excellent models in LULC classification using remote sensing data.XGBoost is a relatively new boosting algorithm combined with gradient descent optimization, and it stands out in the field of machine learning due to its robust performance in massive data.In contrast to deep learning models, tree-based models possess certain distinct advantages in low computation cost and stable feature learning ability (Abdi and Sensing 2020).

Random Forest
Random forest consists of the decision tree as the basic unit, and uses the random subspace method to extract samples.The random samples are trained to construct the mapping relationship between features of the sample and targets, and then multiple decision trees are obtained.This helps to reduce the overfitting.After that, the decision trees are assembled through the bagging method, and the classification results are selected by voting.4,5,6,7,8,9,10,12,13,14,17,18,22,24,26,27,29,36 2. Extreme Gradient Boosting XGBT initially works by fitting a decision tree as a base model.Different from the random forest, the samples are trained iteratively in boosting, where each predictor corrects the residual errors made by its predecessor.The gradient descent optimization is employed to build the additive model by using a specified loss function.The XGBT also incorporates parallel techniques to support the speed and accuracy of the process.The final prediction of the model is obtained by weighted summation of each decision tree, and the weights are estimated from the decision tree performance.
In this study, RF and XGBT implemented with Scikit-Learn, which involves some adjustable parameters.In RF, the number of decision trees (Ntree) and the number of extracted features in node splitting (Mtry) are the two most important parameters.Rodriguez-Galiano et al. (2012b) suggested that Mtry be equal to the arithmetic square root of the number of features.The most important parameters of XGBT are the number of iterations (nrounds), the maximum depth of trees (depth), subsample ratio of the training instance (subsample), and regularization parameters (gamma).
The tree-based models usually preform stability when the number of trees or the number of iterations is above 100 (Samat et al. 2020).In this study, grid search is performed using 5-fold cross-validation, and the optimal models are obtained by selecting the best hyperparameters among the pre-selected parameters (Figure 3).

Features selection and classifier comparison
There were two kinds of features importance ranking provided by the mRMR algorithm, one evaluates the relevance between target and variables, and the other method combines the redundancy between variables.In this study, we calculated the variation in classification accuracy for the selected feature type and number using both feature selection methods in RF and XGBT classifiers, respectively.To reduce the computational effort of solving the probability function for high-dimensional features, 50 features were discretized by equal frequency intervals in mRMR algorithm.Then computed max-relevance score and max-relevance and min-redundancy score (mRMR score) were obtained.The scores are plotted as two histograms (Figures 4  and 5), with the order of features on the horizontal axis indicating the order of relevance and the order of features selection, respectively.
As seen in Figure 4, NDVI1 were the most relevant variables, with a score of 1.2.PC1COR and PC1V had a low correlation with values less than 0.018.Among the 20 most relevant variables, seven variables were red-edge features.According to the mRMR ranking, NDVI1, PC2MEAN, and Rededge1 contribute significantly to urban LULC classification.Then,  variables were added one by one to the two tree-based models according to mRMR ranking and Relevance ranking, respectively.The OA and Kappa coefficient were then calculated to identify the relationship between feature configuration and accuracy (Figure 6).
Based on the mRMR ranking, when the number of features grows gradually in the first stage (the number of features < 7), the classification accuracy of both RF and XGBT improves rapidly.Then, as the number of features continues to increase, RF outperforms XGBT, and when the number of features equals 20, RF reaches the highest classification accuracy of 91.37 and Kappa equals 0.90.The best accuracy of XGBT appears to be more RF lagged with an input feature of 23, and the OA and Kappa are 91.30,0.90, respectively.With the further increase in the number of features, RF and XGBT performed consistently, with a certain decrease in classification accuracy, followed by stabilization.In addition, as for the relevance score (Figure 7), The accuracy trends of both algorithms are similar to the former, but with a slow growth trend.RF and XGBT reached the highest OA value of 91% when the first 33 features of the relevance ranking were input.
The results show that employing the mRMR feature selection method is advantageous for both RF and XGBT classifiers, achieving higher accuracy with a small subset of features.Notably, RF outperformed XGBT by attaining the highest overall accuracy (OA) with a reduced feature set of 20.However, as the number of features increased, XGBT demonstrated comparable stability to RF.In this study, the top 20 features in the mRMR ranking were selected as the optimal feature set to extract the urban LULC classification.

Accuracy of different classification schemes
Due to the outstanding and stable performance of RF, it is employed to classify feature fusion images from seven schemes (Figure 8). Figure 9 shows the overall accuracy and Kappa coefficient of the seven feature schemes.Scheme 1 achieved a moderate level of LULC classification accuracy (OA ¼ 81.86%) by solely utilizing the original bands of GF-6.This performance was marginally superior to scheme 2 (OA ¼ 80.91%), which relied on the first four bands of GF-6 (analogous to the 4-band of GF-1 images).Notably, the inclusion of spectral indices or red-edge indices resulted in a significant OA improvement of 8.22% and 8.43% for Scheme 3 and Scheme 5, respectively, yielding overall accuracies of 90.08% and 90.29%.In contrast, Scheme 4 exhibited the lowest OA and Kappa.The OA was reduced by 3.06% compared to the original image classification.Scheme 6, incorporating all available features, achieved an OA of 90.63%.Among all the feature schemes,  Scheme 7 based on the selected optimal features, performed with the highest overall accuracy (OA ¼ 91.37%).The above results suggest that our optimal features, which were selected from original bands, band index, texture, and time-series vegetation index by mRMR, better complement information to distinguish the urban LULC categories.

Comparison of different classification schemes
We evaluated the image classification results of seven feature sets through both visual interpretation and quantitative evaluation.To present the classification details, LULC map was zoomed in Area1 (Figure 10), which contains six feature types.There was obvious misclassification among different features in Scheme 1 and Scheme 2, especially in cropland, grassland, and built-up land.Scheme 4 added GLCM texture; the boundaries of various land features became distinct.However, with the exception of water bodies, the other five categories of features were seriously confused.The classification outcomes of Scheme 3 and Scheme 5 exhibit relatively consistent alignment with the reference images.Additionally, there was a distinct advantage over Scheme 1, Scheme 2, and Scheme 4 in terms of distinguishing between different types of vegetation, which are obvious in Area 2 (a park area).Schemes 6 and 7 had the most effective visual classification, characterized by coherent classes and minimal speckling.
The PA and UA for each category of these seven schemes are shown in Figures 11 and 12.The four additional bands of GF-6 MFV images (Scheme 1) improved the PA and UA on unused land and grassland compared with Scheme 2. Compared to the original image, the utilization of ground object indices (Scheme 3) and the red-edge indices (Scheme 5) resulted in a significant improvement in the UA for various vegetation categories (4.97%-29.67%).Also, the incorporation of ground object indices enhanced the classification accuracy for built-up land and water bodies.Among all the schemes, Scheme 5 exhibited the highest producer's accuracy (PA) for cropland.Similar with the results presented in Figure 9, Scheme 4 produced the lowest PA in various land types (except forestland).In Scheme 6, which encompassed all features, the classification accuracy (both PA and UA) for grassland, unused land, and cropland was comparatively lower than that of Scheme 7.This indicates that all features have an impact on the classification accuracy.The classification results obtained using the optimal features (Scheme 7) exhibit that PA and UA were ranked highly in each category.Specifically, these results ranked in the top two of the seven schemes evaluated, also with good agreement.

Performance assessment of the multi-feature optimization classification
Table 4 shows the confusion matrix of multi-feature optimization classification based on GF-6 MFV imagery.Among all land types, the water and unused land were the top two with PA equal to 100% and 98.21%, respectively.Built-up land achieved the ideal classification results, with both PA and UA of more than 90%.The forestland was also accurate in terms of the PA and UA (88.52% and 96.76%, respectively).Compared to forestland, grassland was not in good consistency with validation data.The PA and UA for grassland were 80.17% and 78.81%, respectively.The confusion between forestland and grassland in the city is a problematic aspect of the land-cover classification, which is related to their complex surface distribution in the urban area.In this study, the confusion between forestland and grassland was controlled at 7.41%, which is a relatively desirable value.The PA and UA for cropland were in consistent equal to 89.84%.The temporal vegetation index (the red edge index included) could considerably extract cropland.
We also compared our classification with the ESA WorldCover (Zanaga et al. 2022) and GlobeLand30 (Jun et al. 2014).These two classified products of the study area, produced in 2021 and 2020 with a resolution of 10 m and 30 m respectively, were used as samples.To ensure a meaningful and fair comparison, it was necessary to account for the varying LULC categories of the different products.Among the three products, we focused on six land categories in this study that were common and comparable.While, 'wetland'  category was not included in the comparison due to its limited extent.Figure 13 shows the selected sub-region in three products (same place with Figure 10).The complete LULC maps of the study area was shown in Appendix B. And the confusion between cropland and grassland poses a significant challenge in the LULC classification (Schulz et al. 2021).Therefore, it is understandable that there are distinctions between these two land types across various datasets.Comparison with GlobeLand30 reveals that both ESA WorldCover and our multi-feature optimization method performed better in the details of urban LULC classification.When examining the riverbank area, ESA WorldCover and GlobeLand30 have classified it primarily as agricultural land, whereas our findings primarily mapped natural vegetation.A likely reason for this phenomenon is that grassland or cropland along the braided bars of rivers is more sensitive to seasonal changes.Regarding the built-up portion of the subregion, ESA WorldCover tends to overestimate cropland, and there is also confusion between construction land and bare soil.As for our results, a limitation is that the interface between vegetation and buildings is often interpreted as grassland.In addition, the OA of the different classification results was computed, with multi-feature optimization classification ranked first in the accuracy among the three classifications, followed by ESA WorldCover and GlobeLand30, with accuracies of 77.16% and 53% respectively.

Discussion
Accurate land use is crucial for numerous applications, but effectively mapping complex urban landscapes remains serious challenges.However, with the advent of diverse remote sensing technologies, there are now valuable opportunities to achieve large-scale and highly accurate urban LULC mapping.In this study, we used time-series GF-6 MFV images to extract multiple indices and grayscale co-occurrence matrices to capture the spectral and texture variations of various features.And a total of 50 features were created.Subsequently, mRMR performed two types of feature selection.And we employed two tree-based models and combined different input features to evaluate their classification accuracy.Finally, we fused various types of feature combinations for mapping to compare their performance in urban LULC classification.
Our results demonstrate that both RF and XGBT can enhance model performance when using reduced features under optimal parameter selection, which supports the finding of Belgiu and Dr agut ¸(2016) that eliminating redundant features is significant for LULC classification.Comparing the two feature selection methods, we found that the minimum redundancy maximum relevance feature selection requires fewer features than the correlation-based feature selection to achieve the highest accuracy.Incorporating the mutual information between features and feature-target can extract features more efficiently thus enhancing sample reliability and make classification, which further validates the findings of Kiala et al. (2019).In this study, the mRMR-based RF classifier was the superior model, using only 20 variables to achieve higher classification accuracy.While, XGBT exhibited lower performance than RF when fewer features were employed.Both models showed some degradation in performance but maintained at a certain level when all features are processed, which is consistent with the theory that tree-based models are less prone to overfitting (Georganos et al. 2018;Schulz et al. 2021).
Based on the feature importance of mRMR, the four new added bands of GF-6, particularly the two red-edge bands, exhibited a strong ability to distinguish LULC.In comparison to the purple and yellow bands, the red-edge bands contain more information for classifying LULC (Saini and Ghosh 2019).Furthermore, in the classification results with various feature types, it was observed that the four additional bands enhance the accuracy of identifying grassland and bare land.Additionally, the utilization of ground object indices proves beneficial in effectively distinguishing different land types.Moreover, the red-edge features derived from the time series exhibit remarkable capability in classifying different vegetation and enhancing the classification of other features.However, it is important to note that not all features can contribute to the LULC classification (Bai et al. 2021).In this study, the redundant texture features introduced the interference in the LULC classification process.These findings corroborate previous research (H€ utt et al. 2016;Stromann et al. 2019;Schulz et al. 2021), underscoring the importance of considering feature diversity and temporal variation for achieving favourable results in LULC classification.
The combination of GF-6 images and feature optimization showed significant potential for accurately mapping LULC in urban areas, with an overall accuracy of 91.37%.Specifically, our LULC mapping results exhibit high classification accuracy for built-up land, unused land, and water bodies, all exceeding 90%.Meanwhile, it shows consistent and accurate mapping on cropland.This result indicates that the GF-6 images stand out in identifying various phenological characteristics of agricultural crops, aligning with the observations made by Xia et al. (2022).However, the accuracy of grassland was comparatively lower, and the confusion between grassland and forest land is high, probably due to the random scattered distribution of dense or sparse vegetation in urban areas (Guo et al. 2011;Kowe et al. 2020).
Although our findings offer valuable insights into the classification of urban LULC using GF-6 images, there are certain limitations that need to be addressed.Firstly, the LULC types used in this study are limited in scope.It would be worthwhile to further explore the classification potential of the proposed classification method and the GF-6 feature set across a broader range of LULC types.Additionally, future studies should consider incorporating other comparable satellite observations such as Landsat 8 and Sentinel-2.By comparing different data sources and investigating the synergy between these sources and GF-6 data, the classification accuracy of urban LULCs can be further enhanced.

Conclusions
The GF-6 MFV sensor offers a publicly accessible data source that consists of high-resolution images with both spatial and temporal precision.It includes additional two rededge bands, purple and yellow band, and provides imagery with a temporal resolution of 4 days.This study focuses on investigating the capabilities of GF-6 imagery for urban LULC mapping.We developed a feature filtering classification framework that involves extracting multi-temporal image features.To identify the most informative features, we employed the minimum Redundancy Maximum Relevance (mRMR) algorithm to generate subsets of these features, combined with two robust classifiers, RF and XGBT, for classification task.The various feature configurations were compared to assess their performance on LULC classification.
Our results indicate that the inclusion of more features does not necessarily lead to an improvement in the accuracy of LULC classification.By utilizing mutual information, the mRMR method effectively selects features, consequently enhancing the classification performance of RF and XGBT.The characteristics of GF-6 for LULC classification show improvements in various aspects.Firstly, in terms of bands, the four new bands of GF-6 MFV can contributes to reducing the confusion problem of vegetation and bare land.Secondly, temporal features.The utilization of spectral indices across multiple time series windows is a key to accurate LULC classification.In particular, the inclusion of temporal red-edge features can prove valuable in distinguishing different feature classes.The evaluation of the LULC classification based on mRMR and GF-6 images exhibit an overall accuracy of 91.34%.And most of the feature classes have PA and UA above 85%.These results affirm the significant potential of GF-6 imagery for LULC classification in complex and diverse urban areas.

Figure 1 .
Figure 1.Location of study area, with true colour composite map illustrating full extent of the study area.

Figure 2 .
Figure 2. The overall framework of the urban land use/land cover classifications using GF-6 imagery.

Figure 3 .
Figure 3.The classification process for Random Forests and Extreme Gradient Boosting.

Figure 4 .
Figure 4. Relevance ranking based on mRMR algorithm.The horizontal axis represents the feature name.The vertical axis represents the relevance score of each feature.

Figure 5 .
Figure 5. Max-relevance and min-redundancy ranking based on mRMR algorithm.The horizontal axis represents the feature name.The vertical axis represents the mRMR score of each feature.

Figure 6 .
Figure 6.Relationship between number of features, OA and Kappa based on max-relevance and min-redundancy ranking for the two tree-based classifier.

Figure 8 .
Figure 8. Classification results of seven feature schemes; the black rectangle is area 1, the black ellipse is area 2.

Figure 7 .
Figure 7. Relationship between number of features, OA and Kappa based on relevance ranking for the two treebased classifier.

Figure 9 .
Figure 9. Comparisons of overall accuracy and Kappa derived from RF in different feature schemes.

Figure 10 .
Figure 10.Detailed classification results of seven schemes in area 1.

Figure 11 .
Figure 11.The producer's accuracy of different categories in 7 schemes.

Figure 12 .
Figure 12.The User's accuracy of different categories in 7 schemes.

Table 1 .
Spectral bands of the GF-6 satellite.

Table 3 .
Experimental schemes for seven feature sets.

Table 4 .
Confusion matrix of the classification results using the optimal features.Forestland, B -Built-up land, G -Grassland, U -Unused land, C -Cropland, W -Water.