Assessing the Parameterization of RADARSAT-2 Dual-polarized ScanSAR Scenes on the Accuracy of a Convolutional Neural Network for Sea Ice Classification: Case Study over Coronation Gulf, Canada

Abstract Arctic amplification has many impacts on sea-ice extent, thickness, and flux. It becomes critical to monitor sea-ice conditions at finer spatio-temporal resolution. We used a simple convolutional neural network (CNN) on the RADARSAT-2 dual-polarized ScanSAR wide archive available over Coronation Gulf, Canada, to assess which SAR parameter improves model performances to classify sea ice from water on a large volume of data covering 11 years of ice and surface water conditions. An overall accuracy of 90.1% was achieved on 989 scenes of 100% ice cover or ice-free conditions. An accuracy of 86.3% was achieved on the last year of data (134 scenes) which was kept out of the training process to test the model on an independent dataset. A better accuracy is obtained at lower incidence angles and the HH polarization provides the most information to classify ice from water. To achieve the best accuracy, the incidence angle and the noise equivalent sigma-nought had to be included as input to the model. A comparison done with the ASI passive microwave product shows similar errors in total sea ice concentration when using the Canadian Ice Service regional charts as reference. Nonetheless, errors from both datasets differ and the CNN outputs show greater potential to reduce masked areas, given the better spatial resolution, enabling data classification closer to land and identify features not captured by the ASI dataset.

L'amplification arctique a eu de nombreux impacts sur l' etendue, l' epaisseur et le flux de la glace de mer.Il devient critique de surveiller les conditions des glaces de mer a une r esolution spatio-temporelle plus fine.Nous avons utilis e un simple r eseau neuronal convolutif (CNN) sur l'archive ScanSAR a double polarization RADARSAT-2 disponible sur le golfe du Couronnement, Canada, pour evaluer quels param etres RSO am eliorent les performances du mod ele pour s eparer la glace de mer de l'eau sur un grand volume de donn ees couvrant 11 ans.Une pr ecision globale de 90,1% a et e obtenue sur 989 sc enes de couverture de glace a 100% ou de conditions sans glace.Une pr ecision de 86,3% a et e obtenue sur la derni ere ann ee de donn ees (134 sc enes) qui a et e tenue a l' ecart du processus d'entraînement pour tester le mod ele sur un ensemble de donn ees ind ependant.Une meilleure pr ecision est obtenue a des angles d'incidence plus faibles et la polarisation HH fournit le plus d'informations pour distinguer la glace de l'eau.Pour obtenir la meilleure pr ecision, l'angle d'incidence et l' equivalent de bruit sigma-z ero ont dû être inclus comme intrants au mod ele.Une comparaison avec le produit de concentration de glace de mer ASI d eriv ees des micro-ondes passives montre des erreurs similaires dans la concentration totale de glace Introduction Arctic air temperatures have been rising twice as fast as the global average (Meredith et al. 2019;Richter-Menge et al. 2017;Notz and Stroeve 2016) and projections indicate further warming in the Arctic (Overland et al. 2019).This is especially true in the Canadian Arctic where observed air temperatures increases are 3 times higher than the global average (Bush and Lemmen 2019).With this rapid increase in air temperatures, major changes in Arctic sea-ice extent, thickness and flux have been reported (Derksen et al. 2019;Howell and Brady 2019;Mudryk et al. 2018;Tivy et al. 2011).Maximum sea ice extent is decreasing, greater multi-year ice (MYI) fluxes has been measured from the Arctic Ocean to the Canadian Arctic Archipelago (CAA) and less seasonal ice survives a summer melt season (Howell et al. 2023;Howell et al. 2015;AMAP 2017).Arctic ice-free conditions scenarios are becoming more probable even under global temperature rise of 2 C (Screen 2018;Sigmond et al. 2018).Attention has been given toward the maximum and minimum extents of sea ice in summer but ice conditions throughout the winter season has seen extensive changes over the last decades (Notz and Stroeve 2016).These responses of the sea ice to warmer temperatures all year round have major impacts on the climate, ecosystems, communities, wildlife, and the economy in the Arctic (Huntington et al. 2022).
Currently, sea-ice charting produced by national ice centers relies mainly on manual digitization of sea-ice features such as concentration, stage of development (proxy for thickness categories) and floe size (Tivy et al. 2011) interpreted visually by trained experts.The main objective of these charts is to facilitate shipping activity, including icebreaking in ice-infested waters.This implies that the main charted areas are around known shipping routes.Dawson et al. (2018) have shown that shipping activity has increased in the CAA since the 1990s due to later freeze-up and earlier break-up of the sea ice, extending the period where ships can navigate these waters.With the current climate projections, other routes will open (Mudryk et al. 2021).This poses a challenge for national ice services who will require additional resources to analyze data over a large area and for longer periods of the year.Even though reports say that sea-ice extent and thickness is reducing, sea-ice conditions must still be considered severe and be closely monitored (Haas and Howell 2015) especially in the CAA where more hazardous ice like MYI floes have been seen (Howell et al. 2015), sometimes even exceeding the outflow of MYI from Nares Strait (Howell et al. 2023).
Though less sea ice might have positive impacts on Arctic economy (Melia et al. 2016), it is having negative impacts on Arctic wildlife like sea birds (Wong et al. 2018), polar bears (Florko et al. 2020), caribou (Environment and Climate Change Canada 2021) and narwhals (Hauser et al. 2018).While daily and ondemand higher-resolution charts are produced by the ice services, they are limited to areas with active shipping.These products are not designed for wildlife habitat monitoring.For large scale and long-term studies, they can prove useful but for local scale and short-term monitoring, the spatial and temporal resolution of these products are not adapted to link the sea-ice conditions to wildlife behaviors.
Other products such as passive microwave remote sensing products achieve up to daily temporal resolution and up to 3.125 km spatial resolution (Beitsch et al. 2014).The spatial resolution is more applicable to wildlife habitat monitoring and the temporal resolution is very suitable for such studies.Nonetheless, passive microwave products are known to have many limitations like; melt ponds detected as water, thin ice seen as reduced concentration and require atmospheric corrections (Ivanova et al. 2015, Spreen et al. 2008).
National ice services like the Canadian Ice Service (CIS) of Environment and Climate Change Canada (ECCC) use synthetic aperture radar (SAR) satellite imagery as their main source of data to generate ice charts.C-band SAR data was chosen for the Canadian RADARSAT program for its ability to provide contrast between the different ice types and water but also its ability to see through clouds and be independent of lighting conditions since it has its own illumination source (Parashar et al. 1993).SAR data provide higher resolution imagery at a lower spatio-temporal coverage compared to passive microwave products, but they are less affected by atmospheric conditions (Joshi et al. 2016).For example, the pixel spacing of 50 m of the ScanSAR mode of RADARSAT-2 provides higher resolution data.Currently, CIS uses imagery from the Radarsat Constellation Mission (RCM) as its main source of SAR data to produce ice charts.With the multiplication of C-band SAR data like Sentinel-1 and the recently launched RCM, more data is made available, improving the spatial and temporal coverage of SAR data, enabling daily coverage over many areas.While the regional ice charts serve as a weekly snapshot of ice conditions, to serve as a basis for establishing the World Meteorological Organization (WMO) mandated sea-ice climatology and sea ice normals, the daily and on-demand ice charts only target areas with active shipping, at finer scales.Though this provides a rich database of information, it is often not sufficient for other applications such as sea ice safety monitoring for northern communities and wildlife habitat.
Recently, many studies have shown the potential of using machine learning algorithms with the large volume of C-band SAR data available to map sea ice conditions.Zakhvatkina et al. (2019a) published an overview of the different machine learning techniques that were developed since the 90s to map sea ice conditions using SAR data since the launch of the European Remote Sensing Satellite À1 (ERS-1).More recently, focus was given to deep learning algorithms like convolutional neural networks to produce many of the variables included in ice charts (Stokholm et al. 2022;Radhakrishnan et al. 2021;Zakhvatkina et al. 2019b;Wang and Li 2021;Wang et al. 2016).These studies have shown that using CNN on dual-polarized RADARSAT-2 (RSAT-2) and/or Sentinel-1 (S-1) provides comparable results to other products in terms of sea-ice concentration (SIC) such as the ones derived from Multisensor Snow and Ice Mapping System (Wang and Li 2021) and the Danish Meteorological Institute (Stokholm et al. 2022).Some studies combined passive and active microwave imagery to produce SIC that perform better than the Arctic Radiation and Turbulence Interaction Study (ARTIST) Sea Ice (ASI) and the NASA team 2 (NT2) products (Radhakrishnan et al. 2021).
This study uses a similar CNN architecture implemented by Wang et al. (2016) on the dual-polarized RSAT-2 image archive available over the marine region of Coronation Gulf, Canada.The focus of this study, compared to other similar studies, is to identify which component of the SAR imagery, e.g., which polarization or static feature like incidence angle or antenna noise profile, improves model accuracy using more than a decade of sea-ice and water observations.The study area, data used, and model will be presented in the method section and model accuracies and image parameterization will be discussed in the results section.A comparison with existing SIC products will also be presented in the latter section to discuss how the use of an ice/water classification produced from a simple CNN can be useful for applications beyond shipping.

Method
This section describes the study area, the reasoning behind its selection for this study, how the training data was selected and how the CNN model was developed.

Study area
The region of interest for this study is Coronation Gulf (CG) in the Canadian Arctic Archipelago.This area is situated between Victoria Island and mainland Nunavut in Canada and is considered part of the North-West Passage shipping route (Figure 1).According to CIS weekly regional charts (Figure 2, Table 1), the area is completely covered in land fast ice for many months in the winter and is completely ice free for many months in the summer.Not having to segment selected imagery due to their simple ice conditions (ice free and fully ice covered) facilitated data selection.

Data sets
The CNN was trained to classify sea ice from water, using RSAT-2 ScanSAR data acquired from 2008 to 2020 (Table 2).The selection of the training and validation data as well as their labels, i.e., ice or water, was done using the CIS regional ice charts (Table 1) available on the CIS website (https://iceweb1.cis.ec.gc.ca/Archive/page1.xhtml?lang=en).The CIS regional charts were also used as reference to compare SIC outputs from the CNN model and the Arctic Radiation and Turbulence Interaction Study (ARTIST) Sea Ice (ASI) algorithm (Spreen et al. 2008), for CG between 2008 and 2020, to assess the potential of the CNN model to derive SIC in different ice-cover conditions than the ones it was trained on.

Canadian ice service charts
Mainly for climatology, the CIS publishes weekly ice charts, valid every Monday, covering large regions of Canada: Western Arctic, Eastern Arctic, Hudson Bay, East Coast and Great Lakes (Tivy et al. 2011).Figure 2 shows an example of a regional chart of the Canadian Western Arctic.The ice concentrations, types and floe sizes are expressed in the egg code (Cheng et al. 2020;Tivy et al. 2011).This code can also be expressed in the WMO Sea Ice GRID (SIGRID) format (IICWG: SIGRID-3 2014).
These charts were used to determine the periods when CG was fully covered in land fast ice (10/10 ice concentration) and when it was ice free (0/10 ice concentration).Table 1 shows the start and end dates of each year from which the RSAT-2 scenes were selected.The year indicated in Table 1 correspond to the end of ice season, e.g., ice free period for 2010 started in the summer of 2009 and the ice cover period started in 2009 and ended in 2010.A 2-week buffer was used for the imagery selection, i.e., 2 weeks after full freeze-up (start of 10/10 ice concentration for all of CG) and 2 weeks before ice break-up for the full ice cover and idem for the ice-free periods; to ensure pure ice only and water only image samples.This accelerated the labeling process of the image samples since they did not require manual inspection to determine which belonged to the ice and water classes.
The CIS does produce ice analyses from single SAR images (Cheng et al. 2020) but since their main focus is charting high traffic areas (Canadian Ice Service 2015) and not many ships travel in the Coronation Gulf in months where it is not ice-free (Mudryk et al. 2021;Dawson et al. 2018) not many image analyses are available for the study area, hence the use of the weekly regional charts for this study.

RADARSAT-2
RSAT-2 is an operational Canadian C-band SAR launched on December 14, 2007.Since then, the Radarsat Constellation Mission (RCM) was launched in June 2019 and the Government of Canada (GoC) has transitioned to RCM in 2020 as its primary source of C-band SAR data.This is why the available data for this study spans from the early days of RSAT-2 (2008) to the transition time of the GoC to RCM (2020) even though RSAT-2 is still operational.
Though RSAT-2 has many beam modes available comprising of different viewing geometries,  polarizations, spatial resolutions and swath widths, this study focuses on the ScanSAR wide (SCWA) beam mode which is the most used beam mode by the CIS to map sea ice conditions and is the most abundant data type available of the RSAT-2 data archive accessible over sea ice.
Incidence angles of the SCWA beam mode vary from 19 in the near range to 49 in the far range of the image.Signal intensity of both sea ice and water are highly dependent on incidence angle (Dierking 2013).This can prove challenging and many techniques to normalize signal intensity have been developed (Stokholm et al. 2022;Mahmud et al. 2018;Zakhvatkina et al. 2017).Influence of incidence angle on CNN accuracy will be shown.
ScanSAR modes of RSAT-2 only allow dual or single polarization acquisition.In dual polarization, 1 polarization is transmitted (H or V) and both are received, e.g., HH and HV for H transmitted and VV and VH for V transmitted.
The pixel spacing of a SCWA scene is of 50 m.To assess the potential of CNNs to be used for other applications than shipping, each scene was divided into chips of 20 Â 20 pixels, without overlap, to achieve a 1-km output resolution (Henderson et al. 2021).Given the presence of small islands in CG and the exclusion of samples containing land pixels in them for this study, the sample size was chosen in order to avoid masking large portions of CG from the analysis.Each chip includes 4 bands.One band per polarization (HH and HV), a third one for incidence angle and one for the noise equivalent sigma-nought (NESZ).Both incidence angle and NESZ layers are interpolated across the range of the image from a reduced number of samples (i.e., 2 and 100 values for angle and NESZ respectively) provided in the image product metadata.The impact of each band on the CNN accuracy will be discussed.
The 2 polarization bands were calibrated using the sigma-nought (r 0 ) look-up table (LUT).The other 2 calibration LUTs, gamma-nought (c 0 ) and betanought (b 0 ) were tested and model accuracies using the different LUTs are presented.

Training, validation, and testing datasets
To cover as many ice and water conditions and SAR parameters, 1,123 ScanSAR scenes were selected between August 21, 2009 and July 16, 2020 (Table 2).A total of 586 scenes were selected during the full icecovered periods and 537 scenes for the ice-free periods.Of those, the last year of data (2019-2020, 45 and 89 scenes for the ice and water classes respectively) was selected to test the model performance and were not included in the training/validation of the CNN model to test it on an independent dataset.The 989 scenes used to generate the CNN model, a total of 12 749 697 chips, were split 70/30 between training and validation of the model.To this training set, 1,010 scenes were added for SIC comparison analysis where mixed ice/water conditions was present according to the CIS regional charts.The 2-week buffer periods described in the previous section were included in these 1,010 scenes to have a continuous time series over the studied period.Table 2 gives the distribution of available scenes per year and class.
The distribution of the training/validation image chips gives a class ratio (q) of 1.38 larger for the ice class which can be considered as a balanced training dataset especially with such a large number of samples (Buda et al. 2018).To reduce any influence from spatio-temporal autocorrelation between image chips, all   1 where the mixed coverage corresponds to the images available after the full-ice coverage (10/10 landfast ice) period, i.e., the break-up period of the given year.
training/validation chips were randomly shuffled before being used as inputs to train the CNN.The final pre-processing applied to all layers of the chips was image normalization, to improve the mode training accuracy and efficiency (Ioffe and Szegedy 2015): Where Îi is the normalized pixel value, I i the pixel value, I i and r i the mean and standard deviation respectively of all chips and classes of the training/validation dataset for a given band i.
Any duplicate RSAT-2 image products were discarded to keep 1 single image for a given acquisition.No overlap between the image chips was created.For the training/validation datasets, the edges of the images that did not have 20 Â 20 pixels and chips with land and/or no data pixels were excluded.

Convolutional neural network
The convolutional neural network (CNN) used in this study is very similar to the one developed by Wang et al. (2016).The CNN was chosen for this study since, like trained human analysts, the algorithm is trained to recognize spatial patterns and features of specific targets at different spatial scales (LeCun et al. 1999).For sea-ice mapping, unlike expert analysts, the CNN used here does not include temporal information of past ice conditions or auxiliary data such as weather and climatology information or other sources of earth observation data.This is done willingly to determine the SAR data parameterization that affects the performance of a CNN to detect sea ice from water using a single SAR image.
The CNN model was coded in Python 3.6 using Keras 1.0.8library of Tensorflow 2.0.0 (Abadi et al. 2015).Table 3 shows the architecture of the CNN used.The effective spatial resolutions are given in brackets in Table 3.The CNN has 2 convolution layers using 128 different filters in a 5 Â 5 sliding window.Both layers are followed by a max pooling layer using a 2 Â 2 sliding window to reduce the dimensionality.The fully connected layers (FCLs) were added, the first having 1,024 nodes, the second 128 nodes and the last, a single node for the binary output probability.All layers except the pooling layers require an activation function.The rectified linear unit (ReLU) function was selected for all layers except the last FCL.This function removes negative values by setting them to zero and is a good compromise between computation efficiency and overall generalization accuracy (Krizhevsky et al. 2012).This is the most commonly used activation function in recent sea-ice classification studies using a CNN (Stokholm et al. 2022;Wang et al. 2016).To ensure a smoother, monotonic, and continuous output probability of the CNN, the sigmoid function was used in the last FCL (Han and Moraga 1995).The values of a sigmoid function range from 0 to 1 with a smooth S-shape transition between the 2 extremes making it a good option for a binary classification like the one presented here.
The output of the CNN corresponds to the probability of the image chip to be water (P water ) and its inverse, the probability of the chip to be sea ice (P ice ): (2) Given the large number of chips to classify (12 749 697), a 70/30 split was used in the available data for model training and validation respectively.In order to avoid overfitting individual nodes of the first 2 FCLs, 10% of the nodes were randomly dropped at every training iteration (epoch) (Tompson et al. 2015).The stochastic gradient method using the binary crossentropy was used for its algorithm stability (Hardt et al. 2016).To improve the model training accuracy and efficiency, an adaptive learning rate was used, and the training was also forced to stop if no significant change was observed in the validation accuracy after 10 epochs (Bengio 2012).The learning rate was reduced by 30% after 5 epochs if the training accuracy of the model did not improve by more than 3%.Finally, the metric used to monitor the improvement of the mode for each epoch was the accuracy metric (Acc): Where TP i is the total number of true positives for a given class i, i.e., where the probability of class i (Equations 2 and 3) is greater than 0.5 and N the total number of chips classified.Since the objective of this study was not to obtain the best model to classify ice from water, model output probabilities were not calibrated, and the simple 50/50 cutoff threshold was used to classify ice from water (Guo et al. 2017).
To assess the impact of each of the SAR bands, CNN models for the 2 main channels (HH and HV) were generated individually and together.Models were also generated for the different band combinations where HH and/or HV is always included.Results of the CNN models performance will be discussed in the results section.

Results
In this section, the results from exploring a big dataset of 1,123 RSAT-2 images will be shown and general relationships between target backscattering and sensor properties will be presented.CNN model training and validation results will be shown and its performance in relationship with the SAR parameters for each class will be presented.Finally, a comparison between 3 sea-ice products will be presented to assess the potential of a simple CNN, like the one presented here, to estimate SIC for a larger marine region like CG over a period of more than a decade.

Big data distribution analysis
Given the large dataset used in this study, it is important to assess the distribution of the dataset with the different imaging parameters.Figure 3 shows box plot distributions of the r 0 intensity for bins of 1 of incidence angle of the HH polarization for ice (green) and water (blue) classes.The orange line within the box corresponds to the median value of the distribution.The box edges correspond to the lower and upper quartiles of the distribution and the whiskers correspond to the 5 th and 95 th percentiles of the distribution.The NESZ curve is shown in Figure 3 to give a sense of the signal to noise ratio of the backscattered signal.
Relationships between the backscattered signal of the HH polarization and the incidence angle for both classes are also shown in Figure 3 and are obtained with Equations ( 6) and ( 7).Coefficient of determination between the median values and the incidence angles of R 2 ¼ 0.995 and R 2 ¼ 0.999 were obtained for the ice and water classes respectively.Given the selected periods for the full-ice coverage over CG (Table 1), this relationship only applies to thicker sea ice where the thinnest ice type recorded by CIS regional charts is thin FYI (30-70 cm).Also, CG has seen very little MYI in the studied period.This means Equation ( 6) is only applicable to SAR normalization of FYI.Similarly, with the bottom whiskers being dependent on NESZ for the water class (Figure 3), this means that Equation ( 7) is only applicable to the specific beam mode of RSAT-2 used in this study.Figure 3 also shows that the ice signal has less of a slope with incidence angle compared to water meaning the backscatter from a water surface is less sensitive with incidence angle.
Figure 4 shows the same boxplot distribution per degree of incidence angle as Figure 3 but for the HV polarization.NESZ is also shown in Figure 4 to show that since the backscattered signal is very low for the HV polarization, the intensity follows an incidence angle dependency close to the NESZ.This is why no relationship similar to the HH polarization (Figure 3) could be obtained.Correlation coefficients of R 2 ¼ 0.164 and 0.166 were obtained between the HV and NESZ signals for the ice and water classes respectively.This shows that even though the 2 signals have similar dependencies with incidence angles, they are not correlated.It should be noted that the NESZ curve is an interpolation of 100 values dispersed over more than 10 000 pixels in range.It may thus not represent the exact NESZ curve, thus explains why parts of the distributions are below the noise floor.This was also noted by previous studies where the backscattering values in the cross-polarized channels were close to the noise floor (Aldenhoff et al. 2020).

CNN training and validation
Figure 5 shows the evolution of the binary crossentropy loss function and CNN model accuracy for each epoch.The model was optimized through backpropagation and the performance was assessed using the validation dataset.Training stopped after 60 epochs since there were no improvement in the validation accuracy between epochs 50 to 60 even though training accuracy was still rising.This forced stop prevented the model to be overfitted on the training dataset.An overall accuracy of 91.1% and 90.1% were achieved for training and validation datasets respectively (Table 4).

RADARSAT-2 parameterization accuracies
With the trained CNN model using all 4 bands, the entire training/validation dataset was classified, and Figure 6 shows the prediction distributions per class for each degree of incidence angle.
From Figure 6, we see that for all the incidence angle range, the CNN performs better for the ice class compared to the water class.The median values are higher, and the distribution is less spread for the ice class.Better prediction accuracy is observed at lower incidence angles.To see the impacts of the other SAR image bands on the model accuracy, Table 4 shows the accuracies and losses for the different SAR parameterizations for the same training and validation datasets.
Results show that the calibration between r 0 , b 0 and c 0 does not have any impact on the CNN results.This is to be expected since over marine regions, there is no terrain compared to high topography on land, resulting in a uniform geometry.For all the other channel combinations, a McNemar test (Iban and Sekertekin 2022;Fang et al. 2020;McNemar 1947) was conducted with the 4-band model as reference, to see if the reduced number of errors in the 4channel model were statistically significant.The results (statistics not shown in Table 4) suggest that the accuracy improvements are statistically significant with p-values < 0.01.The channel giving the best results alone between the 2 SAR polarizations is the HH channel compared to the HV channel.Nonetheless, with a validation accuracy of 0.693 (Table 4), this indicates that there is significant information provided by the cross-polarization channel.This is also shown with the 6.5% increase in the validation accuracy when combining the 2 polarizations compared to HH alone, knowing that the improved accuracy is statistically significant with a p-value < 0.01 of the McNemar test.
Adding the incidence angle information to the single polarization channels slightly improves the validation accuracy of the HH model but has a more significant impact on the HV model with a 10.1% increase in validation accuracy.Likewise, adding the NESZ information to HH and HV, rather than the incidence angle has a similar impact on the model accuracy but with a lesser impact on both polarizations.
Combining the incidence angle information and the NESZ to the single bands does provide better results indicating that both layers provide useful information to the model predictions.When combining both polarizations to either the incidence angle or NESZ, similar validation accuracies were obtained.
All RSAT-2 data acquired for the year 2020 was kept out of the training/validation exercise to test the model accuracy on a completely independent dataset and independent target properties.Table 5 shows the  4).results of all datasets overall and for each class independently.
The model did not perform as well for the test dataset but still provided good accuracy considering the large dataset used to train the model.The accuracy is similar to what other studies using deep learning models have demonstrated, ranging from 80 to 95%, depending on the data used and the number of classes classified (Stokholm et al. 2022;Radhakrishnan et al. 2021;Zakhvatkina et al. 2019a).As shown in Figure 6, it is clear that there is more confusion for the water class resulting in a lower overall accuracy for the test dataset.

Comparing SIC over the RADARSAT-2 archive
In this section, the outputs of the 4-bands CNN model (Table 4) will be visually analyzed against a cloud free Moderate Resolution Imaging Spectroradiometer (MODIS) image in order to clearly identify ice and water and compared to other sea-ice products to determine the potential of such a simple CNN to be applied to other applications than navigation.
Figure 7 shows a visual output of the CIS regional chart (7a), the ASI SIC concentration product (7b), a true color composite of MODIS (7c), a RSAT-2 HH-HV-HV RGB color-composite image (7d) and the output of the CNN model from 4 bands of the Figure 7d image (7e).All products are from July 13, 2020, except for the MODIS image, which was acquired July 14, 2020, because of cloud cover.For the purpose of discussion, Figure 7e shows the CNN predictions of the 1 km 2 chips.Since the model was trained on chips of pure classes (ice or water), the 50/50 threshold on the uncalibrated CNN prediction output was used to determine if the chip was ice or water and then used for SIC calculations (Figures 8  and 9).
The CIS chart presents a more generalized overview of the SIC in CG and the other products provide more details on the ice features present in CG.The MODIS imagery shows some larger portions of CG seem to be open water which is only captured by lower ice concentration polygons or absorbed by larger surrounding polygons due to the inherent limitations of the CIS product.Since the main source of imagery of this chart is RSAT-2 products, it is worthwhile to look at Figure 7d and it is clear that some open water areas are very difficult to distinguish from the main ice pack, e.g., the southeastern shore of CG.  4) predictions distribution per incidence angle.Orange line is the median, boxes correspond to the 1st and 3rd quartile and whiskers represent the 5th and 95th percentile.The CNN model does pick up open water in that area with a very good prediction probability (>80%).Also, where there are leads in the main close pack ice, the CNN predictions are between 30% and 50% of being water.
The ASI SIC product shows a gradual decrease of the SIC westward which tends to follow the change in ice concentration of the CIS chart.Since this product has a resolution of 6.25 km, there is often a mix of ice and water in that footprint as well as melt ponds on the surface of the sea ice which can have an impact on its accuracy (Spreen et al. 2008).We also see in this example that shallow areas with a higher density of small islands are considered as land, e.g., southwestern part of CG.There is some data loss (white areas) probably due to atmospheric effects, cloud cover and wind roughened open water (Spreen et al. 2008).ASI products are also known to be problematic in marginal ice zones (Radhakrishnan et al. 2021).
Figure 8 shows the weekly SIC time series over the GoC RSAT-2 data archive period.The CIS regional charts are used as reference and the ASI and RSAT-2 SIC products are compared.It should be noted that the CIS charts report SIC in tenths and were converted in increments of 10% for comparison with other products.The average SIC and CNN prediction probability over each pixel was first calculated for all ASI and RSAT-2 output respectively for a given week.The average SIC of the pixel averaged ASI product is then taken for all pixels comprised in the CG AOI to compute the weekly SIC since all pixels have the same footprint.For the RSAT-2, number of pixels having a lower averaged prediction probability of 0.5 is divided by the total number of pixels comprised in CG and used to calculate the RSAT-2 SIC.
A first observation is that the SIC for the ASI product never drops to 0% during summer where we know from the CIS charts that CG is ice free for many months in the summer.As for the RSAT-2 product, it seems that the larger errors occur in the ice-free period compared to the full ice cover period.For both products, there is some differences observed with the charts during the freeze-up and melt periods.
Figure 9 shows the distributions of the weekly SIC bias for a given month for both ASI and RSAT-2 outputs.Overall, the RSAT-2 output has a lower median monthly bias and a narrower spread distribution.Only for the month of December is the median monthly bias lower and less dispersed for the ASI output.During the summer months, the ASI output biases are less dispersed than the RSAT-2 output but  the median values are still higher which is due to the less variable summer SIC values that never reach 0% (Figure 8).Though the distributions are slightly better during the winter months for the RSAT-2 output, both outputs seem to provide similar errors.
Overall, a mean bias of À4.0% and À0.4% and an RMSE of 10.8% and 8.4% for the ASI and RSAT-2 products respectively were achieved when compared to the CIS charts.

Big data distribution analysis
With the big data available for this study, the reported incidence angle dependencies of the HH signal were confirmed (Aldenhoff et al. 2020;Komarov and Buehner 2019;Mahmud et al. 2018;M€ akynen and Karvonen 2017;Pogson et al. 2017).These studies reported ranges from À0.40 dB/1 in the near range to À0.05 dB/1 in the far range which are captured by Equations ( 6) and ( 7).Another relationship confirmed from Figures 3 and 4 was the lower sensitivity of sea ice to incidence angle compared to water.This is due to ice being "radar rough" at the C-band wavelength scale, i.e., its surface root-mean-square height is higher than one-eighth of the sensor's wavelength (5.5 cm/8 ¼ 0.7 cm) thus scattering part of the radar incident signal in all directions (Lillesand and Kiefer, 1994).The HH response over water is mainly dependent on wind speeds (Hersbach 2010) which explain the larger spread of the data.This sensitivity to wind speeds is considerably reduced with higher incidence angles where reduced variability in the distributions is observed.It should be noted that the bottom whiskers are linked to the NESZ and are most likely associated to backscatter values for water surfaces with calm winds where specular scattering is dominant.The slope is most likely dependent on the prevailing winds over the studied area and thus Equation ( 7) might not be applicable to other studied areas.
Looking at Figure 3, we see that only the whiskers overlap at low (<25 ) incidence angles.This indicates that the 2 classes are more separable, during the studied period, in the near range than higher incidence angles with machine learning algorithms.This is also reflected in Figure 6 where the uncalibrated output class probabilities are mostly concentrated around 1.There is also less overlap between the distributions at the higher angles ( >45), but this is not reflected by higher confidence in the output class probabilities of Figure 6.Nonetheless, this aligns with observations made by Geldsetzer et al. (2015) where better class separability between FYI and water was obtained at low and high incidence angles for C-band compact polarimetry SAR data.
Figure 4 shows that the median values of r 0 HV are higher for ice than water for all incidence angle which is expected since there is more volume scattering coming from the sea ice where the water is mostly affected by specular reflection and Bragg scattering (Geldsetzer et al. 2019).Similar to the HH signal over water, the HV signal is mainly dependent on wind speeds, but would only exceed the NESZ under strong wind conditions (Vachon and Wolfe 2010).The observed relationship over water is also dependent on the prevalent winds over the studied area.Even though Figure 4 shows that the median HV values are close to the NESZ, there is still valuable information that could help improve class predictability when combined to the HH polarization information (Table 4).Noise subtraction was not used in this study since, as mentioned above, the NESZ is an interpolation and can lead to negative intensity values in linear units.This can also lead to some artifacts at the beam edges as reported by Zakhvatkina et al (2017).In doing so, the pre-processing computation time is considerably reduced while retaining the improved model performances by simply adding the information as inputs to the model.

CNN training and validation
As mentioned before, this study did not focus on getting the best model output possible but rather understanding which SAR parameter improved model performances.This is why the simpler CNN architecture proposed by Wang et al., (2016) was chosen to reduce computation needs and improve understanding of model behaviors with the different configurations.Nonetheless, many recent studies have shown that different CNN architectures like U-Nets (Stokholm et al. 2022;Radhakrishnan et al. 2021) are very suitable and provide better results in classifying sea ice when comparing with national ice services products.
Also, some intermediate steps in CNN architectures, like batch normalization, can prove very useful in improving efficiency and accuracy of the model (Ioffe and Szegedy 2015).Finally, even though the model outputs and accuracy shown in Figures 5 and 6 provide relatively good results to classify ice from water, adding an extra step in calibrating the output probabilities could improve the overall model performance (Guo et al. 2017).
To assess if the model was overfitted on the training dataset, the force stop condition was tested at a more conservative value of 3 epochs with no improvement in the validation accuracy.Accuracies on the test dataset were the same (Table 5) with this more conservative constraint showing it did not have an impact on model performances.
To assess whether the lower test accuracy was dependent on the selected year, training and testing were performed with 2 different configurations.The second configuration tested was trained on all years except 2011, where it kept for testing.The third configuration was trained on all years except 2014.The 2 years were selected randomly to avoid any potential human bias.For all configurations, training accuracies were the same, i.e., 0.901 (Table 5).Interestingly though, test years provided very different accuracies.2011 had accuracies of 0.943, 0.771 and, 0.888 for the ice, water, and overall classes respectively.2014 had accuracies of 0.917, 0.893 and, 0.906 for the ice, water, and overall classes respectively.This indicates that the training/validation of the model is not as sensitive to the selected years when sufficiently large volumes of data is provided.Test results do show that the overall accuracy of the tests is highly dependent on the selected year and can be variable as shown in Figure 8.This supports that to achieve greater and more robust models, auxiliary data is needed to support the SAR data.

RADARSAT-2 parameterization accuracies
Comparing the different model configurations (Table 4), we see little improvement in model performances in adding the incidence angle or the NESZ information to the HH backscatter signal.This can be explained by the fact that, both classes have strong relationships with incidence angle (Equations 6 and 7) meaning the incidence angle is included in the r 0 information and the model picks up those relationships without requiring the static layer.This behavior was also reported by Wang et al. (2016).For the NESZ, the strong signal-to-noise ratio of the HH channel for both classes indicates that the signal is not sensitive to the noise floor.There is thus no additional information to be gained from the model to include this static layer when having only the HH channel.This is not the case with HV.As shown in Figure 4, the distributions greatly overlap for different incidence angles.Adding the angle information improves the confidence to identify the classes based on the distribution differences for a given angle and not for the entire dataset.Also, the HV signal follows closely the NESZ and similarly to incidence angle, identical NESZ values for 2 different incidence angles can be obtain.This can bring confusion in the CNN prediction and is reflected in the improved model performances when adding NESZ to the HV channel (Table 4).Since NESZ and incidence angle are uncorrelated, adding both static layers reduce model confusion and improve overall model performance.To reduce computation costs, one could consider dropping one of the 2 static channels at the cost of lower model accuracy.
Comparing SIC over the RADARSAT-2 archive Figure 7 shows how different products are meant for specific applications with their respective strengths and weaknesses.For example, since the CIS chart is a climatology product, the finer details like fractures and leads in the ice are not captured.Open water areas are often amalgamated with bigger neighboring polygons with a lower total SIC.This is to be expected since the scale of the product is coarse, and its intended application does not focus on these smaller features.Similarly, with the given spatial resolution of the ASI product, such small features are not captured.The ASI does show some small open water areas the CIS chart does not.Nonetheless, these products, when studied over several decades, have proven useful for polar bear habitat monitoring in the Hudson Bay, Canada (Castro De La Guardia et al. 2017) and longterm climate change studies (Liu et al. 2021).
The CNN model presented here, with its basic architecture, shows that it has the potential to capture small features such as leads and fractures (Figure 7).Even with the uncalibrated model outputs, these features are captured as water in Figure 7.By calibrating the outputs or by adapting the calibration to a specific application, the model classification could be adapted to be more sensitive to a specific class, i.e., ice or water in this case.This shows good potential for different applications requiring higher spatio-temporal information such as wildlife-habitat behavior studies (Henderson et al. 2021) or for community safety which rely on safe ice conditions at a very fine resolution to travel, hunt and fish (van Luijk et al. 2022).The CNN output could also help expert analysts improve charting for climatology and shipping purposes by acting as an automated first guess of the input SAR imagery.
When looking at the time series analysis (Figure 8), both the ASI and RSAT-2 products have higher errors during the summer period.This is reflected, for the RSAT-2 product, in the lower model performances for the water class (Table 5) and can be explained by the larger backscatter distribution of both polarization (Figures 3 and 4) and their dependency on wind speeds (Hersbach 2010;Vachon and Wolfe 2010) which can be highly variable from one scene to another.This was also reflected in the different test years selection exercise where lower accuracies were obtained for the water class.Adding auxiliary information like reanalysis wind speeds or SAR derived wind speed information (Vachon and Dobson 2000) might greatly improve such a product.
During the melt period, both products can be affected by the various melt stages of the snow and sea ice like wet snow and melt pond formation.This could lead to misclassifying ice for water where trained analysts have auxiliary data to identify ice during the melt period.The ASI product seems especially sensitive to higher atmospheric moisture content during the melt season (Sotiropoulou et al. 2016) and SIC seems to drop more rapidly than the other products given its known limitations in marginal ice zones (Radhakrishnan et al. 2021).
Overall, some of these errors in both ASI and RSAT-2 products, could be explained by the ± one tenth (1/10) on the precision of the CIS charts (Cheng et al. 2020) which is comparable to the RMSE.This could be especially true during the melt and refreeze periods but not so during the full-ice cover and ice-free periods.Overall, the 2 products have very similar errors with their respective strengths and weaknesses.This is why a combination of both datasets is more commonly used to achieve better automated SIC products (Radhakrishnan et al. 2021;Saldo et al. 2020).
As shown in Figure 7, the CNN output using RSAT-2 data could be adapted based on the intended application depending on the spatial resolution needed for the output as well.Also, a more complex CNN can be developed to fit to a specific product such as an ice chart generated by expert analysts and improve the accuracy of the automated output (Stokholm et al. 2022;Wang and Li 2021;Radhakrishnan et al. 2021).

Conclusion
In this study, we have shown that a simple convolutional neural network (CNN) could achieve high accuracy to classify sea ice from water within a 1 km 2 RSAT-2 image chip.To achieve this, the RSAT-2 dual-polarized ScanSAR wide image archive over Coronation Gulf marine region available to the Government of Canada was used to train and validate this CNN.A total of 1,123 scenes acquired between August 21, 2009 and July 16, 2020 were used to train and validate the model.This allowed for 10 years of diverse ice/water conditions to train this model.The last year of data (2019-2020 ice season) was kept out of the training process and used to test the CNN on ice/water conditions independent from previous years.An overall accuracy of 90.1% was achieved on the validation dataset and 86.3% on the test dataset.
It was shown that HH was the SAR channel that provided the most information out of the 2 polarizations.Nonetheless, the cross-pol channel HV added significant information to the model which improved its accuracy.Adding the noise floor (NESZ) and incidence angle as extra bands to the CNN inputs had a statistically significant impact on its accuracy.This suggests that the added information helps the CNN decode the cross-pol signal since combining the angle and NESZ to HH alone did not significantly improve the accuracy of the model.
It was also shown that the model accuracy varies greatly with incidence angle.In fact, higher accuracy was achieved at lower incidence angle (<25 ).It is thus suggested that lower incidence angle ranges be used to improve the performance of automated methods to discriminate ice from water.
A visual analysis was done on the output of the CNN model and compared to other sea-ice products currently available to determine its applicability to different studies.It was shown that the binary prediction values generated from the CNN provided information on the ice conditions and could inform on specific features such as smaller leads in the ice which are currently not represented in ice charts such as the ones provided by the Canadian Ice Service (CIS) or the sea ice concentration products available from passive microwave remote sensing data.It was also shown that the CNN detected open water areas which were difficult to identify by expert analysts.The CNN could thus provide a good automated first guess to expert analysts to improve the charts for shipping.
Finally, an 11-year time series analysis was conducted.Results showed that over the 11-year span, the mean bias and RMSE in SIC percentage was relatively similar for both RSAT-2 and ASI SIC outputs with slightly lower errors for the RSAT-2 output.A monthly analysis showed that the strongest errors occur during the shoulder seasons (melt and refreeze periods) with a higher bias variability for the ASI output.During the consolidated landfast ice period, both outputs showed a median error below 10% which is comparable to the precision of ice charts.Overall, both SIC outputs had their respective strengths and weaknesses.A combination of both passive and active microwave remote sensing data could prove beneficial for such applications.
With the new generations of C-band SAR sensors such as the Sentinel-1 constellation and the RADARSAT Constellation Mission providing a very large volume of data, deep learning methods become very attractive to develop automated sea-ice mapping products.Since each sensor has its respective engineering and acquisition properties, i.e., incidence angle range, NESZ and spatial resolution, more work will need to be done to assess the performance of each sensor to classify ice from water.This study brings solutions and indications on how to harness the potential of deep learning and SAR data across all platforms.

Figure 2 .
Figure 2. Example of a CIS weekly ice chart of Canadian Western Arctic.Full ice coverage (10 tenths) is shown in (a) (October 18, 2010) and ice free (0 tenths) is shown in (b) (August 16, 2010).Red rectangles highlight the Coronation Gulf marine region.

Figure 3 .
Figure 3. Backscatter intensity (r 0 ) distribution per degree of incidence angle for the HH channel.Orange line is the median, boxes correspond to the 1st and 3rd quartile and whiskers represent the 5th and 95th percentile.Red line corresponds to the ScanSAR noise floor (NESZ).

Figure 4 .
Figure 4. Backscatter intensity (r 0 ) distribution per degree of incidence angle for the HV channel.Orange line is the median, boxes correspond to the 1st and 3rd quartile and whiskers represent the 5th and 95th percentile.Red line corresponds to the ScanSAR noise floor (NESZ).

Figure 5 .
Figure 5. Convolutional neural network training and validation accuracy and binary cross-entropy loss function for each training epoch for the 4-bands input model (Table4).

Figure 6 .
Figure 6.4-band input model (Table4) predictions distribution per incidence angle.Orange line is the median, boxes correspond to the 1st and 3rd quartile and whiskers represent the 5th and 95th percentile.

Figure 8 .
Figure 8. Timeseries of weekly SIC for all of Coronation Gulf from the CIS regional charts, ASI passive microwave product and output of the RSAT-2 images using the 4-band input CNN model.The values are valid on the Monday for each week.The ASI and RSAT-2 products consist of weekly pixel average.

Figure 9 .
Figure 9. Box plot of the weekly SIC biases from Figure 8 for each month for the entire time series.The CIS charts were taken as reference for the bias calculation.

Table 1 .
Sampling periods for full ice coverage (10/10 landfast ice) and ice-free conditions.Year shown corresponds to the end of the ice season.Imaging started at the end of 2008 hence no imagery was available for the icefree period of 2008.

Table 2 .
Number of RSAT-2 SCWA scenes available per class per year.Year corresponds to the same years as Table

Table 3 .
Architecture of the convolutional neural network used.

Table 4 .
Training and validation accuracies based on SAR input parameterization and different input layers.

Table 5 .
CNN training, validation and overall accuracies for ice and water classes independently and combined for the 4input layer model (Table4).