Evaluating the application of the statistical index method in flood susceptibility mapping and its comparison with frequency ratio and logistic regression methods

Abstract Statistical methods are the most popular techniques to model and map flood-prone areas. Although a wide range of statistical methods have been used, application of the statistical index (Wi) method has not been examined in flood susceptibility mapping. The aim of this research was to assess the efficiency of the Wi method and compare its outcomes with the results of frequency ratio (FR) and logistic regression (LR) methods. Thirteen factors, namely, altitude, slope, aspect, curvature, geology, soil, landuse/cover (LULC), topographic wetness index (TWI), stream power index (SPI), terrain roughness index (TRI), sediment transport index (STI), and distance from rivers and roads, were utilized. A flood inventory was constructed from data captured from the destructive flood that occurred in Brisbane, Australia, in 2011. Model performances were compared using the area under the curve (AUC), Kappa index and five other statistical evaluation tools. The AUC prediction rates acquired for LR, Wi and FR were 79.45%, 78.18%, and 67.33%, respectively. A more realistic representation of the flood-prone area distribution was produced by the Wi method compared to those of the other two techniques. Our research shows that the Wi method can be used as an efficient approach to perform flood susceptibility analysis.


Introduction
Floods are the most prevalent natural disasters worldwide (Khosravi et al. 2016;G omez-Palacios et al. 2017). This phenomenon is defined as a heavy rainfall that causes rivers to overflow and temporarily cover the neighbouring regions (Merz et al. 2007). Landuse/cover (LULC) and climate change are the two main causes of increases in flood occurrences (Bronstert 2003;Dang and Kumar 2017;Kjeldsen 2010). Climate change has altered the current precipitation pattern, which creates heavy rainfalls in a very short period, quickly forming floods as the extent of rainfall exceeds the permeability capability of the soil. Evidence-based management is required to minimize both biodiversity loss and impacts on human populations and infrastructure from natural disasters (Cinderby and Forrester 2016). If land use and land management practices have the potential to increase flooding, it follows that these also have the potential to mitigate this risk through reduced runoff generation and altered land management (Juarez-Lucas et al. 2016;Shabani et al. 2014). This will require spatially explicit and catchment-scale flood models to test landscape change and rainfall runoff scenarios to reduce the impacts from flooding natural disasters and maintain healthy socio-ecological systems under changing catchment and climate conditions (Tehrany et al. 2015b). Remote sensing and GIS technologies, together with the latest modelling techniques, can contribute to our ability to predict and manage floods (Forte et al. 2006;Pradhan 2010). The existing uncontrolled negative influences of flooding on river coastal socio-ecological communities can be reduced by appropriate preventative actions (Novelo-Casanova and Rodr ıguez-Vangort 2016).
Flood-prone area mapping has been implemented using various methods in numerous studies. Some of the most popular methods can be categorized into four main groups of hydrological-based (Liu and De Smedt 2004;Jayakrishnan et al. 2005), quantitative (Pradhan and Youssef 2011;Tehrany et al. 2014b;Rahmati et al. 2016), qualitative Stefanidis and Stathis 2013) and machine learning (Liong and Sivapragasam 2002;Tehrany et al. 2015a) techniques. Among different groups of flood models presented in the literature, artificial neural networks (ANNs), frequency ratio (FR), logistic regression (LR), decision trees (DT), and support vector machines (SVMs) are the most popular techniques that have been utilized in flood domain mapping (Tehrany et al. 2015b;Mojaddadi et al. 2017). Although flood susceptibility mapping models are available, the reliability of flood prediction maps still remains a critical issue. Each method has different capabilities and can be affected by different sources of uncertainties (Shrestha and Nestmann 2009). Hence, a full understanding of the strength of each model and its uncertainties help us to make a proper choice for each application.

Previous studies
Hydrological methods are simple and are based on a nonlinear concept; therefore, they are less effective to model complex features such as the catchments (Sahoo et al. 2009). Traditional flood models have been gradually improved or replaced by rulebased and automated techniques that are more suitable for hazard analyses (Hostache et al. 2013). Some hydrological models, such as SWAT (Anjum et al. 2016) and WetSpa (Nurmohamed et al. 2012) integrate RS and GIS to enhance the accuracy of spatial analysis. However, more robust and precise methods are required to overcome the disadvantages of the traditional hydrological methods. Qualitative methods, such as an analytic hierarchy process (AHP), assesses the flood susceptibility using a multicriteria analysis framework (Karimi et al. 2018). The AHP method has been used by Dahri and Abida (2017) to map the flood-prone areas on the Kassandra Peninsula in northern Greece. These kind of methods require the knowledge of specialists in that specific field. Hence, they cannot be used as reliable techniques due to the involvement of the expert's opinion in their analysis (Rahmati et al. 2016).
Machine learning techniques, such as ANN (Maier and Dandy 2000), SVM (Tehrany et al. 2015b), and DT (Sun et al. 2011), are widely used in flood analysis. However, a considerable processing time, the requirement of having high performance computing systems along with specific software, and strict selection criteria for input parameters make machine learning methods less usable for a wide range of users (Ghalkhani et al. 2013;Tehrany et al. 2013). SVM, as one of the robust machine learning techniques, has been used by Tehrany et al. (2015b) and Mojaddadi et al. (2017) to map the flood-susceptible areas in various cases of study. The experience required to select a proper SVM kernel, setting the criteria using trial and error, and the complexity of the process make these methods less usable in flood modelling compared to the statistical techniques. Another example is related to the DT technique that provides the tree structure of conditioning factors with corresponding probability weights (Tehrany et al. 2013). There are different forms of decision trees available, such as Logistic Model Trees (LMT), Reduced Error Pruning Trees (REPT), NaïveBayes Trees (NBT), and Alternating Decision Trees (ADT) (Khosravi et al. 2018), that can be used in the spatial modelling. However, similar to SVM, the variety of DT algorithm choices and the requirement for a statistical expert can be considered some of the disadvantages of this technique.
Poor predictions due to the dataset size and dissimilar value ranges of the validation and training datasets are the weak points of ANN . Due to the weak points of ANN, a neural fuzzy method has been proposed and utilized in several natural hazard applications (Chang and Tsai 2016;G€ uçl€ u and Şen 2016). However, the neural fuzzy method has its own restrictions such as its inability to discover optimal weight variables that considerably effect the prediction performance of the model, a slow training speed and a high sensitivity to noise in hydrological modelling (Hong and White 2009). Some hybrid methods, such as the adaptive-networkbased fuzzy inference system (ANFIS) and genetic algorithm-based artificial neural network (ANN-GA), have also been applied in flood studies (Chen et al. 2006;Fathzadeh et al. 2017). However, they entail many parameters that restrict their use and reduce their popularity due to data collection difficulties. Moreover, they have long computation times and extra modelling parameters are needed (Dawson et al. 2006).
Deterministic and statistical are two types of quantitative methods. The requirement of having an extensive dataset makes deterministic methods more useful for mapping small regions (Ayalew and Yamagishi 2005). Statistical methods, on the other hand, can be understood with ease within a reasonable period of time (Liao and Carin 2009). As it has been stated by Chau et al. (2005), it is imperative to utilize a quick, understandable, and accurate method for flood modelling. Statistical methods have no specific requirement regarding the input data, software, computer capacity, etc. They can be either bivariate statistical analysis (BSA) (Chen et al. 2018b), multivariate statistical analysis (MSA) (Ayalew and Yamagishi 2005), or a combination of both. BSA evaluates the impact of each class of every conditioning factor on flood occurrence. In contrast, MSA assesses the influence of each factor on flooding, without considering the impact of each class. FR (Khosravi, et al. 2016;Youssef et al. 2016), Weight of Evidence (WoE) (Tehrany et al. 2014b) and Evidential Belief Functions (EBFs) are a few examples of BSA methods. Both WoE and EBF are based on the Dempster-Shafer theory that was first introduced by Shafer (1976). One of the advantages of Dempster-Shaferbased methods is their capability to handle an incomplete dataset. Moreover, they produce predictive flood mapping zones and the degree of uncertainty of the same zone. However, they have a few disadvantages. To use the model, first, the flood conditioning factors should be transformed into evidential data layers (using specific calculations) that can be integrated (by the Dempster-Shafer rule of combination) to generate a predictive flood susceptibility map. This requires several calculation steps and data transformations. In addition, the output result of the Dempster-Shafer parameters (belief, disbelief, uncertainty (doubt), and plausibility) have to be defined as accurately as possible in order to achieve a reasonable final map (Dempster 2008). While these methods are highly useful in some environmental applications, they are less preferable in flood modelling due to the importance of time for data processing.
FR has been utilized by Lee et al. (2012) to map the flood-prone regions in Korea. They stated that the FR method can be quickly and easily applied to areas with little map data, and at low cost. LR is one of the most popular MSA techniques for a variety of applications such as landslides , floods , land subsidence (Kim et al. 2006), earthquakes (Umar et al. 2014), mineral mapping (Xiong and Zuo 2018), and ground water mapping (Chen et al. 2018a). Unlike most of the statistical methods, LR does not require any pre-analysis assumptions. Additionally, it accepts all data types such as scale, nominal, and categorical. Both FR (Khosravi et al. 2016;Youssef et al. 2016) and LR (Pradhan 2010) have been used in several flood studies; however, the application of the Statistical Index (Wi) has not been tested in this domain. The Wi method is a BSA technique that has been frequently and successfully used in landslide analysis (Yalcin 2008;Bui et al. 2011); however, its application has not been evaluated in other natural hazard studies such as floods, forest fires, groundwater, and soil erosion susceptibility mapping. Hence, the aim of this paper is to apply and examine the proficiency of Wi in flood-prone area mapping. There is no question that natural hazard studies, such as those on flood phenomena, require time and effort in order to provide a better understanding of the flooding problem, and any improvement, no matter how small, will have a considerable impact on the lives of people and on biodiversity.

Study area
In January 2011, continuous extensive precipitation in the Brisbane Catchment, Australia reduced the infiltration time around the rivers and caused destructive flooding. A lack of proper prevention actions, unorganized need for flood management, deforestation, and urban expansion are the main reasons for an increase in flooding in this region (Bohensky and Leitch 2014). An estimated 200,000 people were affected throughout Queensland during this period, causing damage of an estimated AUD $1 billion (http://www.bom.gov.au/qld/flood/fld_history/brisbane_history.shtml). Most of the area around the Brisbane River flooded, and considerable destruction and loss of life occurred. The Queensland Government undertook a survey regarding the reasons for Brisbane's 2011 flood and other flood events in the state (QFCI 2012) and the flooding resulted in Australia's largest class actions to date.
The study areas selected are between 27 23 0 50.272 00 S and 27 45 0 25.078 00 S latitude and 152 46 0 17.738 00 E and 153 11 0 25.504 00 E longitude ( Figure 1). During the summer, average temperatures range from 21 to 29.8 C and the city experiences its highest rainfall, which can bring thunderstorms and occasional floods. The average rainfall during this time is 426.6 mm. This area receives approximately 1168 mm of precipitation annually. The minimum amount of precipitation occurs in September (34 mm). February receives the highest amount of rainfall, with an average of 167 mm (http:// en.climate-data.org/).

Methodology
The methodology for the current research consists of splitting the flood inventory into training and testing datasets; selecting the appropriate flood-conditioning factors; and undertaking three statistical techniques of FR, LR, and Wi for flood susceptibility and their validations and comparisons.

Flood inventory mapping
A flood inventory map is required in order to recognize the correlation among flood conditioning factors and flood incidences Tehrany and Jones 2017). To produce susceptibility maps, having an accurate and precise record of the past flood events is essential (Merz et al. 2007). Documents regarding the flood event that took place in January 2011 were the main source of the flood inventory used in this research. These data are usually divided into two categories, training, and testing, in order to train the model and validate the outcomes, respectively. Following previous flood modelling studies (Lee et al. 2012;Tehrany et al. 2013;Khosravi et al. 2016), the flood inventory data were randomly grouped into 70% for training and 30% for testing. The locations and distributions of the flood points can be seen in Figure 1.

Flood conditioning factors
The selection of the flood causative factors, known as conditioning factors, is the most influential stage in developing the final flood susceptibility maps and has the highest impact on the precision of the output maps (Kia et al. 2012). Although a lack of a framework or an agreement on how to select the flood conditioning factors still remains, the most relevant and repeatedly used flood conditioning factors by other researchers (Lee et al. 2012;Tehrany et al. 2014a;Rahmati et al. 2016) were used in this study. A flood conditioning factors dataset was constructed using altitude, slope, aspect, curvature, geology, soil, landuse/cover (LULC), topographic wetness index (TWI), stream power index (SPI), terrain roughness index (TRI), sediment transport index (STI), and distance from rivers and roads ( Figure 2). A DEM with a 5-meter spatial resolution produced from Light Detection and Ranging (LiDAR) data was used to derive other related topographical and hydrological parameters. ArcGIS (10.2) and SAGA GIS (2.2) software were used to produce the aforementioned conditioning factors. All factors were created in raster format with a 5 m Â 5 m pixel size. LR supports all kinds of data types (categorical, scale, nominal, etc.); however, all input factors should be categorized for FR and Wi analysis. The reason for this categorization is that BSA methods, such as FR and Wi, evaluate the impact of each class of every conditioning factor on flood occurrence. Hence, a popular quantile classification method was used to classify scaled factors (altitude, slope, TWI, SPI, TRI, STI, and distance from roads and rivers) into 10 equal classes (Ayalew et al. 2004;Shabani et al. 2018;. Topographical factors of slope, aspect, and curvature were derived from the DEM. Altitude and slope are two factors that have a considerable impact on flood creation (Pradhan 2009). Floods are generated in the low-lying and flat regions, and it is not possible to have flooding at the peak of mountains (Tehrany et al. 2013). Steep slopes increase the speed of the surface run-off and reduce the time available for the soil to absorb the water. In the case of the aspect factor, this parameter has an impact on the received rainfall and sunshine amount of the terrain ). Curvature has three classes, flat, convex, and concave, and is another influential parameter in flood studies (Lee and Pradhan 2006). SAGA GIS software was used to generate SPI, TWI, STI, and TRI from the DEM according to the following equations (Jaafari et al. 2014;Jebur et al. 2014):   ( 3) where A s is the area of the catchment (m 2 ) and b (radians) is the slope gradient. TWI is used to measure topographic control of hydrological procedures (Chen and Yu 2011) and greater TWI values are usually found in flooded areas. SPI represents the erosion power of the stream in the catchment (De Risi 2013). STI defines the movements of the sediments due to the water movement. TRI is one of the morphological factors associated with flooding (Werner et al. 2005). This parameter can be calculated using the following equation: where max and min are the largest and smallest values of the cells in the 3 Â 3 rectangular neighbourhoods of altitude, respectively. Soil (1:250,000 scale) and geology (1:100,000 scale) data were obtained from the CSIRO and Australian government websites. The study area is covered by different types of soil formations, such as clay and sandstone. Geology is another important flood conditioning factor because it has a considerable impact on the variation in hydrology and sediment production in the catchment (Khosravi et al. 2016). Terrain infiltration, runoff speed, and extent are highly affected by the LULC factor (Kassa 2014). A detailed LULC map was received from the Queensland Land Use Mapping Program (QLUMP) and was produced by the Queensland Government. This map was created by classifying SPOT5 imagery, high spatial resolution orthophotography and scanned aerial photos and using local expert knowledge.

Flood susceptibility map produced by frequency ratio (FR)
FR is one of the most cited BSA methods in natural hazard studies, such as flood mapping (Jothibasu and Anbazhagan 2016), landslide mapping (Li et al. 2016), ground water mapping (Manap et al. 2014), mineral potential mapping (Yusoff et al. 2015), and soil erosion mapping (Khosrokhani and Pradhan 2014). Its popularity is related to its simple and rapid calculation process (Lee et al. 2012). Equation (5) shows the calculation of FR for a single conditioning factor, and the flood probability index can be measured by summing the FR of all of the factors (Equation (6)).
where the number of flooded pixels in class i of the factor X is represented by N pix SX i ð Þ; the total number of pixels within factor X j is represented by N pix X j ð Þ; m is the number of classes in factor X i ; and n is the number of factors in the study area (Regmi et al. 2014).
Microsoft Excel was used to calculate FR for each flood conditioning factor and each factor was reclassified using the derived FR values in ArcGIS 10.2 using a spatial analyst tool. The reclassified factors were added using the raster calculator to produce the flood probability index.

Flood susceptibility map produced by statistical index (Wi)
Among all the BSA methods, Wi is one of the least used methods in natural hazards modelling and has not been tested in flood susceptibility mapping. The procedure for this method is fast and reasonably simple, which makes it suitable for natural hazard modelling (Aghdam et al. 2016). Wi weights can be described as the natural logarithm of the flood existence in each class of a conditioning factor divided by the total flood density in the study area (Bourenane et al. 2015). The following equation is used to calculate Wi weights for each factor (Chen et al. 2016): where the weight received for class i of the conditioning factor j is given by W ij ; the flood density in class i of the conditioning factor j is given by D ij ; the total flood density within the study area is given by D; the number of pixels with flooding in class i of the conditioning factor j is given by N ij ; the total number of pixels in class i of the conditioning factor jis given by S ij ; N and S are the total number of floods and total number of pixels in the whole study area, respectively. After deriving the Wi weights for each flood conditioning factor, each factor was reclassified using the derived Wi values in ArcGIS 10.2 using the Spatial Analyst tool. The reclassified factors were added using the raster calculator to produce the flood probability index.

Flood susceptibility map produced by logistic regression (LR)
LR is an MSA technique that has been frequently used in a wide range of natural hazards analysis (Kang and Zhang 2016;Nandi et al. 2016;Zhang et al. 2016). Defining the specific assumptions are necessary for most of the statistical methods; however, LR does not require any pre-analysis assumptions (Ayalew and Yamagishi 2005). Moreover, all types of conditioning factors (continuous, nominal, categorical, etc.) are supported by LR analysis (Lee 2005). LR evaluates a dependent variable of the specific event (flood) and recognizes the correlation between that event and more than one independent variable (conditioning factor) that may influence the probability of the event. Through LR binary analysis, a regression association among the flood inventory and flood conditioning factors will be undertaken (Mathew et al. 2009). Meaning that, the flood inventory is a binary factor representing the existence and non-existence of the flooding. The flood probability index, which is represented on a S-shaped curve in the range of [0, 1], is the output of this technique. Using the resultant weights (logistic coefficients), the flood probability index (p) was measured as follows (Bai et al. 2012): where Z is a linear combination, thus LR involves fitting an equation of the following form to the data: where the intercept of the method is represented by the constant b 0 , b n ðn ¼ 0; 1; 2; :::; nÞ represents the LR weights, and x n ðn ¼ 0; 1; 2; :::; nÞ shows the flood conditioning factors, including slope, aspect, and distance from roads. In the current research, all conditioning factors and the flood inventory map were transformed from raster to ASCII format and transferred to SPSS to perform LR analyses.

Accuracy assessment
Accuracy assessment is an essential step in every modelling (Bui et al. 2011). In this study, the area under the curve (AUC) method was used to evaluate the efficiency and reliability of the derived three flood probability maps from the FR, Wi, and LR methods. AUC is widely used in natural hazard studies due to its comprehensive, reasonable, and visually understandable method of validation (Yilmaz 2009;Nefeslioglu et al. 2010). It begins with arranging the flood probability index in descending order. Then, the arranged flood probability index is classified into 100 categories on the y-axis, with cumulative 1% breaks on the x-axis. It continues with overlaying the flood inventory on the flood probability index. The presence of the flood points (training and testing) in each class is evaluated, and prediction and success rates are derived (Pourghasemi et al. 2012). Success and prediction rates are two products of the AUC technique. The flood training and testing datasets were used to produce success and prediction rates, respectively. AUC produces a range from zero to one. The method is 100% successful if the AUC value is equal to 1. Hence, the closer the AUC value is to one, the more accurate the technique. The statistical evaluation measures of overall accuracy, specificity, sensitivity, positive predictive value (PPV), and negative predictive value (NPV) were applied to measure comparative performance of our models (Tien Bui et al. 2016). Overall accuracy, sensitivity, and specificity measure the proportion of training and testing, flooded, and non-flooded samples, respectively, that are correctly classified. PPV and NPV estimate the probability of training and testing dataset samples correctly classified to the flooded class and non-flooded class, respectively.

Overall accuracy
where True Positive (

FR outcomes
FR was measured for each class of every flood conditioning factor by dividing the flood occurrence ratio by the area ratio. The derived FR result for each factor is given in Table 1. A greater FR weight shows a stronger association among that class and flood occurrence, and subsequently represents a higher probability of flood occurrence in that class. For instance, the results showed that the first four classes of altitude received the highest FR values. This confirms the concept that flooding mostly occurs in the low-elevated regions. The class '0-0.47 ' in the slope map gained the highest FR value of 269.88 among the various classes of slope. Another example is related to the distance from rivers; the closer the distance to the river, the higher the FR weight. The FR-derived flood susceptibility map (Figure 3(a)) was produced by classifying the flood probability index achieved from FR analysis. The flood probability index was categorized into five susceptible zones of very low, low, moderate, high, and very high using the quantile method. The class 'very high' covered 9.92% of the study area. The flood-susceptible area distribution in the flood susceptibility map is 9.94% of the area under 'high', and 19.93%, 19.98%, and 40.2% of the study area it occupies is 'moderate', 'low', and 'very low', respectively.

Wi outcomes
Wi weights for each flood conditioning factor were calculated in Microsoft Excel and ArcGIS 10.2 and are listed in Table 1. The greater the Wi weight for each class of  every conditioning factor, the higher the flood occurrence possibility is within that class. Moreover, the negative Wi weights indicate the negative correlation among the class and flood occurrence. For instance, the first three classes of the slope received positive Wi weights; however, the rest of its seven classes gained minus values. This means that by increasing the slope degree, the possibility of flood occurrence decreases. The Wi technique is based on a statistical correlation of the flood inventory layer with characteristics of the flood conditioning factors classes. Hence, the Wi weights are only measured for flood occurrence classes. If the class does not hold any flood occurrence, it does not have any association with the flood inventory (Bui et al. 2011). Table 1 shows that the highest elevated areas, regions far from rivers, 'Andesitic' and 'Volcanics' classes of geology; 'sedimentary rocks' and 'hard acidic yellow and red mottled soils' classes of soil map; 'marsh', 'estuary', and 'horticulture' classes of LULC did not show any correlation with the flood inventory map in this study. Using the weighted sum option in the Spatial Analyst tool of ArcGIS, the final flood probability index was obtained (Figure 3(b)). Similar to the FR analysis, the Wi-derived flood probability index was classified into five susceptible classes using the quantile method. The very high, high, moderate, low, and very low flood-susceptibility zones have area percentages of 7.23%, 12.66%, 20.00%, 19.99%, and 40.12%, respectively.
The calculated Z parameter was entered into Equation (8) and the final LR-derived flood probability index was produced. Subsequently, similar to the other two methods of FR and Wi, the derived flood probability index was classified into five susceptible zones with the following area percentage for each class; very high (2.46%), high (4.69%), moderate (24.59%), low (32.91%), and very low (35.33%).

Model validation evaluation and their comparison
A flood probability index of quality assessment values was calculated for each of the three models using the training dataset to compare performance and the testing dataset for validation. The TP, TN, FP, FN, PPV (%), NPV (%), sensitivity (%), specificity (%), ACC (%), Kappa, and AUC values of these three models based on the testing dataset are shown in Table 2. Evaluating the classification of flooded pixels, the FR model indicated superior sensitivity (89.66%), followed by the Wi model (89.36%) and then the LR model (85.03%). However, in terms of the classification of nonflooded pixels, Wi demonstrated the greatest specificity (89.71%), followed by LR (87.26%) and FR (86.67%). In terms of overall accuracy, Wi exceeded the other two models, with an 89.51% rating. Wi also returned the highest Kappa index (0.852), followed by LR (0.801) and FR (0.763), highlighting the significant mirroring of reality in all of the models. Figure 4 shows both the success and prediction rates for each model's outcome. The lowest accuracies were achieved for FR, possibly due to its simple calculation procedure. FR can be more applicable in mapping linear features than modelling a complex and nonlinear event, such as flooding. The measured prediction rates for both LR and Wi were almost similar; however, Wi achieved a higher success rate of 82.34% compared to LR (80.75%). In addition, flood testing points were overlaid on the three flood susceptibility maps. The numbers of existing flood points that fell into the 'very high' susceptibility class were measured. The results were as follows: the total testing points that fell into the 'very high' class of Wi, LR, and FR susceptibility maps were 73%, 68%, and 57%, respectively, again reinforcing the superior performance of the Wi method. Flood susceptibility maps represent the areas with the highest potential of flooding, which helps in managing and preventing this disaster in the future. However, the susceptibility maps only show the predicted spatial distribution of flood occurrences and do not give information regarding its temporal probability (Bui et al. 2011). Overall, Wi impressed, outperforming the other methods and producing the most accurate outcomes, although all three models proved to be acceptable for susceptibility mapping in the study area. It can be concluded that Wi displayed the best performance in this study, and the robustness of this method was demonstrated in flood susceptibility modelling and prediction.

Conclusion
Flood susceptibility maps are a fundamental step in flood hazard, vulnerability, and risk analysis. Therefore, it is essential to produce the most precise and reliable flood susceptibility maps possible. The application of several statistical methods has previously been tested in flood modelling; however, the Wi method has not been evaluated in this domain. The Wi technique has been found to be easily understandable and cost-effective for mapping landslide prone areas. Hence, the question is whether it would perform at similar levels of precision for flood susceptibility mapping. In the current research, the Wi method was applied, and its outcome was compared with the performance of two popular statistical techniques, FR and LR. The Wi method recognizes the existence of the flooded points in each class of every conditioning factor and assigns weights to them individually. The final flood susceptibility map was produced by combining all of the weight conditioning factors. The validation results from AUC showed that the flood susceptibility maps generated from Wi and LR are more reliable compared to the FR results. The prediction rates were 79.45%, 78.18%, and 67.33% for the LR, Wi, and FR models, respectively. The accuracies indicated that the two models of LR and Wi have almost equal prediction capabilities; however, the Wi method had a higher success rate (82.34%) compared to the 80.75% obtained by the LR method. This shows the superior performance of the Wi method compared to both FR and LR. Flood susceptibility maps have been shown to be of great assistance in properly defining urban expansion and planning strategies. As the outcomes are produced in medium-scale maps, in situ measurements are required in order to have more accurate information about potential flooding regions. However, production of probability maps, such as in this research, can greatly aid in understanding flood risks and probabilities, leading to more preparedness for future flooding events.

Disclosure statement
No potential conflict of interest was reported by the authors.