Developing a GIS-based rough fuzzy set granulation model to handle spatial uncertainty for hydrocarbon structure classification, case study: Fars domain, Iran

ABSTRACT It is well agreed that geologic risk occurs during hydrocarbon exploration because diverse uncertainties accompany the entire hydrocarbon system parameters such as the source rock, reservoir rock, trap and seal rock. In order to overcome such attributes with uncertainties, a number of soft computing methods are used. Information granules could be provided by the Rough Fuzzy Set Granulation (RFSG) with a thorough quality evaluation. This is capable of attribute reduction that has been claimed to be essential in investigating the hydrocarbon systems. This paper is an endeavor to recommend a Geospatial Information System (GIS)-based model with the aim of categorizing the hydrocarbon structures map consistent with the uncertainty range concepts of geologic risk in the rough fuzzy sets and granular computing. The model used the RFSG for the attribute reduction by a Decision Logic language (DL-language). The RFSG was employed in order to classify hydrocarbon structures according to geological risk and extract the fuzzy rules with a predefined range of uncertainty. In order to assess the precisions of the fuzzy decisions on the hydrocarbon structure classification, the fuzzy entropy and fuzzy cross-entropy are applied. The proposed RFSG model applied for 62 structures as the training data, average fuzzy entropy has been calculated as 0.85, whereas the average fuzzy cross-entropy has been calculated 0.18. As it can be discerned, just seven structures had cross-entropies greater than 0.1, while three structures were larger than 0.3. It is implied that the precision of the proposed model is about 89%. The results yielded two reductions for the condition attributes and 11 fuzzy rules being filtered by the granular computing values.


Introduction
In general, the life cycle to investigate the hydrocarbon systems in a sedimentary basin would cost a huge amount of investment and take a lengthy time. Furthermore, throughout its life cycle, because of being a multi-criteria decision-making procedure, it encompasses numerous risks. They include technical, economic, environmental, geological, portfolio and financial risks. The current research attempts to take into account the geological risk meaning the risk of the existence of a producible hydrocarbon accumulation. The professionals dealing with hydrocarbon exploration processes are undeniably expected to be aware of the magnitude of the geological risks in each geologic structure in a sedimentary basin. Numerous petroleum systems exist and every system possesses various attributes; for this reason, to obtain an efficacious exploration, it is necessary to conduct more accurate investigations.
In the process of hydrocarbon exploration, geological risk occurs due to the concerned experts' unsatisfactory knowledge. In order to realize and control imperfect knowledge, numerous classical methods could be adopted. According to Malvic (2017), these methods had been categorized into stochastic and deterministic approaches. The geological deterministic models are often numerical such as calculation of Probability of Success (POS; e.g. Malvic, Novak, and Zelenika (2015), which this paper borrowed this concept as the core of its risk assessment process. As Rose (2000) described, the geologic risk is evaluated by taking into consideration the probability that the subsequent four independent hydrocarbon system factors as Equation (1) explained: where P source is the presence of a mature source rock, P reservoir is the presence of the reservoir rock, P strap is the presence of a trap and P dynamics is for the Play dynamics.
A conceptual model of uncertainty in spatial data which was proposed by Klir and Yuan (1995) distinguishes between error embedded in data and vagueness which is encapsulated in them. The main focus of this research is to handle uncertainties in hydrocarbon system parameters which exist because of vagueness and must be modeled by soft computing theories such as rough sets and fuzzy sets not classical probability theories.
Vagueness arises in several complex issues of engineering, science and many other fields such as Heštera (2020) and Siahuei et al. (2021). These issues cannot be resolved using crisp mathematical methods in which defines the values of set members either 0 or 1. Many well-known theories, including probability theory, Fuzzy Set Theory (FST), Intuitionistic Fuzzy Set Theory (IFST), Vague Set Theory (VST), Interval Mathematics Theory (IMT) and Rough Set Theory (RST) have been established by Akram and Zafar (2018) to explain uncertainty.
As Dubois and Prade (1990) mentioned, we have discovered two types of widely implemented combined methodologies in uncertainty handling related to classification domains including fuzzy rough sets and rough fuzzy sets. Rough sets, as a tool for granular computing (Anh, Vo, and Pedrycz 2017;Hu, Xie, and Yu 2007), offer a unique induction structure in artificial intelligence (Wang and Tao 2003). Using this methodology, we granulate the universe of discourse into a group of granules to describe the objects and apply these concepts to determine random subsets of the universe. Feature evaluating, variable selection, attribute reduction (Hu, Xie, and Yu 2007;Qian, Liang, and Wei 2013), rule extraction (Leung et al. 2008) and uncertainty reasoning are the main developments encountered in rough sets theory (Yang and John 2008). Of these, the fuzzy set theory has been developed by Zadeh (1965), which is referred to as a very efficacious method in many applications (Ashraf, Akram, and Sarwar 2014). Discriminating distinctive elements is really daunting once an issue encompasses incomplete, uncertain or vague information and then we are obliged to take into account the granules. It is the information processing science at different levels of granularity (e.g. Zadeh 1965;Pawlak 1982;Hobbs 1985;Zadeh 1997;Nguyen, Skowron, and Stepaniuk 2001;Miao and Fan 2002;Bargiela and Pedrycz 2003;Keet 2008 andYao 2008a). By altering the equivalence relation, different types of rough set models were obtained (Wu, Mi, and Zhang 2003).
The current study has adopted granulation as a classification task to scrutinize the geological risks and opportunities in hydrocarbon exploration. The knowledge from either supervised or unsupervised data would be inextricably associated with uncertainty (Nguyen, Skowron, and Stepaniuk 2001). Therefore, such uncertainty along with the ones intrinsic to the data has been taken into account throughout the process. In line with this, it is well established that for the classification processes, granular computing would be good enough to deal with the uncertainty (Leung et al. 2008). Rough Fuzzy Set Granulation (RFSG) has been long considered as a successful method in solving the mentioned issue. It is of note that the RFSG, as an efficient approach, could be applied to elucidate and examine how decisions are made on classifying the hydrocarbon structures employing the qualified rules uncertainty exists.
The current research endeavors to introduce a Geospatial Information System (GIS)-assisted framework on the RFSG to examine the hydrocarbon system parameters of the hydrocarbon fields for envisaging the geologic risk associated with each structure with its relevant uncertainty. The recommended framework in this research can execute the spatial rule extraction from the spatial data with uncertainty in a GIS based RFSG process. According to the findings of a research elaborating on the hydrocarbon structures of Fars province of Zagros sedimentary basin in Iran, the latent knowledge is presented to choose the critical hydrocarbon system notion. It is an affiliated notion that embraces all of the various elements and processes of hydrocarbon system, including: the main elements (source, reservoir, seal and trap) and processes (trap formation, retention and exploration). Source rock refers to tiny-meshed sediment with sufficient amount of organic ingredients, which is capable of generating movable hydrocarbons. A reservoir rock is an underground rock that has proper porosity and permeability to permit the occurrence of hydrocarbon accumulation. Seal rock points to an impermeable rock all around the reservoir rock to maintain the hydrocarbon. Closure is the maximum distance between summit of trap and lowest level of it. Trap relates to accumulation process of hydrocarbon where retention is about the maintenance circumstances of hydrocarbon in an optimum mood. The extraction elements clarify the accessibility to hydrocarbon from earth to subsurface depth (Rose 2000). This provides the GIS-assisted strategies to be originated to classify the structures in line with the geologic risk for drilling. The paramount point here is that this risk would not be a crisp value and is a fuzzy range having lower and upper approximation borders because there are many uncertainties in hydrocarbon system parameters.
Indeed, a plethora of research has been conducted so far which resembles that of ours. For instance, the GIS capabilities were explained by Bin Mohammed, Javed, and Alshayef (2017) with the intention of appraising the spatial multi-criteria approaches for hydrocarbon exploration which lend their basis to the fuzzy logic. Moreover, the offshore/subsea pipeline routing constraint was studied by Balogun et al. (2017). They employed a hybrid decision system where a GIS is integrated with the fuzzy logic approximate reasoning models with the aim of obtaining an optimum performance. Likewise, an approach was introduced by Liu et al. (2007) for oil and gas potential predictions in the seismic reservoirs working on the basis of the rough set algorithm. In another survey undertaken by Zhou, Yu, and Gu (2017), the fuzzy logic and a GIS were employed with the purpose of fusing the geological and geophysical analyses when plotting the gas potential of a field in Kazakhstan carboniferous system that functioned on the basis of the assumed gas accumulation model. An approach was utilized by Han et al. (2014) which worked based on the analysis of the rough set theory by the Support Vector Machine (SVM) method. The objective was to boost the ability of identifying the oil zones process. Furthermore, in order to help decision-makers to utilize a model meant for hydrocarbon exploration in Zagros sedimentary basin in Delavar (2018a, 2018b) integrated the soft computing, the GIS and multi-criteria decision analysis. The mentioned expedient experimentations have been taken into account by this research as paramount guiding principles, yet, it offers several considerations that are explained through the subsequent sections.
This research has several contributions. First of all, entire hydrocarbon parameters are involved together in calculating geologic risk for each structure instead of surveying parameters separately. Second, the proposed model applied different soft computing methods as fuzzy sets, rough sets and granular computing in a unique framework to take advantage of all the exclusive capabilities of each of these methods so that a hydrocarbon structure in a sedimentary basin may be a member of both the lower approximation and upper approximation of the geologic chance concepts to some degree. The last contribution is related to the ability of model in extraction of fuzzy rules with a certain degree of confidence for the hydrocarbon system to investigate the status of each structure.

Rough fuzzy set granulation
In order to elaborate on the calculus of human thoughts problem, many tools have been devised by the mathematicians so far. Regrettably, the devised tools in crisp mathematics, especially the ones in the classical mathematical logic have failed to assist recognizing the natural language which is utilized by the human concept with the intention of uttering judgments and reasoning about them as a perception which could help devices really intelligent systems. A cause might be the one that the mankind can effectively resolve numerous real-life complications while being capable of uttering thoughts via equivocal, uncertain and inaccurate perceptions and then reasoning about them. In order to go beyond computation with numbers to computation with words, and additionally to go beyond manipulations of measurements toward manipulations of perceptions, it was recommended by Zadeh (1965) that it is better to base the thoughts calculus via the fuzzy logic. In line with this, "information granule" was recommended by (Zadeh 1965(Zadeh , 1979(Zadeh , 1999 which refer to a cluster of diverse objects which are combined based on indistinguishability, similarity, or functionality. The granular computing method categorizes the information into subsets or granules of information (Yao 2002;Yao 2008a;Lin 2003). Having yielded an information table concerning the problem, the granules are created by the granular computing by taking into account the resemblance of the attribute values of the objects. Later on, it concentrates on introduction of the classification rules.
Equation (2) provides the definition of an information table: (Pawlak 1991;Yao and Zhong 1999) where U is a finite non-empty set of objects, AT is for a finite non-empty condition attribute set that comprises mc elements, D is a nonempty finite decision attribute set that encompasses md elements (m = mc + md) and AT∩D is empty. In rough sets, we have some additional information related to the elements. Elements that have same information are kept in one equivalence class, and with the help of these equivalence classes, we take lower and upper approximations of the set which leads to uncertainty . According to the rough sets and the fuzzy sets, the rough fuzzy set theory was introduced by Dubois and Prade (1990) in dealing with Fuzzy Decision Information System (FDIS). In the rough fuzzy set theory, two operators are named the upper approximation � R F ð Þ and lower approximation, � RðFÞ are designed to roughly signify the fuzzy concept F. � R F ð Þ and � RðFÞ which are two fuzzy sets on U with membership functions defined by Equations (3) and (4) When using the RFSG, two principal inspirations of the attribute reduction exists which are expulsion of the redundant attributes and enhancement of decision rules. The positive region is a vital conception in the attribute reduction in the RFSG. Taking into account sets C, D ⊆ A, and X ⊆ U, C and D would be the equivalence relations over U. The C-positive region of D characterizes the set of objects being appropriately categorized into D-elementary sets created by IND(D) by making use of the knowledge from C. IND(D) implies indiscernibility which is a property for certain specific attributes in an information table. It is an equivalence relation defined on U. In particular for x i and x j , they are D-indiscernible with respect to D, which can be defined as follows (Zadeh 1979) The C-positive region of D is defined as (Chen et al. 2015): No superfluous attribute is allowed to be present in the reduction. No attribute could be eliminated from R without altering the equivalence classes. To put it simply, a reduction would be an essential constituent of the knowledge that is capable of capturing the entire basic concepts which scrutinized utilizing the indispensable attributes. In line with this, five kinds of reduction were presented by Zadeh (1999) to be employed for the rule extraction in the RFSG. These could be induced on the basis of the discernibility matrix proposed by Yao (2002). Consistent with what Yao (2008a) claimed, a reduction type defined below was implemented in the current research because it was able to maintain the decision type as well as its precision; it is of note that the other four reductions fail to do so (Hu, Xie, and Yu 2007) For "x 2 U, EA x ð Þ can be defined by using Equation (7): If AT is an x_E consistent set of FI, and no proper subset of A is also an x_E ⊆ consistent set, then A is referred to as an x_E reduction of F. By Equation (7), the decision having the maximum lower approximation membership value and the membership value of the objects under the decision are unaffected as well.
The decision rules in the RFSG which entail certain and possible rules could be induced from the concept of lower and upper approximation. By the time a reduction is acquired, the fuzzy decision rules could be effortlessly created by overlaying the reduction on the original decision table. Consistent with Han et al. (2014), a decision rule in the FDIS is the expression Ф→Ψ, implying that if Ф then Ψ, where Ф and Ψ are the antecedent and consequent of the rule, respectively. Ф is a crisp set on U and Ψ is a fuzzy membership value. Specific indexes, namely, the generality and absolute support explained by Equation (8) and (9) are employed to authenticate the quality of the derived rules using the RFSG.
It is of note that the generality assesses the rule applicability, representing the extent of the objects or the relative number of the objects, to which the rule could be implemented. The generality has been defined by Yao (2008b) as follows: In granular computing, G stands for the support of the rule. The correctness of the rule is also assessed by the absolute support, defined by Yao (2008b) as follows: In granular computing, the absolute support implies the confidence or accuracy of the rule which could be considered as the conditional probability of a randomly selected element satisfying on the condition that the element satisfies Ф. In set-theoretic terms, this is known as the extent to which m (Ф) is contained within m (Ψ) (Yao 2008b).

Methods
This research has three main objectives: • A hydrocarbon structure in a sedimentary basin may be a member of both the lower and upper approximations of the geologic chance concepts to some degree which are uncertain concepts about spatial boundary of a structure. So the first goal is to calculate two optimistic and pessimistic values for each hydrocarbon structure. Based on these values, classification of structures will be possible in a specified range of uncertainty. • Decision making attributes reduce in a sedimentary basin during hydrocarbon system determination by applying attribute reduction procedure. • Fixed fuzzy rules are extracted from the information table which is related to hydrocarbon system. These rules are independent from structures and can be used for all over the sedimentary basin. They may be applied as a part of Spatial Decision Support Systems (SDSS) for future explorations.
The proposed RFSG model (Figure 1) integrates the merits associated with both rough set granulation and fuzzy sets. This model specifically handles the data tables with crisp conditions and fuzzy decision attributes of related hydrocarbon system. The RFSG modeling process requires the fuzzification of the decision attributes. In this research, the hydrocarbon system parameters were employed as the condition attributes and the geologic risk as the decision attribute. The hydrocarbon structures ought to be categorized randomly into training and testing datasets in order to construct the model. If it is intended to handle the hydrocarbon system parameters more accurately, the relevant FDIS needs to be generated while applying the attribute reduction on the FDIS. Having attained an attribute reduction, the rules are generated by overlaying the reductions on the original decision table and reading off the values. By making use of the dataset, it is required to inspect the validity of the candidate rules originated from the training dataset. Particularly, to execute the mentioned inspection, both the generality and absolute support indices were employed. The candidate rules being acknowledged by both of the indices which would be considered as the qualified rules. These rules could be later revised by the domain experts, and could be uttered in the natural and comprehensible if-then statements .
Several distinct steps are involved on the basis of the RFSG for handling uncertainty during the hydrocarbon structures classification. They include identifying the problem, designing a geodatabase, analyzing the RFSG, and producing classification, entropy and cross-entropy maps as results. Figure 1 depicts the meticulous steps which are needed by each stage and are then explained fastidiously in the subsequent sections.
The most focal stage needed when extracting the spatial rule using a GIS-assisted RFSG model is to understand the criteria and objectives during the hydrocarbon exploration procedure. Spatial factors relevant to the hydrocarbon system could be cautiously investigated by the specialists. Having defined the relative attributes of the hydrocarbon system, numerous spatial and non-spatial data could be gathered as a geodatabase. The data has to be processed appropriately in order to obtain a high quality rule extraction analysis. Data pre-processing encompasses diverse stages like investigating the data distribution and outliers, as well as managing the missing values, plus converting data into proper formats and coordinate systems.
An information table was designed as FDIS, according to the geodatabase information layers. The columns of information table include availability likelihood of hydrocarbon system parameters. Based on this table, three mentioned objectives of this research would be accessible by integration of rough set, fuzzy set and granular computing concepts. In order to condition attributes reduction, positive region as a tool for indiscernibility assessment, was applied in a rough set framework. This reduction helps experts make decisions about the classification of hydrocarbon structures due to their geologic chance in an uncomplicated space. Decision attributes were calculated by a GIS-based Delaunay as membership functions. This fuzzification facilitated handling uncertainties which exist in classification process by an uncertainty interval. Finally, two functions from granular computing method were applied for fuzzy rule extraction.
In reliability judgment phase, by the aim of calculating fuzzy entropy and fuzzy cross-entropy values, the precision of classification of hydrocarbon structures was investigated. The extracted rules are required to be ultimately converted into valuable knowledge, meaning that the qualified rules have to be revised and deduced with the aid of proficient experts. Regarding the unusual patterns or results which fail to match with the common practice, it is essential that they be inspected prior to their implementation. The ultimate results will be the knowledge about the classification of the geologic structures consistent with their geologic risks and relevant interval uncertainties.

Case study and implementation
Fars area located in Zagros fold belt in which is illustrated in Figure 2, Iran entails around 15% of the confirmed gas reserves in the world (Motamedi, Sherkati, and Sepehr 2012). Located in Zagros fold-thrust belt in southern Iran, the anticlines in Fars region have been regarded to be exceedingly valuable because they have plentiful hydrocarbons. Furthermore, the region is readily recognized by the North West-South East trending parallel anticlines which verge to the South West (Maleki and Jahadgar 2015). According to the geological classification, the study area includes Interior Fars, Sub Coastal Fars, Coastal Fars and Bandar Abbas area. Four hydrocarbon systems, namely, Permian, Jurassic, Tertiary and Cretaceous could be detected in Fars region. Over the past years, various surveys have scrutinized the hydrocarbon system in Fars embayment. By doing so, tectonic and The mainstream of the recognized reservoirs in the area are carbonated, but Asmari and Khami group formations outcrop in the area. Dehram group being linked to Permian would make the mainstream of the examination objective in Fars area. It is of note that Khami, Bangestan and Asmari come next.
A hydrocarbon system involves a pod of active source rock plus the entire associated oil and gas. Also, it embraces the entire necessary elements and processes which are required for the existence of the oil and gas accumulations (Magoon and Dow 1994). The indispensable components include the source, reservoir, seal and overburden rocks, whereas the processes consist of trap formation and the generationmigration-accumulation of the petroleum. The entire necessary components are urged to be placed in time and space by which the processes needed to form a hydrocarbon accumulation would befall. Chronostratigraphic chart of Fars province (Figure 3) shows the age of rocks strata of the study area in relation to time.
Diverse methods could be applied for hydrocarbon prospects classification from the viewpoint of the geological risk and it is noted that every single approach  could have their own limitations and advantages. The focal issues were the bulky quantity of the prospects, formations and parameters, which could influence the establishment of the hydrocarbon system. To counter these effects, a novel solution was recommended by this research. As such, the geological risk could be computed for every prospect as Bazan and Szczuka (2001)claimed in one formation, and by taking into account only one parameter of the hydrocarbon system each time and following that the process would proceed for the other prospects and the other parameters, alternately. The formation group stays unchanged and then the Dehram group was concentrated on due to the fact that it forms the majority of the exploration target in Fars area.
The mentioned approach is, however, disadvantaged mainly about uncertainty existing in the knowledge-based process, meaning that once the experts select a membership value to a parameter that named decision attribute in the information table, this number would commonly have a number of uncertainties with itself. Dealing with uncertainty means the uncertainty accompanying an information product has to be stated meaningful terms and measures (Koromorowski et al. 1999). This issue, yet, was considered as vagueness relevant to both the fuzzy and rough sets.
This research was undertaken on the hydrocarbon prospects of Fars province in Iran. According to the geologic surveys, among the 246 structures existed in the area; 102 structures were selected to be manipulated in the GIS environment as the train and test data. Every structure had a capability for the hydrocarbon exploration. It is estimated that approximately ten million USD would be required to drill an offshore hydrocarbon well on each structure. Consequently, any error involved in site selection of the drilling place can result in the huge waste of capital and time. Increasing the quantity of the wells on structures to appropriate locations has been claimed to be a really paramount for the prosperity of the hydrocarbon exploration corporations.
In keeping with numerous hydrocarbon geologic theories and the aid of the professionals from diverse fields, the vital hydrocarbon system parameters such as the source rock factors, the reservoir factors, the trap factors, and the seal rock factors were scrutinized and seven hydrocarbon system variables were examined as the condition attribute for the information table. The mentioned factors include "trap", "reservoir", "source", "seal", "porosity", "retention" and "extraction". The data pre-processing was undertaken to improve the data quality. The spatial data concerning the structures, wells, the sedimentary basin, and their attribute tables were gathered as a geodatabase as a basis for the needed GIS analysis. The extent of each condition attributed gained from five experts and the mean of all the quantities was selected for each attribute. This process was reiterated for all the 102 structures and the measures concerning the chance for the existence of each factor were transformed to four classes as represented in Table 1. Table 2 represents the information table of the first five structures (Figure 4) from the original data to demonstrate the classified quantities.
The hydrocarbon exploration in Fars area was spatially clustered, which was substantiated by assessing Moran's Index by spatial autocorrelation tool in the GIS environment for the sample drilled wells ( Figure 5). Its p-value was roughly 0.09 and the z-score was around 1.7, implying that there was fewer than 10% probability that the incidence of the hydrocarbon system is spatially randomly distributed. Consequently, in keeping with the popular hydrocarbon systems, the adjacent structures would probably have hydrocarbon or not. This could be attributed to the first law of geography (Tobler 1970).
Consequently, in Fars province, we can determine if a structure has hydrocarbon with a high or low geologic chance via the incidence of the hydrocarbon exploration in the adjacent structures. Nonetheless, several exceptions could be  regarded for this law in actual fact. The major denial can be attributed to the faults. They can extinguish the conditions of the reservoir rocks as a central portion of two adjacent structures or even one structure. This problem was handled in this study by applying barriers for fault feature class in the geodatabase. The adjacency concept explained at an earlier juncture could be modeled through the fuzzy sets which could be employed as the decision attribute in two levels, namely, Bad Geologic Chance (BGC) and Good Geologic Chance (GGC). The decision attributes membership could be specified as a scenario that is detailed as follows: Primarily, the geologic  Table 2 Figure 5. Spatial autocorrelation report and its results.
structures in Fars area could be categorized into two classes, namely the ones with hydrocarbon and without hydrocarbon. A new condition attribute field "Adjacent Structures" would be created for every structure by making use of the Thiessen Polygon analysis in the GIS environment ( Figure 6 ) by counting the number of the adjacent structures having hydrocarbon. For every likely value of the "Adjacent Structures", the quantity of the structures that contain hydrocarbon could be estimated. The quantity was utilized to signify the corresponding fuzzy membership. The transformed FDIS of the first five structures have been depicted in Table 2. The fuzzification of the decision attribute substitutes the "Adjacent Structures" in the table with related fuzzy membership values (Table 3).

RFSG analysis
The RFSG analysis could be executed having designed the geodatabase and construction of the FDIS table. In this research, the hydrocarbon system parameters were utilized as the condition attributes while two fuzzy decision attributes were chosen as "low approximation of Geologic Chance" and "high approximation of Geologic Chance". We could use the identified rough set toolkits in order to create positive areas, attribute reductions, and rules. The data has to be categorized into training and test datasets to construct the model and test the validity of the extracted rules, respectively. The derived rules ought to be filtered to facilitate the measures in granular computing. Such measures include generality and absolute support. A threshold was defined for every measure as a filter  for the extracted rules. The candidate rules acknowledged by both the generality and absolute support thresholds are considered as the qualified rules to be additionally revised by the experts. 62 structures out of the 102 structures were randomly chosen as the train dataset whereas the 40 remained structures were utilized as the testing dataset. Seven condition attributes existed which include hydrocarbon system parameters and two fuzzy decision attributes concerning the geologic chance of the hydrocarbon exploration. The training dataset was employed to create the reductions and rules by utilizing the positive region concept Equation (5) for each attribute. Two reductions were derived and 27 rules were created. The algorithm was executed on a genetic based reduction to detect a minimal reduction of the FDIS. The reduction sets were generated as {Trap, Reservoir, Source, Retention, Extraction} and {Trap, Source, Seal, Retention, Extraction} for the FDIS. The conclusion to draw is that removing the porosity in both reductions would be someway reasonable due (Yao 2002) to the fact that it is one of reservoir implicit features. However, it is impossible to exclude the reservoir and porosity instantaneously in actual fact. Therefore, it is necessary to choose the first reduction as the possible situation. By having several trials and errors, it was planned to filter 27 extracted rules by their generality and absolute support values. While benefiting from supervision of five experts from diverse fields and the rule shortening process with granular computing parameters, we could acquire 11 candidate rules. In order to measure the generality and absolute support, we counted the total matching numbers of the structures. Any candidate rules having a generality and absolute support beyond 0.75 was acknowledged as a qualified rule. As a final point, 11 qualified rules along with their related generality and absolute support values were presented in Table 3.
It is possible to express the qualified rules in the natural and understandable form of if-then statements. As Chen et al. (2015) stated, such rules are a generalized form of the logical rules that are referred to as the association rules. Even though such keywords as "if" and "then" are employed, the rules are cautioned not to be interpreted as expressing logical implications. As an alternative, they should be employed to simply connect the concepts together (Pawlak and Skowron 2007). It should be asserted that, diverse sorts of uncertainties and the properties accompanying the rule are qualified by generality and absolute support.

Discussion
It is of note that the entropy-based approaches are employed to appraise the accuracy of the results. Formulas proposed by Shannon (1948) were employed in this research to estimate the entropy and cross-entropy to evaluate the accuracy of fuzzy classification. The entropy formula of a fuzzy set is defined as follows: where μ k x s ð Þ shows the fuzzy degree of x s related to class k; x s s represents the individual objects; n stands for the total number of the objects; k denotes the individual categories; and c represents the total number of categories. A high value of entropy means an object fit in many classes and therefore is fuzzy; contrarily, a low value means the object has a high membership of only one class and is crisp accordingly. The cross-entropy measures the complementary effect of two different magnitudes: the posterior entropy of the reference probability and the divergence of the classifier posterior from the reference posterior. Here crossentropy is proposed as a means of evaluating the accuracy of a rough fuzzy classification, by illustrating how closely a rough fuzzy classification represents geologic risk when fuzzy membership is a feature of both the training and test data sets. We could make use of the cross-entropy with the aim to evaluating the accuracy of the fuzzy classification outputs in regard to the fuzzy reference data (Foody 1995): A small value of the cross-entropy implies an accurate classification. In this research, 62 structures were existed as the training data. The sum of all the fuzzy entropies was 51, whereas the average fuzzy entropy was 0.85. Moreover, the standard deviation was 0.031. The sum of all the fuzzy cross-entropy was 10.8, whereas the average fuzzy cross-entropy was 0.18. Furthermore, the standard deviation was 0.148. Figure 7(a,b) exhibits the fuzzy entropies and fuzzy cross-entropies of the structures in the training data.
As it can be discerned, just seven structures had crossentropies greater than 0.1 while three structures were larger than 0.3. It is implied that just some structures existed with results significantly different from the train data.
In the RFSG-based model of this study, a structure may be a member of both the lower approximation and upper approximation of the concepts to some degree; in other words, they are fuzzy concepts. The lower and upper approximation for the BGC and GGC are exhibited in Figure 8.
The domain experts revised and talked over 11 extracted fuzzy rules with the aim of interpreting the rules and devising a strategy. It is interesting to know that fuzzy decision rules could be generated by our RFSG framework due to the fact that the RFSG is able to approximate the fuzzy concepts. The conclusion to draw is that the RFSG rules encompass more information in comparison with the classical rules. Consequently, the decision maker can acquire additional information in comparison with the classical rules (Seraj, Delavar, and Rezaee 2019).

Conclusion
Selecting an appropriate spot for drilling the exploration well on a structure has been claimed to be essential in that any fault in computing the drilling place can result in wasting the capital and time. Notwithstanding such a great deal of investment, hydrocarbon exploration has been referred to as a conventional instance for making decision under risk and uncertainty conditions. In geospatial science, the uncertainty can evidently bring unwanted results to the users thanks to making improper decisions. For this reason, classification of hydrocarbon structures under the uncertainty conditions is a vital tool during hydrocarbon exploration procedure. It facilitates the prioritization of hydrocarbon structure for drilling.
This research has illustrated how the rough-fuzzy concept of a hydrocarbon system parameters could be quantified using RFSG. The work deals with classifying hydrocarbon structures in an information system in which there are fuzzy decision attributes and it proposes a novel approach for handling spatial uncertainty in classification of hydrocarbon structures. The classic rough set or the fuzzy approaches distinctly recommend the location of drilling; however, they are faced by theoretical restrictions which could result in unsuccessful or deceptive interpretations. As a ruleextraction model, the RFSG which is on the basis of the rough fuzzy sets is recommended to extract the spatial fuzzy rules for classifying the hydrocarbon structures in Fars province. Those fuzzy spatial decision rules which are detected by the model are employed in recognizing the uncertainty degree of the existing hydrocarbon in a structure. Therefore, according to condition and decision attributes, there should be more attention to hydrocarbon system parameters when an exploration for hydrocarbon is going on an area. For example, due to our proposed model, rule number 10 could explicate as below: • "IF a structure has 75% chance for trap, • AND the chance for reservoir existence is 100%, • AND source rock entity chance is 75%, • AND extraction chance is 100% • THEN the structure will have 0.34 BGC and 0.66 GGC for hydrocarbon system existence".
The generality and absolute support values for this rule were 1 and 1, respectively. By computing the reductions as a vital stage throughout the decision making process, they could be applied to detect minimal conditional attributes among numerous parameters associated with the hydrocarbon system, upon which the decision attributes are contingent. The produced fuzzy rules could be applied to foresee or categorize the hydrocarbon structures.
Finally, in our RFSG model, a spatial structure can belong to both the lower and upper approximation of concepts, i.e. these values of concepts are fuzzy. Although our information table can be converted into a classical decision table, it may lose some of the information of the fuzzy sets. For instance, the fuzzy sets cannot be declined too roughly to stipulate appropriate partitioning of the structures which would finally influence the accuracy of classification negatively. Indeed, if the fuzzy sets are divided into too many crisp subsets, the rules extracted from the decision table may be fitted in an exaggerative situation. In considering these, it can be seen that proposed RFSG model is more suitable for hydrocarbon structure classification when our decision table has fuzzy decision attributes.
The model proposed in this work helps in developing tools with more precision in the field of assessing geologic risk of hydrocarbon structures. RFSG considers structures with minimum range of uncertainty (difference between upper and lower approximation), entropy and cross-entropy as best class in terms of drilling. Vice versa it classifies structures with maximum range of uncertainty entropy and cross-entropy as worst class for drilling. Making decisions under these uncertainties is risky. This work opens a way for experts to develop soft computing tools for their future applications.

Data availability statement
The data that support the findings of this study are available from the corresponding author, upon reasonable request (https://www.researchgate.net/profile/Sahand-Seraj-2).