Co-occurrence and abundance of pollinators and pests in horticultural systems in Africa using an integrated Earth observation-based approach

Flower-visiting insects that are pollinators play a critical role in promoting biodiversity in agroecosystems and agricultural food production through their pollination ecosystem service. However, several factors affect the survival of these pollinators and flower visitors, including the heavy and indiscriminate application of agrochemicals to control crop insect pests, which is impacted by various cropping patterns in a landscape and by shifting environmental conditions. Thus, this study focused on investigating the influence of cropping patterns on the spatial distribution of pollinators ( Apis mellifera , Hymenoptera other than A. mellifera

pollinators for the production of fruits, vegetables or seeds (Klein et al. 2007).
Some studies have referred to a possible decline of these pollinators in various geographical setups (Novais et al. 2016;Rhodes 2018), while other studies have shown that managed bees are on the increase in different set-ups, as estimated from Food and Agriculture Organization (FAO) of the United Nations datasets from 1961 to 2017 (Phiri, Fèvre, and Hidano 2022).However, these insect pollinators are still endangered by factors such as landscape simplification, increasingly influenced by monoculture cropping systems; intensive use of agrochemicals such as synthetic pesticides; climate variability; and increased occurrence of pollinator pests and diseases (Moreaux et al. 2022).
Flower-visiting insects have been used to serve as an indication of pollination services in several crops, including avocado (Persea americana) (Garibaldi et al. 2020).Various insects visit the avocado flower; however, the insect flower visitors that have been known to be the most efficient pollinators for avocado are the Western honeybee Apis mellifera (Dymond et al. 2021).Consequently, distinguishing pollinators from other generic flower visitors is important in avocado production for the successful management of key pollinator species (Sagwe et al. 2022).
Avocado, a highly pollinator-dependent crop, is a vital horticultural commodity in Kenya and is largely cultivated by small-scale farmers.Eighty percent of avocado produced in Kenya is consumed in the domestic market, while the rest is exported as fresh or processed fruits/oils (Kathula 2021).However, the presence of insect pests like the false codling moth (Thaumatotibia leucotreta) and the oriental fruit fly Bactrocera dorsalis has a severe impact on the production of avocado in Kenya (Toukem et al. 2020).Integrated pest management (IPM) has been implemented in avocado production systems to combat these pests while limiting the use of chemical pesticides (Onsomu 2019), but excluding pollinator management strategies.Nevertheless, the inclusion of pollinator management strategies through integrated pest and pollinator management (IPPM) would improve yields of pollinator-dependent crops, while sustaining biodiversity (Biddinger and Rajotte 2015).
Spatial characterization of pollinators, flower visitors and pests is an essential cornerstone of IPPM.
Currently, up-to-date spatial prediction of fruit crop pollinators, flower visitors and pests is scarce in agriculture-promising countries like Kenya.To the best of our knowledge, only a few studies have assessed the spatial pattern of fruit crop insect pollinators, flower visitors and pests at a localized scale.For instance, Makori et al. (2022) used multisource spatial data to understand the spatial distribution and change patterns of stingless bees in Kenya and revealed a higher probability of their decline than of their proliferation.In another study, Mandela et al. (2018) analyzed the diversity and abundance of camphor basil (Ocimum kilimandscharicum) flower visitors in Kakamega forest in Kenya and reported that species diversity of the flower visitors increased with closeness to the forest edge.
Additionally, Zingore et al. (2020) predicted the potential expansion of the peach fruit fly Bactrocera zonata and found that, under changing climatic conditions, the pest could invade wider regions in Africa and South America.Furthermore, Mahmoud et al. (2020) determined the habitat suitability of two fruit fly species (i.e.B. zonata and B. dorsalis) and their suitable co-occurrence range in Sudan, and found that the two pests were spread across a wide area in the country.Furthermore, Stotter (2009) assessed the spatial-temporal distribution of the T. leucotreta in South Africa in the citrus crop and found that male T. leucotreta were mostly confined to citrus orchards, thus providing insights into the local distribution of the T. leucotreta across the agricultural landscape.Statistical models have also been employed to estimate B. dorsalis and T. leucotreta abundance in avocado orchards by using climatic variables (Odanga et al. 2018) and a landscape productivity indicator (i.e.normalized difference vegetation index: NDVI) (Toukem et al. 2020).Both studies demonstrated the importance of climatic and NDVI variables in estimating B. dorsalis and T. leucotreta abundance, respectively.The spatial distribution of pollinators, flower visitors, and pests is also influenced by the landscape structure, which could be tailored in terms of land use land cover (LULC) or cropping patterns (Mudereri et al. 2020;Ochungo et al. 2019).
Although various milestones have been achieved by earlier studies, most have not predicted spatial distribution probability for the avocado pollinators, flower visitors, and pests in a localized avocado production system such as Murang'a County.Furthermore, few studies have assessed the use of geospatial modeling approaches for determining the suitable sites for the co-occurrence of two or more pollinators, flower visitors, or pest species (Mahmoud et al. 2020).Moreover, studies have not looked at the role or effect of remotely sensed cropping patterns and environmental variables in estimating pollinator, flower visitor and pest distribution, cooccurrence and/or abundance.
Therefore, this study sought to illustrate the synergy provided by remotely sensed outputs, e.g.cropping patterns in applications such as insect studies.Specifically, the aim of this study was twofold: (1) to predict the spatial distribution probability of avocado pollinators, flower visitors, and pests using cropping patterns, and environmental and topographic variables, together with an ecological niche modeling approach (maximum entropy: MaxEnt), and to determine the suitable co-occurrence range of the avocado pollinators, flower visitors and pests; and (2) to estimate and analyze the abundance of avocado pollinators, flower visitors and pests using environmental variables and generalized linear models (GLMs).

Study area
The study was conducted in the County of Murang'a in Kenya (Figure 1).The County lies between latitudes 0° 34' 00'' S and 1° 07' 00'' S, and longitudes 36° 00' 00'' E and 37° 27' 00''E.There are two distinct rainfall patterns in the area: long rains from March to May and short rains from October to November every year.The annual temperature ranges from 12°C to 20°C, while the annual rainfall ranges from 800 to 2600 mm (Ovuka and Lindqvist 2000).Murang'a County has a complex heterogeneous landscape, translating into heterogeneous cultivation of crops like avocado, common bean (Phaseolus vulgaris), sweet potato (Ipomoea batatas), mango (Mangifera indica), maize (Zea mays), macadamia (Macadamia integrifolia), arrowroot (Maranta arundinacea), pineapple (Ananas comosus), banana (Musa spp), coffee (Coffea arabica) and tea (Camellia sinensis).Horticultural crops, e.g.avocado, depend on pollinators such as Hymenoptera Apis mellifera.The peak avocado flowering season begins in August (Sagwe et al. 2022), while the fruiting period occurs in February (Toukem et al. 2020).Gatanga, Kandara and Maragua sub-Counties in Murang'a County were selected, since they are critical for avocado farming in Kenya.Further information regarding the study area has been described in Aduvukha et al. (2021).

Methodology
Figure 2 demonstrates the methodology used for species distribution modeling (SDM), co-occurrence analysis, and species abundance modeling (SAM).In summary: i) SDM involved using the MaxEnt model while integrating remotely sensed data of cropping patterns and non-croplands variables mapped in  A detailed description of the datasets and analysis is presented herein.processing was carried out using the Sentinel application platform (SNAP) toolbox and they included the application of the precise orbit file, thermal noise and image border, radiometric calibration, and speckle filtering (Filipponi 2019).In addition, the 90 m shuttle radar topography was bilinearly resampled to 10 m for terrain correction of the Sentinel-1 data.A subset composite image VV and VH image of each season was then obtained after stacking the individual processed images (Aduvukha et al. 2021) 3, 4, 5, 6, 7, 8a, 11, and 12 (Aduvukha et al. 2021).(Ahamed et al. 2011) 3 Green normalized difference vegetation index (GNDVI)

2.2.
NIRÀ Green NIRþGreen (Gitelson, Kaufman, and Merzlyak 1996) 4 Modified soil adjusted vegetation index (MSAVI) (Qi et al. 1994) 5 Normalized difference vegetation index (NDVI) NIRÀ Red NIRþRed (Tucker et al. 1979) 6 Normalized difference water index (NDWI) NIRÀ SWIR3 NIRþSWIR3 (Gao 1996) 7 Soil adjusted vegetation index (SAVI) pineapple, while the non-croplands included areas of water, forest, shrubland, and built-up areas (Aduvukha et al. 2021).A stratified random sampling method was used to collect the ground truth field data as points (i.e.pixels) by using a mobile-based global positioning system (GPS), and GPS Essentials (GPS Essentials 2020) with a maximum allowable error of ±3 m.Furthermore, the collected field data points were set at a sampling distance of ≥20 m each to avoid spatial autocorrelation instances with respect to the 10 m resolution of Sentinel-2 imagery bands used.On-screen digitizing on high-resolution imagery provided by Google Earth imagery (Google Earth 2020) was then employed to convert the pixels of reference data to homogenous units (i.e.polygons) to be used for classification.Studies have shown that polygon-based training areas perform better than pixel-based training areas do (King'ori et al. 2023).A summary of the number of points and their corresponding pixels within the polygons of each class is shown in Aduvukha et al. (2021).

Cropping pattern and non-croplands classification.
This included selecting the most impor- tant variables among the remote sensing dataset combinations (Table 3) and thereafter using a machine-learning algorithm for classification.Specifically, a guided regularized random forest (GRRF) algorithm was used to select important variables in each of the eight remotely sensed data combination scenarios (Table 3) (Aduvukha et al. 2021).
The GRRF for selecting most important variables has been found to perform better than other methods, such as regularized random forest (RRF) and random forest (RF) algorithms, do (Deng and Runger 2013).
The limitation of RF in selecting the most important variables lies in its susceptibility to select highly correlated variables, while RRF may select variables that are not robustly relevant (Deng and Runger 2013).On the other hand, the strength of GRRF in selecting the most robust variables lies in its facility to subject each feature to a penalty coefficient by altering the coefficient of importance of gamma (γ) value of 0 to 1, while maintaining the base coefficient of lambda (λ) value of 1 (Deng and Runger 2013).The importance of the variables was assessed using the mean decrease accuracy ranking method (Han, Guo, and Yu 2016).Further explanation of GRRF, RF and RRF in variable selection can be found in Deng and Runger (2013).
Thereafter, a RF classification algorithm was used to delineate the different cropping patterns and noncroplands (Aduvukha et al. 2021).The RF algorithm was preferred to other supervised classification methods, such as maximum likelihood.This is because RF is non-parametric, i.e. it does not assume the data distribution but it learns first from the "seen" data and predicts the pattern of the "unseen" data (Breiman   (Sisodia, Tiwari, and Kumar 2014).Data were partitioned as 70% for training and 30% for testing (Aduvukha et al. 2021).The variable selection and cropping patterns and non-croplands classification were implemented in R software using the caret package (R Core Team 2019).Consequently, the area under class method (Olofsson et al. 2013) was used to construct the confusion matrix for accuracy assessment to estimate the user's accuracy (UA), producer's accuracy (PA), overall accuracy (OA), and kappa coefficient (Aduvukha et al. 2021).

Pollinators, flower visitors, and pests field data collection 2.2.2.1. NDVI field characterization for sampling avocado farms.
Time-series Sentinel-2 images of both dry and wet seasons accessed from the Google Earth Engine cloud computing platform were used (Gorelick et al. 2017).The seasons were determined from Climate Hazards Group InfraRed Precipitation with Station Data (CHIRPS) rainfall data (Adan et al. 2021).Ninety images for the dry season were obtained from 1 January 2018 to 28 February 2018, while 50 images for the wet season were obtained from 1 March 2018 to 31 March 2018.One hundred random points were generated within the study area, and a K-means clustering technique was used to categorize the extracted NDVI values into three levels, i.e. high, medium, and low (Table 4) (Adan et al. 2021).
The dry and wet season NDVI images were combined to provide a composite NDVI image (Adan et al. 2021).Using expert knowledge (observing texture, pattern, shade, tone, and color hue) (Li et al. 2020), the classification accuracy of the composite NDVI was assessed.An overall accuracy of 86.2% was attained, as detailed in Adan et al. (2021).

Sampling protocol for farms.
The vegetation intensity classes of low, medium, and high obtained from NDVI as described in Section 2.2.2.1 were used as sampling strata, in which pollinator, flower visitor and pest data were collected in the study area.The sampling was carried out during the avocado peak flowering (26 August 2019-4 September 2019) and peak fruiting (27 January 2020-13 February 2020) seasons, respectively, in farm sizes of approximately 0.2-0.4ha (Sagwe et al. 2021).Thirty-five farms were selected across the low, medium, and high NDVI regions, using a multi-stage sampling protocol, as detailed in Adan et al. (2021).In summary, the avocado farms selection protocol in each stratum included: (1) minimal number of avocado trees per farm at seven; (2) socio-economic data on farmers' willingness-to-pay for IPPM technologies (IPM only where biological treatment of pests was introduced, pollinators only (P) where managed bees were introduced, IPPM where both managed bees and biological treatment of pests were introduced, and control where neither treatments of pests nor managed bees were introduced); and (3) setting specific distances among avocado farms with the different IPPM technologies.The specific distances between farms with the different technologies were as follows (i) IPPM and P were at least 1.5-3.0km separate from each other, (ii) IPM and control were at least 0.5 km away from each other, and (iii) IPPM or P were at least 3.5 km away from either IPM or control sites (Adan et al. 2021).The implementation of the different IPPM technologies was assessed in a separate study by Toukem et al. (2022) to establish the effect of the inclusion of pollinators in pest management of crops such as avocado.

Avocado pollinator, flower visitor, and pest occurrence and abundance data.
For purposes of this study, flower-visiting insects that are good at pollinating avocado crops were defined as "pollinators," while those flower-visiting insects that are poor at pollinating avocado crops were called "flower visitors" (Sagwe et al. 2022).Furthermore, in this study, "occurrences" were defined as geolocated instances of the observed pollinator, flower visitor, or pest, while "abundance" was defined as the relative number (count) of a pollinator, flower visitor, or pest trapped per unit area of the farm size.Therefore, for avocado pollinators and flower visitors, three avocado trees, spaced 20 m apart within each of the selected farms, were randomly selected, and each of the three avocado trees was observed for 5 min from 0800 to 1700 h (Greenwich Mean Time: GMT + 3).The pollinators and/or flower visitors were captured using sweep nets (white in color) and were forthwith preserved in 70% ethanol (Sagwe et al. 2022).The sampled pollinators and flower visitors were identified by Robert Copeland, icipe, Nairobi, Kenya, and categorized into four groups: (1) A. mellifera (order Hymenoptera), (2) Hymenoptera excluding A. mellifera (order Hymenoptera), ( 3) Syrphidae (order Diptera) and ( 4) Calliphoridae (order Diptera).Other categories of pollinators or flower visitors were also sampled, but were of very low count, and thus were not included in this study.
Regarding avocado pests, traps for the specific pests were set on two separate avocado trees at a distance of ≥20 m apart in each of the selected farms (Toukem et al. 2020).Lynfield traps (icipe, Nairobi, Kenya) with para-pheromone methyl eugenol (River Bioscience, Addo, South Africa) were used to trap B. dorsalis, while T. leucotreta were trapped using white delta-shaped traps, lured with the relevant sex pheromone (Kenya Biologics, Nairobi, Kenya).Insects from the Lynfield traps were kept in 70% ethanol for preservation, while sheets with trapped insects from the white delta-shaped traps were enclosed in a polythene sheet before analysis in the laboratory, as detailed in Toukem et al. (2020).The pests were subsequently identified by Robert Copeland, icipe, Nairobi, Kenya.On the other hand, the abundance of the avocado pests was collected after 2 weeks within the sampling period only on 17 farms that were set as control treatments out of the total 35 selected farms, since the other 18 farms were put on other IPPM treatments before sampling the abundance of pests.
A mobile-based GPS application, i.e.GPS Essentials (Schollmeyer Software Engineering, Munich, Germany), was used to geolocate the specific trees within each avocado farm where pollinators, flower visitors, and pests were sampled.

Predictor variables 2.2.3.1. Environmental predictor variables and preprocessing.
The most influential environmental variables that affect the distribution and abundance of avocado pollinators, flower visitors, and pests were used as predictor variables in the modeling experiments (Kjøhl, Nielsen, and Stenseth 2011;Odanga et al. 2018).These were average temperature, precipitation, solar radiation, wind speed, morning relative humidity, afternoon relative humidity, and elevation (Table 5).Slope, hillshade, and aspect were derived from elevation.In addition, the variable "cropping patterns," which were derived from Aduvukha et al. (2021), were also included as comprising a predictor variable.These cropping patterns were monocrop tea, mixed crop avocado, mixed crop maize, monocrop avocado, monocrop coffee, monocrop maize, and monocrop pineapple and non-croplands.For the environmental variables, long-term mean average values of July, August, September, and October, estimated from 1961to 1990and 1970to 2000(Kriticos et al. 2012;Fick and Hijmans 2017), were used in predicting pollinator and flower visitor spatial distribution probability, as they coincide with the pre-peak, during and post- peak flowering seasons of avocado in the study area.This period also coincided with the pollinator and flower visitor field data collection.For pest spatial distribution probability modeling, long-term mean average values of the environmental variables, estimated from 1961 to 1990 and 1970 to 2000 for December, January, February, and March, were used.This matched with the avocado pre-peak, during and post-peak fruiting season in Kenya, during which the pests were sampled.
The bilinear interpolation method was used to resample the environmental predictor variables to 10 m × 10 m pixel size and then clipped to the size of the study area to be harmonized with the cropping pattern predictor variable that was developed by Aduvukha et al. (2021).

Predictor variable selection.
The variance inflation factor (VIF) was used to determine the most uncorrelated predictor variables to reduce multicollinearity (Robinson and Schumacker 2009).A VIF threshold of ≥10 was set as an indicator of multicollinearity and redundancy in the predictor variables (Pradhan 2016) (Table 6).The total number of predictor variables subjected to VIF were n = 11 (Table 1) for the pollinators, flower visitors, and pests, with seven predictor variables being retained for pollinators and flower visitors, while eight predictor variables were retained for the pests (Table 6).However, some of the key predictor variables, such as temperature that is known to influence the distribution of pollinators, flower visitors, and pests, were suggested for exclusion by the VIF test.However, some of these predictor variables were retained, for instance, average temperature, based on their biological relevance to avocado pollinators and flower visitors (average temperature with VIF of 135.99) and pests (average temperature with VIF of 70.71) distribution and abundance (EFSA 2011; Pradhan 2016).

Species distribution modelling.
2.2.4.1.Model settings.Before applying the SDMs to predict the spatial distribution probability of avocado pollinators, flower visitors, and pests, the geolocations of avocado pollinators, flower visitors, and pest observations were reprojected to the Universal Transverse Mercator (UTM) coordinate system, zone 37 south (Snyder 1987).This was done to ensure compatibility with the coordinate system of the predictor environmental variables.
The MaxEnt model, Version 3.4.1 (Phillips and Dudík 2008), was used to predict potentially suitable areas of avocado pollinators, flower visitors, and pest distributions.The model settings of the MaxEnt were majorly influenced by the number of occurrence points for each of the avocado pollinators, flower visitors, and pests (Phillips and Dudík 2008).Consequently, this influenced the MaxEnt feature types in modeling the avocado pollinators, flower visitors, and pests (Table S1 in Supplementary).Outliers were eliminated using a "ten percentile" training presence criterion, which declares the 10% most extreme presence observation as absent (Cord et al. 2014).Additionally, a regularization multiplier of two was employed to ensure a less localized prediction (Radosavljevic, Anderson, and Araújo 2014).Moreover, to ascertain a robust MaxEnt model, the replicate runs were set to 10 ( Makori et al. 2017).Cross-validation replication type was used for A. mellifera, Syrphidae, Calliphoridae, B. dorsalis and T. leucotreta because of its robustness (Kohavi 1995), while a bootstrap replication type for Hymenoptera excluding A. mellifera was used because of the small sample size (Merow, Smith, and Silander 2013).A sensitivity analysis of the variable contribution to the model was conducted using a jackknife test.The jackknife test assesses how each variable affects the performance of the model by determining changes in the accuracy of the model as it systematically eliminates

Model performance assessment.
Model performance of the prediction of spatial distribution probability for the occurrence of avocado pollinators, flower visitors, and pests was assessed using the receiver operating characteristic (ROC)'s thresholdindependent area under the curve (AUC) (Merow, Smith, and Silander 2013).The AUC informs the probability of whether presence (sensitivity) in comparison to absence (specificity) was ordered correctly by the model.The values of AUC range from 0 (no possibility of occurrence) to 1 (highest possibility of occurrence), with values greater than 0.7 being regarded as acceptable for predicting spatial distribution probability for the species (Araújo et al. 2005).

Co-occurrence spatial distribution of avocado pollinators, flower visitors, and pests.
Potential co-occurrence analysis of the sampled pollinators, flower visitors, and pests was carried out using the spatial distribution probability outputs generated from MaxEnt.The MaxEnt outputs were in the American Standard Code for Information Interchange (ASCII) file format, which were first converted to the raster image file format (tiff) and assigned three unique values: low, medium, or high.The tiff files were reclassified to low (0.01-0.35), medium (0.36-0.69), and high (0.70-0.99) classes.These classes represented the cluster thresholds of cooccurrence spatial distribution probability of the pollinators, flower visitors, and pests, from the cooccurrence analysis.Each of the tiff files was converted to a vector polygon file to perform an intersect analysis among the respective classes represented by the respective polygons.This was done to assess areas of similarity between the pollinators and flower visitors (only pollinators and flower visitors) and pests (only pests) and also to compare the co-occurrence with the occurrence of all pollinators, flower visitors, and pests.Intersecting polygons were converted to raster data, resulting in the delineation of cooccurrence spatial distribution probabilities of all the pollinators and flower visitors, all the pests, and all the combined pollinators, flower visitors, and pests.

Species abundance modelling
A generalized linear model (GLM) was implemented in R software, Version 3.6.1 (R Core Team 2019) to infer the relationship between the abundance of the avocado pollinators, flower visitors, and pests, and the selected environmental predictor variables.Specifically, the negative binomial distribution in GLM, which accommodates overdispersion of integer counts data, was used (Lindén and Mäntyniemi 2011).The coefficient estimates of all the environmental predictor variables and their significance level (p ≤ 0.05) were analyzed to assess their relationships with the abundance of avocado pollinators, flower visitors, and pests.Hymenoptera species excluding A. mellifera pollinators were not included in this analysis because of their very low counts (n = 8).

Cropping pattern and non-croplands
The best-performing classification scenario (OA = 94.33% and kappa = 0.93) (Table S2 in Supplementary), i.e.Sentinel-2 bands, vegetation indices and Sentinel-1 combination, was used in the spatial modeling of the avocado pollinators, flower visitors and pests (Aduvukha et al. 2021).The mapped cropping patterns were monocrop avocado, mixed crop avocado, monocrop maize, mixed crop maize, monocrop coffee, monocrop tea, and monocrop pineapple while the non-croplands were comprised of built-up area, grassland, forest, shrubland, and water (Figure 3).

Maximum entropy (MaxEnt) model performance
All the MaxEnt models used for predicting the spatial distribution probability of all the studied avocado pollinators (A.mellifera, Hymenoptera excluding A. mellifera and Syrphidae), flower visitors (Calliphoridae) and pests (B.dorsalis and T. leucotreta) demonstrated a good prediction performance within an AUC of 0.70-0.83(Figure 4).

Spatial distribution probability of avocado pollinators, flower visitors, and pests
The MaxEnt model for avocado pollinators (A.mellifera and Syrphidae) and avocado flower visitors (Calliphoridae) predicted a medium to a high spatial probability distribution in Kandara sub-County, and a low to a high spatial probability distribution in Maragua and Gatanga sub-Counties (Figure 5a-c and d).The majority of Kandara and portions of Maragua and Gatanga sub-Counties experienced the highest spatial distribution probability score of >0.9 for the presence of A. mellifera and Syrphidae pollinators, and Calliphoridae flower visitors.The MaxEnt model for Hymenoptera excluding A. mellifera pollinators (Figure 5b) also showed a low to a high spatial probability distribution score in the three sub-Counties.The MaxEnt model predicted high avocado pest spatial probability distribution scores (≥0.9) in the central and western sides of Maragua and Kandara sub-Counties, and low to high scores in Gatanga sub-County.A low distribution score was observed in the eastern side of Maragua and Gatanga sub-Counties for B. dorsalis and T. leucotreta (Figure 5e, f respectively).

Co-occurrence spatial distribution probability of avocado pollinators, flower visitors, and pests
The avocado pollinator and flower visitor co-occurrence analysis showed a low to high probability of cooccurrence in the three studied sub-Counties, with the majority of Kandara showing a medium to high probability of pollinator and flower visitor co-occurrence (Figure 6a).Further, avocado pest co-occurrence analysis showed a medium to high probability of co-occurrence in Kandara, while a low to a high probability of cooccurrence was present in Gatanga and Maragua (Figure 6b).On the other hand, the combined cooccurrence analysis of avocado pollinators, flower visitors, and pests showed a medium to a high probability of co-occurrence in Kandara and a low to a high probability of co-occurrence in Maragua and Gatanga.(Figure 6c).The accuracy of the co-occurrence analysis (Figure 6) is taken to be similar to those of Figure 5 (SDM) since the inputs used are derived from Figure 5.

Avocado pollinator, flower visitor, and pest abundance
The relative abundance of the avocado pollinators and flower visitors showed that A. mellifera was relatively more abundant (80.84%) than Calliphoridae (10.05%) and Syrphidae (9.11%).Among the avocado pests, the relative abundance of B. dorsalis (96.62%) was higher than that of T. leucotreta (3.38%) (Table 9).Distribution of the abundance avocado pollinators, flower visitors, and pests per their respective farms are summarized in Table S3 in Supplementary.

Generalized linear model
Abundance of A. mellifera had a positive relationship with precipitation, slope, and wind speed, but a negative relationship with aspect, hillshade, morning relative humidity, and average temperature (Table 10).Furthermore, the results showed that the abundance of Syrphidae pollinators had a positive relationship with aspect, precipitation, morning relative humidity, slope, average temperature, and wind speed, but a negative relationship with hillshade, while the abundance of Calliphoridae flower visitors had a positive relationship with aspect, hillshade, and wind speed, but a negative relationship with precipitation, morning relative humidity, slope, and average temperature.No variable was significant (p ≤ 0.05) for pollinators and flower visitors.
Regarding the avocado pests, the abundance of B. dorsalis had a positive relationship with hillshade, slope, average temperature, and afternoon relative humidity, but a negative relationship with aspect, precipitation, wind speed, and solar radiation (Table 10).However, only the relationship between the abundance of B. dorsalis with aspect, precipitation, and hillshade was statistically significant (p ≤ 0.05).Thaumatotibia leucotreta abundance showed a positive relationship with aspect, afternoon relative humidity, average temperature and wind speed, but a negative relationship with precipitation, hillshade, slope and solar radiation, with only solar radiation being statistically significant (p ≤ 0.05).

Discussion
This study predicted the spatial distribution of avocado pollinators, flower visitors, and pests in Murang'a County, Kenya using accurate remotely sensed cropping patterns (OA 94.33% and kappa 0.83), environmental factors as predictor variables, and the MaxEnt ecological niche modeling approach.In addition, the study revealed the co-occurrence spatial distribution probability of the studied avocado pollinators, flower visitors, and pests.Furthermore, the study also examined the relationship between the abundance of avocado pollinators, flower visitors, and pests with environmental variables by using a GLM model.

Species distribution modelling
The study identified highly suitable habitats for each of the avocado pollinators, flower visitors, and pests in the northern and central parts of Kandara, the northwestern part of Gatanga, and the south-eastern part of Maragua.These regions are characterized by conducive climatic conditions for avocado farming and thus demonstrated the importance of the LULC variable in predicting pollinator, flower visitor, and pest habitats or invasive plant species (Sittaro, Hutengs, and Vohland 2023;Tonnang et al. 2017).In terms of the influence of environmental variables on the distribution of the avocado pollinators and flower visitors, the present study suggests that the distribution of the pollinators and flower visitors had a negative correlation with high wind speed (>2.5 ms −1 , Figure S2 in Supplementary).There is a likelihood of increased resistance of the flight of A. mellifera and Syrphidae pollinators with high wind speed, especially in the opposite direction (not measured in this study).This may cause a reduced flower visitation rate, consequently reducing their distribution (Hennessy et al. 2020).Wind speed has also been observed to contribute to an increased rate of cooling of some Diptera pollinators (Inouye et al. 2015).The high precipitation (>40 mm, Figure S2 in Supplementary) reported in this study area may be unsuitable for the distribution of A. mellifera and Syrphidae pollinators, and Calliphoridae flower visitors.Previous studies have highlighted how high rainfall negatively affects the foraging and flight activities of A. mellifera pollinators, which may have resulted in their lower occurrence (González et al. 2009).
Average temperature can positively affect the survival, development, and reproduction of crop insect pests, thus affecting their distribution (Zingore et al. 2020).Studies have indicated that T. leucotreta survives and propagates successfully in a temperature range between 16°C and 30°C (Jager and Marthalise 2013).Likewise, Choi et al. (2020) reported that the same temperature range positively influenced the fertility of B. dorsalis in the laying of eggs.Moreover, slope (>0%, <10%, Figure S2 in Supplementary), which is a derivative of elevation, positively influenced the spatial distribution probability of both B. dorsalis and T. leucotreta.This result corroborates Odanga et al. (2018) who found that B. dorsalis distribution increased as elevation decreased, since B. dorsalis is a lowland pest.On the other hand, this study demonstrated that wind speed negatively affected the distribution of B. dorsalis, which implies that regions with high wind speed (≥3.2 ms −1 , Figure S2 in Supplementary) would, for instance, negatively affect the flight of B. dorsalis, hence affecting their distribution (Susanto et al. 2022).
Co-occurrence analysis revealed that a large area of Kandara exhibited medium to high-probability cooccurrence of the studied pollinators, flower visitors, and pests individually and combined.Sharing of the ecological niches between the studied pollinators, flower visitors, and pests may cause the competition for resources, especially by the dominant pollinator and pest thus affecting the populations of other pollinators, flower visitors, and pests (Mahmoud et al. 2020).Nonetheless, there is a possibility of "minor" pollinators, flower visitors and pests to adapt to new ecological niches for their survival (Hassani et al. 2022).

Species abundance and generalized linear modelling
In this study, A. mellifera (order Hymenoptera) was more abundant, compared with Syrphidae (order Diptera) pollinators and Calliphoridae flower visitors (order Diptera).Earlier studies have shown that A. mellifera has been observed to be the most abundant pollinator in fruit crop systems including avocado (Kjøhl, Nielsen, and Stenseth 2011), although A. mellifera is often a managed species.The B. dorsalis avocado pest was more abundant than T. leucotreta, presumably owing to the presence of other suitable host plants, e.g.mangos, which were also in their fruiting season during the time of the field data collection.
The relationship between average temperature and morning relative humidity was negative to the abundance of A. mellifera since these pollinators prefer to start most of their activity at lower temperatures (12°C to 13°C) and because moderate morning relative humidity provides warm and cooler conditions that also increase their activity (Nikolova et al. 2016).On the other hand, wind speed showed a positive relationship with the abundance of A. mellifera in this present study, but Hennessy et al. (2020) have reported that an increase in wind speed increased the resistance of the flight of bees, thereby causing fewer flowers to be visited.The negative relationship of abundance of A. mellifera with the aspect and hillshade, which are both derivatives of elevation, can be explained by the species-area relationship, whereby an increase in elevation may also affect the air temperature, which tends to be cooler than the minimum temperature for A. mellifera activity (Lefebvre et al. 2018).On the other hand, precipitation showed a positive relationship with the abundance of A. mellifera, although high precipitation may cause mechanical difficulty in the flight of A. mellifera, thereby affecting the visitation of these pollinatorshence lower count observations (Lawson and Rands 2019).Surprisingly, no environmental variable was significant to the abundance of A. mellifera supposedly because of the near indistinctive differences in the abundance counts among the different observed farms.
The positive relationship between the abundance of B. dorsalis and T. leucotreta and average temperature and afternoon relative humidity can be explained by the importance of temperature and moisture in improving pest egg formation (Potting and van der Straten 2010;Rashmi et al. 2020).Interestingly in this study, the abundance of B. dorsalis is shown to be positively correlated with hillshade and aspect, which are derivatives of elevation.This is contrary to Odanga et al. (2018), who found that B. dorsalis populations are negatively correlated with an increase in elevation.On the other hand, slope, and hillshade, which are derivatives of elevation, showed a negative relationship with the abundance of T. leucotreta.Odanga et al. (2018) demonstrated the fact that these pests exhibited no distinguishing characteristics across different altitudinal ranges owing to their wider tolerance to temperature ranges, which are also affected by altitude.
The negative relationship between the abundance of B. dorsalis and T. leucotreta and precipitation is presumably caused by the inhibition of survival of the larva-pupal stage of the pest, which is inhibited when soil moisture is beyond the field capacity, while T. leucotreta is more present in warmer humid areas (Montoya, Flores, and Toledo 2008;Potting and van der Straten 2010).Wind speed had a negative relationship with the abundance of B. dorsalis in the present study, in that an increase in the speed of wind, especially in the opposing direction, can cause flight resistance of the pest, thus affecting their flight to the targeted area (Susanto et al. 2022).On the contrary, Verghese et al. (2006) found that wind speed had a positive relationship with the abundance of B. dorsalis, as the wind can also be a medium of dispersing the metathyl food lure for B. dorsalis, hence attracting them, and subsequently increasing the abundance of B. dorsalis catches.An increase in solar radiation causes an increase in temperature, whereby extremely high temperatures negatively affect B. dorsalis (Shrestha, Thapa, and Gautam 2019).The relationship between the abundance of B. dorsalis and precipitation, aspect and hillshade was significant (p ≤ 0.05), supposedly because of the contribution of these variables to temperature variations.
The positive relationship between wind speed and the abundance of T. leucotreta could have influenced more numbers of T. leucotreta to be trapped, since the wind may aid the swinging of the traps for an increased number of T. leucotreta captures (Moore 2019).Presumably, the significance (p ≤ 0.05) of the relationship of the abundance of T. leucotreta with solar radiation demonstrated in the results was caused by the direct temperature effect on the degree of moisture, i.e. relative humidity, thus directly affecting the abundance of T. leucotreta (Potting and van der Straten 2010).
The relationships between the abundance of Syrphidae and Calliphoridae and the environmental variables observed in this study may be inconclusive.This is because their observed counts in the sampled farms did not exhibit population densities that varied significantly for proper interpretation.A longer sampling period could provide an opportunity to sample increased species abundance (Mandela et al. 2018).
This study summarized Hymenoptera excluding A. mellifera, Syrphidae and Calliphoridae into larger groups such as families to increase the sample size suitable for the spatial distribution modeling.This may result in broadening the ecological niches of species in the specific orders and families, and so fail to account for species-specific ecological niches.However, the findings can be useful in deducing the general behavior of the respective pollinators and flower visitors, in comparison with other pollinators, such as A. mellifera of the Apidae family.

Conclusions
This study revealed the possibility of combining accurate remotely sensed variables of cropping patterns and environmental variables in investigating the spatial distribution and abundance of the avocado pollinators, flower visitors, and pests (AUC > 0.70).Specifically, this study revealed that the studied pollinators, flower visitors, and pests had a medium to high probability of occurrence in Kandara and some parts of Maragua.Gatanga had a varied probability of their occurrence, from low to high.This study also highlighted the dominance of A. mellifera avocado pollinators and B. dorsalis as avocado pests.Moreover, the co-occurrence analysis of the avocado pollinators, flower visitors and pests also demonstrated the potential regions of their high, medium, and low simultaneous occurrences.This is crucial in providing key information necessary for the decisionmakers and farmers in the implementation of IPPM techniques at a landscape scale.Future studies could look into the spatial modeling of these insects, during off-peak and peak flowering and fruiting seasons, in regions of similar agroecological zone settings.This would help to capture the diversity of the avocado insects and compare the most influential variables in the abundance of these insects in the different seasons.Consequently, this would further aid in the development of recommender systems for intelligent implementation of sustainable development efforts, such as IPPM.

Figure 1 .
Figure 1.Map of the study area comprising Gatanga, Kandara and Maragua sub-Counties in Murang'a County, Kenya, with overlaid sampled avocado farms, elevation and other surface features.
Aduvukha et al., (2021), avocado pollinators, flower visitors and pests occurrences sampled in NDVI informed avocado farms(Adan et al. 2021;Sagwe et al. 2021;Toukem et al. 2020) and environmental variables; ii) co-occurrence analysis involved intersection analysis of the outputs from the MaxEnt model; and iii) SAM involved utilizing the GLM, i.e. negative binomial distribution, to analyze the effect of the environmental variables on the abundance of the avocado pollinators, flower visitors, and pests.

Figure 5 .
Figure 5. Spatial distribution probability of avocado pollinators (a) Apis mellifera, (b) hymenoptera excluding Apis mellifera, and (c) Syrphidae; avocado flower visitors (d) Calliphoridae and avocado pests (e) Bactrocera dorsalis (f) Thaumatotibia leucotreta predicted using the maximum entropy (MaxEnt) model.The dark blue color indicates a low spatial distribution probability, while the red color represents a high spatial distribution probability.The resolution of the maps is 10 m in relation to spatial resolution of the cropping pattern variable.

Figure 6 .
Figure 6.Spatial distribution probability of co-occurrence of (a) avocado pollinators and flower visitors (b) avocado pests and (c) avocado pollinators, flower visitors and pests.The light green color indicates a low probability of co-occurrence spatial distribution, while the yellow and the red colors represent a medium and a high probability of co-occurrence, respectively.The resolution of the maps is 10 m in relation to spatial resolution of the cropping pattern variable.

Earth observation remote-sensing data description and processing 2.2.1.1. Remote sensing datasets and field data collection. Remote
derived fromLevel-1 (ESA 2019).This study utilized the IW and Level 1 GRD products.Vertical transmit and vertical receive (VV) and vertical transmit and horizontal receive (VH) modes of dual polarization were utilized in Sentinel-1 images.The sensor image pre-

Table 2
(Aduvukha et al. 2021)blished work).A composite image for the vegetation phenological variables (n = 15) from each of the four seasons was created and used in the cropping pattern classification analysis (Table2).2.2.1.1.5.Cropping pattern and non-croplands field data collection.Field data for cropping pattern and non-croplands were sampled from 12 December 2018 to 19 December 2018(Aduvukha et al. 2021).The cropping patterns included monocrop maize, mixed crop maize, monocrop avocado, mixed crop avocado, monocrop coffee, monocrop tea and monocrop

Table 3 .
Aduvukha et al. (2021)asets combination scenarios and number of variables for the classification of cropping patterns and non-croplands, as highlighted inAduvukha et al. (2021).

Table 5 .
Predictor variables used in spatial distribution and abundance analysis of avocado pollinators, flower visitors, and pests.

Table 6 .
Variables selected after performing multicollinearity analysis using the variance inflation factor (VIF) of a minimum of 10 for avocado pollinators, flower visitors, and pests.

Table 7 .
Contribution (%) of predictor variables to Apis mellifera, Hymenoptera excluding A. mellifera, Syrphidae, and Calliphoridae spatial distribution probability from maximum entropy (MaxEnt) models using the jackknife test.

Table 8 .
Contribution (%) of predictor variables to Bactrocera dorsalis and Thaumatotibia leucotreta spatial distribution probability from maximum entropy (MaxEnt) models using the jackknife test.

Table 9 .
The abundance (n) and relative abundance (%) of avocado pollinators, flower visitors, and pests.

Table 10 .
Summary of the relationship between the environmental variables and the abundance of avocado pollinators, flower visitors, and pests depicting the regression coefficient estimates and p-value (p ≤ 0.05).