Distinctive macroinvertebrate communities in a subtropical river network

Abstract Macroinvertebrates are widely used as bio-indicators in streams and rivers, and it is usually assumed that their community composition is primarily controlled by local environmental conditions. We examined the distribution of macroinvertebrates within the Guadalupe River basin (3256 km2) in Central Texas across physiographic gradients. Spatial analysis with variables that considers flow direction, connectivity and distances between sites (asymmetric eigenvector maps, AEM) detected distinctive communities in the lower reaches of the mainstem, in spring-influenced reaches, and in a tributary with intermittent reaches. Variation partitioning with redundancy analysis showed that large-scale factors, i.e. riverine network patterns (large-scale AEM variables), climatic variation and ecoregion explained a significant proportion (28%) of the variation in community composition within a river basin. The riverine network patterns were the most important factor, explaining 12% alone. Local environmental factors were significant, but completely confounded within these spatial patterns. We propose that there are distinctive macroinvertebrate communities depending on the location in the river network and this may apply to other (subtropical) rivers, which should be tested by future studies. We recommend spatial analysis that considers distances and connectivity within a river network as a powerful tool to recognize multiscale riverine network patterns, which can help to identify priority areas for conservation and to develop sound monitoring programs.


Introduction
Biological monitoring is a widely accepted survey methodology to evaluate the ecological health of rivers and streams (Barbour et al. 1999). Although it is well known that both local and regional factors may affect the distribution of communities, one of the principle assumptions of macroinvertebrate biomonitoring is that local communities are primarily controlled by local environmental conditions. Therefore, when high dispersal rates among spatially connected communities (i.e. so-called mass effects) override the importance of local factors, biomonitoring may lead to inaccurate information about the environmental health of a local aquatic system (Vilmi et al. 2016). The relative importance of local environmental conditions versus dispersal for metacommunity structure may depend on the location within the river network. For example, an analysis of three river basins in Maryland, USA found that local environmental factors were most important for macroinvertebrate community structure in headwater sites, but that dispersal-driven processes were more important in riverine mainstem sites (Brown and Swan 2010, but see Schmera et al. 2018). In addition, the spatial location of sites within a riverine network influences composition and diversity across a drainage, with diversity typically being greater at the confluence points in the mainstem network and in the lower reaches of a drainage (Altermatt et al. 2013).
Numerous studies have examined the relative importance of local environmental versus spatial factors for the structuring of metacommunities (e.g. Cottenie 2005;Logue et al. 2011). Most studies conducted in streams and rivers found that local environmental factors are generally more important than spatial factors (Heino et al. 2015b; but see Heino et al. 2015a), their relative importance, however, may vary with distances between sites and the spatial extent surveyed (Heino et al. 2015b). In contrast, dispersal processes over larger time scale such as historical colonization events are often ignored, although they may play a major role for metacommunity structure (Castillo-Escriv a et al. 2017). It is also important to differentiate between the role of dispersal processes at the spatial scale of metacommunities (local communities linked by dispersal, Leibold et al. 2004) and a biogeographical scale (e.g. along a macroclimatic gradient, Gonçalves-Souza et al. 2014). In particular, studies examining patterns in community composition and diversity over larger spatial extents, e.g. across drainage basins, should also consider biogeographic patterns which are results of long-term dispersal effects (e.g. historical colonization events, historical dispersal barriers) and large-scale environmental differences (e.g. climatic gradient; Leibold et al. 2010;Heino et al. 2015bHeino et al. , 2017. The purpose of this study was to examine multiscale spatial patterns of benthic macroinvertebrates within a riverine network (hereafter referred to as 'riverine network patterns') in the Guadalupe River basin in central Texas, USA ( Figure 1). The spatial extent of the basin examined by this study allowed us to also examine larger scale spatial patterns of macroinvertebrate communities that are likely the result of biogeographic processes.
An increasing number of studies have used complex spatial analyses such as asymmetric eigenvector map (AEM) analysis for variation partitioning in metacommunity analyses of macroinvertebrates (e.g. G€ othe et al. 2013;Zhang et al. 2014;Cauvy-Frauni e et al. 2015). However, AEM analysis can also be used to identify multiscale riverine network patterns for macroinvertebrates at a larger spatial scale to identify locations, reaches, and segments of the river with a distinct macroinvertebrate community composition. This could serve as a useful starting point to discuss and further investigate potential factors that drive these patterns, instead of attributing any differences between monitoring sites to local environmental factors, and could ultimately lead to a more effective design of monitoring programs. Therefore, we addressed the following main questions: (1) What is the spatial pattern of macroinvertebrates communities in the Guadalupe Basin? Which multiscale riverine network patterns do they show? (2) Which groups of macroinvertebrates and environmental factors (local and larger scale) are associated with these patterns? (3) What is the relative importance of climatic variation, ecoregion and larger scale riverine network patterns? (4) To what extent are climatic variation, ecoregion, and riverine network patterns correlated with local environmental conditions and land use-land cover?

Study area
The Guadalupe River basin is relatively large (3256 km 2 ) and encompasses a pronounced regional physiographic gradient. Like the other Gulf coast rivers in Texas, the Guadalupe River flows from northwest to southeast, experiencing a climatic gradient with increased precipitation from west to east, and increasing temperatures from north to south (see below). Both high disturbance intensity and stable conditions occur in this basin, where flashfloods are common and where groundwater-fed tributaries with relatively consistent flow occur next to tributaries with intermittent reaches and higher variation in seasonal flow. The basin was never glaciated and also includes major springs, which are considered hotspots for endemic macroinvertebrates (Bowles and Arsuffi 1993).
The Guadalupe River basin contains portions of four level III ecoregions of North America (https://www.epa.gov/eco-research/ecoregions-north-america, Figure 1): the Edwards Plateau, the Texas Blackland Prairies, the East Central Texas Plains and the Western Gulf Coastal Plain. All four ecoregions can be characterized by homogeneity Figure 1. Sampling sites 1-28 in the Guadalupe River (green line), Texas and its tributaries, the Blanco River (pink), San Marcos River (blue), and Comal River (see insert), The four ecoregions are shown as differently colored areas. associated with both abioticsoils, vegetation, geology, climate and physiography (Omernik 1987;Griffith et al. 2004), and biotic factors, including algal coverage. The ecoregions do not follow a strict up to downstream pattern but two of the four ecoregions alternate in the middle and lower reaches of the Guadalupe (Figure 1).
The Edwards Plateau is dominated by karst limestone geology and many headwaters and stream reaches are strongly spring-influenced, containing clear water with high physicochemical stability. The Blackland Prairies is dominated by clays and silty soils and contains larger fraction of cropland and urban space. The East Central Texas Plains is largely composed of savanna and mostly used for pasture. The Western Gulf Coastal Plain is a low gradient plain that ends at the Gulf of Mexico. Rainfall varies across the basin, from a minimum of 406 mm per year in the north and western portions of the basin (Edwards Plateau) to a maximum of 1473 mm per year in the southern and eastern regions (Western Gulf Coast Plain).
Two of the major tributaries, the Comal and San Marcos rivers, are strongly groundwater influenced (i.e. these rivers are fed by large spring complexes at their headwaters) and thus exhibit stable physicochemical conditions and relatively more consistent seasonal flows. In contrast, the other tributary, the Blanco River and the upper portion of the Guadalupe River mainstem (sites 14-16 on the Guadalupe, Figure 1) exhibit much higher variation in seasonal flows with some sections of these rivers going dry during dry years or experiencing large-scale flooding during wet periods. Indeed, the Blanco River experienced large-scale and historic flooding in early May 2015 several months before we started sampling for this study.

Field data collection
We collected macroinvertebrates and local environmental data at 28 sites across the basin between 15 July and 5 October 2015 (each site sampled once). Sites were located in the main tributaries of the Guadalupe River including the Comal River (2 sites), the San Marcos River (4 sites), and the Blanco River (7 sites); the remaining 15 sites were distributed along the mainstem of the Guadalupe River stretching over 492 river kilometers from the most upstream (site 14, Figure 1) to the most downstream site (site 28, Figure 1, Table S1). Each site was sampled once across the mid-and late-summer growing season. In order to minimize the effects of temporally declining flows affecting local conditions across the basin during the sampling interval, lower discharge and lower stream order sites in the drainage were sampled at the beginning of the study period. Over the course of the study, flows across the basin generally declined, with discharge at the furthest downstream site (Guadalupe River at Victoria, Texas; USGS Station ID 08176500) receding from 52.9 to 18.5 m 3 /L (x ± 1 SD = 29.1 ± 9.2 m 3 /L) over the 82-d sampling interval (https://waterdata.usgs.gov/tx/nwis/rt).
Prior to collecting macroinvertebrate samples at each site, we measured pH, water temperature, dissolved oxygen (DO; mg/L), and conductivity (lS/cm) using a multiparameter probe (YSI 556). Water velocity immediately upstream from each sample point in a riffle was measured with a Hach flow meter (FH950). The percent sediment size composition at each sampling point was estimated using a modified Wentworth scale (Wentworth 1922) and percent algae cover was estimated using an underwater viewing window. Duplicate water samples were taken at each sampling location using 2-L brown Nalgene bottles which were rinsed three times with site water before sample collection. Water samples were placed in a cooler on ice and transported to the lab at the Texas State University, where samples were filtered and preserved within 48 hours of collection.
Local environmental conditions such as substrate type and composition and water velocity are known to affect macroinvertebrate community composition within a given sampling site, (Allen 1995). We collected invertebrate samples from haphazardly placed locations within riffles at each sampling site, which are the most ideal mesohabitat to sample when evaluating macroinvertebrates since it consistently contains higher diversity Brussock 1991, Barbour et al. 1999) and many environmental monitoring programs focus on macroinvertebrates from riffles (Carter and Resh 2001).
Macroinvertebrate samples were collected using a 500-lm Hess sampler (35-cm diameter). This mesh size is commonly used in biomonitoring programs, although it misses smaller benthic organisms, especially early stages of many macroinvertebrates. At each sampling site, four Hess samples were collected from within a riffle area, except for 5 of the 28 sites (3 Hess samples at sites 21, 24, 25 and 28, and 2 Hess samples at site 7). To account for differences in the number of Hess samples, macroinvertebrate densities for each taxon were expressed as number of individuals/m 2 . During sampling, substrate was agitated for a 2-minute interval and samples were preserved in 90% ethanol (EtOH) for processing in the laboratory. Macroinvertebrate samples were identified under a stereomicroscope (Nikon SMZ745T) to the lowest practical taxonomic level (typically genus) using relevant several taxonomic keys (Merritt et al. 2008, Diaz 2014. A total of 59 macroinvertebrate taxa were identified, including six non-insect taxa (Table S2). Non-insect taxa were identified to order and all other taxa were identified to genus, except Diptera, which were identified to family.

Land cover data
Land cover data were downloaded from the United States Geological Survey and overlaid on sample site locations in ArcGIS v10.4 using the National Land Cover Database (NLCD 2011 version). Land use-land cover (LULC) was determined as percent composition among 20 categories: developed open space, developed low intensity, developed medium intensity, developed high intensity, open water, perennial ice/snow, barren land (rock/sand/clay), deciduous forest, evergreen forest, mixed forest, dwarf scrub, shrub/ scrub, herbaceous grassland, herbaceous sedge, lichens, moss, pasture/hay, cultivated, woody wetlands and emergent herbaceous wetlands (NLCD 2011 Product Legend; https://www.mrlc.gov/nlcd2011.php). Three spatial scales of LULC for each sampling site were examined based on Allan (2004) and Becker et al. (2014): (1) a reach scale with land cover in a 100-m buffer on either side of the river with a 2 km buffer upstream from each site; (2) a riparian scale with land cover in a 100-m buffer for total distance upstream for each site; and (3) a catchment scale with land cover for the whole watershed upstream of the site. We followed the procedure outlined in Becker et al. (2014) to combine and reduce LULC into eight categories: urban, cultivated, evergreen forest, deciduous forest, mixed forest, rangeland, wetlands and open water. Barren land was removed from any analyses because it made up <1% of the coverage area (Dodds andOakes 2008, Becker et al. 2014). Ecoregions for each site were based upon USEPA Level-III Ecoregions, downloaded from the EPA (Griffith et al. 2004), and overlaid across the Guadalupe River Basin in ArcGIS. Estimates of river slope were generated using a digital elevation model (DEM), and river distances between sites were evaluated by using a river network map in ArcGIS. Mean annual precipitation data for each site were obtained from Texas Parks and Wildlife Department and reported as the annual mean during the 2000-2010 period.

Data analysis
Prior to analysis, values obtained from duplicate water samples for each analyte from each site were averaged. To avoid issues with multicollinearity in analyses, variables which were highly correlated (r > 0.70) were removed from the dataset. Mean, maximum, minimum and point slope estimates for each site were highly correlated, so only site mean slope was used in further analyses. TSS and NVSS were also highly correlated, thus TSS was used in further analyses. A Pearson correlation matrix for each group of predictor variable data set revealed that the riparian and catchment scales for LULC percent coverage were highly correlated for nearly all variables and the riparian LULC scale were removed from further analyses (Becker et al. 2014).
Twenty taxa were excluded from analysis because they contained <5% of taxa at all sites (Zhao et al. 2017). To address question (1) and to evaluate potentially complex multiscale spatial patterns within the river network, we used an AEM analysis, a spatial modeling technique that considers autocorrelation at different spatial scales. AEM analysis was developed for ecosystems such as rivers in which directional physical processes (water currents) can affect the distribution of organism (Blanchet et al. 2008a). We computed Eigenfunction-based spatial variables (eigenvectors) from a directional downstream distance matrix accounting for connectivity between sites (Blanchet et al. 2008a), and assigned weights to the edge matrix based on watercourse distances in kilometers between sites. A forward selection procedure was used to reduce the number of spatial eigenvectors to predict the variation in community composition (Blanchet et al. 2008b). We also used Moran's eigenvector map (MEM) analysis  to model hydrological connections between sites without considering the direction of the flow. However, the results were very similar to the AEM-analysis and were therefore not included here.
To evaluate the association of differences in community composition with higher densities of certain groups and with environmental factors (local environmental factors, LULC and climatic variables) (question 2, Table S3) we used redundancy analyses (RDA). To examine the relative importance (question 3) of climatic variation (water temperature, precipitation), ecoregion and large-scale riverine network patterns (AEM variables), we used variation decomposition based on redundancy analysis (RDA, Cottenie 2005). To focus on larger scale patterns, we only used the first three large-scale AEM variables selected by the forward selection (see above). The computed percentage of explained variation was adjusted for the number of explanatory variables (i.e. adjusted R 2 , Peres-Neto et al. 2006). The dependent abundances of macroinvertebrates (genera or family) were Hellinger transformed to minimize the disproportional influence of rare species on the redundancy analysis (Legendre and Gallagher 2001). To determine to what extent ecoregion, climatic variation, and large-scale AEM variables were correlated (and how much variation they shared) with local environmental conditions and land use-land cover (question 4), we ran pairwise variation decomposition based on RDA (see above), i.e. ecoregion vs. local environmental conditions, and ecoregion vs. land use-land cover variables, and the same for climatic variation and large-scale AEM variables. Comparisons among more than two groups of variables (e.g. variation decomposition with AEM variables, local environmental factors and land use-land cover) were not possible, because of high correlation between the variables. All analyses were done in R (R Development Core Team, 2017, version 3.4.0) using the package vegan (Oksanen et al. 2017).

Results
Multiscale riverine network patterns and associated macroinvertebrates and environmental factors (Questions 1 and 2) The analysis of AEM variables resulted in nine significant variables (Figure 2), which explained 52% (p ¼ 0.001) of the variation in macroinvertebrate community composition across the Guadalupe Basin. Most of the AEM variables represented large-scale spatial patterns (variables V1 to V6; Figure 2). Overall, the analysis revealed four notable patterns which were associated with different genera and environmental factors. First, there was a unique macroinvertebrate community composition in the lower portion of the Guadalupe River after the confluence with the San Marcos River (V1; Figure 2), characterized by higher densities of the mayfly genus Traverella, the predatory stonefly Neoperla, and the riffle beetle Hexacylleopus (Figure 3a). This spatial pattern largely corresponded with the climatic gradient in the basin, with greater precipitation (and higher water temperatures, Figure 3b) in the lower portions of the basin (Figure 3b). These changes in community structure in the lower portion of the Guadalupe River also correlated with several local environmental factors: higher TSS and Chla concentrations, slightly higher pH, and higher proportion of sand in benthic substrates (Figure 3c). With respect to LULC patterns in the basin, these taxonomic changes were also correlated with an increase in the percent coverage of wetlands and agriculture at the catchment scale (Figure 3b).
The second riverine network pattern was associated with the spring -influenced reaches along the Comal and San Marcos rivers (V21, Figure 3a both spring sites have the highest negative values for second RDA axis). These sites exhibited higher densities of macroinvertebrates with lower dispersal abilities or those lacking desiccation-resistant resting stages (i.e. Hyalella, the water penny Psephenus), and the riffle beetle Microcylleopus) (Figure 3a). In these reaches, the LULC patterns had higher percentages of urban, mixed forest, and open water (presumably associated with the headwater spring complexes).
A third distinctive community type in the basin was found along the Blanco River characterized by high abundances of the net spinning caddisfly Chimarra (variables V2, V6 and partly V4; Figures 2 and 3(a)). Local environmental conditions at these sites tended to have greater water depths and a higher proportion of cobble in benthic substrates (Figure 3(c)). Blanco sites also had slightly a higher proportion of ranchland use (up to 72%, compared to up to 60% elsewhere in the basin, Figure 3(b)).
The last major community type was associated with sites located within the Texas Blackland Prairies ecoregion (several sites in the lower San Marcos, the upper Guadalupe, and the lower Guadalupe rivers, compare large open squares just before and after the bigger confluence in V5, Figure 2 with blue Texas Blackland Prairies ecoregion in Figure 1). Although there were no obvious associations with environmental factors (Figure 3), there were increased densities of the mayflies Baetis and Leptohyphes at the sites located in this region.
Other riverine network patterns that were detected included a different pattern around the confluence of the San Marcos with the Guadalupe River (V3; Figure 2), and another smaller scale pattern with the most upstream reaches of the upper Guadalupe being different (V11; Figure 2).

Relative importance of climatic variation, ecoregion, and larger scale riverine network patterns (Question 3)
Large-scale AEM variables (the first three variables) explained 12% of the variation in community composition across the Guadalupe Basin after eliminating the shared effects of the other factors climate and ecoregion (i.e. pure effects). The pure effects of climate variation and ecoregion were 4% and 3%, respectively (Figure 4). In addition, ecoregion and AEM variables shared 7% of the variation, and all variables shared 5% of the variation. Due to multicollinearity between the factors, the total amount of variation explained was 28% instead of 31%.

Correlation with local environmental and LULC factors (Question 4)
Pairwise variation partitioning in RDA indicated that that ecoregion shared more variation with LULC factors (15%) than with local environmental factors (7%). Similarly, climatic factors shared a small proportion of the variation with LULC factors (7%), but none with local environmental factors. Not surprisingly, both LULC factors and local environmental factors were spatially structured. Local environmental factors shared 28% (all variation) with (large-scale) AEM variables, and LULC factors 22%.

Discussion
We found that a considerable portion of the variation in community composition of macroinvertebrates in the Guadalupe River basin was explained by variation in larger scale factors (i.e. climatic variation, ecoregion and riverine network patterns). Such factors are not routinely included in monitoring programs, despite findings that such large-scale patterns are important (e.g. Feminella 2000;Mykr€ a et al. 2004). Not surprisingly, part of the variation in climate and ecoregion in the Guadalupe basin were spatially structured (i.e. shared variation with large-scale AEM variables), but both climate and ecoregion also explained a small amount of the variation in community structure on its own. The pure effects of large-scale spatial variables were most important and all significant AEM variables combined explained $50% of the variation in macroinvertebrate community structure. This result suggests that the location of a site within a river network (e.g. the presence of spring influenced reaches, confluence points in the network) and historical and current connectivity were more important in determining macroinvertebrate community composition than the physiographic gradients across the basin. The importance of local environmental factors for metacommunity structuring has been shown by many metacommunity studies in rivers (Heino et al. 2015b; but see Heino et al. 2015a), suggesting that species sorting is the prevalent dynamic in rivers. It is important, however, to consider the spatial extent encompassed by a study and the distances between sites in relation to the dispersal abilities of the study organisms as different processes may act on different scales. For example, a study on neotropical lepidopterans and spiders found that environmental effects dominated at the metacommunity scale, whereas at the biogeographical scale dispersal-based processes were more important (Gonçalves-Souza et al. 2014). The biogeographic scale was also considered in a study on aquatic organisms across drainage basins in Finland, which showed that basin identity and local environmental variables were both important for community structure, whereas the spatial effects within a basin were usually negligible (Heino et al. 2017, area of three drainage basins: 63,609 km 2 .). In contrast to this, our study found that large-scale spatial patterns can also play a role within a basin (area: 3256 km 2 ). Unfortunately, we could not determine the relative importance of local environmental factors, as all local environmental factors were spatially structured (local environmental factors shared all explained variation with (large scale) AEM variables, see above). In order to avoid the local environmental factors to be completely confounded with large-scale spatial patterns, a different sampling design would have been needed with additional sites being placed closer to each other. It is also possible that local factors would explain additional variation in the distribution of macroinvertebrates, when more specific requirements of organisms were included in analyses, such as food availability and prey presence. For freshwater mussels (technically also macroinvertebrates, but usually not included in routine biomonitoring), the inclusion of a moving niche component, that is, the presence/absence of their host fish, they need for successful recruitment, explained substantially more of their distribution across river basins in Ontario, Canada (Schwalb et al. 2013). Maybe the biggest obstacle is the scarce information available about life history of many macroinvertebrates, hence more life history studies of macroinvertebrates are needed.
The AEM-variables revealed interesting multiscale riverine network patterns in macroinvertebrate community composition, which indicated a distinct community composition for the lower portion of the basin (lower Guadalupe mainstem), the spring-influenced reaches, the Blanco River, and the different ecoregions. The different community composition in the lower Guadalupe mainstem was associated with local environmental factors, but also larger scale climatic factors, which are usually associated with a biogeographical spatial scale. In particular, the plecopterans (mostly the genus Neoperla) were present in the lower mainstem Guadalupe sites, but were largely absent from sites extending up into the Edwards Plateau. The large-scale southeast-northwest spatial occurrence pattern of plecopterans in the study watershed is consistent with previous descriptions of plecopterans distributions in the region. This is likely related to precipitation patterns and its influence on stream permanence and the past biogeographic dispersal patterns of plecopterans in the region (Stewart et al. 1973;Szytko and Stewart 1977). In addition, larger scale differences in water quality and food availability likely influence the occurrence of species in the lower Guadalupe maintstem. The lower mainstem has higher suspended Chl-a and higher TSS concentrations, indicating greater food availability for filter feeders such as Traverella, which also occurred in higher abundances in the lower Guadalupe.
The spring influenced reaches of the basin were characterized by macroinvertebrate communities containing species with lower dispersal abilities or an inability to tolerate periods of desiccation, such as amphipods (Hyalella azteca) and smaller bodied riffle beetles (Microcylloepus pusillus). It is also noteworthy that the proportion of shredders was considerably higher near Spring Lake in the San Marcos River and near Comal Springs compared to the rest of the basin (0-3%, see Zawalski 2017 for more details). Diversity is typically higher in lower reaches of river networks (Altermatt et al. 2013) and that holds true for freshwater fishes and unionid mussels in Texas (Dascher et al. 2018). However, we found the highest diversity of macroinvertebrates near springs (Zawalski 2017). Spring-influenced reaches of riverine networks, especially in arid areas, play an important role as ecological and evolutionary refugia (Davis et al. 2013) in that some species found in these reaches may only be able to persist in these refugia during extended or severe drought periods. As such, protection of these segments from anthropogenic impacts, including restriction of groundwater pumping from regional aquifers provides crucial refuge and protection for these spring-associated and dispersal-limited taxa (Bowles and Arsuffi 1993) as well as maintaining locations of higher species diversity in the landscape.
Unmeasured factors such as flow permanence may be an important driver for the distinctive community in the Blanco River. Several reaches (at least 3 sites) were dry in 2014 and all of the sites in the Blanco River experienced an especially large flooding event in 2015, shortly before the sampling. The communities in this river were characterized by high abundances of Chimarra, which could be especially resilient to such disturbance as it was found in higher abundances in unstable substrate in the Ard eche river, France (when compared to other net-spinning caddisflies, Dol edec and Tachet 1989). A higher concentration of suspended food sources could also be a factor. Algal growth is enhanced in pools that become isolated or when flow is decreased. In addition, when intermittent reaches go dry, terrestrial organic matter can accumulate and aquatic plant material be decomposed and providing a high input of organic matter when these reaches are reflooded (Williams 2006), which could be a good food source for collectors such as net building caddisflies. Indeed, the majority of collector-filterers in the Guadalupe basin can be found in the Blanco River as well as the headwaters of the Guadalupe and Comal Rivers (Zawalski 2017). The intermittent reaches may also be a good habitat for macroinvertebrates with high colonization potential. Interestingly, 99% of the genera present in the intermittent reaches in the Blanco River (i.e. dry in 2014) had winged adult stages. The average for all sites in the Blanco River was 96% compared to the rest of the basin with 82% not including the spring sites. At spring sites the percentage of winged results were the lowest (13% and 37% for San Marcos and Comal springs, respectively).
It is well known that differences in regional species pools must be considered for biomonitoring and the development of biotic and multimetric indices, especially if they are based on biological attributes of species instead of functional metrics (Pont et al. 2006). Our data suggest that biogeographic differences can play a role not just between basins (e.g. Heino et al. 2017), but also within river basins. Furthermore, our results indicate that the distribution of macroinvertebrates may also depend on the location in the river network (near springs, lower reaches, tributary with intermittent reaches). The importance of location in the river network for the distribution of macroinvertebrates and metacommunity structuring has been previously shown, e.g. with focus on the arrangement of tributaries (Rice et al. 2001) and river network properties (Altermatt et al. 2013), and comparing headwaters and mainstem (Brown and Swan 2010). Based on our results, we predict that metacommunity structure and dynamics in a subtropical river network will vary because of different disturbance levels found in different parts of the river network (Table 1). The environmental conditions are most stable close to the springs, which allows macroinvertebrates with low dispersal abilities and those unable to tolerate periods of desiccation to become abundant. As these springs are evolutionary refugia species sorting should still occur despite limited dispersal over a sufficiently large time-scale. Such a species sorting with limited dispersal (sensu Winegardner et al. 2012) assumes a tradeoff between competition and dispersal, and the abundant low dispersal macroinvertebrates will co-exist with high dispersal species (Table 1). In contrast, in intermittent reaches with high disturbance levels due to drying and flash flooding, only macroinvertebrates with high dispersal abilities become abundant. The community composition is determined by colonization after disturbance events, followed by succession driven by local environmental condition and biotic interactions (i.e. species sorting with high dispersal, Table 1). However, the relative importance of dispersal vs. environmental filtering may vary with hydrological phases of dry, flowing, and non-flowing conditions in intermittent systems (Datry et al. 2016). Finally, the distinct community in the lower mainstem of the Guadalupe is subject to an intermediate disturbance level, where the community composition is mainly determined by abiotic conditions (species sorting with efficient dispersal, Table 1), but probably also by biogeographic patterns. The frequency of high-vs. low-dispersal groups of macroinvertebrates (e.g. percentage of winged adults) reflects the relative frequency found in the entire basin (Table 1). We propose that the changes we observed in macroinvertebrate communities are predictable based on the location in the river network and may apply to other (subtropical) rivers, which should be tested by future studies.
It has been postulated that ecologists should also consider distances along the river network (Heino et al. 2015b), and newly developed statistical methods allow for such a spatial analysis in river networks (Legendre and Legendre 2012). Spatial variables considering watercourse distances (i.e. AEM variables) have been used increasingly in ecological studies, but have not yet gained popularity in more applied studies. We recommend spatial analyses that consider distances and connectivity as a powerful tool to recognize multiscale riverine network patterns, and which may otherwise go undetected. For instance, a  Winegardner et al. 2012), and associated genera in relation to disturbance levels in different parts of the river network for which a distinctive community was detected in the Guadalupe basin. survey throughout the basin will be necessary to identify distinct communities, and those differences will then need to be considered when monitoring human impact. In addition, such an analysis can help to identify priority areas for conservation and management plans (such as spring -influenced reaches, see above). Our example uses macroinvertebrates, but it could be easily applied to fish or other groups of organisms. Using multiscale spatial analysis also helps to identify the relative importance of processes at different spatial scales, and may indicate mechanisms responsible for these patterns. Thus, it would be an important step for designing an effective monitoring program, detecting human impact, and developing mitigation plans.