Strength-weighted flow cluster method considering spatiotemporal contiguity to reveal interregional association patterns

ABSTRACT One of the most crucial topics in spatial interaction studies is mining patterns from extensive origin-destination (OD) flow data to capture interregional associations. However, prevailing methodologies tend to disregard the importance of using the relative closeness of interregional connections as weights, treat spatial and temporal dimensions independently, or overlook the temporal dimension completely. Consequently, the identified patterns are susceptible to inaccuracies, and the precise identification of pattern occurrence time and duration, despite their fundamental importance, remains elusive. In light of these challenges, this study proposes a strategy to calculate and combine the strength of weighted spatiotemporal flows, and develops a clustering method and evaluation metrics based on this framework. Compared to alternative density-based methods, the strength-based calculation approach demonstrates a capacity to identify flow patterns characterized by relatively high interregional closeness. Thus, the identification of flow patterns expands beyond density-based approaches, encompassing strength-based considerations and a shift from absolute to relative closeness between regions. Experiments using synthetic datasets conducted in this research demonstrate the effectiveness, efficiency, and extraction accuracy of the proposed method. Furthermore, a case study using real Chinese population migration data demonstrates the efficacy of the method in revealing implicit spatiotemporal association patterns between regions. The present study implements an interaction strength-based flow clustering and evaluation method that considers spatiotemporal continuity, making it applicable to spatial flow data analysis involving interaction volume and time attributes. As a result, this method holds promise for facilitating the modeling of intricate spatial flows within various contexts of study.


Introduction
Geographers have exhibited a growing inclination toward investigating the spatial interaction of spatial social flow data, driven by advancements in information communication and Internet of Things technologies, which have resulted in the generation of diverse spatiotemporal flow data (Emch et al. 2012;Lu et al. 2016).The analysis of spatial flows encompassing human, vehicular, logistic, and information flows has assumed an increasingly significant role in the examination of geospatial phenomena, including regional association patterns, spatial network structures, and spatial diffusion processes (Andris, Liu, and Ferreira 2018;Giordano, Cole, and Le Noc 2022;Ye and Andris 2021).In the study of spatial interaction within social flows, scholars in the field of geographic information particularly emphasize the innovation and development of various methodologies for extracting comprehensive insights from flow data.Notably, one of the key research domains focuses on the identification of regional association patterns, employing the concept of clustering OD (origin-destination) flow data.
OD flow pattern forms are diverse, including clustering methods based on one or more attributes of origin, destination, and flow direction (Bogataj, Bogataj, and Drobne 2019;Guo et al. 2021); clustering methods for identifying network communities (Chen, Xu, and Xu 2015;Wang, Wang, and Onega 2021); and methods for extracting spatial interaction flow patterns between regions (Kim et al. 2014).Among these methods, interregional movement pattern clustering is the most distinctive, applicable for identifying regional association patterns, recognizing functional areas from a flow space perspective, and more.Currently, various interregional movement pattern clustering methods and applications have been proposed and developed.
However, these methods primarily reflect flow density, overlooking the importance of using the relative closeness of interregional connections as weights.No interregional pattern extraction method incorporates the time dimension based on weighted flows.Weighted flow patterns reflect the strong heterogeneity of interaction strength between regions rather than interaction density heterogeneity.Currently, density-based flow pattern extraction methods cannot directly extend to extracting spatial flow patterns from weighted flows or spatiotemporal flow patterns.Therefore, this study proposes a spatiotemporally contiguous clustering approach for origindestination flows weighted by interaction strength, to address the gap in existing research.
The primary innovations and contributions of this study are: (1) We propose a weighted origindestination flow model to measure interregional interaction strength and an identification algorithm that discovers patterns formed by regions with strong relative flow associations.This represents a new perspective for identifying meaningful flow patterns, contrasting prevailing density-based techniques that rely solely on absolute density thresholds to detect regions of high-flow frequency.(2) Our flow pattern identification method accounts for spatiotemporal continuity, identifying exact flow pattern occurrence time and duration.
In this paper, the related concepts of spatiotemporal flows weighted by interaction strength are first explained (Section 3.1), as are the challenges in extracting regional association patterns from density-based to strength-based and spatial to spatiotemporal OD flows, and the problems addressed (Section 3.2).The algorithm logic and specific implementation of interaction strength-based spatiotemporal flow pattern identification are then focused on (Section 4).The algorithm logic for identifying flow patterns is described (Section 4.1).Various characteristic statistical variables are constructed to elucidate multiple meanings of the flow pattern (Section 4.2).Flow pattern evaluation indicators from multiple perspectives are proposed (Section 4.3).Finally, the validity, accuracy, and application value of the proposed method are verified using synthetic and real data sets (Section 5).

OD flow-based spatial interaction analysis algorithm
OD flow pattern extraction methods have long played a very important role in spatial interaction studies.Compared with the types of spatial features, such as points and area in GIS, OD flow data characteristics are richer and more complex in structure, so their pattern extraction methods are more difficult to implement and more diverse in form (Andrienko et al. 2017;Andris, Liu, and Ferreira 2018).Generally speaking, the method of OD flow pattern extraction, based on the target object of clustering, can be divided into three ways based on the origin or destination of OD flows, based on OD flow units, and based on networks composed of OD flows.
(1) The clustering method based on OD flow origins and destinations primarily considers origins and destinations separately.One approach clusters origins and destinations separately, then analyzes the association between resulting origin and destination clusters (Pei et al. 2009(Pei et al. , 2015;;Wan et al. 2012).
Another approach clusters origins and destinations simultaneously, effectively identifying multiple cluster types formed by combining origin and destination regions of varying densities (Luo, Cats, and van Lint 2017;Randriamanamihaga et al. 2014).However, such methods disrupt the overall OD flow structure and weaken or ignore relationships between origins and destinations of individual flows.
(2) The clustering method based on OD flow units treats each OD flow unit as a whole, identifying direction-based patterns such as convergence, diffusion, and co-direction (Guo et al. 2020;Van Nuffel 2007).These methods also group flows with similar directions, origins, and destinations into flow clusters.Aggregating and visualizing raw OD flow units using map generalization techniques provides another approach to flow analysis (Graser et al. 2019;Koylu, Tian, and Windsor 2023).However, some methods ignore location, time, and other attributes inherent to flows during clustering, although they treat each OD flow unit as a whole.These should not be considered spatial clustering methods for OD flows (Nie, et al. 2015, Zhang, et al. 2016).
(3) The network analysis method based on OD flows treats all flow units as a single analytical object to construct a spatial complex network.Non-spatial or spatial clustering of the flow network is then achieved using association partitioning methods without or with spatial constraints (Crivellari, et al. 2022, Gao et al. 2013;Xu, Santi, and Ratti 2022).This divides the flow network's nodes into multiple categories.These methods effectively explain the structure of regions (Louail et al. 2015).However, the diverse associations between regions are challenging to ascertain.

Interregional flow pattern in flow pattern mining algorithm family
In the last decade, identifying flow patterns to cluster interregional OD flows from large OD datasets has gained attention.These methods analyze raw flows using clustering, optimization, statistics, and other algorithms or map synthesis, yielding origin and destination regions of arbitrary shapes with directional interrelationships.Kim et al. (2014) pioneered this clustering method, termed MZP (Kim et al. 2014).Chen et al. proposed an improved algorithm, MPFZ, to cluster subway, taxi, and other flow data and extract interregional movement patterns (Chen et al. 2022;Liu et al. 2022).Subsequently, interregional movement pattern algorithms based on intelligent optimization and probabilistic methods were developed and applied to analyzing residential mobility patterns in cities.
Based on probability calculation and clustering, Zhou et al. completed two studies.First, they introduced road network constraints into the mining model to identify interregional movement patterns (Zhou et al. 2019).Second, they developed a flow pattern identification method that accounts for variable OD flow densities (Zhou et al. 2019).Yao et al. (2018) added road network and K-function constraints to recognize interregional movement patterns subject to road network and proximity constraints.Song and Liu proposed an interregional flow pattern mining approach based on intelligent optimization and incorporating shared nearest neighbor (Liu et al. 2022Song et al. 2019).These methods differ in their implementation but are similar in form.They extract interregional movement patterns from unweighted OD flow sets.

Flow pattern mining algorithm with introduction of weight or time
Characteristics other than OD flow origins, destinations, and directions, such as flow weights, temporal dimensions, and other attributes, have gained attention and been incorporated into interregional interaction pattern mining.Regarding flow pattern extraction considering weight, Zhang et al. introduced OD flow weight so that results reflected flow strength rather than density (Zhang et al. 2018).However, their flow unit merging method was flawed, compromising result accuracy.Tao and Thill (2016) used spatial statistical methods to construct an empirical spatial flow weight matrix, identifying anomalous interaction regions as clusters of very high or low flow values (Tao and Thill 2016).Although still relying on flow unit counts, their key contribution was achieving significant flow patterns through spatial statistics.For flow pattern mining considering time, time was introduced as a flow attribute, and spatiotemporal flow pattern extraction methods reflecting flow density over space and time were proposed (Zhou et al. 2019;Yao et al. 2018).Other approaches include origin-destination-time (ODT) matrices (Andris, et al. 2018), time series-based origindestination flow prediction (Hasanpour Jesri and Shirazi 2022), areas of interest over time (Zhang, Liu, and Wang 2019), etc.However, these methods ignored flow locations and were not spatial flow pattern extraction methods.Extracting interregional flow patterns weighted by or over time remains limited.

Definition 1 (Weighted spatiotemporal OD flow):
A weighted spatiotemporal origin-destination flow unit The spatial distance between flow units can be defined in various ways.The spatial distance defined here aims to determine the spatial proximity relationship between polygon elements and whether flow units are spatial neighbors.For polygon elements, their spatial proximity relationship can be directly determined based on topological relationships or other means, such as k-nearest neighbor and distance-based affected regions.To clarify, the spatial proximity between two flow units is illustrated through a simple example.In this study, the principle that two polygon elements sharing a border are spatial neighbors is adopted.Thus, because the origins f 1 o andf 2 o are spatial neighbors, and the destinations f 1 d andf 2 d are also spatial neighbors, the flow units f 1 and f 2 are spatial neighbors, as shown in Figure 2(a).Similarly, flow units f 3 and f 4 , f 5 and f 6 are spatial neighbors.As shown in Figure 2(b), the origins and  destinations of any two flow units are not proximal because f 0 1 d and f 0 2 d ,f 0 3 o and f 0 4 o ,f 0 3 d and f 0 4 d , and f 0 5 o andf 0 6 o are not spatial neighbors.Therefore, in this study, we define the spatial distance between OD flows as follows: Definition 2 (Spatial distance between OD flows) For any two OD flows f i and f j , the spatial distance between them can be defined as: (1) where distðf i � ; f j � Þ represents the spatial distance between f j � and f i � .When the origins of f i o and f j o have a shared border, In practical applications, other principles for spatial proximity, such as k-nearest neighbor may be used.However, we only introduce one common principle in this study.
Definition 3 (Temporal distance between OD flow) Similar to the definition of spatial distance, the temporal distance (time interval) between any two OD flows f i and f j can be defined as where for flow units f i and f j , Here τ is the threshold value of temporal distance.The spatial distance and time interval between any two OD flows and the spatiotemporal distance of any two flow units can be determined when the principle of the above definition is followed.
Definition 4 (Spatiotemporal proximity between OD flows) When the threshold value τ of time interval are given, and if the spatial proximity of the origins and destinations of flow units is defined on the basis of a shared border, then any two flow units f i and f j are spatiotemporal proximate, only if they meet the two conditions:SD f i ; The two weighted flow units meet the spatial "colocation" and temporal "synchronicity," i.e. spatiotemporal proximity in the context of this study, when they meet the spatiotemporal proximity defined in section 3.3.2.The merging models proposed by Kim and Zhang, the former oriented to density and the latter oriented to spatial flow rather than spatiotemporal flow, and the merging accuracy is low, so they cannot be applied to the strength-weighted spatiotemporal flow in this paper (Hasanpour Jesri and Shirazi 2022;Kim et al. 2014;Zhang et al. 2018).
Definition 5 (Spatiotemporal neighborhood of OD flow) For a certain flow unit f i , the set of flow units with a spatiotemporal proximity relation to this flow unit can be defined as: All flow units in NF f i ð Þ belong to the neighbors of f i .
Definition 6 (Strength-weighted OD flow) OD flow strength is used to describe the closeness of the association between the origin and destination of a flow.Strength is related not only to the interaction volume of the OD flow but also to other flows leaving the origin f i o at time t i o and all other flows arriving at the destination f i d at time t i d .It is calculated as follows: The flow density between origin and destination regions reflects the absolute closeness of their association.For example, in the flows shown in Figures 3  (a,b), the number of flow units from region 1 to 2 is 200, while that from region 3 to 4 is 500, indicating the latter density is greater than the former and the association between regions 3 and 4 is closer.However, in Figures 3(c,d), if the flow volume between origin and destination regions is regarded as the weight of a single flow unit, the flow strength of f 1 and f 2 can be calculated using Equation (4).It can be found that in Figure 3(c), the weighted flow f 1 is far more important for the origin region 1 (larger weight) than other flows from this origin region, accounting for f 1 w =O 1 w ¼ 0:7143 of the importance.Its importance for the destination Region 2 is also far greater than that of other inflows, accounting for f 1 w =D 1 w ¼ 0:6897.Similarly, the weighted flows f 2 are f 2 w =O 2 w ¼ 0:4762 and f 2 w =D 2 w ¼ 0:5882 important for their origin and destination, respectively.Finally, the strengths of f 1 and f 2 are calculated as ST f 1 ð Þ ¼ 0:4926 and ST f 2 ð Þ ¼ 0:2801, respectively.Although the density (absolute closeness) of f 2 is far greater than that of f 1 , the flow strength (relative closeness) of f 1 is greater than f 2 , indicating the closeness between the former's origin and destination regions is closer -the strongest closeness in the local region where this flow unit is located.
Definition 7 (Strength-reachability of flow unit pairs) Any flow unit f j in NF f i ð Þ can be a part of a certain WST-FP only when flow strength P i j between f j and f i reaches the threshold value δ defined by the user.Using the calculation of flow strength between f i and f j as an example.By extending existing density-based flow pair merging methods (Kim et al. 2014;Zhang et al. 2018), we propose an strength-based flow pair merging method.The calculation formula for its flow strength is expressed as follows: Where S is the flow unit dataset, f i w þ f j w À � represents the sum of the interaction volumes of f i and f j .
represents the total interaction volumes of the flows which from the origin (f i o ) and the start time should be limited to the time period Similarly, represents the total interaction volumes of the flows whose destination is f i d and the reaching time should be limited to the time period ].If δ is used to indicate the user-defined strength threshold, for flow units f i and f j , when P i;j >δ, f i and f j can be considered as strength-reachable.

Problem definition
This paper aims to address two problems, with the relationship between them being that solving the first problem provides the basis for the second one.Existing interregional association patterns reveal the absolute closeness between origin and destination regions through the density of OD flows but fail to capture potential association patterns with relatively high closeness between origin and destination regions despite relatively low absolute closeness.Therefore, the first problem to be solved can be defined as: (1) How to identify interregional association patterns with relatively high closeness from a set of volume-weighted flows?
As shown in Figure 4, to illustrate Problem 1 specifically, each region group is numbered as RG-1 to RG-4.In Figure 4(a), there are many low volume-weighted flow units between RG-1 and RG-2.However, RG-1 has more high volume-weighted flow units flowing to other regions, so the proportion of flows from RG-1 to RG-2 is small.Since more high volume-weighted flow units from other regions flow into the destination region RG-2, the proportion of flows from RG-1 to RG-2 is small.This results in a small flow strength from RG-1 to RG-2.In contrast, most flow units between RG-3 and RG-4 have a large flow strength, and these flow units are adjacent to each other, resulting in a large flow strength between RG-3 and RG-4.This forms an interregional association pattern, as shown in Figure 4(b).One of the objectives of this paper is to identify such interregional association patterns with a relatively high degree of closeness through algorithms.
Spatiotemporal OD flows contain not only spatial position information but also temporal information.Space and time are inseparable and should be viewed as an integral whole.Interregional association patterns extracted from spatiotemporal flows should also have spatiotemporal attributes, that is, it is necessary to obtain not only the set of flow units and the sets of origin and destination regions included in the interregional association pattern, but also the duration of the origin and destination region sets of the flow pattern.Achieving this goal constitutes the second major challenge of this paper, namely, (2) how to capture interregional association patterns with arbitrary spatial aggregation shapes and arbitrary durations by viewing the spatial and temporal dimensions of flow units as an integral whole?
To further clarify Problem 2 above, Figure 5 provides a specific explanation.We first set aside the complexity of mining interregional association patterns from massive OD flow data and its challenges.Instead, we use a simple example to illustrate the complexity and importance of considering both the spatial and temporal proximity of flow units, as well as the serious drawbacks of existing simple and brutal treatments.
As shown in Figure 5(a), Layer 1 contains multiple flow units, and the entire region is divided into two classes of areal units (fine-grained and coarse-grained regions).If a flow pattern needs identification, the first task is to determine the origin and destination regions and provide a time range constraint.Since the origin-destination regions and hourly time intervals are predefined, this method has limitations in identifying more precise spatial regions and temporal periods of flow patterns.For example, FP1 is only known to occur between 6:00 and 7:00, although it may actually occur from 6:10 to 6:30.Similarly, the exact timing of FP2 within the 8:00 to 9:00 period is uncertain.Using other predefined time ranges cannot overcome this limitation, whether the ranges are small (e.g. 30 minutes) or large (e.g. 1 day).In summary, predefining spatial regions and temporal intervals inherently limits the precision in detecting origin-destination locations and timing of flow patterns.
The example in Figure 5 regions of a flow pattern may be smaller or larger than in Figure 5(a).The duration may be a subperiod of time or may span the current hourly period.This study proposes a spatiotemporally continuous clustering method to quickly and accurately detect strength-weighted spatiotemporal flow patterns (WST-FP) between OD regions of arbitrary shape over flexible time periods, without predetermining specific regions or periods.
The modifiable areal unit problem (MAUP) persists, as cluster results vary based on spatial unit selection.The introduced time dimension brings a modifiable temporal unit problem (MTUP).Different time intervals affect analysis outcomes.To mitigate, spatial units and time intervals should align with data traits and analysis goals.For example, in urban settings, flow cluster areas bounded by roads suit functionally homogeneous blocks better than grid cells.For time intervals, analysis objectives should guide selection.A 10-minute interval may sufficiently capture 24-hour taxi passenger flow aggregation patterns without excess sparsity or generalization.In summary, thoughtful spatial and temporal unit selection, tailored to the problem context, helps address MAUP and MTUP limitations.Concrete analysis of specific problems is needed, as real situations are complex.

Algorithm description
The mining process of all WST-FPs from massive flow units is briefly introduced to improve our understanding.As shown in Figure 6(a), the flow unit with the maximum strength ð Þ is first selected as the seed flow unit from among all the flow units, and mark it as visited.Then, all the adjacent polygon elements of the origin f 1 o and destination  Lastly, any flow unit that meets the spatiotemporal proximity and flow reachability is selected as the next seed flow unit and marked as visited, and the process shown in Figure 6(b-f) is repeated.As an example, is used as the seed flow unit and marked as visited.The process in Figure 6(g) is entered, which returns to the process similar to that in Figure 6(b).The iteration is continued until each flow unit in set WST-FP is marked as visited.Then, the flow units in FP jointly constitute a new WST-FP, as shown in Figure 6(h), which is the schematic of the two obtained WST-FP.
The pseudo-code for strength-weighted spatiotemporal flow pattern as shown in Figure 7, and the variables in the pseudo-code are consistent with this study.

Characteristic variables of flow pattern
A complete WST-FP contains at least two flow units (inclusive), and the origin or destination of the flow pattern consists of at least two proximal regions.An WST-FP contains many basic attribute variables, which are crucial for measuring the pattern and calculating various inspection quantities.An WST-FP can be represented as WST-FP i ¼ ff 1 ; f 2 ; . . .; f n g ¼ O !D, where n represents the num of flows in WST-FP, O and D represent the spatiotemporal attributes of origin and destination region groups, respectively.Then, for any flow unit f i f i 2 WST À FP i ð Þ, at least one flow unit f j f j 2 WST À FP i À � satisfies the spatiotemporal proximity and the threshold condition (P i;j >δ).The characteristics of the two levels of space and time are analyzed (Figure 8).
At the spatial level, the set of origin regions of WST-FP i can be represented as

Statistical metrics of result evaluation
The metrics of coverage, closeness and composite of OD flow patterns between regions were originally first proposed by Kim et al. for the evaluation of clustering results of flow density (Kim et al. 2014).Later, it was applied to the evaluation of clustering results of strength-weighted flow by Zhang et al. (2018).In this paper, it is further refined and extended to make it applicable to the evaluation of clustering results of spatiotemporally strength-weighted flow patterns. (

1) Coverage rate
Coverage rate refers to the ratio of the sum of interaction volumes in the flow WST À FP i to the sum of the interaction volumes of all flow units with the same starting time period or reaching time period of the WST À FP i in the calculation analysis, which is used to reflect the degree of importance of the flow value of a target flow pattern in the entire flow data within the specified time period.The coverage rate formula for WST À FP i can be represented as: where (2) Closeness rate The set of origin regions of any WST À FP i is O i R, and the total flow from O i R is represented by |O i R|.The set of destination regions is D i R, and the total flow to the destination regions is represented by |D i R|.For any WST À FP i , the s-value is used to represent the interaction closeness of this flow pattern.In addition, the s-value is used to reflect the strength of correlation between the origin and destination regions in a flow pattern.The calculation formula is where O i j j represents the total interaction volumes of flow units whose origin region belongs to O iR and starting time in time period O i T. D i j j represents the total interaction volumes of flow units whose destination region belongs to D iR and reaching time in time period D i T. O i � j jD i j j represents the product of total flow from the origin regions and total flow to the destination regions.The greater the s WST À FP i ð Þ value, the stronger the correlation between the origin and destination regions of this flow pattern; otherwise, its correlation is weaker.

(3) Composite value
Coverage rate reflects the scope of the pattern from the flow itself, and closeness reflects the strength of the pattern through the correlation between the origin O i and destination D i of the flow pattern.The two indexes evaluate the strength of the WST-FP from a partial perspective and are limited to a certain degree.In this study, the composite value of coverage rate and accuracy is adopted to comprehensively reflect the strength of a pattern.The specific formula is ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffiffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffiffi O i j j� D i j j p (15)

Test with synthetic data
To verify the effectiveness of the WST-FPs mining method proposed in this paper, two synthetic datasets are designed.One of the datasets is simple the other is complex (Figure 9 and Figure 10).Figure 9 shows a set of volume-weighted spatiotemporal OD flow units, and Table 1 shows the basic attributes of all OD flow units corresponding to Figure 9.There are a total of 9 OD flow units, respectively, named as f1, f2, . . ., f9. Figure 9(a) shows the spatiotemporal distribution of all OD flows, while the study area and basic areal unit with its number code are shown in Figure 9(c).The number code of the study unit corresponding to the origin and destination of each OD flow corresponds to the O_ID and D_ID fields in Table 1.O_DATE and D_DATE indicate the occurrence time of each flow in the origin and destination units.The VAL field is the interaction volume of flow.In addition, comparative experiments between the proposed method in this paper and existing density-based spatial flow and spatiotemporal flow clustering methods are also provided, as shown in Figures 11 and 12.

Small-scale synthetic dataset
If the shared edge or corner is used here as a spatial proximity rule, and half an hour as the time interval, we can see from Figure 9(b) that {f1, f2, f3, f4} is detected as a flow pattern, denoted as WST-FP1.{f5, f6, f7} is detected as another flow pattern, denoted as WST-FP2.In Figure 9(b), f8 is spatially adjacent to the flow unit in WST-FP1 but not temporally adjacent.F9 is neither adjacent in space nor time to any pattern.Therefore, f8 and f9 are neither contained in WST-FP1   nor WST-FP2.Although other OD flow units meet the proximity rule in space and time, the interaction volumes of some may have a small contribution to the regions where the pattern is located.At this time, these OD flow units also cannot be regarded as part of the pattern.In the end, we expect to obtain a flow pattern result as shown in Figure 9(d).
In this experiment, the time interval is set to 30 min, the merging threshold of interaction strength is set to 0.6, and the spatial proximity relationship adopts the shared edge or corner rule.The analysis result is consistent with the expected result.The resulting parameters of WST-FP1 and WST-FP2 are shown in Table 2. Other time interval thresholds and thresholds of interaction strength can also be used for pattern detection, which will influence the results.

Large-scale synthetic dataset
To further verify the algorithm proposed in this paper, a spatiotemporal network with 40 rows, 30 columns, and time periods from 12:00 to 13:40 is designed.A large-scale set of OD flow units containing random values of interaction volume is generated, as shown in Figure 10(a).This dataset also contains some labeled OD flow units that constitute the preset flow patterns, as shown in Figure 10 The evaluation parameters of the flow pattern are key to understanding and interpreting the characteristics of each pattern.Table 3 shows a list of the evaluation parameters of the four patterns included in the dataset in Figure 10.The indicator v-rate explains the degree of coverage of the individual flow patterns.The pattern results show WST-FP2 accounts for the largest proportion of the total interaction volume in the entire study area during the model time period.WST-FP 3 accounts for the smallest proportion.The weight is the total number of OD flows between the two grids in this case.The s-rate reflects the closeness of the association between the origin region and the destination region of the flow pattern.Among the four patterns, the order of degree of closeness from strong to weak is WST-FP 4, WST-FP 2, WST-FP 1, and WST-FP 3.
According to the principle of Section 4.2 about flow pattern result evaluation parameters, for the same flow pattern, the v-rate and s-rate values tend to be inversely correlated.Since the c-value is the square root of the product of v-rate and s-rate, it characterizes the balance between coverage and closeness of a single flow pattern.Here, WST-FP 4 has the largest balance value, while WST-FP 3 has the smallest.O_Duration and D_Duration, O_Count and D_Count represent the count of areal units and duration in the origin and destination regions of each flow pattern, respectively.

Comparison with two state-of-the-art clustering methods
To validate the efficacy of the proposed algorithm WST-FP, we compare it with two state-of-the-art clustering methods, specifically the spatial flow L-function (SpatialflowL) based approach and the flow ST-DBSCAN (Birant and Kut 2007;Rus et al. 2022) based   method.Whereas SpatialflowL (Shu et al. 2021) is solely able to identify interregional association patterns in space, the flow ST-DBSCAN algorithm identifies interregional associations concurrently in both the temporal and spatial dimensions.Therefore, the latter approach is capable of detecting spatiotemporal interregional association patterns.11(b,c), it can be observed that the two clusters overlap spatially but segregate temporally.
Given that SpatialflowL cannot incorporate the temporal dimension, two clusters that segregate temporally yet locate closely in space and certain noise flows in spatial proximity are identified as a cluster, as illustrated in Figure 11(d).Although the ST-DBSCAN algorithm can concurrently discern the temporal and spatial dimensions, merely Cluster 1 of higher density is detected in the present experiment due to the subjectivity in threshold estimation, as evidenced in Figure 11(e).Figure 11(f) delineates the outcomes extracted by the WST-FP algorithm proposed in this paper.Since the interaction volumes of OD flows in Cluster 1 and Cluster 2 are substantial and the interaction volumes of flow units in Cluster 1 and Cluster 2 constitute a large proportion of the total volumes of all other OD flows departing from the origin region or ending in the destination region, their strengths can be inferred to be strong according to Definition 6.Therefore, the WST-FP algorithm is capable of identifying both clusters simultaneously.
Similar to the above, Figure 12(a) demonstrates the visualization of the second comparative dataset, where a darker color indicates greater interaction volume of a flow.As evidenced in Figure 12(b) and (c), although Cluster 1 exhibits higher density, numerous flows other than those in Cluster 1 depart from the origin region of Cluster 1, and many flows other than Cluster 1 end in the destination region of Cluster 1.This results in Cluster 1 having high density yet low strength.In contrast, Cluster 2 has high strength since only a small number of other flows depart from the origin region of Cluster 2 to other regions and only a few flows from other regions end in the destination region of Cluster 2.
Given that SpatialflowL cannot incorporate the temporal dimension, Cluster 1 and Cluster 2 identified contain some noise flows in close spatial proximity, as illustrated in Figure 12(d).In the present experiment, the ST-DBSCAN algorithm also detects only Cluster 1 of higher density due to the subjectivity in threshold estimation, as shown in Figure 12(e).The WST-FP algorithm primarily takes into account the relative strength of association between origins and destinations.Therefore, the WST-FP algorithm is able to identify Cluster 2, which exhibits lower density yet higher strength, as evidenced in Figure 12(f).
Table 4 evaluates the patterns identified by the three methods in Figure 12.It can be seen that the patterns identified by SpatialflowL have the highest v-rate values compared to flow ST-DBSCAN and WST-FP algorithms.This is because SpatialflowL only considers the spatial continuity between flows, thus including flows that are not temporally continuity.On the other hand, the s-rate not only considers the volume between the origin and destination regions, but also the total volume of the origin and destination regions.This is highly similar to the notion of flow stength, and WST-FP identifies patterns based on flow stength.Therefore, compared to SpatialflowL and flow ST-DBSCAN, the patterns identified by WST-FP have the highest s-rate values.The c-value comprehensively considers both v-rate and s-rate.From Table 4, it can be seen that the patterns we identified have the maximum c-value.

Study area and data description
In this study, the Chinese mainland is selected as the study area, as shown in Figure 13

Result and evaluation
A topological proximal rule was also used to model the spatial relationship between regions in a realworld dataset.Two regions were marked as spatial neighbors when they had a shared edge or corner.In this case, the duration threshold is set to 2 days, that is, flows were regarded as occurring in proximal time when they happened 2 days earlier or later than the current time.The flow merging threshold was set to 0.009.Approximately 50 flow patterns were found under the threshold constraints, and 21 of these patterns are visualized on the map, as shown in Figures 14 and Figure 15.Here, for comparative analysis, flow patterns with closer distances between origin and destination regions are placed in Figure 14, and farther ones in Figure 15.
The 11 flow patterns shown in Figure 14 can be analyzed based on the spatial distribution and spatial relationship of the origin and destination regions of a single flow pattern.The overall distribution characteristics of all flow patterns can also be analyzed.They can be further described by combining the evaluation parameters of each flow pattern in Table 5. Figure 14 shows there is rarely a large difference in the number of spatial units in the origin and destination regions of a single flow pattern.For example, if oc and dc denote the number of spatial units constituting the origin  The advantage of the evaluation indicators is that they can reflect the different characteristics of the flow pattern in a more quantitative statistical way, which are difficult to directly visualize on the map.Table 5 shows the coverage degree of flow patterns 1, 2, 3, and 8 is much higher than that of other flow patterns.Among them, WST-FP8 has the highest coverage degree (v-rate).Combined with the spatial location of this model in Figure 14, the origin and destination regions of WST-FP8 are located in the capital Beijing and the Pearl River Delta Economic Development Zone, respectively.The ones with the highest closeness degree (s-rate) are flow patterns 9 and 10, both of which have similar spatial locations and are interactions between East China and Northeast China, but in opposite directions.Obviously, the balance (c-value) of the 11 flow patterns shown in Figure 14 does not differ much.
The characteristics of the spatial distribution, spatial relationships, and time periods of the flow patterns in   6 shows the evaluation results of WST-FPs in Figure 15.Except for WST-FP10, the duration of all these flow patterns is short.Most patterns in Figure 15 have larger s-rate values, indicating a stronger interaction between the origin and destination of these flow patterns.The northwest part of the study area belongs to a low population density region, while the southeast part belongs to a high population density region.16(a,b).In high population density regions, the WST-FP method can extract flow patterns such as those shown in Figure 16(f).The SpatialflowL method identifies similar patterns, as in Figure 16(d), but this is a spatial flow pattern rather than a spatiotemporal flow pattern, thus the time periods when the pattern occurs cannot be obtained.The flow ST-DBSCAN method obtains two patterns as shown in Figure 16(e), and their time periods can be determined.However, compared to the pattern in Figure 16(f), their implications are completely  different.The flow patterns obtained by the method proposed in this paper have large flow strength, while those obtained by the other two methods have large flow density.

Revisiting the distinction between interaction volume and strength
It is important to understand the difference between interregional flow density and flow strength.To illustrate the critical role of flow strength in solving practical problems, as well as how it differs from flow density, here is an illustrative example.Suppose there are three regions: Region A, Region B, and Region C, as shown in Figure 17(a,b) can be seen as a spatially networked abstraction of real-world supply and demand relationships.Region C provides many vital resources to both Region A and Region B. Specifically, the resource flow from Region C to Region A is 500 (the 500 here can be approximated as the value of the flow density), while the total resource flow to Region A from all other regions is only 100.In comparison, while the resource flow from Region C to Region B is also high at 600, the total resource flow to Region B from other regions is far higher, at 5000.If a disaster strikes Region C cutting it off from interaction, this would severely impact production and livelihoods in Region A. For instance, if Region C has an accident and cannot supply resources to other regions, the impact on Region B is relatively minor since the resources it receives from C are only 10.7% of its total.However, the same accident would have an enormous impact on Region A, since the resources it receives from Region C make up 83.3% of its total.This highlights how flow strength reflects the importance and dependency of a certain interaction for a region, distinct from just the magnitude of flow density.Grasping flow strength accurately is critical for analyzing regional networks and responding to contingencies.

Application prospect of the algorithm
The conventional clustering results based on unweighted OD flows primarily reflect density characteristics, whereas the clustering results derived from weighted OD flows in this study capture the strength of association between regions.Unlike flow density, which reveals absolute closeness between regions in existing methods, flow strength in our method reveals local relative closeness between regions.This study introduces a flow cluster algorithm that progresses from traditional unweighted clustering to weighted clustering and from spatial clustering to spatiotemporal clustering, presenting a novel approach for integrated spatiotemporal flow pattern mining.Furthermore, this method can be viewed as a new map generalization technique for spatial OD flow data.The primary objective of this method is to address questions such as identifying regions with strong interactions or associations, determining the boundaries of origin and destination in flow patterns with significant associations, and understanding when these associations occur.This approach holds considerable potential in urban planning, transportation analysis, and regional planning, which involve spatial flow elements, such as human flow, logistics, traffic flow, and information flow from a broader flow space perspective.The notion of strength-based local relative closeness between regions enables the revelation of the priority and importance of connections between regions.In practical decision-making scenarios, where limited resources are available for regional development, it becomes crucial to ascertain which connections between regions are the most significant and should be prioritized for strengthening or protection.Local relative closeness facilitates the comparison of the closeness between different regions and their adjacent regions, enabling the identification of the most critical regional associations and offering decision-makers a basis for informed choices.For example, in urban transportation planning, determining the direction of new routes requires understanding which areas have the closest links.Local relative closeness can identify these areas, guiding the establishment of connectivity between them.Similarly, in disease prevention and control, determining the order of isolating high-risk areas necessitates considering the closeness of their connections to infected areas.Local relative closeness can guide the prioritization of isolating areas with the closest links to infected regions, effectively curbing the spread of the disease.
In summary, compared to simple absolute closeness measures, local relative closeness provides more accurate and realistic results for analyzing inter-regional relationships.In practice, this approach can assist planners in adopting more scientifically grounded and targeted strategies, such as implementing isolation measures in disease control or formulating transportation network plans in urban planning.

Limitations and future directions
The strength-weighted spatiotemporal flow clustering method proposed in this study exhibits notable advantages in terms of efficiency, accuracy, and robustness.However, there are still certain limitations that warrant exploration in future research directions.Firstly, this method encounters challenges related to the modified area (time) unit problem when mining pattern results.The spatial area unit and temporal interval unit used for analysis need to be predefined, which can impact the interpretation and generalizability of the findings.Secondly, the current definition of flow patterns in this study does not account for cases where the origin and destination regions of a single flow pattern partially or completely overlap.Exploring the inclusion of such patterns within the scope of flow patterns would be a valuable avenue for future investigation.Thirdly, different flow patterns may spatially overlap to some extent, making it difficult to visualize them effectively on the same map.Developing an effective flow pattern visualization method to address this challenge remains an open research problem.Addressing these aspects will be critical for future research in this field.The resolution of the modified area unit problem, the inclusion of overlapping origin-destination regions in flow patterns, and the development of visualization techniques for overlapping flow patterns are key areas that require further exploration and innovation.

Conclusion
Spatial interaction is a concept that often manifests itself through the association between different regions.The analysis of spatial interaction involves examining key attributes of interaction events, such as their start time, end time, duration, and interaction strength.These attributes are crucial for understanding and studying spatial interaction patterns.When considering the proximity between regions, two aspects are commonly evaluated: absolute closeness and local relative closeness.Absolute closeness refers to the direct connection between regions, without taking into account the relationship between a region and its adjacent regions.On the other hand, local relative closeness considers the proximity of the connection between a region and its surrounding regions relative to other connections in the area.To effectively reveal the laws governing regional interactions, it is important to consider the interaction strength and time associated with OD flows.These flows serve as fundamental indicators in clustering and analyzing OD data, enabling a more accurate understanding of flow patterns and enhancing overall knowledge in the field.By incorporating these indicators, researchers can gain deeper insights into the spatial interaction dynamics between regions.
This paper proposes algorithms for efficiently mining flow patterns with spatiotemporal continuity based on interaction strength from large-scale OD flows.It also introduces metrics such as coverage, closeness, and tradeoff, which serve as means to evaluate the effectiveness and accuracy of flow patterns.The primary focus of the research is on addressing challenges related to the merging rule of spatiotemporal OD flow pairs weighted by interaction strength, the calculation of flow strength during the merging process, and the evaluation and interpretability of flow patterns using indicators.
When measuring absolute closeness, the common approach is to consider simple flow or the number of interactions between regions.However, in the case of local relative closeness, it becomes necessary to account for the neighborhood environment and the interaction range of a region.This requires the adoption of more complex indicators, such as flow strength, to assess the proximity between regions.The proposed algorithm is characterized by its efficiency, as it has a time complexity of less than O(n 2 ) when constructing a spatiotemporal index.It is also designed to be applicable to various types of OD flow data, provided they include interaction volume and time attributes.The algorithm is well-parameterized, requiring only two input parameters: the spatiotemporal proximity rule and the strength reachability threshold.The robustness and practicality of the method are demonstrated through case experiments conducted using both synthetic and real datasets.These experiments serve to validate the effectiveness and applicability of the proposed algorithm in realworld scenarios.

Disclosure statement
No potential conflict of interest was reported by the authors.

Funding
five basic attributes, where i represents the ith flow unit, and f i o and t i o represent the origin and the corresponding starting time of the flow unit.Similarly, f i d and t i d represent the destination and the corresponding reaching time of the flow unit.f i w represents the weight of the flow unit, that is, the number of people who depart from f i o at time t i o and reach f i d at time t i d .As shown in Figure 1(a), these are three examples of weighted spatiotemporal flow units.The weights of OD flows can represent different meanings, such as interaction volumes or interaction strengths.By default, the weight refers to the interaction volume unless otherwise specified.Based on the above definitions, OD flows with t i o and t i d as None are called spatial OD flows.Flow units with f i w ¼ None are called unweighted spatiotemporal OD flows, as shown in Figure 1(b).Unweighted OD flows do not indicate OD flows with a weight of 1 but instead treat those with weights greater than or equal to 1 as equivalent.In other words, their weights are all set to none.

Figure 1 .
Figure 1.The basic form of weighted and unweighted spatiotemporal OD flow.(a) weighted OD flows and (b) unweighted OD flows.

Figure 2 .
Figure 2. Spatial distance measurement between flow units.(a) flow unit pairs that satisfy the spatial adjacency relationship, (b) flow unit pairs that do not satisfy the spatial adjacency relationship.

Figure 3 .
Figure 3.The density and strength of OD flows respectively characterize the absolute and relative closeness of association between their origin and destination.(a) Smaller number of unweighted flows between regions 1 and 2, (b) larger number of unweighted flows between regions 3 and 4, (c) weighted OD flow between regions 1 and 2 shows low interaction volume but high interaction strength, (d) weighted OD flow between regions 3 and 4 shows high interaction volume but low interaction strength.

Figure 4 .
Figure 4. Interregional association patterns with a relatively high degree of closeness.(a) visualization of flow dataset, (b) flow pattern with relatively high degree of closeness.

Figure 5 .
Figure 5. Spatiotemporal flow pattern.(a) traditional flow patterns with predefined spatial and temporal constraints, (b) flow patterns considering spatiotemporal continuity.

Figure 6 .
Figure 6.Recognition of strength-weighted spatiotemporal flow patterns from massive flow units: (a) select a seed flow unit f 1 , (b) find the adjacent regions of the seed flow unit f 1 , (c)-(d) retain the adjacent regions that contain the flow unit, (e) calculate whether each adjacent flow unit can be merged with the seed unit, (f) collections of flow units that can be merged and filtered, (g) randomly select a flow unit from the set of flow units that can be merged as the next seed unit and repeat steps (b)-(f), (g) the final extracted two WST-FPs.
f 1 d of flow unit f 1 are found.As shown in Figure 6(b), c is an adjacent polygon element of the origin aðf 1 o Þ, and d is an adjacent polygon element of the destination b f 1 d ð Þ.No flow unit exists between several neighbors of origin and destination of f 1 .Thus, such polygon elements should be eliminated from the neighbor set, and the set of flow units that consists of adjacent polygon elements of the origin aðf 1 o Þ and adjacent polygon elements of the destination b f 1 d ð Þ are obtained.f 1 and other flow units in Figure 6(c) meet the accessible conditions of the spatial distance defined in section 3.1.Then, each flow in the flow unit set above are checked, and only the flow units whose time distance from f 1 meets the threshold τ are retained, which are called NF f 1 ð Þ, as shown in Figure 6(d).Subsequently, flow unit f 1 and other flow units that exhibit a spatiotemporal proximity relationship with f 1 are combined, and flow strength reachability of each combination are determined on the basis of the rules defined in section 3.1, as shown in Figure 6(e).If the set of flow units of a new WST-FP is marked as FP, then f 1 and all other flow units that meet the accessible spatiotemporal distance and flow strength with f 1 are placed in set FP as flow unit members of this pattern.
and the set of destination regions can be represented as D R ¼ f 1 d ; f 2 d ; f 3 d ; f 4 d f g.Then, O R and D R jointly constitute the basic spatial characteristic quantities of the WST-FP i .At the temporal level, the origin region of each flow unit corresponds to a starting moment t i o , and the destination of each flow unit corresponds to a reaching moment t i d .The time period of origin regions of the WST-FP i can be expressed asO T ¼ O T1; O T2 ½ �, where O T1 ¼ min t 1 o ; t 2 o ; t 3 o ; t 4 o ð Þ represents the earliest starting moment existing among all starting moments in the origin regions and O T2 ¼ max t 1 o ; t 2 o ; t 3 o ; t 4 o ð Þ represents the latest starting moment.Mean while, the time period of destination regions can be expressed asD T ¼ D T1; D T2 ½ �, where D T1 ¼ min t 1 d ; t 2 d ; t 3 d ; t 4 d ð Þ represents the earliest reaching moment among all reaching moments in the destination regions andD T2 ¼ max t 1 d ; t 2 d ; t 3 d ; t 4 d ð Þ represents the latest reaching moment.The duration of the origin regions of the WST-FP i is Δt o ¼ O T2 À O T1 j j, and that of its destination regions is Δt d ¼ DT 2 À DT 1 j j.The duration of the entire WST-FP i is Δt od ¼ D T2 À O T1 j j.O T1, O T2, D T1, and D T2 and Δt o , Δt d , and Δt od jointly constitute the basic temporal characteristic quantities of the WST-FP i .

Figure 9 .
Figure 9. Mining flow patterns from a small set of volume-weighted spatiotemporal flow units: (a) a set of flow units, (b) whether flow units are contained in flow patterns or not, (c) experimental area, and (d) two mined flow patterns.

Figure 10 .
Figure 10.Massive synthetic spatiotemporal OD flow data and labeled flow patterns.(a) volume-weighted spatiotemporal OD flow (b) labeled spatiotemporal OD flow, (c) four mined OD flow patterns.
(b).The proposed algorithm is used to discover spatiotemporal flow patterns from this dataset.The merge threshold is set to 0.0009, and the time step is set to 10 min.The analysis results are shown in Figure 10(c).The mining results are similar to the preset flow patterns.Partially inconsistent grids are affected by the time step and merging threshold.This experiment proves the effectiveness of the algorithm constructed in this study in the flow pattern mining of large-scale volume-weighted spatiotemporal OD flows.
Figure 11(a) illustrates the visualization of the first comparative dataset, where the darker color of an OD flow indicates greater interaction volume.As evidenced in Figure 11(b), this dataset also encompasses certain labeled OD flows that constitute predetermined flow patterns.Figure 11(b) contains two clusters in total, with Cluster 1 exhibiting a higher density than Cluster 2. Figure 11(c) presents the visualization of the flow patterns in Figure 11(b) in a twodimensional view.By integrating Figure . There are approximately 300 prefecture-level cities, which are tertiary administrative regions.The data used are the flow data of people traveling by plane within China every day.The flow data are obtained through statistics, with prefecture-level cities of tertiary administrative divisions as basic spatial units.The data are obtained from Tencent Open Platform of Location Big Data.The daily human mobility data throughout 2018 are used, i.e. the flow unit data shown by OD lines in Figure 13 are the visualized results of all statistics in one day of 2018, and approximately 2.5 million OD flow data were obtained throughout 2018.Each record contains the occurrence date, origin, destination, and interaction volume of the flow unit.Prefecture-level cities are the units of origins and destinations.Statistics from the China Aviation Administration Company indicate the country's total passenger flow by air in 2018 was 610 million persontimes, including outbound and inbound tourists.That is, the data accounted for 27.87% of the total.Figure 13 is cited from Zhang et al. (2018) and the same dataset as in Zhang's paper is used to validate the algorithm in this paper.

Figure 13 .
Figure 13.Study area and flow data visualization during one day (each flow unit contains origin city, destination city, and passenger count).
and destination regions in a single flow pattern, oc and dc in WST-FP1 are 7 and 3, respectively.The oc and dc in WST-FP2 are 5 and 5, and the oc and dc in WST-FP3 are 4 and 5, respectively.Furthermore, the origin or destination of most flow patterns is near the frontier.In terms of distance, even two regions with very close distances may have flow patterns formed, e.g.WST-FP3, WST-FP4, WST-FP7, and WST-FP11 flow patterns have very small distances between the origin and destination regions.In terms of spatial distribution, more flow patterns formed in border areas but less in inland areas.The flow patterns are the fewest in the central interior region and the most in the southern frontier region.In terms of duration, most flow patterns have a relatively short duration, and only a few have a particularly long-time span.For example, WST-FP4 lasted from 16 February 2018 to 7 July 2018, located on the northern side of the study area.WST-FP6 lasted from 1 January 2018 to 8 October 2018, located on the southern side of the study area.The time span of these two patterns is very long, lasting approximately 5 and 10 months, respectively.Another interesting phenomenon is most of these short-and medium-distance flow patterns occur in the first half of the year.

Figure 14 .
Figure 14.Results of strength-weighted spatiotemporal flow patterns in a medium-close distance.(a) Mapping of flow patterns, (b) duration of each flow pattern.

Figure 15
Figure 15 have many similarities with Figure 14.These medium-and long-distance flow patterns are also mainly distributed near China's national border, and the central region still lacks flow patterns.Throughout the western and southern sides of the study area, most flow patterns are east-west oriented, e.g.flow patterns 1, 2, 4, and 5. On the east side of the study area, the flow patterns are mainly north-south oriented, such as flow patterns 6, 8, 9, and 10.The flow patterns in Figure 14 also have similar characteristics to Figure 15 in terms of spatial distribution and spatial relationships.In terms

Figure 15 .
Figure 15.Results of strength-weighted spatiotemporal flow patterns in a medium-long distance.(a) Mapping of flow patterns, (b) duration of each flow pattern.
Based on the definitions of flow density and flow strength, it can be found that flow pattern extraction methods based on flow density tend to identify flow patterns from high population density regions.In contrast, flow pattern extraction methods based on flow strength are not limited by population density, because flow strength is a relative measure, while flow density is absolute.To demonstrate the characteristics of the method proposed in this paper, SpatialflowL, flow ST-DBSCAN, and the WST-FP method proposed in this paper are used to extract flow patterns from the real dataset.Some results are shown in Figure 16.Clearly, in low population density regions, the WST-FP method can uncover flow patterns as shown in Figure 16(c).The other two methods fail to detect flow patterns in the same area, as shown in Figure

Figure 16 .
Figure 16.Comparative experiments based on real-world datasets.(a) Mapping of flow patterns, (b) duration of each flow pattern.(a) result of SpatialflowL, (b) result of flow ST-DBSCAN and (c) result of WST-FP in low-density population regions.(d) result of SpatialflowL, (e) result of flow ST-DBSCAN and (f) result of WST-FP in high-density population regions.

Figure 17 .
Figure 17.An example demonstrating the practical application of flow strength.(a) interregional supply and demand networks in the real world, (b) interregional flow networks following abstraction.

Table 1 .
Attributes of flow units.

Table 2 .
Result parameters of flow pattern based on a small-scale dataset (time period unit: minute).

Table 3 .
Result parameters of flow pattern based on a large-scale datasets (time period unit: minute).

Table 4 .
Evaluation results of three methods in Figure12.

Table 5 .
Evaluation results of WST-FPs in Figure14.

Table 6 .
Evaluation results of WST-FPs in Figure15.