Managing score heterogeneity between online consumer review websites

Abstract This paper describes an alternative approach to measuring score heterogeneity between online consumer review websites. This topic is important in tourism management and in the hospitality sector, where it is helpful to be aware of the ratings obtained by services, from information readily available on the website. We approach this issue by considering tests of multiple population means, assuming this question can be viewed as a clustering problem and that all feasible data configurations can be tested using a Bayesian procedure from which the posterior probabilities of each cluster model are computed. The proposed Bayesian model is a useful alternative to frequentist multiple testing methods, which neglect uncertainty regarding other potential configurations. We draw conclusions about the overall score parameter and propose a Bayesian model averaging model for estimation purposes. Finally, the proposed Bayesian framework is illustrated in detail using a real dataset.


Introduction
In recent years, increasing numbers of researchers in business, economics, management, social sciences and related disciplines have been using Bayesian statistics in their data analyses, an approach facilitated by the greater computational power now available with which to address the complexity of statistical models.In tourism studies, too, this method has proven beneficial and offering greater flexibility and enables more intuitive interpretations to be made.
Bayesian analysis has been introduced in tourism in the last two decades where many contributions have been made by Assaf and colleagues.As Assaf et al. (2018) pointed out, the Bayesian approach offers several advantages: 1) It allows previous information to be incorporated into the analysis; 2) It is more robust to small sample sizes; 3) It produces more reliable statistics for formal model comparison; 4) It provides a better approximation for managing uncertainty within the models.Topics addressed in their papers include analyses of cost efficiency and estimated business efficiency, using parametric and non-parametric Bayesian stochastic frontier and DEA approaches (Assaf, 2009(Assaf, , 2011(Assaf, , 2012;;Assaf & Tsionas, 2015;Assaf et al., 2021), efficiency in the US hotel industry (Assaf & Magnini, 2012), regression and forecasting models (Assaf & Tsionas, 2019, and references therein), and Bayesian methods and structural equation modelling in tourism research (Assaf et al., 2018).An excellent brief review of the recent literature on Bayesian statistics in tourism management can be found in Bianchi and Heo (2021) and references therein.
However, there is only a very limited body of research literature on the use of Bayesian techniques in tourism and hospitality-sector studies related to electronic word-of-mouth (eWOM), an area in which the frequentist approach is much more prevalent.Probably, the recent paper by S´anchez-Franco et al. (2019) is a prelude to the opportunities offered by such Bayesian methods in this area.
With the increasing incorporation of e-commerce into the tourism industry, however, many consumers now base their hotel selections and reservations on the information offered on tourism websites (Peng et al., 2018).The internet (platforms, applications, etc.) is now a major channel for booking accommodation (Buhalis & Law, 2008) and it is important both for customers and for hotel managers to have a good understanding of the rationale underlying travellers' booking preferences and their propensity to conduct direct online operations; among other benefits, this knowledge could facilitate significant cost reductions for both parties (Lei et al., 2019).In this respect, the use of a single-value rating is often complemented by multi-criteria ratings or text comments, among other resources (Chen et al., 2021;Nilashi et al., 2019;Zhao et al., 2019).Many studies have analysed the sociodemographic and trip-related characteristics of tourists, including Coenders et al. (2016) andBoto-Garc´ıa et al. (2021).In this respect, too, cognitive variables were used by Fong et al. (2017) to predict tourists' intention to reuse mobile apps for making hotel reservations, while Boto-Garc´ıa et al. ( 2021) examined the associations among trip features such as travel purpose, distance to origin, duration of stay and booking mode choice, and confirmed the existence of varying preferences among different population groups.
Many tourism and hospitality researchers have studied the use of eWOM.Some have evaluated online consumer reviews (OCR), such as TripAdvisor (Fang et al., 2016) or Booking.com(Mellinas & Martin-Fuentes, 2021;Mellinas et al., 2015Mellinas et al., , 2016)), while others compare review sites (Xiang et al., 2017).Customer reviews reflect their experience of the hospitality sector, which is usually summarised as a subjective numerical value, commonly ranging from 1 to 10.This area is one in which Bayesian statistics can usefully be considered, as the data already represent a subjective decision (Bianchi & Heo, 2021).
Understanding the heterogeneous preferences for hotel choice attributes across different customer segments is an interesting issue in the hospitality context.In this paper, we will show that the Bayesian approach provides a means of addressing the problem of between-OCR heterogeneity in hospitality management.Analyses of this topic must resolve two significant complications in this regard: firstly, the existence of heterogeneity among OCR webs, a variation that could bias parameter estimates.Furthermore, after this heterogeneity has been identified, it is necessary to syncretise the information in order to generate valid estimates of the quantities of interest and to ensure that the analysis incorporates all the uncertainty presented concerning between-OCR variation.The following motivating example is offered to clarify the problem.

A motivating case study
The Santa Catalina Hotel is an emblematic, five-star (grand luxury) hotel located in the city of Las Palmas de Gran Canaria in the Canary Islands, Spain, which has attracted national and international celebrities since its inauguration in 1890.The hotel has been a landmark throughout the history of Gran Canaria, an island renowned for its white sand and black lava beaches, and was named Best Historical Luxury Hotel in Europe and Best Luxury Cultural Hotel in Southern Europe at the 2019 World Luxury Hotel Awards.
The dataset used in this article was extracted in June 2022 using public data from Booking.com, TripAdvisor.comand Hoteles.com and cover the time period from 1 January to 31 December 2021.Table 1 summarises the scores received by the hotel from each of these OCR websites.
In the present study, our aim is to illustrate how several sources of OCR-scoring information may be aggregated coherently; concerns emerging from different scoring scales are not the subject of this paper (Mellinas et al., 2016).Our analysis uses the standard 1-10 scale, and the TripAdvisor and Hoteles sample data were elementary linear transformed to this conventional scale.Figure 1 shows the histograms for the sample populations.
The rating information includes summaries of the extent to which reviewers enjoyed a product (scored on a discrete scale ranging from the lowest to the highest possible rating), the number of individuals who have evaluated the product and the average rating of all reviews.
However, when evaluating OCR, concerns may arise regarding the number of individual ratings used to derive an average value; evidently, average ratings that are based on many opinions are more robust information sources than those based on just a few (Hoffart et al., 2019).Therefore, researchers wish to determine the behaviour of the parameter that represents the average ratings, termed λ; to obtain an overall expected score for the OCRs.Also, for each OCR website, λ i represents the average ratings obtained from population i.In the present case, i ¼ 1; 2; 3 represent the OCR websites of Booking, TripAdvisor and Hoteles, respectively.Thus, each sample population is identified by the parameter λ i .
For a reliable estimation of λ, several non-trivial questions must be answered.First, can we assume that all three samples come from the same population?That is, is there homogeneity among the samples or should we assume there are different populations?If so, and secondly, which different populations are there and how can we pool them to obtain a fair estimation of λ?The possibility of between-OCR statistical heterogeneity is a factor of major significance, as misleading inferences will be drawn if the data configuration is erroneously managed.This is a non-trivial statistical problem presented when inference about a parameter is needed and information is available from heterogeneous sampling populations.The presence or otherwise of statistical heterogeneity is determined by testing the null hypothesis that the true expected scores for all OCR websites are identical, i.e., H 0 : In general, this problem is related to the homogeneity test for different statistical models and the multiple testing options considered when the null hypothesis of homogeneity is rejected.Various multiple testing procedures have been proposed, but all present concerns and difficulties, such as how to ensure the correct interpretation is made of different pÀ values for multiple comparison, or the best means of determining the power behaviour of the Pearson's χ 2 -statistic, among many others (Farcomeni, 2008).
We suggest that clustering the samples and then calculating the posterior probability of the cluster models provides an alternative method for measuring the between-OCR variation.This approach is based on the assumption that some statistical heterogeneity is present in each of the OCR websites considered, with an intensity ranging from (total) homogeneity to (total) heterogeneity.In order to accurately estimate the OCR presented, the intermediate levels between these extremes must also be evaluated.
In this paper, we address the problem of hypothesis testing for several population means.This question is addressed from a novel standpoint, under the assumption that it can be regarded as a clustering problem.Therefore, as with the original homogeneity problem, all possible data configurations can be tested using a Bayesian procedure from which the posterior probabilities of each cluster model are computed.The rest of this paper is organised as follows.In Section 2, we describe our proposed homogeneity test for several populations.The third section provides a realdataset example, illustrating the use and versatility of this approach.Section 4 then presents a Bayesian model averaging procedure to estimate the overall expected score parameter λ: Finally, Section 5 presents some concluding remarks.The Supplementary Materials section available on the github website provides Mathematica and R codes that can be downloaded.These codes were developed to make our approach readily accessible and to enable the reproducibility of the results obtained.

Methodology
In this section, we present the statistical model created to answer the issues raised above, in order to investigate statistical distributions that are identified by their expected value.In the context of OCR, Mariani and Borghi (2018) in their study of Booking.comonline reviews as a data source, found that not only is the distribution of hotel ratings left-skewed, but the characteristics of this distribution vary depending on the category of hotel.
Therefore, the discrete random variable that models the online score awarded by a customer must present these two features: it must be identified by its population mean and it must be clearly skewed.Bianchi and Heo (2021) suggested that a Poisson sampling distribution for the score X; would be appropriate 1 : We now consider a general approach with data obtained from k OCR websites.First, assume that we have kð� 2Þ independent Poisson observations with λ i ¼ n i δ i and where X i represents the observed score from OCR website i recorded over time or space of size n i : The problem of interest is then to test In other words, we wish to compare the rate of occurrence of δ i events.In practice, however, for a given population i, we have samples of size n i from independent Poðδ i Þ and by sufficiency need only consider S i ¼ ∑ n i j¼1 X ij , which follows a Poisson distribution with mean n i δ i ;.Therefore, the role of X i in (1) can be played by S i .For simplicity, we use the simplest notation in model (1), as follows.
Let x ¼ ðx 1 ; . . .; x k Þ be independent samples of k Poisson populations Poðλ 1 Þ; . . .; Poðλ k Þ with sample sizes n 1 ; . . .; n k .To cluster the samples, we follow the product partition model suggested by Barry and Hartigan (1992) and consider c ¼ ðc 1 ; . . .; c s Þ is any configuration of length s, with 1 � s � k, that is, c is a partition of sample x in s blocks (the above case study example corresponds to the particular case k ¼ 3).
We denote by t i ¼ ∑ n i j¼1 x ij the total number of counts in sample Clearly, the likelihood function will have different expressions for each cluster configuration of c samples.Each model generated by c indicates a different heterogeneity structure of the sampling model, and its posterior probability informs us about the uncertainty for this configuration.For instance, in the case study presented above, x 1 corresponds to data sampled from Booking.com, x 2 refers to TripAdvisor data and x 3 refers to Hoteles sample data.There is a one-to-one correspondence between every partition of the parametric space (and models) and the partition of the sample data.For instance, the null hypothesis of equal means λ 1 ¼ λ 2 ¼ λ 3 (homogeneity model, M 1 ) corresponds to the single data set fx 1 [ x 2 [ x 3 g; which we denote ð123Þ (i.e, the whole observed data can be considered from the same statistical population).At the other extreme, when all samples are mutually heterogeneous, the configuration ð1j2j3Þ (heterogeneity model, M 5 ) is equivalent to the multiple hypothesis λ 1 �λ 2 �λ 3 : In fact, when k ¼ 3 populations are compared, intermediate heterogeneity models correspond to the models: M 2 : λ 1 �λ 2 ¼ λ 3 ; in notation ð1j23Þ; M 3 equivalent to the configuration ð2j13Þ, i.e. λ 2 �λ 1 ¼ λ 3 , and M 4 : λ 3 �λ 1 ¼ λ 2 ; in notation ð3j12Þ: In general, the total number of possible partitions is given by the Bell number of order k (Rota, 1964).In our case study, the five partitions corresponding to k ¼ 3; are fð123Þ; ð1j23Þ; ð2j13Þ; ð3j12Þ; ð1j2j3Þg; where ð1j23Þ represents the same partition as ð32j1Þ: The comparison of Poisson means is then equivalent to a homogeneity test with null model and the parameters of the competing model (hypothesis) H 1c are denoted by λ 1 ; . . .; λ s ; in order to simplify the notation (bearing in mind that these parameters are different).
In the Bayesian approach, for each configuration structure in a set of candidate models, model uncertainty is quantified according to its posterior model probability, and Bayes factors play a crucial role by discriminating among competing models.Assaf and Tsionas (2018) highlighted the advantages of using Bayes factors for hypothesis testing, in contrast with pÀ values.The Bayes factor for comparing model H 0 vs H 1c is given by where f 0 ðxÞ and f 1 ðcÞ are the marginal distributions of the data x and c, respectively, given by and where πðλÞ is the prior distribution on the hyperparameters λ, and f 1 ðcjλÞ is as defined below, and it depends on the hierarchical model that relates the λ 1 ; . . .; λ s of partition c with the parameter λ of the null hypothesis.
In general, the best model (equivalently, configuration c) is the one that maximises the posterior probability of the models where PrðcÞ denotes the prior probability of configuration c and C k is the set of all cluster configurations given k samples.A common objective prior for hypothesis testing purposes is the discrete uniform prior PrðcÞ ¼ 1 jC k j ; for all c 2 C k ; when jC k j denotes the cardinal of the set C k ; i.e., the number of all different models considered.For instance, for the case study considered above, k ¼ 3 and jC 3 j ¼ 5:

The marginal distributions
The likelihood function of the data x under the null hypothesis is where symbol / indicates "proportional to".
For any configuration c of length s; the likelihood function is where t j and m j are the total number of counts and the length of vector of sample c s ; respectively.Observe that the constant of proportionality in (7) coincides with that in expression (6).
However, the likelihood function f 1 ðcjλÞ in (4) cannot be obtained using the information from x j in sample j; which is related to λ j but not to λ: Therefore, a distribution πðλ j jλÞ is needed to link the λ j and the λ parameters.
As in Bayesian multiple testing, a link distribution πðλ j jλÞ should present the following desirable properties (Scott & Berger, 2006): (I) it should be related to that of the statistical model f ðxjλÞ; (II) it should be centred around the parameter λ; and (III) its variance (or any other measure of dispersion if variance does not exist) should reflect tentative existing differences among the λ j : A simple and numerically well-conducted link distribution relating parameter λ j to parameter λ is the conjugate Gammaðλ j jλ; 1Þ; i.e.
which has mean Eðλ j jλÞ ¼ λ and variance V ðλ j jλÞ ¼ λ: This link distribution does not require the intensive use of numerical procedures to compute the quantities of interest and presents all of the desirable properties given above.
A priori, the λ 1 ; . . .; λ s parameters of any partition can be regarded as exchangeable, that is, they are conditionally independent given the parameter of the null model λ, according to the link distribution πðλ j jλÞ.Hence, the joint link distribution for any partition of length s is given by Thus, from ( 7)-( 9) (see the Appendix A for technical details), we obtain the likelihood function f 1 ðcjλÞ as Finally, to complete the integrals in (3) and ( 4) and following Bianchi and Heo (2021), we assign a Gamma prior distribution for the parameter of the null λ; where hyperparameters α and β are given.To compare data structures, a non-informative distribution is usually well suited.As is well known, this case occurs when α ¼ 1=2 and β ¼ 0; i.e.
Hence the marginal distributions in (3) and ( 4) are given by, and As the arbitrary constants associated with the improper prior are the same in Equations ( 13) and ( 14), they are cancelled out in the expression of the Bayes factor in (2), and are obtained by The Bayes factor in ( 15) is computed numerically for each sample c.

Posterior distributions of $\lambda_i$
The posterior behaviour of the λ i parameters of interest is also relevant once the clustering procedure is applied to the data, and can help us to compare Poisson populations.The posterior distribution for λ i (i ¼ 1; . . .; k) can be computed either by numerical integration or by Markov chain Monte Carlo (MCMC) simulation.The MCMC implementation to find the posterior distributions is particularly easy to code and is preferred (see Appendix B for technical details).

Results
To illustrate the proposed procedure, we now analyse data from the case study above.The Mathematica and R codes for this case study are available in the Supplementary Materials section (github link).
The values in Table 2 are obtained from a uniform prior assigned to each configuration which assigns the same probability to every model, i.e., PrðcÞ ¼ 1 5 ; "c 2 C 3 : By comparison with the frequentist approach, the proposed Bayesian clustering procedure generates a much more detailed and accurate output with which to compare populations.As shown in Table 2, the most probable configuration corresponds to the model M 2 , with a posterior probability of 0:57, in which two of the samples considered are homogeneous fTripAdvisor; Hotelesg while the first, fBookingg, is a different sampling population.However, the structure of the clustering partition induced by the dataset in Table 2 indicates that different forms of heterogeneity could be present.
The posterior probability of the homogeneity partition (model M 1 ) is high but much lower than that of model M 2 .Furthermore, the configuration which represents the hypothesis that all samples are heterogeneous, or equivalently λ 1 �λ 2 �λ 3 (model M 5 ) has a small posterior probability of 0:07, meaning that full heterogeneity is clearly rejected.Observe that the common frequentist

Posterior probability
Homogeneity

Intermediate Heterogeneity
M 2 ð1j23Þ λ 1 �λ 2 ¼ λ 3 0.57 λ 1 �λ 2 �λ 3 0.07 configurations (homogeneity vs. heterogeneity) only represent 29% of the probability while the remaining 71% is concentrated in the intermediate heterogeneity cluster.This way of handling uncertainty is very appropriate for obtaining fair estimates of λ.In the frequentist approach, whether the samples are homogeneous or heterogeneous, any failure to consider intermediate situations would omit the highest probability cluster and produce misleading inferences.
By summing the probabilities of all the configurations in which each sample appears as a single cluster, we obtain their marginal probabilities.Thus, we can determine which population is most different from the others.In our example, population 1 (Booking) is the most different, with a probability of 0:64.This supports the finding for the configuration shown in model M 2 (the hypothesis λ 1 �λ 2 ¼ λ 3 is true with a probability of 0:57).The remaining marginal probabilities are 0:12 and 0:16 for populations 2 (TripAdvisor) and 3 (Hoteles), respectively.

Posterior distributions of λ i
As commented above, given the hierarchical structure of the link distribution πðλ i jλÞ a MCMC implementation to find the posterior distributions can be coded straightforwardly.In our study, a R code was performed using JAGS (Just Another Gibbs Sampler) through the rjags package (Plummer et al., 2016).For more details, see the Supplementary Material section, showing the codes used for posterior distributions and convergence diagnostics of the corresponding chains generated.
Figure 2 shows the posterior densities of the three Poisson parameters obtained from MCMC simulations.These densities overlap in a non-negligible interval, which indicates that the hypothesis that TripAdvisor and Hoteles are analogous populations cannot be rejected.The values of the three parameters are close to each other (see Table 3) except perhaps the parameter corresponding to the first sample λ 1 ; thus confirming the relatively high probability of marginal x 1 being obtained before (0:64).Table 3 shows the (posterior) mean and median estimations obtained, together with the 95% highest posterior density (HPD) intervals for the λ i parameters.2016) measured (positive) differences between Booking and Priceline systems, when testing for the existence of scoring differences between these systems (the null hypothesis), but with their frequentist procedure we can only accept or reject the null hypothesis based on its pÀ value, which, as we know, does not represent the chance that it is true (Assaf & Tsionas, 2018).The proposed Bayesian procedure provides a simple and intuitive solution to this concern, testing inequality hypotheses on the parameters as follows.Let us test the null hypothesis H 0 : λ 1 >λ 2 > . . .>λ s vs H 1 : the ordering of the parameters is any other different permutation, with 1 � s � k:

Mellinas et al. (
Under the hierarchical model considered, we need to compute the posterior probability of the null space Λ 0 ¼ ðλ 1 ; . . .; λ s Þ : λ 1 >λ 2 > . . .>λ s f g, which is found to be where n ¼ ðn 1 ; . . .; n k Þ and t ¼ ðt 1 ; . . .; t k Þ: MCMC methods are useful here, as they take advantage of the posterior conditional independence of the λ j given λ, i.e., simulating first from the posterior of the hyperparameter λ, and then from Gamma distributions. For instance, using the data of the motivating example, the (posterior) probability of λ 1 <λ 2 is 0:98.That is, there exists a probability of 0:98 that the average score from the Booking population was lower than that from the TripAdvisor population.Similarly, the posterior probability of λ 1 <λ 3 is 0:99.

Drawing inferences on the overall score parameter λ: a BMA proposal
As shown in Table 2, there are five heterogeneity configurations, and choosing any one of these may overlook model selection uncertainty and misrepresent parameter uncertainty.To overcome this problem, we propose the use of Bayesian model averaging (BMA) techniques.Thus, instead of selecting one model and drawing inferences from it, we use weighted averages over the set of models to obtain the expectations and quantities of interest.
The BMA approach to drawing inferences on parameter λ involves averaging over all models.For instance, the posterior distribution for the observed data is given by  that is, the average of the posterior distributions under each of the models considered, weighted by its posterior model probability.The posterior probability for the model M s can be obtained by: where is the integrated likelihood of the model M s ; λ s is the vector of parameters of model M s , Prðλ s jM s Þ is the prior density of λ s under model M s ; Prðdatajλ s ; M s Þ is the likelihood and PrðM s Þ is the prior probability of model M s : Inference on the parameter λ is also affected by the weights applied in expression (17).In particular, the posterior mean for λ is the weighted average of the posterior means in each model:

Results (continued)
Returning to the dataset considered in the motivating example, from Table 2 we have different posterior probabilities for the cluster structures, and these probabilities are incorporated in the BMA of the overall average λ score.All the measures of interest can be obtained from the BMA posterior distribution in (17).Table 4 shows the posterior summaries of the λ score under each heterogeneity configuration, together with the proposed BMA.
It is interesting to note the following.On the one hand, under the classical homogeneity partition, which assumes that all samples are obtained from the same population, the parameter of interest λ would be underestimated (see Table 4 for the estimate corresponding to model M 1 ) and the uncertainty reflected by the credible interval is significantly larger.In contrast, the model with a greater likelihood, M 2 , provides more accurate estimates with a significantly narrower credibility range than the prior model.Finally, the BMA model accumulates all the uncertainties associated with each of the potential models and combines them according to their probabilities of being true, as shown in Table 2.

Concluding remarks
In this paper, we employ a straightforward Bayesian approach to address the issue of comparing multiple populations of OCR scores.To determine the necessary Bayes factors for comparing each alternative with the baseline homogeneity model, we adopt a hierarchical model that assesses the homogeneity of the Poisson means.Bayesian practitioners base judgments on the posterior probability of all involved models, in contrast to the often inconclusive pÀ values used in frequentist approaches.This advantage allows Bayesian methods to quantify evidence supporting the null hypothesis of homogeneity and to simultaneously test multiple hypotheses-an area where frequentist methods fall short.The Bayesian approach bypasses these limitations, eliminating the need for ad-hoc thresholds, which can be essential in cases of small sample sizes (k), low parameter estimates (λ i ), or varying sample sizes.The proposed Bayesian method is applicable in each of these scenarios.
Our paper presents a Bayesian clustering application involving three key online rating platforms.Extending this method to more platforms is feasible, though scalability might raise computational issues beyond 15 platforms due to the considerable number of clusters to be examined, as indicated by Bell's number (Rota, 1964).Fortunately, Bign´e et al. (2020) recently reviewed numerous published studies analyzing eWOM ratings, with no more than 15 platforms included.Similar to Bign´e et al. (2020), which considered eight platforms, our paper offers implications for managers and analysts-emphasizing the importance of focusing on true overall average scores to optimize products and services, rather than excessive analysis of minor online rating fluctuations.Using a real-world case study to demonstrate the Bayesian model's practicality reveals that our methods could deliver considerable results in online consumer reviews.This paper could be considered a further step towards this final objective.
Beyond the work by A. G. Assaf and Tsionas (2019), who applied Bayesian Model Averaging (BMA) to estimate dynamic panel data models in tourism research, few studies in this field have utilized the BMA procedure.We propose BMA as an intuitively appealing approach to addressing uncertainty in heterogeneity.This procedure facilitates inference generation about the true overall average λ score by averaging heterogeneity models derived from clustering observed data.Furthermore, our proposed Bayesian model offers a natural method to determine whether one sample population's ratings surpass those of others.The clustering procedure described accounts for all heterogeneity structure uncertainties and identifies each feasible cluster model.This characteristic may also serve as a valuable starting point for exploring sources of heterogeneity before conducting simple model averaging, akin to Mellinas et al. (2016).
where Gammaðλ i jt i þ λ; n i þ 1Þ denotes the Gamma distribution of the corresponding parameters.
Therefore, the posterior distribution for λ i (i ¼ 1; . . .; k) can be computed either by numerical integration or by Markov chain Monte Carlo (MCMC) simulation.The MCMC implementation to find the posterior distributions is particularly easy to code and is preferred.
Figure 1.Feedback histograms for the three OCR websites for hotel Santa Catalina.

Figure 3
Figure3shows all the posterior distributions including the BMA posterior density of λ.It is apparent that the BMA posterior distribution of λ represents a compromise between the posterior distributions Figure Posterior distribution of λ for each heterogeneity configuration (models M i ; i ¼ 1; . . .; 5; in dashed grey lines) and the corresponding posterior BMA distribution (continuous black line).

Table 4 . Posterior summaries of λ under each model
M i ; i ¼ 1; . . .; 5 and the proposed BMA each heterogeneity model in which the weights from models M 2 and M 1 are greatest.As expected, this is reflected by a left-skewed posterior distribution, with a mode around λ ¼ 9. under