Optimized control charts using indifference regions

Abstract In statistical process monitoring, the CUSUM and EWMA control charts have received considerable attention because of their remarkable ability to detect small sustained shifts. In practice, small process variation and shifts are anticipated beforehand in many processes, so the focus should be on detecting a moderate to a large shift. The aforementioned charts identify minor changes in population parameters as out-of-control scenarios; thus, “small” and potentially practically insignificant shifts are producing signals. To counteract this, both charts are amended to accommodate an indifference region by optimizing the detection of a shift at the outer boundaries of the indifference region. The results show that the adapted CUSUM and EWMA monitoring schemes yield comparable results. On nearly all occasions, the CUSUM chart outperforms the EWMA chart, yet the EWMA chart seems more robust and is easier to interpret. Furthermore, we provide two practical examples to illustrate the use-case of optimized charts to mitigate small (unimportant) variations, such as seasonality and modest temporary shifts. Overall, this work provides a general approach tailored to practice in quality control, e.g., as prescribed by ISO standards. It also answers a recent call in statistical process monitoring literature to reconsider the design of control charts.


Introduction
Control charts are commonly used tools in statistical process monitoring (SPM).These charts aim to detect changes (e.g., shifts) or incidents in processes as a result of special causes.These changes are often evaluated in terms of shifts from some in-control process parameter, such as the mean or standard deviation.The aim is to detect such a shift as quickly as possible, which has been the starting point for a rich literature proposing various methods.For an overview of current directions in theory and applications of SPM, we refer to Woodall and Montgomery (2014).
The common practice for designing a control chart is to start with defining or modeling the in-control process behavior.For example, assuming normally distributed data, this can be done by estimating the in-control mean l 0 and standard deviation r 0 .When monitoring the mean of a variable, the process is typically considered out-of-control in the literature when the current process mean l ¼ l 0 þ dr 0 = ffiffiffi n p is unequal to l 0 (equivalently d 6 ¼ 0), where d represents the shift size and n, is the subgroup size.Control charts are typically designed to satisfy a specified false alarm rate (average run length) as long as the process is in control, which we denote as FAR 0 (ARL 0 ).Depending on the type of control chart, there may be options to tune the out-of-control performance to detect shifts of a specified size d more quickly.However, for any value of l other than l 0 , the process is usually considered out of control.
Recently, it has been questioned to reconsider this paradigm as it might misfit practice, for example, when applied in industry (Woodall and Faltin 2019).In their discussions, the authors appeal to amend the control chart design by using an in-control region in which a slight shift should not always be considered an out-of-control situation.They provide examples and explanations of when and why this approach would be beneficial.
The approach of using an indifference region has also received attention in the past, as can be found in Ewan and Kemp (1960); Freund (1957Freund ( , 1960)); Woodall (1985Woodall ( , 1986)); Yashchin (1985Yashchin ( , 2018)), amongst others.The returning underlying motivation is that many assignable causes may lead to real but small process changes that may not be important to practice.For example, many processes contain minor seasonal influences or other day-to-day variations (e.g., different operators) that cause small (temporal) shifts in the process parameters.These effects may not be part of, or go unnoticed in a Phase I study.For example, this can occur when the considered time period is too short to incorporate seasonal patterns or when an estimator cannot completely capture this information.These issues can also be caused by overdispersion, as discussed in Goedhart and Woodall (2022).Furthermore, suppose the variation is small relative to the process specification limits and is temporal (such as for seasonal variation).In that case, it may be desirable to have an approach that can disregard these types of shifts that normally would cause signals.Acting upon these irrelevant signals even aggravates process variation-a phenomenon that is called overadjustment.
Related to this framework are the complementary concepts of statistical and practical significance in control charts.A statistically significant result does not necessarily imply practical importance.For similar discussions in a hypothesis framework, see for example Wasserstein and Lazar (2016), Snee and Hoerl (2018), Woodall and Faltin (2019), and Blume et al. (2019).The consideration of using indifference regions also resonates with the international standard as prescribed by the International Organization for Standardization (ISO 2020).In this specific standard, the use of acceptance control charts (Freund 1957) is recommended, which exhibits a region wherein the user is indifferent about the chart's performance.An illustration of this is provided in Figure 1.In the figure, three regions centered around the target level are distinguished: 1.The rejectable (out-of-control) region, which is outside the upper and lower Rejectable Process Limit (R PL ). 2. The indifference region, which is between the Acceptable Process Limit (A PL ) and the R PL .3. The acceptable (in-control) region, which is between the upper and lower A PL .Within this region, the target or l 0 (center line) is indicated by the "4" in the middle.
Note that the displayed regions relate to limits for individual observations.However, as mentioned in the standard (ISO 2020), the focus is on process acceptability rather than product disposition, and the limits are to be determined accordingly.The consequence is that the in-control situation is no longer considered a single value of the process metric of interest but an interval.This is the essence of the three-region approach discussed by Woodall and Faltin (2019).Such an approach qualifies for situations where the variation in the process can be tolerated to a larger extent, for example, when the specification limits are much wider than typical process variation.For this purpose, Woodall and Faltin (2019) illustrate how this paradigm can be accommodated by a cumulative sum (CUSUM) chart.
Performance comparisons of various control charts in different regimes have been fertile research ground, e.g., Srivastava and Wu (1993); Hawkins and Wu (2014); Diko, Goedhart, and Does (2020).As an extension of this line of research, we demonstrate how the CUSUM, EWMA, and Shewhart control charts can be designed and optimized using indifference regions and compare their performance for various scenarios.
The paper is organized as follows.In Section 2, we outline the control charts considered, the objectives, and the notation.In Section 3, we evaluate and compare the performance and robustness of the optimized control charts for various indifference regions in different scenarios.Next, a discussion is provided in Section 4, followed by two examples in Section 5 that demonstrate its value to practice.Finally, we conclude in Section 6.

Definitions and monitoring schemes
In this section, we outline the Shewhart, CUSUM, and EWMA control charts and submit them to the indifference region approach.We consider two-sided Phase II control charts to monitor a process for which observations X ij are obtained each time period i in subgroups of size n, with j ¼ 1, :::, n: The X ij are assumed to follow a normal distribution with mean l 0 þ dr 0 = ffiffiffi n p and standard deviation r 0 , where d represents the (standardized) size of a possible shift, and where l 0 and r 0 are assumed to be known following a Phase I study.We refer to Jones-Farmer et al. (2014) for an overview of Phase I analysis for process improvement and control.Note that d ¼ 0 means that the process is typically considered in-control.
Throughout we consider the standardized variables Note that Z i follows a standard normal distribution for d ¼ 0 and a Nðd, 1Þ distribution for d 6 ¼ 0: If the in-control X ij are not normally distributed, the well-known Box-Cox approach (for a comprehensive overview, see Sakia (1992)), or the approach of Chou, Polansky, and Mason (1998) using a Johnson transformation can be used to achieve (approximate) normality of these data.Evidently, transformations should be used with care; see, for example, the work by Khakifirooz, Tercero-G omez, and Woodall (2021), who show that outliers can be masked in Phase I if transformations are used.Santiago and Smith (2013) showed clear shortcomings when using a transformation to normality when data stems from the exponential distribution.

Indifference regions and optimization framework
In Phase II, the monitoring phase, the process mean equals l ¼ l 0 þ dr 0 = ffiffiffi n p : In most applications, the process is considered out-of-control for any value of d 6 ¼ 0, which is different when using indifference regions.In particular, when considering the use of an in-control region as discussed in Woodall and Faltin (2019) and in the ISO standard (see Figure 1), a region is determined for which the process is considered in-control, even when l 6 ¼ l 0 : In the case of a two-sided control chart for detecting an increase or decrease in the process mean, the indifference region approach could be described as follows: The in-control region, 0 jdj d 0 , where d 0 is the acceptable (standardized) shift size (worst acceptable).The indifference region: d 0 < jdj d 1 , with d 1 the shift size desired to be detected quickly (best unacceptable).
The out-of-control region: jdj > d 1 : Comparing this set-up to that of Figure 1, then we observe that d So, both d 0 and d 1 are to be determined based on the practitioner's knowledge of which shifts are practical significance in the process.After this, choices need to be made on the desired properties of the control chart.Woodall and Faltin (2019) implement this three-region approach by adjusting the CUSUM chart to match the performance of the classical Shewhart control chart with standard limits at a given level d ¼ d 0 : In this paper, we adapt this approach by adjusting the control charts to match a pre-specified in-control value ARL 0 when d ¼ d 0 : Then, we optimize the outof-control performance to detect shifts of size d ¼ d 1 as quickly as possible.These two goals set the parameters of the CUSUM and EWMA control charts.Note that the Shewhart control chart depends on only one parameter, such that the value of ARL 0 dictates both the in-control and out-of-control performance directly; there is no optimization step.

Shewhart control chart
The standard Shewhart control chart comprises an upper control limit (UCL) and a lower control limit (LCL).It depends on a single parameter (the control limit constant) which we denote as L S .The charting statistic for the Shewhart control chart equals Z i , and the UCL and LCL equal L S and ÀL S , respectively.A signal is provided if the charting statistic is above the UCL or below the LCL.A common choice for the control limit constant is L S ¼ 3, which yields a false alarm rate (FAR 0 ) of 0.0027 or, equivalently, an in-control average run length (ARL 0 ) of 370.4.Since ARL 0 ¼ 1=FAR 0 for the Shewhart control chart, the required value of L S when using indifference regions as described in Section 2.1 is found by solving the following equation: where UðXÞ is the standard normal cumulative distribution function, and where Z i $ Nðd 0 , 1Þ:

The CUSUM control chart
The CUSUM control chart was originally introduced by Page (1954).In this paper, we consider the standardized CUSUM chart as described by Montgomery (2020), which consists of two charting statistics, namely: where , where L C represents the control limit constant for the CUSUM chart.
The CUSUM chart is thus based on two parameters, the reference value k and the control limit constant L C .The optimal reference value k to detect a shift of size d 1 has been proven to be k ¼ d 1 =2, see for example Woodall (1986); Woodall and Adams (1993).The value of L C can then be chosen to provide a specified value of ARL 0 : When considering the indifference region approach, note that an increasing shift toward d ¼ d 1 can be viewed as a shift of size ðd 1 À d 0 Þ from the target value d 0 , for which the optimal value of k would be k ¼ ðd 1 À d 0 Þ=2: The same holds, of course, for a decreasing shift from the target value Àd 0 to Àd 1 : This was also pointed out by Woodall and Faltin (2019), and can be incorporated as a target value in Equation (2) as follows: Since , the optimal CUSUM chart when using our indifference region approach is equivalent to using k ¼ ðd 1 þ d 0 Þ=2 in the standard CUSUM of Equation (2).The parameter L C can then be tuned to match the specified ARL 0 for d ¼ d 0 , which can be found using the approaches implemented in R by Knoth (2014).

The EWMA control chart
The EWMA control chart was introduced by Roberts (1959) as an alternative to the standard Shewhart control chart to be more capable of detecting small shifts.To do so, the charting statistic of the EWMA control chart (denoted as Y i ) takes a weighted average of the new observation and past performance and can be denoted as follows: where 0 < k < 1 represents the weight given to the most recent observation, and where Y 0 ¼ 0: A signal is given when where L E represents the control chart constant for the EWMA control chart.
Two parameters have to be chosen for the design of an EWMA monitoring scheme, namely k and L E .Note that when k ¼ 1, the EWMA chart is equivalent to the Shewhart control chart; for more details, see the seminal work by Lucas and Saccucci (1990).Typically, smaller values of k are used when focusing on detecting minor shifts.However, contrary to the CUSUM control chart, there is no closed-form solution for the optimal value for either of the two parameters of the EWMA control chart.Montgomery (2020) found that values for k in the interval 0:05 k 0:25 work well in practice, while also mentioning that values 0.05 and 0.1 with appropriate control limits perform very well for both normal and non-normal distributions.
An optimization variant of the EWMA chart has been studied in Aparisi and Garc ıa-D ıaz (2007).First, they set several criteria-either predetermined or found via using a Taguchi loss function-on the incontrol performance, possibly using an asymmetric indifference region.Next, they employ a genetic algorithm to optimize the EWMA chart while meeting the criteria at the outer ends of the in-control region.Our approach does not require the use of such a metaheuristic.It directly searches for the quickest detection of (out-of-control) shifts of a particular size d 1 while ensuring a specific in-control performance ARL 0 at d 0 , which determines the two parameters k and L E .To do so, similar to Hawkins and Wu (2014), we apply a grid search to determine the optimal combination of parameters for the chosen settings of ARL 0 , d 0 , and d 1 .We use the approaches implemented in R by Knoth (2014) for this part.Furthermore, to make the comparison with the CUSUM control chart fairer, we henceforth take the steady-state limits that are obtained when i ! 1 in Equation ( 5), which means a signal is only produced if Y i j j > L E ffiffiffiffiffiffi k 2Àk q : 3. Comparing Shewhart, CUSUM and EWMA control charts In the previous section, we outlined the methodology for adapting the Shewhart control chart and optimizing the EWMA and CUSUM monitoring schemes to incorporate the indifference region approach.Under these optimized settings for the different control charting schemes, we again use the routines provided by Knoth (2014) in R to compute the corresponding ARLs.We compare the various monitoring schemes in an extensive head-to-head comparison, considering the ARLs to detect an out-of-control shift.We also provide and discuss the optimal parameter values for the control chart designs.Subsequently, we will consider some robustness checks: zero-state versus steady-state performance in Section 3.2; dependency of the approach on Phase I estimates in Section 3.3; and finally, the impact of non-normality in Section 3.4.

Performance evaluation
For our evaluation we start with d 0 equal to 0 (only an in-control point), and also consider d 0 equal to 0.5, 1, and 2. For the out-of-control situations, we consider a tiny ( All charts are adjusted to yield a pre-specified value of ARL 0 when d ¼ d 0 , where we considered ARL 0 ¼ 100 and ARL 0 ¼ 500 to use a smaller and larger value for comparison purposes.
In Tables 1 and 2, we list the parameter values obtained from the calculations described in Section 2 for the three control charts considered for ARL 0 ¼ 100 and ARL 0 ¼ 500, respectively.For the Shewhart chart, a higher value of d 0 obviously leads to a larger value of L S .For the CUSUM chart, a larger value of d 0 comes with a larger value of k but a lower L C , rapidly decreasing when d 1 increases.Moreover, the L C parameter is hardly affected by an increase in d 0 when d 0 !0:5: To be specific, we observe similar patterns in the L C values of d 0 with equal size of the indifference region d 1 À d 0 ; e.g., L C ¼ 4:419, 4.418, and 4.418 for pairs ðd 0 , d 1 Þ ¼ ð0:5, 1Þ, ð1, 1:5Þ, and ð2, 2:5Þ, respectively.
To provide some intuition on this pattern, note that a unit increase of d 0 when d 0 is already large increases the limits of the Shewhart with the same size.For example, considering the cases d 0 ¼ 1 and d 0 ¼ 2, the values of L S are 3.327 and 4.326, respectively.For the CUSUM chart, this change is incorporated in the value k.For example, when d 0 increases from 1 to 2, k jumps by 1 while the L C values remain the same.This is because the charts essentially become one-sided for large jd 0 j: Focusing on the EWMA control chart, the optimal parameter patterns are more convoluted because of Equation ( 5).Larger d 0 values result in larger values of k and L E .The L E values increase in d 1 when d 0 ¼ 0, whereas when d 0 6 ¼ 0, L E decreases if d 1 increases.Finally, when d 1 increases, the EWMA control chart's parameters converge to its Shewhart counterpart, i.e., k close to 1 and L E ¼ L S , echoing the literature that has acknowledged the excellent capability of a Shewhart control chart to detect large shifts.
From a computational point of view, note that all optimizations were performed by relying on a standard grid search of ARL-computation functions of Knoth (2014).However, we found that a small modification was needed in two settings.To be precise, in the case of d 0 ¼ 2 and d 1 ¼ 2:5 for ARL 0 ¼ 100 and ARL 0 ¼ 500 the grid search had to be adjusted, bounded from below to circumvent it to generate unrealistic parameter combinations or errors.The bounds can easily be retrieved from neighboring settings.Second, in these cases, we also had to increase the number of nodes for the Gauss-Legendre quadrature underlying the Nystroem method to solve the related ARL integral equation; in these cases, they were increased to 120, whereas in all other experiments, 40 were already sufficient.
In the Tables 3, 4, 5, and 6 the ARL values are obtained by using the control chart schemes as proposed in this paper.They are generated by setting ARL 0 ¼ 500 at various d 0 levels (d 0 equal to 0, 0.5, 1, and 2, respectively).For clarity, the settings used to optimize the control charting schemes (i.e., the required ARL 0 at d 0 and the detection of a shift of size d 1 ) are displayed in bold type in reported performance across all tables.Note also that the choice of d 1 does not impact the Shewhart chart, as it has only one parameter (L S ).Note that the performance difference between the optimized CUSUM and the EWMA control charts is typically small, and a CUSUM control chart generally outperforms the EWMA chart.
Specifically, in Table 3 (d 0 ¼ 0), we observe that the EWMA control chart yields better performance when d 1 ¼ 0:5: But for d 1 ¼ 1, its performance of detecting a larger shift size deteriorates as reflected by the out-of-control ARL scores to supersede the CUSUM counterparts.For d 0 > 0, as exhibited by Tables 4, 5, and 6, the CUSUM control chart outperforms the EWMA chart uniformly, both for which they are designed to detect (the values in bold type) and the even larger (out-of-control) shift sizes.However, considering the in-control performance (d < d 0 ), the EWMA chart is favored because of the higher ARLs when the process is in control.The results when evaluating the same scenarios for ARL 0 ¼ 100 using the settings of Table 1 are consistent with the ones presented in the tables for ARL 0 ¼ 500; therefore, these experiments are not included.

Steady-State performance
Concerning the latter, in Table 7, we evaluated the incontrol ARLs of the corresponding charts in the steady state.By design, these ARLs should have a zero-state average run length near either ARL 0 ¼ 100 or ARL 0 ¼ 500: Except in the cases of no acceptable region and the interest for small shifts (d 0 ¼ 0 and d 1 ¼ 0:5 or 1) the differences are negligible.Furthermore, when either d 0 or d 1 increase, the steady-state and zero-state ARLs come closer together for both control charting schemes.

Dependency on Phase I estimates
Another critical concern is related to our procedure of optimizing the CUSUM or EWMA control chart.In our optimization attempts, we find the parameters that result in a specific performance on a particular shift (l ¼ l 0 þ d 0 r 0 ) while it is optimized to detect another, higher shift (l 0 þ d 1 r 0 ); both are in terms of several standard deviations r 0 of the mean l 0 : Thus, the framework presumes knowledge of l 0 and r 0 , whereas in practice the true values are hardly available.There are different approaches possible to deal with this, where we will focus on two.One approach is to set pre-determined values of d 0 and d 1 , and adjust the three regions based on the Phase I estimates.This is done in Section 3.3.1.Another approach is to use more practically informed limits where the three regions are fixed and where d 0 and d 1 are determined after Phase I estimation.This will be studied in Section 3.3.2.

Fixed design parameters (d 0 and d 1 )
Naturally, in a Phase I study, l 0 and r 0 are estimated.So, for determined values d 0 and d 1 the uncertainty in the parameters affects the position of the indifference region and corresponding limits.As an illustration, consider without loss of generality a standard normal process (i.e., l 0 ¼ 0 and r 0 ¼ 1) with d 0 ¼ 1 and d 1 ¼ 3: In Figure 2, we illustrate these values in the case of known parameters on the left-hand side.In practice, we have to adapt our procedures to account for the process to be estimated at a different level, using l0 , and to account for a different variation r0 : For example, while an actual mean shift of d 0 standard deviations would be a Phase II mean of l ¼ d 0 , our estimate of this shift size would be a mean of l ¼ d 0 À l0 : More importantly, it also entails that the performance for shifts of d and Àd is no longer identical.Therefore, in the case of estimated parameters, we evaluate the control chart performance for the shift furthest away from our estimated mean.
For implementation in R (with random seed 1), we evaluate the performance using estimated parameters by generating samples of sizes m from a N ð0, 1Þ Table 3. ARL values for ARL 0 ¼ 500 and d 0 ¼ 0: distribution.We use the sample average ( X) and standard deviation (s) to estimate l0 ¼ X and r0 ¼ s=c 4 ðmÞ: Note that the constant c 4 ðmÞ is often used to obtain an unbiased estimate of r 0 .Next, note that the distance between the original indifference lines (d 0 and Àd 0 ) to the center line of the estimated control charts is equal to jl 0 À d 0 j and jl 0 þ d 0 j, which are not identical (unless l0 equals 0).The maximum distance is thus equal to jl 0 j þ d 0 : Since part of the objective is to constitute a specified ARL (ARL 0 ) for a shift of size d 0 on either side, we choose to use the maximum distance as shift size in our evaluation.The ARL on the other side (smallest distance) will be larger, so the considered ARL values can be viewed as the minimum ARL for shifts d 0 or Àd 0 when parameters are estimated.
In addition, the estimation of r 0 needs to be incorporated; remark the larger bandwidth (r 0 > r 0 ) on Table 4. ARL values for ARL 0 ¼ 500 and d 0 ¼ 0:5: Table 5. ARL values for ARL 0 ¼ 500 and d 0 ¼ 1: Table 6.ARL values for ARL 0 ¼ 500 and d 0 ¼ 2: the right side of Figure 2.For the three charts under consideration, the following adjustments are made (cf.Section 2, where r 0 1): LS ¼ L S r0 for the Shewhart chart; LC ¼ L C r0 , and k ¼ kr 0 ¼ d 0 þd 1 2 r0 for the CUSUM chart; and LE ¼ L E r0 for the EWMA chart.
We evaluate the ARL performance conditional on Phase I estimates using these adjusted parameters.For several Phase I sample sizes m, we generate 10,000 simulated samples, which are used to obtain 10,000 conditional ARL values.In addition, the scenarios (in terms of d 0 and d 1 ) are varied.The results are summarized in Table 8, which reports the average conditional ARL (AARL) and the standard deviation of the conditional ARLs (SDARL) for each setting.
In Table 8, we vary the sample size m and study the dependency of our framework on Phase I estimates by reporting the AARL and SDARL values.Indeed, when m is low, we find that the ARLs are more volatile, as SDARLs are larger and AARLs vary more for different values of d 0 .As expected, when m increases, the values become more accurate, and the SDARL decreases.Moreover, the EWMA chart seems to mitigate the Phase I uncertainty better than the CUSUM control chart, as observed by the lower SDARLs values.A possible explanation for the worse CUSUM chart performance is that the uncertainty around r 0 is directly affecting both the control limits (via LC ) and the charting statistic (via k), whereas for the EWMA and Shewhart control charts it only affects the control limit-LE and LS , respectively-but not the charting statistic.Zooming in on their performances, we find that the EWMA chart is more robust when d 0 is small, i.e., below 1.
Still, we observe that for higher d 0 values, i.e., the values at which we fix the ARL 0 performance (here to 100), the problem is more persistent.So, using these Table 7.Average steady-state run lengths of the in-control (zero-state) CUSUM and EWMA control charts, wherein the ARL 0 ¼ 100 and ARL 0 ¼ 500 are set for the d 0 and are optimized for detection of a shift of size d 1 , see Tables 1 and 2).methods only in cases with sufficient data is advisable to ensure sufficiently accurate estimates of l 0 and r 0 , say around m ¼ 500 observations when d 0 ¼ 2: This might feel restrictive, but considering the use-case of an (optimized) indifference region, one likely has extensive track records when considering such an approach.
As an additional analysis of the effects of parameter estimation, we evaluate the out-of-control AARL and SDARL for one specific set of parameters (ARL 0 ¼ 100, d 0 ¼ 1, and d 1 ¼ 2).These results are shown in Table 9, where we also included the respective in-control (d ¼ 1, cf.Table 8) AARL and SDARL as well as the case of known parameters m ¼ 1: We observe that the differences in AARL for varying sample sizes are small, as all AARL values are comparable to the case of m ¼ 1: The main difference is in the SDARL values.As anticipated, the SDARL decreases when the sample or shift size increases.

Fixed indifference regions
So far, we studied the effect of Phase I estimates by adjusting the indifference regions using these estimates.It demonstrates the impact of estimation error when we have pre-determined values for d 0 and d 1 .This helps to understand how Phase I uncertainty impacts the performance and therefore helps the reader decide their d 0 and d 1 when parameters are estimated.
Note that in practice, it might be wise for a practitioner to compensate a larger estimate of r 0 by choosing a smaller value of d 0 .An alternative angle is thus to keep a fixed indifference region, where the choice of d 0 and d 1 are dependent on the Phase I estimates.This approach is outlined in Figure 3, where taking Phase I uncertainty into consideration leads to estimating d0 and d1 to ensure the same indifference region.So, after measuring l0 and r0 we establish d0 ¼ d 0 r 0 r0 and d1 ¼ d 1 r 0 r0 : In the example displayed in Figure 3, the upper limit of the indifference region is equal to l 0 þ 3, which results in d1 ¼ 3=r 0 : Similarly, we would find d0 ¼ 1=r 0 : Next, in a similar vein as in Section 3.3.1,we compute the AARL and SDARL using this approach.The corresponding tables, Table 10 for in-control and Table 11 for out-of-control performances, show similar patterns as the study that has been carried out in the previous section.In fact, we find that keeping the indifference regions fixed, by choosing suitable d0 and d1 , counteracts the Phase I uncertainty to some extent: a high/low r0 will decrease/increase d0 and d1 , yielding lower SDARL values in both tables.The out-of-control performance also improves, as can be observed by comparing Table 11 to Table 9.The explanation is that by fixing the indifference region, you already incorporate some prior knowledge about the process in the determination of the control limits.

Impact of Non-Normality
In addition to parameter estimation, many other aspects could be taken into account, such as the performance of the optimized charts under different data distributions.The EWMA control chart is known to be more robust to non-normality than the Shewhart control chart; see, for example, Borror, Montgomery, and Runger (1999).To assess the impact of non-normality on optimized control charts, we expand our experiments to cover some settings with skewed and heavy-tailed data distributions.The package SPC does not cover non-normal settings, so we resort to the well-established Markov chain approach.This approach is introduced to assess the performance of  the CUSUM control chart in Brook andEvans (1972), and, for example, used in Borror, Montgomery, andRunger (1999) to assess the robustness of the EWMA chart.
The non-normal distributions considered are the Gamma distribution and Student's t-distribution.It is known that for a Gamma distribution with parameters a (scale) and b (rate) the skewness is given by 2 ffiffi a p , while for the symmetric t-distribution the excess kurtosis equals 6 4À with the degrees of freedom.For comparison purposes, we added the normal distribution to the experiments in Table 12 as the middle case.As a result, moving upward from the normal distribution, the distribution becomes heavier tailed, while in the downward direction, it becomes more skewed.As a final note, the parameters for the Shewhart, CUSUM, and EWMA charts at each combination of d 0 and d 1 follow from Table 2 (ARL 0 ¼ 500).
Studying Table 12 in more detail, we find that the performance of the optimized control charts degrades quickly when deviating from normality, with the Shewhart chart showing the worst scores, in line with Borror, Montgomery, and Runger (1999) for nonoptimized EWMA charts.Here, for d 0 ¼ 1 or 2, and d 1 ¼ d 0 þ 2-two boldfaced blocks-the EWMA and CUSUM charts show comparable ARLs under nonnormality.Between these two charts, the EWMA chart receives fewer signals in the in-control region (above the boldfaced blocks).In contrast, in most cases, the CUSUM chart has faster detection in the out-of-control region (below the boldfaced blocks).

Discussion
The motivation of the three-region approach is to focus on detecting practically significant shifts, not small shifts caused by minor (perhaps temporary) common causes.Such situations frequently occur in practice, see Freund (1957), and have recently received renewed attention by Woodall and Faltin (2019).Answering their call, we implement this approach to the EWMA and CUSUM control charting schemes by allowing a slight common cause variation in the in-control parameter via d 0 .The evasion of too many signals has also resonated in the Table 12.The ARL values for different data distributions for ARL 0 ¼ 500 and various combinations of d 0 and d 1 .literature on the economic design of control charts, e.g., Rahlm (1985) and Lorenzen and Vance (1986).Still, we stress that the approach outlined here is fundamentally different as the extra margin chosen by the user is optimized and thus does not follow economic considerations of the cost of reacting to signals.Besides, these costs are often hard to estimate, and the approach has several flaws that may ultimately lead to poor control chart designs, see Woodall, Lorenzen, and Vance (1986).
As opposed to the Shewhart control chart with a single parameter, the EWMA and CUSUM charts have additional flexibility to optimize toward the detection of particular shifts of size d 1 , allowing the optimization of these control charting schemes.The performance comparison reveals that the CUSUM control chart appears best suited to this task, except when d 0 and d 1 are both small.However, the EWMA control chart is slightly more robust.Besides performance and robustness, some other considerations play a role when selecting the right control charting scheme, which we discuss below.

Optimization issues
Obtaining the optimal parameter values for the CUSUM control chart is straightforward, as the optimal value of k is known to be k ¼ ðd 0 þ d 1 Þ=2: The other parameter, L C , can then be chosen to match a specified ARL value.For the EWMA chart, optimization always requires an evaluation of both its parameters simultaneously.Moreover, with this optimization, some scenarios generate k values outside the desired region of k 2 ½0:05, 0:25 as prescribed by Montgomery (2020).While it is, of course, possible to restrict the optimal parameter search to this interval, it comes at the cost that it will deteriorate the theoretical performance of the EWMA control chart.In addition, trying to distill patterns in the optimal parameters as a function of d 0 and d 1 (boundaries of the indifference regions), we find that for the CUSUM chart, they behave monotonically, while for the EWMA chart that is not the case as its parameters greatly interact.
Another computational issue with the EWMA may come forward when d 0 is large (e.g., 2 or larger) and when d 1 is close to d 0 (e.g., d 0 þ 0:5Þ: While the optimal parameters for the CUSUM charts are still easily obtained in these situations, the optimization of EWMA may run into issues, as the determination of parameters could become too difficult to use standard settings of Knoth (2014), i.e., for computation the number of quadrature nodes used in the approximation should be increased.

Small indifference regions
The issue in optimizing the EWMA control chart that creates computational issues is a too-small indifference region combined with a relatively high d 0 ; the difference between d 0 and d 1 is only 0.5, while d 0 ¼ 2: In such a case, it comes down to two conflicting goals; at d 0 , we aim for a specific ARL 0 performance, while at d 1 , we optimize to detect such a shift quickly.So, as these goals are fundamentally different, choosing d 0 and d 1 close, i.e., having a tiny indifference region (especially for the EWMA chart), may lead to problems in this framework.
A strategy to avoid a too-small indifference region is to reconsider the settings of d 0 , d 1 , and ARL 0 from a practical point of view.The purpose of an indifference region is to have a range of values in which the performance is not so critical.Thus, a straightforward approach to alleviating potential computational issues of the EWMA chart would be to make the difference between d 0 and d 1 (much) larger.For example, choose a smaller value of d 0 , and set ARL 0 to a larger value.However, if the ARL 0 is set higher at a lower d 0 , one should collect more data to offset the possible estimation error, as seen in Section 3.3, where we study the impact of Phase I estimates in our approach.Another practical resolution would be to choose a larger value of d 1 .

Interpretability of the charts
Interpretability is another relevant aspect.Between the CUSUM and the EWMA control charts, the EWMA chart has the advantage that the charting statistic (although it is a weighted average) remains in the original unit of measure.The CUSUM chart does not have this property as it tracks a positive and negative cumulative sum, which only sums positive or negative differences from a reference value.Moreover, because no individual values are plotted in the CUSUM and EWMA charts, it cannot be observed directly from these charts whether an individual value is out-of-specification, which touches upon the difference between statistical and practical significance, see Woodall and Faltin (2019); Blume et al. (2019) for recent discussions.The Shewhart control chart does not have this discrepancy.However, due to their (much) better detection of persistent shifts, the modified CUSUM and EWMA charts are still recommended over the Shewhart control chart.In the next section, we apply the proposed methodology to some examples to highlight its value in practice.
Both charts are still amenable to various alternatives, which have not been studied to ensure a "fair" and comprehensible comparison.First, one could consider EWMA and CUSUM charts with different limits.Whereas we have taken fixed limits, one could opt for time-varying limits and other variations, see Knoth (2003) and Crosier (1986),  which are part of Knoth's R package spc.Second, in the spirit of the CUSUM chart to have two monitoring statistics, one could opt for two one-sided EWMA charts, see Gan (1993).

Practical examples
In this section, we consider two examples.The first example is inspired by the paper of Woodall and  Faltin ( 2019), which exhibits a more extreme use case of implementing an indifference region.The second example involves wafers from a hard-bake process, which can be found in Montgomery (2020).The statistical software Minitab is used to execute the experiments and to generate the figures.The figures for the EWMA chart are manually rendered in Minitab to be able to implement steady-state limits, which is not an option in Minitab.

Case 1: Critical quality attribute (CQA)
Our first example extends to the case study reported in Woodall and Faltin (2019).Inspired by that example, we synthesized data with the same characteristics in Phase I, e.g., mean 6 and standard deviation 0.06.So, we simulated 200 data points from a normal distribution with these characteristics; we added some minor seasonal variation with an amplitude of one standard deviation to mimic insignificant and temporary shifts.The resulting values are given in the control chart of Figure 4.Note that the seasonal effect is only considered present in Phase II.
In this case study, the specification limits are much wider than the control limits of the process, i.e., 4 and 8 versus 5.82 and 6.18, as observed in Figure 4. Therefore, it is sensible not to react to some small variation, e.g., small and expected seasonality, which can be considered common noise instead of special causes of variation.Thus, to modify the control charting schemes, we have to allow the process to return to "normal".Note that the absence of seasonality in a Phase I study and the presence of it in Phase II might occur when the Phase I study was carried out during a limited part of a seasonal cycle.Or, for example, when the moving range (MR) estimator is used to estimate the process dispersion.
Because there is a great discrepancy between the control and specification limits, we adapt the EWMA and CUSUM control charts using the proposed approach.Taking some extreme values from Table 2, i.e., ARL 0 ¼ 500 at d 0 ¼ 1 (as the amplitude of the seasonal variation is one standard deviation) and d 1 ¼ 3, we adjust the original time-weighted charts.Both charts are given in Figure 5.When comparing the standard to the optimized versions, we observe that the original charts easily detect seasonal effects and produce many signals.Note that the ARL 0 at d 0 ¼ 0 is 168 for the standard CUSUM control chart (k ¼ 0.5; L C ¼ 4) and 560 for the standard EWMA control chart (k ¼ 0:2; L E ¼ 3); by default, the CUSUM chart is thus set more sensitive than the EWMA chart.Applying our procedure to accommodate the seasonal variation results in only one or two signals.Naturally, setting d 0 to an even higher setting will fully mitigate the signals produced because of the seasonality.Yet, true shifts go unnoticed for a longer period, which will be part of our experiment in Section 5.2.
In a second adaptation of CQA, we excluded the seasonality but added two small temporary shifts and one larger persistent shift.Specifically, to our 200 data points, we added a shift of one standard deviation from observation 21 until 40, and we subtracted one standard deviation from observation 101 until 120.The persistent shift (3 standard deviations) occurs at observation 161.In line with the previous example, we use the parameters of Table 2 with d 0 ¼ 1 and d 1 ¼ 3, such that our modified charts are designed to ignore the temporary variation while detecting the true persistent shift as early as possible.Indeed, in  As seen in the left panels of Figure 7, the standard CUSUM and EWMA charts easily detect the temporary shifts that occur far before the true persistent shift, which is, of course, detected by both.The panels on the right-hand side show that the adjusted CUSUM and EWMA charts can fully disregard these temporary shifts and immediately detect the true persistent shift at 161.This example underpins the promise of modifying the CUSUM and EWMA charts to make them less sensitive to small "unimportant" shifts.2. 5.2.Case 2: Wafers from a hard-bake process Where our first example was specifically designed to illustrate the purpose of the proposed method, our next example relates to Chapter 6 of Montgomery (2020) about wafers from a hard-bake process.In the original example, subgroups of size 5 are selected; we converted these subgroups to individual values, for which the historical mean is l 0 ¼ 1:5056 and standard deviation r 0 ¼ 0:0625: Moreover, as the USL and LSL for this process are set at 1.0 and 2.0, so one could question whether this process should be monitored so closely.Therefore, we implement the standard Shewhart control chart and the adapted one based on the corresponding parameter from Table 1.
The application of a standard Shewhart control chart (limits at 3 standard deviations) signals at observation 43, whereas the adapted chart signals at observation 45, as seen in Figure 8. Next, we use our framework of indifference regions where in addition to requiring the ARL 0 to be 500 at d 0 , we optimize for detecting specific shifts at d 1 equal to 2 or 3, which is done in Figure 9.Note that in this scenario, the shift seems persistent, and thus should be deemed a true shift, which is also concluded in Montgomery (2020).Fortunately, our optimized versions are still capable of detection, but at a later stage.So, this example illustrates the tradeoff of employing an indifference region-to disregard unimportant shifts-and swiftness of detection in case of a true shift (not in the in-control region).The larger the indifference region, the later a true shift will be detected.
An interesting comparison is between Figures 9(c) and 9(d), where the EWMA chart only signals at 45, while the CUSUM chart still signals at 43.Finally, if d 1 ¼ 3 is set to 2, both charts signal only at observation 45.Interestingly, the EWMA bounds decrease as the k parameter has increased when comparing Figures 9(c) and 9(e).

Concluding remarks
In process reliability, this work leverages optimization to advance control chart schemes to better fit practice.Answering the call of Woodall and Faltin (2019) and aligning with the international standard, ISO (2020), we provide an approach accessible to the average practitioner approach that wants to utilize the leeway between the control limits (the range of the process fluctuations) and specification limits (the range in which products are acceptable).Often there is quite a gap between these different types of limits, which allow the sensitive time-series charts to return to normal; one can argue that for practice, many control charting schemes are overly sensitive and that somehow larger deviations from the target can be accounted for as normal process variation, i.e., common noise.The reduction of signals is also resonated in the literature on the economic design of control charts, but with a different motivation (Rahlm 1985;Lorenzen and Vance 1986).
To do so, we employ the framework of an indifference region to disregard signals from small shifts or minor seasonal effects.The CUSUM and EWMA control charts are adapted to meet a specified average run length ARL 0 at a mean shift of d 0 > 0 standard deviations.At the same time, they are optimized to detect a shift of d 1 ð> d 0 Þ standard deviations.In this way, the indifference region automatically demarcates two other regions; within l 0 6d 0 , we find the in-control region and further outward, beyond l 0 6d 1 , the out-of-control region.
Comparing the optimized EWMA and CUSUM charts to the standard Shewhart control chart and their unaltered counterparts, we show that they are particularly promising for practice when there is sufficient leeway compared to the specification limits of a process.Out of these two, we conclude that the CUSUM chart is the better choice as it generally outperforms the EWMA chart slightly and has some advantageous computational properties, i.e., one of its parameters can be immediately set, and thus only one parameter has to be determined.Although an EWMA chart is easier to interpret as it has the same unit of measure for its statistic as the original data, the EWMA parameter combinations can become tedious, especially when the indifference region is small.Finally, we find that in many instances, EWMA's weight parameter k lands outside the advised and common range reported in standard statistical literature (Montgomery 2020).
The optimization procedures and comparison of the Shewhart, optimized EWMA, and CUSUM charts are primarily focused on dealing with normally distributed data.When data do not come from a normal distribution, the EWMA and CUSUM charts outperform the Shewhart chart, as expected.However, the performances of these charts might be improved by using data transformations (Chou, Polansky, and Mason 1998).Finally, since the optimization procedures intrinsically rely on design choices and Phase I estimates, the robustness of the approach is assessed as well.Checking the sensitivity of the framework on Phase I estimates reveals that sufficient data should be available to have a reliable optimized EWMA or CUSUM chart-considering the use case of an indifference region, a condition that is easily met.

Figure 1 .
Figure 1.The theoretical setting of an indifference region (white); process variation manifests itself vertically.The upper and lower A PL demarcate the acceptable region for the underlying process, with the center line (dashed) indicating the target value.The two R PL lines demarcate the boundaries whereafter it is rejectable.Figure inspired by ISO 7870-3:2020.

Figure 2 .
Figure 2. Dependency of the framework on Phase I estimates, used in the computation of the AARL and SDARL metrics; in this exampled 0 ¼ 1 and d 1 ¼ 3 (l 0 ¼ 0 and r 0 ¼ 1).

Figure 3 .
Figure 3. Dependency of the framework on Phase I estimates when having a fixed indifference region.It is used in the computation of the AARL and SDARL metrics; in this example d 0 ¼ 1 and d 1 ¼ 3 (l 0 ¼ 0 and r 0 ¼ 1).

Figure 8 .
Figure 8. Shewhart control charts, standard and adapted, applied to the Flow Width data with parameters based on l 0 ¼ 1:5056 and r 0 ¼ 0:0625:

Figure 6
Figure 6 the standard Shewhart control chart detects extra signals, whereas the adjusted Shewhart with the limits raised to be at 3.878 standard deviations (r 0 ) of l 0 still finds one signal related to the temporary shift.As seen in the left panels of Figure7, the standard CUSUM and EWMA charts easily detect the temporary shifts that occur far before the true persistent shift,

Figure 9 .
Figure 9.An application of optimized control charts using indifference regions.In all altered schemes d 0 ¼ 1 where ARL 0 ¼ 500, while in (c) & (d) are optimized to detect a shift at d 1 ¼ 2 and in (e) & (f) at d 1 ¼ 3; see also Table2.

Table 8 .
The average (AARL) and standard deviation (SDARL) over the in-control ARLs when applying Shewhart, CUSUM, and EWMA control charts over different Phase I estimates, where d 1 ¼ d 0 þ 1 and ARL 0 ¼ 100 are fixed, but d 0 and the Phase I sample sizes m vary.

Table 9 .
The average (AARL) and standard deviation (SDARL) over the ARLs when applying Shewhart, CUSUM, and EWMA control charts over different Phase I estimates for different shift sizes d, where d 0 ¼ 1, d 1 ¼ 2, and ARL 0 ¼ 100 are fixed, but the Phase I sample sizes m vary.The case m ¼ 1 represents the case of known parameters.

Table 10 .
The average (AARL) and standard deviation (SDARL) over the in-control ARLs when applying Shewhart, CUSUM, and EWMA control charts over different Phase I estimates, where d 1 ¼ d 0 þ 1 and ARL 0 ¼ 100 are fixed, but d 0 and the Phase I sample sizes m vary.

Table 11 .
The average (AARL) and standard deviation (SDARL) over the ARLs when applying Shewhart, CUSUM, and EWMA control charts over different Phase I estimates for different shift sizes d, where d 0 ¼ 1, d 1 ¼ 2, and ARL 0 ¼ 100 are fixed, but the Phase I sample sizes m vary.The case m ¼ 1 represents the case of known parameters.