Visual feature binding in younger and older adults: encoding and suffix interference effects

ABSTRACT Three experiments investigated younger (18–25 yrs) and older (70–88 yrs) adults’ temporary memory for colour–shape combinations (binding). We focused upon estimating the magnitude of the binding cost for each age group across encoding time (Experiment 1; 900/1500 ms), presentation format (Experiment 2; simultaneous/sequential), and interference (Experiment 3; control/suffix) conditions. In Experiment 1, encoding time did not differentially influence binding in the two age groups. In Experiment 2, younger adults exhibited poorer binding performance with sequential relative to simultaneous presentation, and serial position analyses highlighted a particular age-related difficulty remembering the middle item of a series (for all memory conditions). Experiments 1–3 demonstrated small to medium binding effect sizes in older adults across all encoding conditions, with binding less accurate than shape memory. However, younger adults also displayed negative effects of binding (small to large) in two of the experiments. Even when older adults exhibited a greater suffix interference effect in Experiment 3, this was for all memory types, not just binding. We therefore conclude that there is no consistent evidence for a visual binding deficit in healthy older adults. This relative preservation contrasts with the specific and substantial deficits in visual feature binding found in several recent studies of Alzheimer's disease.

Older adults have reliably been shown to exhibit a deficit for associating a variety of different types of stimuli in episodic memory, including verbal and non-verbal, and static and dynamic information (Naveh-Benjamin, 2000;Old & Naveh-Benjamin, 2008a, 2008b. The deficit presents as a greater age-related decline in memory for the associations between features, such as names and faces, than for individual features. A general age-related decline in visual working memory performance has also been observed (e.g. Johnson, Logie, & Brockmole, 2010;Park et al., 2002). These findings have led to the hypothesis that older adults may have difficulty temporarily maintaining combinations of features such as colours, shapes, and locations (temporary binding). Such deficits could manifest as an age-related difficulty in recalling, for example, that the car was red and the van was blue, rather than vice versa, or whether the round or the oval white pill had just been taken.
Multiple studies have reported evidence for an agerelated deficit in temporarily binding objects to spatial locations (Borg, Leroy, Favre, Laurent, & Thomas-Antérion, 2011;Brockmole & Logie, 2013;Cowan, Naveh-Benjamin, Kilb, & Saults, 2006;Fandakova, Sander, Werkle-Bergner, & Shing, 2014;Mitchell, Johnson, Raye, Mather, & D'Esposito, 2000). There is also robust evidence for a binding deficit in Alzheimer's disease patients when only surface visual features such as colour and shape are involved (e.g. Parra, Abrahams, Fabi, et al., 2009;Parra, Abrahams, Logie, Mendez, et al. 2010). On the other hand, visual binding deficits have been much more elusive in healthy older adults, so much so that, using a change detection paradigm, the evidence is weighted towards older adults being unimpaired in this ability (Brockmole, Parra, Della Sala, & Logie, 2008;Read, Rogers, & Wilson, 2015;Rhodes, Parra, & Logie, 2016). However, Brown and Brockmole (2010) found that a small age-related visual binding deficit may indeed accompany the healthy ageing process, even when using the commonly employed change detection paradigm. In Experiment 2 of that study it was demonstrated that, only in older adults, memory for bound visual objects (coloured shapes) was significantly poorer than memory for the worst performing individual feature (shape). Similarly, Isella, Molteni, Mapelli, and Ferrarese (2015) appeared to observe significant age × memory condition interactions using A ′ (but not proportion correct) as their outcome variable (see Allen, Hitch, Mate, & Baddeley, 2012), with their older adult group displaying reduced binding memory accuracy. However, they argued for the absence of a specific binding deficit in older adults, because the pvalue for the interaction did not meet the .001 level. Furthermore, evidence of age-related temporary visual binding deficits has been reported using more complex paradigms (Chen & Naveh-Benjamin, 2012;Peich, Husain, & Bays, 2013). Thus, although this form of deficit is by no means robust across studies, it is observable. Given the theoretical and applied importance of this topic, and the inconsistency of findings reported in the literature pertaining to the effects of healthy ageing on temporary visual feature binding, our analyses will highlight the effect sizes and confidence intervals for the key comparisons of interest, in each experiment (Cumming, 2012(Cumming, , 2014. Rather than simply stating whether or not a statistically significant binding deficit exists, this approach, which has thus far been lacking in the literature, will establish the size of any effect (i.e. binding deficit) that is related to healthy ageing. Additionally, in order to demonstrate that temporary binding is not disproportionately affected by age, one must be able to obtain evidence for the null hypothesis, that there are no interactions between age group and condition. Therefore, we also report Bayes factors for crucial interactions of interest, using the default family of priors outlined by Rouder, Morey, Speckman, and Province (2012).
It is also unclear what cognitive mechanisms may underlie visual feature binding deficits when they are observed (see Allen, Brown, & Niven, 2013, for a review). Brown and Brockmole (2010) argued that attention deficits were not able to account for the observed deficit, and this is in parallel to findings in the long-term memory literature (Chen & Naveh-Benjamin, 2012;Cowan et al., 2006;Kilb & Naveh-Benjamin, 2007;Naveh-Benjamin, Guez, & Shulman, 2004). Examining the reliability of possible agerelated visual feature binding deficits across a variety of contexts has important implications for models of binding in working memory and for the characterisation of healthy cognitive ageing. The present study assesses the extent to which encoding and maintenance limitations may contribute to visual feature binding deficits in young and older adults. We addressed the potential role of three specific processes, which were identified as key variables of interest in the study of temporary visual feature binding. These are encoding duration (Exp. 1: 900/1500 ms), mode of presentation at encoding (Exp. 2: simultaneous/sequential), and interference after encoding (Exp. 3: control/suffix presentation). Cowan et al. (2006) suggested that older adults' binding deficits could be due to a lack of processing robustness (following Li et al., 2004; see also , emphasising the requirement for detailed investigations of the encoding stage of the visual binding process in younger and older adults. Specifically, processing speed has been shown to exert a strong influence on older adults' working memory performance (e.g. Salthouse, 1992Salthouse, , 1994Salthouse, , 1996 and could therefore differentially impact older adults' encoding success and subsequent binding memory performance. In addition, both presentation format at encoding, and suffix interference after encoding, were also important to consider in the present context, given their influence on visual binding that has been identified in the existing literature, particularly with younger adults, as well as their relationship with attentional processes (Allen, Baddeley, & Hitch, 2006, 2014Allen, Castellà, Ueno, Hitch, & Baddeley, 2015;Ueno, Allen, Baddeley, Hitch, & Saito, 2011;Ueno, Mate, Allen, Hitch, & Baddeley, 2011). Despite their importance in the recent binding literature, to the best of our knowledge, serial position effects in sequential presentation conditions and suffix interference effects have yet to be investigated in healthy older adults' visual binding. By investigating all of these variables, we will gain further insights into visual feature binding in both younger and older adults, and important evidence regarding the potential visual binding deficit in healthy ageing.

Experiment 1
In one previous study that demonstrated an age-related temporary visual binding deficit, the total encoding time was longer (1500 ms; Brown & Brockmole, 2010, Exp. 2) than had been used in previous similar experiments that did not reveal an age-related binding deficit (900 ms, Brown & Brockmole Exp. 1;Brockmole et al., 2008). Indeed, it is possible to predict that encoding time could influence the presence or absence of binding deficits. Recent studies suggest that temporary visual feature binding occurs rather automatically at a low level of the working memory system Logie, Brockmole, & Vandenbroucke, 2009) but that, by 1500 ms, a stable representation of the object has been encoded into visual short-term storage (Logie, Brockmole, & Jaswal, 2011; see also Peich et al., 2013). Thus, one could predict enhanced visual working memory for features and/or conjunctions with longer encoding durations, and such enhancement could benefit older adults to a greater extent than younger adults. Rhodes et al. (2016) recently investigated the effect of a rather long encoding time (2500 ms) for comparison with the conventional 900 ms, and found that this did not differentially affect the binding performance of older adults. Nevertheless, it is important to compare directly the 900 and 1500 ms encoding conditions, given the differential findings that have been reported in the literature previously, and in light of the evidence suggesting that 1500 ms is an important threshold for developing stable bound representations, at least in younger adults .
Furthermore, it is possible to predict that the slightly longer encoding duration could be problematic for older adults. Younger adults may disproportionately benefit from the extra available time, or older adults' performance could actually suffer. As declining frontal lobe functions may underlie cognitive ageing (Braver & West, 2008), one could predict that the less efficient use of task-limiting strategies could be observed in older adults when more time is available for encoding (Sander, Werkle-Bergner, & Lindenberger, 2011). Participants may supplement fast, relatively automatic, perceptual attention-driven binding processes (Hu, Hitch, Baddeley, Zhang, & Allen, 2014;Murre, Wolters, & Raffone, 2006) with higher level executive processes such as inhibition and strategy use (Hu et al., 2014;Logie et al., 2011;Naveh-Benjamin, Brav, & Levy, 2007;Sander, Lindenberger, & Werkle-Bergner, 2012). Task-limiting strategies could include focusing on a subset of stimuli at the expense of full, successful encoding and/or retention of the array. We hypothesised, then, that stimuli display duration may influence the presence or absence of age-related binding impairments.

Design
A 2 × 2 × 3 mixed factorial design was used to investigate the effects of age group (younger, older), encoding format (short-900 ms/long-1500 ms; within participants), and memory block type (colour, shape, binding; within participants) on memory performance. A single-probe change detection paradigm was employed, using A ′ as the dependent measure. As is typical for change detection paradigms, a hit was defined as the correct detection of change. A ′ ranges between 0 and 1, and .5 indicates chance level performance (Stanislaw & Todorov, 1999). To enhance comparability with future research, but also for ease of inclusion in future meta-analyses, Cohen's d was selected as the main standardised effect size measure (Cumming, 2012(Cumming, , 2014.

Participants
There were 48 participants in total. The younger sample comprised 8 males and 16 females, recruited from the University of Edinburgh community, aged 18-25 years (M = 19.92, SD = 1.91; mean years of education = 14.50, SD = 1.60). Their mean estimated full-scale IQ [Test of Premorbid Functioning -UK (ToPF); Pearson, 2009] was 101.17 (SD = 7.04). Across all experiments, the older adults volunteered on the basis of being healthy, living independently in the community, and agreeing to visit the University to participate. There were 13 male and 11 female older adults, aged 70-86 years (M = 76.08, SD = 4.24; mean years of education = 15.00, SD = 3.36). Their estimated IQ was 115.63 (SD = 7.44). The latter was significantly different between younger and older adults, t(46) = 6.91, p < .001, although this was in the opposite direction of any expected age effects, and likely reflects greater verbal knowledge in older adults (Park et al., 2002). Indeed, the number of years of formal education was not significantly different, t(33) = .66, p = .52. The Mini-Mental State Examination (MMSE; Folstein, Folstein, & McHugh, 1975) was used to determine that all older participants were cognitively healthy (M = 28.79, SD = 1.14, min = 27, max = 30). All participants reported normal or corrected-to-normal vision, and no memory problems. For this and the following experiments, the participants were unique to each experiment and had not taken part in any of the other experiments. Participants received either an honorarium or course credit.

Materials
Stimulus displays comprised three coloured shapes presented on a grey background. Each object was created from a pool of six colours (red, yellow, blue, green, cyan, purple) and six shapes (circle, triangle, diamond, heart, arrow, cross), by randomly selecting one colour and one shape, without replacement. Test arrays comprised either a single feature (a colour blob or a blank, grey shape with a black outline for the colour and shape memory blocks, respectively) or a coloured shape (for the binding memory block). Stimuli were presented on a 51 cm LCD monitor with a screen refresh rate of 60 Hz. Each stimulus measured approximately 2.2 cm 2 , and viewing distance was not constrained.

Procedure
All participants completed the ToPF (Pearson, 2009), and older participants also completed the MMSE (Folstein et al., 1975). The main task was then administered using a blocked, counterbalanced procedure. There were three blocks of trials, which varied by memory test type (colour, shape, binding), for each of the two encoding time conditions (short, long). The blocked procedure was adopted so as to minimise any potential influence of higher level cognitive mechanisms (e.g. confusion over task instructions, particularly across different memory conditions). Participants completed all three blocks of one encoding condition followed by the other. Encoding conditions and memory blocks were counterbalanced, with the constraint that each participant completed the different memory tasks in the same order for each encoding condition. Each trial block comprised six practice trials followed by 36 experimental trials. Trial order was randomised across participants. Figure 1(a) illustrates the paradigm. Participants pressed the space bar to begin a trial, whereupon a randomly generated two-digit number (between 20 and 99) was displayed for 2000 ms. Immediately, participants would begin articulating this number out loud continuously and consistently, throughout the trial, until they made a keypress response. This secondary task was intended to suppress articulation and verbal recoding of the stimuli (Baddeley, 2007;Larsen & Baddeley, 2003). After presentation of the two-digit number, a central fixation cross appeared for 1000 ms which was then accompanied by three coloured shapes, presented in a row in three fixed locations (left, centre, right), for either 900 (short) or 1500 ms (long), above the cross. After a 1000 ms delay, a single test item appeared below the fixation cross in one of three fixed locations (left, centre, right; randomly selected, but appearing in each position equiprobably). Participants responded yes or no, by pressing the z or m keys on the keyboard, to indicate whether or not the item had appeared in the memory array. Following previous studies (e.g. Allen et al., 2006;Wheeler & Treisman, 2002), target trials involved presenting a colour, shape, or colour-shape combination at test. In lure trials, new colours or shapes were introduced from the pool in feature memory blocks, and new colour-shape combinations were introduced in the binding memory blocks by swapping features that had appeared in the memory array (see also Brown & Brockmole, 2010). Half of the trials featured targets at test, while the other half featured lures. Table 1 displays the mean A ′ scores from each condition. These data were analysed using a 2 (age group) × 2 (encoding time) × 3 (memory type) mixed analysis of variance (ANOVA). There were effects of age group, F(1, 46) = 27.14, MSE = .008, p < .001, h 2 p = .37, encoding type, F(1, Figure 1. (a) Experiment 1 procedure. Participants carried out six trial blocks (i.e. each memory block typecolour, shape, bindingunder each encoding time condition -900, 1500 ms). (b) Experiment 3 proceduresuffix condition. Each participant carried out each memory type (colour, shape, binding) under each interference condition (control, suffix). The control condition was the same as the 900 ms condition of Experiment 1. In all experiments, stimuli were drawn from a pool of six different colours and shapes, and are not drawn to scale here. Fill effects represent different colours.
Focusing on comparing binding memory performance with that of the worst performing individual feature condition, shape memory, we assessed the size of the effect of retaining bound relative to individual features, using the A ′ scores. 2 The mean of the paired differences was calculated across shape and binding memory, and a negative value indicates poorer performance in binding. We first calculated d z ′, by dividing the mean of the paired differences between shape and binding performance (M z ) by the standard deviation of the difference scores (SD z ; Cohen, 1988). We then calculated Cohen's d using d = d z ′ √ 2 (Cohen, 1988). Cohen suggested the convention that an effect size of .2 be interpreted qualitatively as a small effect. A value of .5 is indicative of a medium effect, which would be noticeable when visually inspecting the data, while .8 reflects a large effect.
In the long encoding condition this was −.004 (SD = 0.11 [−0.05, 0.04]; d = −.05). Thus, the effect size is small in both age groups under both encoding conditions, and is negligible in the long encoding condition in older adults.
The argument that there is no disproportionate effect of age on temporary visual feature binding has previously been made on the basis of failure to reject the null hypothesis. Bayes factors offer an intuitive, and increasingly popular, way to state evidence for one proposition (or model) relative to another (Edwards, Lindman, & Savage, 1963). In this case, models are defined by the inclusion or omission of main or interaction effects. As mentioned above, we used the default family of priors outlined by Rouder et al. (2012), which place greater prior probability on effects of small magnitude (i.e. symmetrically around zero) but do not rule out larger effects. Our Bayesian ANOVA was conducted using the BayesFactor package in R (Morey & Rouder, 2015;R Core Team, 2015). The analysis compared a full model, with all main effects and interactions between experimental variables, to reduced models omitting a specific component at a time (for example, a model without an age group × memory type interaction). 3 The resulting Bayes factor therefore reflects the relative evidence for retaining the effect in the full model versus the reduced model, which does not represent the effect.
Relative to a full model, with main effects of age group, memory type, and presentation time, and the interactions between these variables, the reduced model, which omits the age × memory type interaction, was favoured by a factor of approximately 13 to 1. Thus, Experiment 1 provides fairly strong evidence against a differential effect of age across the three memory types. There was also evidence against the three-way interaction, as the reduced model without this component was preferred by over 6 to 1.

Discussion
As expected, younger adults outperformed the older adults overall, indicating an age-related deficit in visual working memory (Brown & Brockmole, 2010;Johnson et al., 2010;Park et al., 2002). Importantly, however, there was little evidence of a specific deficit in the temporary retention of bound visual objects, relative to memory for the most difficult individual feature condition (shape). The standardised effect sizes also support this conclusion. While there was evidence of a small binding-related effect size in the short encoding condition, the younger adults also exhibited this across both encoding conditions; therefore, there are no grounds to argue for a specific agerelated binding deficit within this particular experiment. We were also able to gauge the strength of evidence for interactions between age and memory condition, using Rouder et al.'s (2012) default Bayes factors. This analysis showed that models without interactions between age group and memory type were more likely, relative to the saturated, full model, which included them. When considered in relation to the wider working memory literature that is weighted against an age-related deficit in temporary surface visual feature binding using similar paradigms (Brockmole et al., 2008;Brown & Brockmole, 2010, Exp. 1;Parra, Abrahams, Fabi, et al., 2009;Rhodes et al., 2016), these results are in keeping with the suggestion that the performance of healthy older adults in binding conditions is primarily driven by their ability to detect changes in individual features, and does not reflect an impairment specifically of temporary binding of colour-shape combinations.
These results also provide evidence regarding the potential role of encoding time in older adults' temporary visual feature binding performance (Brown & Brockmole, 2010). The results clearly showed that encoding time cannot account for the differential findings in the previous literature on visual feature binding. Although older adults benefitted more than younger adults from a longer encoding time overall (e.g. Salthouse, 1996), the presence or absence of age-related visual binding deficits in working memory appears to be unrelated to the encoding duration, at least within the present experimental conditions (see also Rhodes et al., 2016).

Experiment 2
Sequential presentation appears to disrupt temporary visual feature binding in both younger and older adults (Allen et al., 2006;Brown & Brockmole, 2010; see also Gorgoraptis, Catalao, Bays, & Husain, 2011;. As sequential presentation requires creating new bindings while maintaining previously encountered ones, this disruption may result from a fragility of bound material in visual working memory Logie et al., 2009;. That is, relative to individual features in visual working memory, bound objects might be more readily overwritten by new information. We would therefore expect to replicate a general decrement in binding performance, when stimuli are presented sequentially. It is possible that older adults may be more vulnerable to this manipulation. Allen et al. (2014) have shown that visual working memory for sequentially presented objects involves two attentional components, which may be thought of as internal, or top-down, attentional control, and externally driven, bottom-up attention (Chun, Golomb, & Turk-Browne 2011). If older adults exhibit limited executive attentional resources (Braver & West, 2008;Buckner, 2004), then sequential presentation could cause difficulty. That said, Brown and Brockmole (2010; see also Read et al., 2015) provided evidence that older adults' visual binding was not differentially disrupted by sequential presentation, suggesting retention of the capacity to encode individual features and bindings serially.
A new avenue of exploration, however, was specifically to examine older adults' performance across the different serial positions during sequential presentation. Allen et al. (2006Allen et al. ( , 2014 showed that, in younger adults, memory for bound representations exhibited a particular deficit relative to individual feature memory in the earlier positions in the sequence while, for the final position in the sequence, bindings were retained as well as individual features. We therefore predicted that the poorest performance would be observed at an earlier serial position, which depends upon internally guided executive control. We could further predict the same pattern of serial position deficit in older adults, if they create and maintain serially presented bindings in the same way as younger adults. It is possible, however, to predict a different serial position curve in older adults. In a dual-task verbal working memory study, Foos (1989) reported similar serial position curves to Allen et al. (2006), in that the final of three items was best remembered. In older adults, however, a clear deficit existed for the middle item, whereas there was no effect of age for items 1 and 3. We could therefore predict a particularly clear agerelated deficit at the middle serial position in our task, which could potentially be more pronounced for bindings.

Design
A 2 × 2 × 3 mixed factorial design was used to investigate the effects of age group (younger, older), encoding format (simultaneous, sequential; within participants), and memory block type (colour, shape, binding; within participants) on A ′ , the main dependent measure of memory performance.

Results
The Bayesian ANOVA on the data from Experiment 2 revealed evidence for the age group × memory type interaction. The full model was marginally more likely, given the data, than a model omitting this interaction (B = 0.526, approximately 2 to 1 support for the interaction). However, removing the colour condition from the analysis -and contrasting shape and binding only-leads to favouring the reduced model, and hence the absence of the age × memory type interaction, by a factor of over 4 to 1.
For the three-way interaction, the full model is preferred over the reduced model whether the colour condition is included in the analysis (B = 0.401, approximately 2.5 to 1 support for the interaction) or not (B = 0.344, approximately 3 to 1 support for the interaction). However, in this case, it is clear that the slight evidence for the threeway interaction is borne out of younger adults' larger binding cost in the sequential condition (see above).
To follow up on the age group × position interaction, planned comparisons investigated the serial position effect (collapsed across memory blocks) within each age group. Within the younger adults, accuracy was not significantly different between serial positions 1 and 2, t (23) = .70, p = .49, and between positions 2 and 3, t(23) = .99, p = .33. In the older adults, however, accuracy was much poorer in serial position 2 as compared with both positions 1, t(23) = 4.73, p < .001, and 3, t(23) = 4.68, p < .001. Thus, while younger adults were unaffected by the serial position of the target object, older adults performed significantly worse, and at chance level, for the middle stimulus in the sequence, across all memory conditions.
Planned comparisons were also used to follow up the effects of memory test type within each serial position, collapsed across age groups. Within position 1, colour and shape were not significantly different, t(47) = 1.35, p = .18, whereas shape was performed much better than binding, t(47) = 3.13, p = .003. Within position 2, colour was performed better than shape, t(47) = 2.93, p = .005, but shape and binding were performed similarly, t(47) = .39, p = .70. Within position 3, colour and shape were not significantly different, t(47) = .77, p = .44, whereas shape was marginally significantly better than binding, t(47) = 2.01, p = .051. As predicted, a binding deficit is greatest and most reliable only at the earliest position in the sequence (Figure 2(b)), and this was the case for both age groups.

Discussion
The results are consistent with previous literature showing that, in younger adults, temporary visual binding is poorer when stimulus presentation is sequential rather than simultaneous (Allen et al., 2006;Brown & Brockmole, 2010). The effect of feature binding was large under the sequential condition, but only very small in the simultaneous condition. Results also showed that, for both age groups, the earliest bound item in the sequence is especially vulnerable (Allen et al., 2006), reflecting the fragility of bindings in memory, relative to individual features (Allen et al., 2006;Baddeley et al., 2011).
There was little evidence for an age-related binding deficit in the present experiment, and this was underlined by the effect sizes observed across both encoding conditions, which again were only small to medium in older adults. This result is therefore consistent with the present Experiment 1, and with previous findings in the literature (Brockmole et al., 2008;Brockmole & Logie, 2013;Parra, Abrahams, Fabi, et al., 2009). The present findings also support Brown and Brockmole's (2010) conclusions that any binding deficits in older adults do not result from, nor are exacerbated by, sequential presentation of the memory array (see also Chen & Naveh-Benjamin, 2012;Read et al., 2015). As the observed small effect of feature binding was not dependent on encoding format (and therefore present in the simultaneous condition), age and presentation format may also impact upon shape memory. However, considering that binding deficits are characterised by deficits over and above those in individual feature memory, in this case shape, then one must conclude that there is only a small to medium effect of feature binding on older adults' working memory and that sequential presentation format does not drive its presence.
A particularly interesting finding was the marked serial position curve in older adults, who exhibited specific difficulty remembering the middle stimulus in the sequences, across all memory types. In contrast, the younger adults produced a much flatter curve, possibly as the sequence length was within their working memory capacity. Older adults may have been more likely to lose the middle items due to increased susceptibility to interference (Braver & West, 2008;Lustig, Hasher, & Zacks, 2007), given that the middle objects were unique in being subjected to interference from both a previously encoded item (position 1) and the requirement to create a new representation (position 3). For Experiment 3 we go on to directly assess the role of interference in visual feature binding memory. However, it is important to note that the age-related limitation for serial position 2 could reflect a different process than the binding interference effect observed at position 1 for both age groups. It is more generalised (i.e. across all memory types) and also apparently exerts a much stronger effect, bringing performance near to chance. In older adults with reduced capacity, the middle item may have been differentially lost over time due to its lack of rehearsal (relative to position 1) and recency (relative to position 3; Foos, 1989;Foos & Wright, 1992). To elaborate, due to a capacity limit in internally guided executive attention, or processing speed deficits (Salthouse, 1996;Vaughan & Hartman, 2010), older adults may have strategically focused internal attention on the first item (see Hu et al., 2014), while item 3 was successfully served by external attention Hu et al., 2014). Although this would have allowed relatively successful performance for two of the three items, item 2 would be particularly vulnerable. We suggest that this would be a useful focus for future work investigating visual working memory limitations with ageing.

Experiment 3
To our knowledge, the impact of a to-be-ignored suffix on memory for features and bindings has not yet been investigated within the ageing context. In younger adults,  and Ueno, Mate, et al. (2011) (see also Allen et al., 2015) showed that binding maintenance is disrupted or overwritten by subsequent presentation of a tobe-ignored suffix. This perceptually driven interference particularly impacts on recently encountered items and/ or those that participants are attempting to prioritise in memory (Hu et al., 2014). It has been claimed that older adults' visual working memory may suffer interference from irrelevant information at encoding (Braver & West, 2008;Lustig et al., 2007;Pilotti, Beyer, & Yasunami, 2002). Therefore, if an underlying cause of age-related binding deficits is the susceptibility to interference, then a suffix should result in or exacerbate such a deficit. Specifically, we could predict a greater binding deficit, in terms of poorer performance in binding memory relative to shape memory, in older adults when carrying out the suffix interference condition.

Design
A 2 × 2 × 3 mixed factorial design was used to investigate the effects of age group (younger, older), suffix condition (control, suffix; within participants), and memory test type (colour, shape, binding; within participants) on change detection performance (A ′ ).

Materials and procedure
The stimuli and procedure were as reported for Experiment 1, except that the six trial blocks varied by suffix condition (control, interference) as well as memory test type (colour, shape, binding). Conditions were counterbalanced as per Experiment 1.
The control condition was administered as per the 900 ms condition in Experiment 1. In the suffix condition, once the memory array had disappeared, and after a further 250 ms, a new coloured shape was presented in place of the fixation cross (see Figure 1(b)). This suffix was created by combining a colour and a shape from the excess items of the feature pool; specifically, the object was plausible as an object related to the memory set Ueno, Mate, et al., 2011), but neither of the features would have appeared within the memory array for that trial. Participants were informed that the suffix was to be viewed but ignored. The fixation cross replaced the suffix for the remaining 500 ms of the delay period, before presentation of the test item. Table 1 presents the A ′ data, which were analysed using a 2 (age group; young, older) × 2 (suffix condition; control, suffix) × 3 (memory type; colour, shape, binding) mixed ANOVA. There were main effects of age, F(1, 46) = 37.43, MSE = .022, p < .001, h 2 p = .45, in which the younger adults (M = .90, SD = .04) outperformed the older adults (M = .79, SD = .08), and of suffix, F(1, 46) = 35.79, MSE = .006, p < .001, h 2 p = .44, in which the suffix reduced performance overall (M control = .88, SD = .07; M suffix = .82, SD = .10). There was a main effect of memory type, F(1.7, 77.0) = 71.64, MSE = .009, p < .001, h 2 p = .61. Collapsed across age group and suffix condition, planned comparisons revealed that the colour condition was performed better than shape, t(47) = 12.78, p < .001, but there was no difference between shape and binding memory performance, t(47) = 1.20, p = .24. The interactions between suffix and age group, F(1, 46) = 6.74, MSE = .006, p = .013, h 2 p = .13, memory type and age group, 6 F(2, 92) = 11.70, MSE = .007, p < .001, h 2 p = .20, and suffix and memory type, F(1.7, 78.7) = 4.29, MSE = .005, p = .022, h 2 p = .09, were all significant, while the three-way interaction was not 7 (p = .81).
The suffix × memory type interaction was driven by the smaller effect of the suffix within the colour memory task, most likely an artefact of a ceiling effect in this condition. Indeed, there is no suffix × memory type interaction when only shape and binding blocks are included in the analysis, F(1, 46) = .15, MSE = .006, p = .70, h 2 p = .003. Following up the age group × memory type interaction, as expected, colour memory was better than shape in both younger (M colour = .97, SD = .03; M shape = .85, SD = .06), t(23) = 9.05, p < .001, and older adults (M colour = .90, SD = .05; M shape = .77, SD = .09), t(23) = 8.95, p < .001. In younger adults, binding (M binding = .88, SD = .06) was performed slightly better than shape memory, t(23) = 2.23, p = .036. However, close inspection of the data in Table 1 reveals that, rather than binding memory performance being better in younger adults in Experiment 3, shape memory performance is slightly worse than in the previous two experiments. In older adults, binding (M binding = .71, SD = .14) was significantly poorer than shape memory, t (23) = 2.74, p = .012. This interaction still existed when only shape and binding memory blocks were included in the analysis, F(1, 46) = 12.04, MSE = .009, p = .001, h 2 p = .21.
In the younger adults, the mean of the paired differences between shape and binding memory accuracy in the control condition was .03 (SD = .07) with a 95% CI of [.002, .06] (d = .63). Within the suffix interference condition, this was .03 (SD = .09 [−.01, .07]; d = .43). In the older adults, the mean of the paired differences between shape and binding memory in the control condition was −.06 (SD = .12 [−.11, −.008]; d = −.69), whereas in the suffix condition this was −.08 (SD = .18 [−.15, .001]; d = −.59). Thus, there was a medium positive effect of binding in younger adults, whereas there was a medium negative effect of feature binding in older adults, regardless of the interference condition.
For Experiment 3, the presence of the two-way interaction between age group and memory type was strongly favoured over its omission (by over 14,000 to 1). Although restricting the analysis to the shape and binding conditions greatly reduced the strength of evidence for this interaction, it was still favoured by a factor of over 100 to 1. There was no suggestion that the visual suffix modulated this, as a model omitting the three-way interaction was favoured by over 7 to 1. This was also true when restricting analysis to the shape and binding memory conditions only (B = 3.436).

Discussion
The findings from Experiment 3 showed an age-related deficit for temporary visual feature binding. This goes against much of the pre-existing literature on temporary visual feature binding in older adults, which has shown this ability to be impervious to healthy ageing (Brockmole et al., 2008;Brown & Brockmole, 2010, Exp. 1;. It does, however, support the limited evidence that previously existed using the change detection paradigm (Brown & Brockmole, 2010, Exp. 2, see also Isella et al., 2015), as well as evidence using alternative paradigms (Chen & Naveh-Benjamin, 2012;Peich et al., 2013).
Inspection of the younger adult data presented in Table 1 highlights that the one clear difference between the findings of Experiment 3 and those from the previous two experiments is that, in Experiment 3, the younger adults showed poorer performance for shape compared with binding memory while, in Experiments 1 and 2, binding performance was poorer than shape. However, in younger adults, binding performance was relatively stable across the three experiments, whereas shape performance decreased slightly in Experiment 3. Thus, the effect in younger adults may be driven by poorer shape memory performance rather than better binding memory performance. Importantly, though, in older adults the observed effect sizes in Experiment 3 are consistent with the present Experiments 1 and 2, as these also demonstrated small to medium effects of feature binding in older adults (see Figure 4). The experiment also produced interesting findings regarding interference effects in visual working memory. We did not replicate a specific suffix effect on binding memory in younger adults Ueno, Mate, et al., 2011). A smaller array size was used due to the focus upon older adults in the present study, suggesting that memory array size may be important for this effect. Future work could explore the factors determining the magnitude of suffix interference effects and the role of capacity on interference effects in binding. Importantly, there were interference effects across all forms of memory in both age groups, but older adults exhibited a greater general suffix effect (Braver & West, 2008;Lustig et al., 2007;Pilotti et al., 2002). Older adults are therefore more susceptible to interference in visual working memory, even when the object is to be ignored (Pilotti et al., 2002). Finally, although older adults exhibited greater interference effects than younger adults, their binding performance was not dependent upon interference, as the effect clearly existed across both control and interference conditions.

General discussion
The three experiments reported here were focused upon understanding the role of encoding and maintenance processes in temporary memory for visual features and feature bindings. To summarise the main results, all three experiments provide evidence of a small to medium effect on feature binding performance, relative to individual feature memory, in older adults. However, small to large effects were also observed in the younger adults in Experiments 1 and 2, with a large effect in this group when faced with sequential presentation of the memory array. Indeed, our Bayesian analysis of the data from Experiments 1 and 2 supported the absence of an age group × memory condition interaction when contrasting binding and shape performance, thus going beyond previous failures to reject the null hypothesis (although see Rhodes et al., 2016). For Experiment 3, the weight of evidence was in favour of a disproportionate binding cost for older adults. However, effect sizes showed that the binding cost for older adults was no larger in Experiment 3 than was observed in any of the previous two experiments. It is therefore questionable the extent to which we may describe such a pattern of performance as reflecting an age-related binding deficit. Our experiments also highlighted age-related vulnerabilities in visual working memory more generally; namely, older adults differentially benefited from more encoding time, they exhibited a specific, marked impairment in memory for both single features and for colourshape bindings presented in the middle of three positions in sequential presentation, and they experienced a greater suffix interference effect.
There has been some debate regarding whether or not healthy older adults show impaired temporary binding of surface visual features and the present findings contribute new evidence in this regard. Particularly given the inconsistencies in this literature (i.e. presence/ absence of age-related binding deficits), it seems prudent and timely to use an effect size approach to answering this question (Cumming, 2012(Cumming, , 2014. Thus, rather than asking whether or not such a deficit exists, we may reframe this question to ask whether or not we have consistent evidence for a larger effect size of feature binding relative to individual feature memory in older adults as compared with younger adults. Effect size data may also usefully contribute to future meta-analyses as well as being readily comparable across similar studies. There have been recent observations of age-related binding deficits in visual working memory (Brown & Brockmole, 2010, Exp. 2;Chen & Naveh-Benjamin, 2012;Peich et al., 2013;see also Isealla et al., 2015). However, these findings contrast with other previously published experiments using change detection paradigms (Brockmole et al., 2008;Brown & Brockmole, 2010, Exp. 1;Parra, Abrahams, Fabi, et al., 2009;Read et al., 2015;Rhodes et al., 2016). Taken together, we would argue that the present evidence clearly shows that older adults do consistently demonstrate a small to medium visual feature binding effect size, but that younger adults tend to show this too (present Experiments 1 and 2). We therefore only have minimal evidence for an agerelated binding deficit, at least using a change detection paradigm. Certainly, where younger and older adults do not show parallel effects (e.g. present Exp. 3, Brown & Brockmole, Exp. 2), the effect in older adults appears only to be small to medium, in terms of paired differences between binding and shape memory performance. Finally, the cross-experiment analyses reported in Appendix show the consistency that exists across the samples used in the present three experiments, and support the argument that age-related binding deficits are not reliable in healthy older adults.
Researchers have argued that temporary visual feature binding is a relatively automatic process, carried out at a low level of the working memory system Logie, 2011;Logie et al., 2009). Thus, this process may be relatively resistant to decline in healthy ageing. One conclusion, when taking all of the evidence into consideration, is that binding in visual working memory is less susceptible to ageing than is object-location binding in working memory (Brockmole & Logie, 2013;Cowan et al., 2006;Mitchell, Johnson, Raye, Mather, et al., 2000), and associative long-term memory (e.g. Naveh-Benjamin, 2000; Old & Naveh-Benjamin, 2008a). Furthermore, temporary visual feature binding is markedly impaired in individuals suffering from Alzheimer's disease (AD; Parra, Abrahams, Fabi, et al., 2009;Parra, Abrahams, Logie, Mendez, et al., 2010). This suggests that the underlying pathology of this disease compromises the processes involved in an ability that is relatively unaffected by healthy ageing, and offers the possibility of a specific cognitive marker for the disease (e.g. Della Sala, Parra, Fabi, Luzzi, & Abrahams, 2012;Parra, Abrahams, Fabi, et al., 2009;Parra, Della Sala, Logie, & Morcom, 2014; for reviews see Allen et al., 2013;Logie, Horne, & Petit, 2015;Logie, Parra, & Della Sala, 2015). We would therefore argue that temporary visual feature binding is relatively well preserved in healthy compared with pathological ageing, and in comparison with other forms of binding, such as long-term associative binding in healthy older adults (Old & Naveh-Benjamin, 2008a).
Yet, age-related deficits, although relatively small, are observable. Indeed, Peich et al. (2013) recently observed binding deficits with healthy ageing, in the form of misbinding errors. Taking all of the available literature into account, they concluded that there is a quantitative difference between the extent of the visual feature binding deficit observed in healthy older adults and those affected by AD; our findings fully support such a conclusion and contribute standardised effect size data. As the deficit is clearly quantitatively smaller in healthy ageing than has been shown in previous studies of temporary surface visual feature binding in AD, whether or not older adults' visual feature binding performance may be differentiated from that of younger adults likely depends on the methodological approach being used (Chen & Naveh-Benjamin, 2012;Peich et al., 2013), and detection of an age-related binding impairment may require very large sample sizes (Brockmole & Logie, 2013). Further research into visual working memory as well as visual feature binding more specifically, focusing on the encoding deficits presently observed (related to encoding duration, serial position, and suffix presentation) should be useful in helping to clarify the cognitive mechanisms that underlie abilities and limitations in healthy cognitive ageing, but also in AD.
In the context that the presence of a temporary visual feature binding deficit has been suggested as a marker of AD, our findings clearly show that, using the change detection paradigm, only small to medium effect sizes are observable in healthy older adults' performance. Furthermore, this is consistent across a series of experiments, and is often comparable to the direction and sizes of effects observed in younger adults. Cumming (2012Cumming ( , 2014 highlighted that we should move away from our reliance upon the dichotomous (accept/reject) approach afforded by null hypothesis significance testing, as this is often untrustworthy when used across different experiments and samples of the population. Rather, a preferable approach is to use simpler experimental paradigms that allow for the estimation of effect sizes over numerous studies. In the case of the debate regarding age-related visual feature binding deficits, we have shown that effect sizes are useful in establishing the ability of older relative to younger adults. It is recommended that future research should aim to establish the effect sizes associated with visual feature binding in pathological ageing, and with other forms of binding in the healthy ageing context.

Notes
1. There is some discrepancy in the associative memory literature regarding the data that are compared in order to test for a memory deficit. In the long-term memory literature, associative memory scores are typically compared with the averaged performance of the two individual feature conditions. We have focused upon comparing shape, the worst performed individual feature condition, with binding performance primarily because (1) an averaged score is an artificial amalgamation of performance across two different single feature types and (2) while helping to overcome floor effects in the binding condition, the best performed individual feature conditionin this case, colourcan result in ceiling effects. Furthermore, this approach has been commonly used in the working memory literature (e.g. Allen et al., 2006;Brockmole et al., 2008;. However, for comparison with the wider literature, the analysis with averaged individual feature performance was also carried out, and an age-related binding deficit was not significant (p = .96). 2. We also analysed Cohen's d using the proportion correct measure. We observed similar effect sizes as with A ′ , and the same general pattern as depicted in Figure 4. 3. We used the anovaBF function with the whichModels argument set to top and the number of Monte Carlo samples set to 50,000. An additional 10,000 iterations were run until the proportional error associated with each Bayes factor was less than 5%. The full results from the Bayesian analyses from each experiment are available in the Supplementary file. 4. This interaction is not significant when comparing binding with the averaged colour-shape memory blocks (p = .20). 5. The 2 (age group) × 2 (encoding format) × 2 (memory type; shape, binding) ANOVA on the A ′ data revealed qualitatively the same findings regarding binding as did the analysis that included the colour memory condition, in that the three-way interaction remained significant, F(1, 46) = 5.17, MSE = .005, p = .028, h 2 p = .10. 6. The analysis involving the averaged colour and shape blocks for comparison with binding also revealed this significant interaction, F(1, 46) = 8.77, MSE = .004, p = .005, h 2 p = .16.
7. The age group × memory type interaction remains when only the data from the control condition were analysed, F(2, 92) = 9.10, MSE = .004, p < .001, h 2 p = .17, and with a range of other measures that were also analysed, including proportion correct, F(2, 92) = 5.00, MSE = 74.36, p = .009, h 2 p = .10, and corrected recognition (hits-false alarms), F(2, 92) = 6.68, MSE = .029, p = .002, h 2 p = .13. Furthermore, the two-way interaction remained in an analysis of covariance on the A ′ scores, which adjusted for the effects of sex, years of education, and Test of Premorbid Functioning (IQ estimate) score, F(1, 43) = 10.94, MSE = .009, p = .002, h 2 p = .20. Finally, the three-way interaction is still not significant when only the shape and binding memory blocks are included in the analysis (p = .81).

Disclosure statement
No potential conflict of interest was reported by the authors.

Data source
The underlying research materials for this article can be accessed via the UK Data Service, doi:10.5255/UKDA-SN-850670.

Funding
This work was supported by the Economic and Social Research Council [grant number RES-000-22-4661].