Consumer evaluation of scab-resistant apple cultivars in Sweden

The feasibility of using untrained consumers for sensory evaluation was investigated on Swedish-grown scab-resistant apple cultivars in 2002 (11 cultivars) and 2003 (15 cultivars). The consumers evaluated five cultivars each, including the control cultivar ‘Aroma’. Each cultivar was scored for appearance, texture, juiciness, taste and overall impression by a minimum of 100 consumers in 2002 and by 200 in 2003. All quality attributes were positively correlated when calculations were performed across all consumer scores, with overall impression and taste having the strongest association. ‘Rubinola’ and K:1160 emerged as the best liked cultivars overall. Cultivar means were analysed for relationships with some previously scored or measured variables and with consumer profile data. Positive correlations with overall impression were obtained for amount of red colour in 2002, and for sugar levels in 2003. Among consumer profile variables, age had the largest influence on consumer scores, with some indications that older consumers preferred (or at least tolerated) soft-fleshed cultivars better than did younger consumers. Minimum number of consumers needed to obtain reliable results was determined by computer simulations indicating that the error rate became stabilized already with 50 consumers.


Introduction
Most of the commonly grown apple cultivars are susceptible to detrimental fungal diseases like apple scab and powdery mildew and to various insects, thereby requiring frequent use of fungicides fungicides and pesticides.Recently, there has been an increas-Recently, there has been an increas-ing interest in disease-resistant apples that can be grown with a more restricted use of chemicals (Janse 1993, Kellerhals et al. 1997).All over the All over the world, apple breeding programs now have disease resistance as one of the main goals, and more than 100 new disease-resistant apple cultivars have been released since 1970 (Brown and Maloney 2003).
Certainly only a few of the new apple cultivars will become commercially important.The comparatively short and cool growing season in the Nordic countries makes it impossible to grow many of the new cultivars that have been developed elsewhere.The success of a new cultivar is, however, ultimately determined by consumer preferences, which are affected by many different parameters (review in Harker et al. 2003).Although appearance was the most important factor when choosing fruit and vegetables in the shop (von Alvensleben and Meier 1990), consumers rated both texture and taste as more important when given the possibility to taste the fruit prior to making a purchase (Daillant-Spinner et al. 1996).Moreover, consumer preferences can vary considerably over geographical distances as well as between groups within the population (Hampson andQuamme 2000, Richards 2000).A visual consumer test performed in Canada and New Zealand thus showed that preferences for the appearance of apples differed both between the two countries as well as between different cities in the same country (Cliff et al. 2002).
Sensory evaluations are often performed with trained panels (Watada et al. 1980, Daillant-Spinnler et al. 1996, Hampson et al. 2000, Weibel et al. 2003).These panels generally consist of at least 20-30 people, although Hampson et al. (2000) found 11 trained panellists to be sufficient for routine screening of breeding selections.The use of a trained panel can be rather costly and suitable panellists are not always easily available.Therefore, some other kind of consumer testing may be needed before deciding on e.g.commercial release of a new cultivar or the planting of many trees of a relatively unknown cultivar (Hampson et al. 2000).
The objective with our study was to investigate the feasibility of using relatively inexpensive and easily available mass testing by untrained consumers.This was therefore investigated during two years in Sweden.A set of 11 scab-resistant apple cultivars together with the control 'Aroma' was evaluated in connection with an apple fair in the year 2002.The evaluation was repeated in the following year with 15 scab-resistant cultivars.Con-sumer scores were subsequently analysed for relationships with some previously measured variables (shape, background colour, relative area covered by red colour, firmness and sugar content) and with consumer data (age, gender, access to homegrown apples, average apple consumtion, and interest in buying organically grown apples).

Material and methods
A large apple fair with apple-related entertainment for the general public is arranged each year in Kivik, situated in the apple district Österlen in SE Sweden.In connection with this fair, a sensory evaluation of 11 (28-29 September 2002) and15 (27-28 September 2003) scab-resistant apple cultivars was arranged at a small museum called �pp-�pplets hus (translated as the Apple house) in Kivik.
(translated as the Apple house) in Kivik.The well-known and appreciated but not scab-resistant Swedish cultivar 'Aroma' was included to provide a reference.Cultivars analysed in both years were Co-op 12, 'Dayton', 'Scarlet O'Hara' and 'William's Pride' from the US, 'Rubinola' and 'Vanda' from the Czech Republic, and 'Frida', 'Fredrik', K:1016, K:1160 and K:1210 from the Swedish apple breeding program at Balsgård.In addition to these, 'Waleria' from Poland, 'Redfree' from the US and 'Reglindis' from Germany were analysed in 2003 (Table 1).In this year, the evaluated K:1016 fruit was obtained from two different orchards and therefore treated as belonging to two different cultivars; K:1016 B (Balsgård) and K:1016 ÖS (Östra Sönnarslöv).Most of the analysed cultivars have the scab resistance gene V f from Malus floribunda.'Reglindis' has instead inherited polygenic scab resistance (V A ) from 'Antonovka'.
The cultivars were divided into three groups with five cultivars in 2002 and four groups with five cultivars in 2003.'Aroma' was included in each group.Fruit was kept at room temperature in 2002 and outdoors in 2003 for at least 12 hours before tasting.Each consumer tasted all five cultivars in one of these groups.The consumers were  Vol. 15 (2006): 388-401.belled "very poor" and "very good" at the left and right anchor points.For texture, the scale was labelled "very soft" and "very firm".For juiciness, the scale was labelled "very dry" and "very juicy".The consumers were also asked to fill in their age (10-15, >15-20, >20-30, >30-40, >40-50, >50-60 and above 60, although for calculations only three classes were used, 10-30, >30-50 and >50 years old), gender, whether they had access to home-grown apples, their average apple consumption (one or more apples a day, 2-5 apples a week, one apple a week, one apple a month or a maximum of five apples a year), if they prefer to buy organically grown apples and whether they would still prefer organic apples if the price was five Swedish crowns (0.6 Euro) higher per kilo than the price for conventionally grown apples.
On the day before the evaluation weekend, five apples of each cultivar were taken randomly for physical and chemical analysis.Each cultivar was classified into one of 9 different shape categories (Table 1).Background colour was scored visually on a 4-level scale: dark green, medium green, greenish yellow and yellow.Amount of covering red colour was estimated visually for each fruit to the nearest 10% of the whole surface.After removal of the skin, firmness was investigated with a fruit pressure tester, Facchini, mod.FT 327.Each fruit was penetrated twice, on opposite sides, to a depth of 8 mm with a probe with the diameter 11.3 mm.Sugar content (soluble solids) was measured with an Atago digital refractometer PR-1 and expressed as % SS.

Statistical analysis
Consumer scores for the quality attributes (appearance, texture, juiciness, taste and overall impression) were normally distributed and parametric statistics were therefore used.In 2003, scores for the reference cultivar 'Aroma' differed significantly between the four test groups, possibly affected by the sequence in which the cultivars were evaluated and by the perceived relative quality of the other cultivars in the group.In order to compare all cultivars, mean scores for 'Aroma' were subtracted from the mean scores for each of the other cultivars in the same test group.Subsequently, the overall mean for 'Aroma' (across all test groups) was added to each of the cultivar means.
Pairwise associations between consumer scores for the five attributes were analysed across the whole data set with Pearson correlation coefficients.The five attributes as well as measurements of the amount of covering colour, firmness and sugar content, were also compared using non-parametric Spearman rank correlation based on cultivar means.In addition, one-way ANOVA together with Tukey HSD were used to detect significant differences among the cultivars for each of the five attributes.
Possible influence of the consumer profile variables on the evaluation scores was analysed for each cultivar separately, using one-way ANOVA together with Tukey HSD (age and consumption) or Mann-Whitney U-test (gender, access to homegrown apples and interest in buying organic apples).
The 352 consumer scores for overall impression of 'Aroma' in 2002, were used to determine the minimum amount of consumers required for a reliable consumer test.Means were calculated for 100 random subsets consisting of 10,20,30,40,50,60,80,90 and 100 evaluation scores, respectively.Standard deviation was calculated for each group of subsets and then plotted as a function of the number of consumer scores in the group.
All calculations were carried out using SPSS 10 Data Analysis Package (Norusis 1990).

Associations between quality attributes
All five quality attributes were significantly correlated in 2002 when analyses were performed on all the individual test scores.Possibly there was a tendency for consumers to either like or dislike a certain cultivar in all respects.Nevertheless, it is in-Jönsson, Å. & Nybom, H. Consumer evaluation of apple teresting to note that the highest correlation was obtained between taste and overall impression (r = 0.84, P = 0.00).Relatively high correlations with overall impression were noted also for appearance (r = 0.50, P = 0.00) and juiciness (r = 0.41, P = 0.00).Fruit texture showed surprisingly low correlations with the other attributes, possibly because a firm texture could be perceived either as desirable (crispy) or undesirable (unripe and too hard).

Appearance
Cultivar means for appearance varied from 5.0 to 7.8 (a range of 2.8 units) in 2002 (Table 2) and from 6.3 to 8.7 (2.4 units) in 2003 (Table 3).K:1160 had the highest mean for appearance in 2002, followed by 'Aroma', while the two bottom cultivars were 'Vanda' and K:1210.In 2003, 'Rubinola' had a significantly higher mean than all others, whereas 'William's Pride' and 'Waleria' had the two lowest means within a group of eight cultivars that were not significantly separated.
Appearance was correlated with overall impression (r = 0.74, P = 0.01) and taste (r = 0.68, P = 0.02) when cultivar means for 2002 were compared (Table 4).In 2003, appearance was not associated with any of the other attributes (Table 5).No associations were detected between appearance on the one hand and shape or background colour on the other hand, and only a very weak association with the amount of red colour in 2002 (r = 0.49, P = 0.10) and none in 2003.

Texture
For texture, cultivar means ranged widely, from 3.5 to 7.2 (3.7 units) in 2002 (Table 2) and from 2.6 to 6.9 (4.3 units) in 2003 (Table 3).In 2002, 'Scarlet O'Hara' was scored as significantly harder than the other cultivars, and Co-op 12 as softer.In 2003, Co-op 12 again hade the lowest score while 'Frida' had the highest.
Cultivar means for texture were not associated with any of the other attributes in either of the two

Juiciness
Cultivar means for juiciness ranged only between 4.5 and 6.1 (1.6 units) in 2002 (Table 2) and between 4.5 and 5.8 (1.3 units) in 2003 (Table 3).'Aroma' had the highest value in both years but there were few significant differences between the cultivars.

Taste
Cultivar means for taste ranged rather widely, from 4.3 to 7.4 (3.1 units) in 2002 (Table 2) and from 4.6 to 8.7 (4.1 units) in 2003 (Table 3).Significant differences mostly involved the high-rating cultivars.'Rubinola' had the highest mean in 2002, followed by K:1160 and K:1016 while 'Dayton' had the lowest mean.Taste scores were very similar to the scores for overall impression (r = 0.98, P = 0.00); 'William's Pride' and K:1210 moved down one placement compared to overall impression whereas 'Vanda' and 'Dayton' moved up one placement.'Rubinola' had the highest taste score also in 2003 but the following placements differed somewhat from those obtained in the preceding year; 'Reglindis', 'Scarlet O'Hara', K:1160, 'Fredrik', 'William's Pride', 'Redfree' and 'Waleria' moved up 1-3 placements compared to overall impression.Correlation between the mean scores for overall impression and taste was again very high (r = 0.90, P = 0.00) (Table 5).
The taste means showed a significant, positive correlation with the measured sugar content (r = 0.63, P = 0.01) in 2003 (Table 5) whereas no association was found in 2002 (Table 4).Taste and amount of red colour were positively correlated in 2002 (r = 0.63, P = 0.03).The background colour of all the cultivars in the evaluation is greenish yellow or yellow when fully ripe.However, only four cultivars in 2002 and five cultivars in 2003 were rated as yellow (Table 1).Especially 'Dayton', 'Scarlet O'Hara', 'Vanda', K:1016 and K:1210 may have been perceived as not fully ripe by the majority of consumers in 2002 and would probably have gained higher scores if evaluated later in the season.In 2003, K:1016 B, K:1016 ÖS, 'Vanda' and 'Fredrik' did not appear to have reached optimum eating quality.Some consumers expressed a preference for these apples during the evaluation but none reached the top values.Fully ripe were 'William's Pride', K:1210, 'Dayton', 'Aroma', 'Rubinola', Co-op 12, 'Redfree' and 'Waleria'.

Overall impression
Overall impression of the different cultivars ranged from 4.8 to 7.5 (2.7 units) in 2002 (Table 2) and from 5.3 to 8.4 (3.1 units) in 2003 (Table 3).'Rubinola' obtained the highest mean value in 2002, followed by K:1160 while K:1210 and K:1016 had the lowest values.'Rubinola' obtained the highest value for overall impression also in 2003, followed by 'Aroma' while 'Redfree' and 'Waleria' had the lowest values.
Judging from the strength of the correlations with other scored attributes, taste is the most important factor for overall impression, followed by juiciness and appearance whereas texture had no influence at all.Overall impression was also correlated with sugar content (r = 0.54, P = 0.03) in 2003 (Table 5), probably due to the strong association between taste and overall impression, and with the amount of red colour (r = 0.62, P = 0.03) in 2002.

Consumer profile data
A total of 352 and 850 consumers, respectively, participated in the 2002 and 2003 testings.Some values were missing since the consumers did not always fill in the personal information on their score sheets.In 2002, there were 193 females and 148 males, and in 2003 485 females and 350 males.In 2002, 73, 121 and 150 consumers were 10-30, >30-50 and >50 years old, and in 2003, the corresponding values were 201, 278 and 362.More than half of the consumers, i.e. 205 in 2002 and 468 in 2003, had access to home-grown apples.Of the consumers from 2002, 104 ate at least one apple a day, 144 ate 2-5 apples a week, 63 ate one apple a week, 25 ate one apple a month and only 2 consumers ate less than 6 apples in a year.In 2003, the corresponding values were 317, 350, 106, 35 and 8. Thus more than 90% of the participating consumers ate at least one apple a week, and can therefore be regarded as potential future consumers of the tested cultivars.
The only parameter for which the two years differed appreciably, was interest in organically grown apples.In 2002, 64% of the female consumers and 61% of the male consumers preferred organic apples.Forty-two % of the females and 43% of the males were also prepared to pay a higher price.In 2003, 75% of the female consumers preferred organic apples and 64% of the male consumers.A total of 51% and 46%, respectively, were also prepared to pay the higher price.A significant increase was noted from 2002 to 2003 in both the number of consumers preferring organic apples (P = 0.00) and the number prepared to pay a higher price (P = 0.00).Preference for organic apples was not related to age (69, 69 and 70% in the three age groups preferred organic apples) nor to access to home-grown apples (69% had homegrown apples and 66% had not).
For each cultivar, possible effects of various consumer profile parameters on the evaluation scores were calculated.Altogether, 28 cultivaryear combinations (12 cultivars in 2002, 16 in 2003) were evaluated in the two years.Consequently, one or two cultivar-year combinations should be expected to show a significant result by chance only (i.e.type 1 error: rejection of a true null hypothesis) when using a probability level of P < 0.05.For the variables gender and access to home-grown apples, only one or two cultivar-year combinations appeared to have been significantly affected by consumer parameters.These had Pvalues between 0.01 and 0.05, therefore not ruling out a type 1 error.
For the variables concerning amount of apple consumption and interest in buying organically produced fruit, 1-4 cultivar-year combinations showed significant results.Tables with the results are available from the authors upon request.The most highly significant results involving the apple consumption (P = 0.00) suggested that consumers who ate few apples appreciated the taste of 'Rubinola' and the overall impression of 'Vanda' more than did consumers who ate more apples.The most highly significant results involving the interest in organic apples (P = 0.00) suggest that consumers with a high interest tended to like 'Redfree' and to dislike K:1210 compared to consumers with no interest in buying organic apples.However, when these results were compared to the other Jönsson, Å. & Nybom, H. Consumer evaluation of apple data, it was impossible to find any variables that could explain why some cultivars were better liked than others.
Consumer age was the variable that showed the highest impact on the scoring (Table 6).In 2002, older consumers preferred the appearance of 'Aroma', 'Frida' and 'Co-op12', results that could not be explained in terms of differences in shape or colour.For texture the medium-firm 'Aroma' was better liked among the older consumers.For both taste and overall impression, 'Fredrik' was most liked among the younger consumers whereas K:1160 and 'Co-op12' (the two softest cultivars!) were most liked among the oldest consumers.
Consumer age had some impact on scoring also in 2003.In texture, 'Rubinola' and 'Reglindis' were better liked by the older consumers.According to the penetrometer values, 'Reglindis' and 'Rubinola' were the two firmest cultivars but 'Reglindis' was perceived as one of the softest accord- Vol . 15 (2006): 388-401.ing to consumer scores.For juiciness, 'Reglindis' obtained higher scores from medium-old consumers than from the other consumer age groups.Mean scores for overall impression for the three consumer age categories can be compared in Table 7.Ten of the 15 test cultivars had the medium-age category score in the middle, indicating the possible existence of a linear relationship between age and some other variable but we were not able to identify any measured or scored variable which in itself could explain why some cultivars were better liked than others.

Number of consumers required for evaluation
The standard deviation for 100 calculations of the mean score for overall impression of 'Aroma' was plotted against number of scores analysed (Fig. 1).
From this graph, it appears that raising the number of consumers above 50 will not lower the variance appreciably and may therefore be considered as a sub-optimal use of resources, provided that the sampled consumers are sufficiently homogenous and also representative of the target population.ma' in 2002, subsets of 10, 20, 30, 40, 50, 60, 80, 90 and 100 evaluation scores were taken at random.Means of 100 subsets for each size class were calculated, and the standard deviation plotted as a function of the number of consumer scores in the group.

Discussion
Cultivar preference For overall impression, 'Rubinola' turned out to be the best liked cultivar both in 2002and 2003, followed by K:1160and 'Aroma' in 2002, and by 'Aroma' and 'Reglindis' in 2003.In a Danish study of 22 scab-resistant apple cultivars, 'Rubinola' also emerged as the one having the highest eating quality (Kühn and Thybo 2001a).

Associations between consumer-scored attributes
Previous reports have already shown that new apple cultivars must have a desirable appearance to catch the interest of the consumers (von Alvensleben and Meier 1990) but once the apple is bought, taste is more important and probably decides if this particular cultivar will be bought again, provided that the consumer knows and remembers the cultivar name or can recognize the cultivar through its distinctive appearance (Harker et al. 2003).
Taste was the most important factor contributing to overall impression, which agrees with results previously reported by Daillant-Spinner et al. (1996) and Jaeger et al. (1998).In our trial, both (1998).In our trial, both appearance and juiciness turned out to be of lower importance than taste but still very highly correlated with overall impression in one of the years.For appearance, none of the cultivars had a mean score below 5.0 in 2002, or below 6.0 in 2003, which suggests that all of them were acceptable in this respect.By contrast, the rating of texture had no importance for overall impression.

Colour and shape
For a given cultivar, yellow background colour is usually preferred since it is perceived to indicate riper, softer and sweeter fruits (Christensen 1983).All of the apple cultivars in our trial have a more or less yellow background colour when ripe.Therefore, cultivars having a green background colour at the time of the sensory evaluations may not have been quite ripe.We did, however, not find any association between background colour and any of the scored attributes.In an evaluation performed in Great Britain, no association was found between the colour of the skin (yellow or green background or red streaks) and consumer preference for different sensory properties of apples (Daillant-Spinnler et al. 1996).
The existence of a red blush or stripes over a large part of the fruit surface is usually perceived as desirable among apple breeders.Although considerable geographic differences were noted in preferences for green or yellow apples, about half of the consumers at each location (Canada, British Columbia & Nova Scotia, and New Zealand) preferred red apples when given a choice between self-coloured red, yellow and green apples (Cliff et al. 2002).We expected that the amount of red colour on the fruit would affect the evaluation of appearance but this was not the case in our trial.A large amount of red colour did however have a positive influence on the scoring of taste as well as overall impression in 2002, possibly because the more reddish apples had reached a level of ripeness that appealed to a majority of the consumers.
Very small effects of fruit shape have been noted for consumer preferences (Table 1), with most shapes being equal and somewhat lower scores being obtained only for oblong, ovate, cylindrical and cordate apples (Hampson and Quamme 2000).In another study, round and conical apples were equally liked whereas oblong apples were less liked (Cliff et al. 2002).Apples in our trial varied only from ovate to globose and conical, which may explain the fact that we did not find any effects of shape on the scoring of the different attributes.

Texture and firmness
Although not synonymous, a crispy texture as well as a hard texture have been shown to be pos- Vol. 15 (2006): 388-401.itively linked, and to contribute (especially crispiness) to the liking of apple cultivars (Hampson et al. 2000, King et al. 2001).Daillant-Spinnler et Daillant-Spinnler et al. (1996) found an association with texture and (1996) found an association with texture and taste attributes in a consumer evaluation of apples.Apart from scoring texture in sensory evaluations, attempts have also been made to use instrumental measurements.Penetrometer-derived values were thus correlated with sensory descriptor values for hardness and crispiness (King et al. 2001).Harker et al. (2002) report, however, that Harker et al. (2002) report, however, that (2002) report, however, that fruit lacking significant differences in instrumental firmness may still vary significantly when scored for texture properties by a sensory evaluation panel.
For simplicity, consumers were asked to evaluate texture using a single scale in our trial.Although we found an association between the texture scores and the penetrometer-derived firmness in both 2002 and 2003, there was no significant association between these data and the overall impression or any other scored or measured variable.Possibly, there was a slight tendency for older consumers to like (or at least tolerate) softer cultivars but more data is needed to ascertain this.Analyses of penetrometer-derived measures for firmness indicate that North-American consumers like apples around 60 N whereas apples below 45 N may be regarded as too soft (DeEll et al. 2001).In our trial, cultivar means ranged between 47.3 and 101.0 N in 2002 and between 37.7 and 112.8 in 2003, thus almost all cultivars may have been perceived as acceptable.
Juiciness may be regarded as a texture component but it is also a factor in the perception of taste, probably by acting as a carrier for flavour components.The strong correlation sometimes reported between juiciness and crispiness may be explained by the strength of inter-cellular bonds in the middle lamella of adjacent cells; fracturing of strong bonds is perceived as crispiness and simultaneously causes the release of juice (King et al. 2001).In our trial, juiciness was more closely associated with taste than with texture, possibly because texture might have been considered as hardness and not only (or mainly) crispiness.

Taste
Apple 'flavour' is to a major extent determined by volatile molecules present in the fruit flesh (Fellman et al. 2000) and by the sugar/acid balance.In consumer evaluation tests, properties like sweetness, sourness and aromatics are sometimes scored individually.All of these are associated with flavour liking in apples and appear to be more useful in predicting consumer preferences than e.g. % SS and titratable acid (Hampson et al. 2000).Kühn and Thybo (2001b) however report that high % SS and consumer perception of sweetness was positively related to taste preferences for children between 9 and 13 years old.
Our evaluation form included only one simple taste scoring in order to keep the testing simple and quick.This taste score proved to be the most important factor in determining overall impression.Interestingly, three of the four cultivars that had the highest scores for overall impression in 2002 also had the highest sugar content.The only high-sugar cultivar that did not obtain a high score for overall impression was 'Vanda' (on the 8th place) and the only low-sugar cultivar that received a high overall impression score was 'Aroma', which is well-known and appreciated for its very aromatic flavour.A strong aroma appears to be at least as desirable as sweetness, or even more so.The four cultivars in 2003 with highest overall impression scores included the two sweetest cultivars and two above-average sweet cultivars.Cultivars that differed in sugar content between the two years always placed better for taste and overall impression in the year of the highest sugar content.This may be partly due to a simultaneous development of both sugar and volatile compounds as the ripening process proceeds.Hampson et al. (2000) and Cliff et al. (1998) did not find any gender differences in their consumer evaluation studies, whereas women have been found to prefer smaller fruits in other studies (Redalen 1988, Hampson andQuamme 2000).We did Jönsson, Å. & Nybom, H. Consumer evaluation of apple not investigate fruit size, and the few significant gender-related differences for other variables in our study may have been due mainly to type 1 error.

Consumer profile data
According to our data, levels of apple consumption did not influence the consumer's opinion of the evaluated apples in any particular direction, and neither did access to home-grown apples.
Number of consumers who prefer to buy organic apples and are prepared to pay a higher price rose significantly from 2002 to 2003, and showed a strong gender difference; women were more interested in organic apples than men.In a household survey in the US, number of consumers willing to buy eco-labelled apples decreased as the price premium increased, but 40% would still buy eco-labelled apples with a 0.5 € higher price (Blend and Ravenswaay 1999).The amount of consumers willing to pay the price premium was even higher in our evaluation, 49% in 2002 and 58% in 2003.Consumers interested in buying organic apples did, however, not differ from other consumers in the way they scored the tested apple cultivars.
The only consumer characteristic that appears to have had a major impact on consumer evaluation scores is age.Still, it is difficult to find a single variable that explains why some cultivars are more liked by younger consumers and others by older.A certain tendency for older consumers to like (or at least tolerate) more soft-fleshed cultivars was discerned but more data is needed.The only age-related differences we have found in the literature concerns fruit size; older consumers have been reported to prefer smaller fruits (Hampson and Quamme 2000) whereas shape and colour showed no association with age-related preferences (Cliff et al. 2002).

Number of consumers required for evaluation
Trained taste panels are often regarded as important and useful tools for evaluating new cultivars and promising selections.Using a trained panel is, however, generally more expensive compared to a volunteer consumer evaluation.As a complement, mass-testing by volunteers may therefore provide valuable data to a lower cost.Sometimes there is also need for a survey of a larger number of consumers, especially if there are any indications of heterogeneity in the future target population.
The exact number of panellists or consumers needed of course depends on the type of study.Already 11 panellists were considered sufficient for discrimination at one point on a 0-9 scale (Hampson et al. 2000).Cliff et al. (1998) report that results from 50 consumers do not provide a reliable estimate of consumer preference.We however found that raising the number of consumers above 50 reduces the variance only to a minor extent, probably due to our consumers being relatively homogenous.

Fig. 1 .
Fig.1.From the 352 scores for overall impression of 'Aroma' in 2002, subsets of 10, 20, 30, 40, 50, 60, 80, 90  and 100 evaluation scores were taken at random.Means of 100 subsets for each size class were calculated, and the standard deviation plotted as a function of the number of consumer scores in the group.

Table 1 .
Presence of scab resistance genes according to the respective breeders, and parameters measured within this study: shape, visual scores of background colour, covering colour and percentage covering colour, sugar measured with a refractometer and firmness measured with a penetrometer (fruitpressure tester).

Table 3 .
The mean consumer scores for overall impression, taste, appearance, texture and juciness for the year 2003.Number of consumer scores varied between 197 and 215 for all cultivars except for Aroma (n = 829-839) and Fredrik (n = 418-431).

Table 4
. Correlation between cultivar means scored by consumers (appearance, texture, juiciness, taste and overall impression) and visually scoring (covering colour), measurements of sugar content with a refractometer and firmness with a penetrometer in 2002 conducted by the authors.results.In 2003, texture means were negatively correlated with the amount of red colour (r = -0.56,P = 0.02), possibly because several of the firmest apples had not yet ripened fully.Jönsson, Å. & Nybom, H. Consumer evaluation of apple

Table 5 .
Correlation between cultivar means scored by consumers (appearance, texture, juiciness, taste and overall impression) and visually scoring (covering colour), measurements of sugar content with a refractometer and firmness with a penetrometer in 2003 conducted by the authors.

Table 6 .
Comparison of the mean consumer scores for the age groups 10-30, >30-50, >50 years, for appearance, texture, juiciness, taste and overall impression of the cultivars in 2002 and 2003.Only cultivars with significant differences between the different age groups are shown.Number of consumer scores varied between 12 and 150 in 2002 and 32 and 154 in 2003.