Introduction
The Immunocompetence Handicap Hypothesis (ICHH;
Folstad and Karter, 1992) lies at the core of current theories regarding female preferences for male masculinity. The ICHH arises from Zahavi's notion of an honest signal (
Zahavi, 1975;
Zahavi and Zahavi, 1997), wherein an individual organism “signals” some underlying quality by demonstrating the ability to withstand a serious handicap. Under the ICHH as commonly interpreted, androgen-dependent secondary sexual characteristics may signal an individual's superior immune functioning insofar as they indicate an ability to withstand the immunosuppressive effects of the sex hormones responsible for their development. Human facial dimorphism (masculinity), then, is theorized to signal immunocompetence in the face of high levels of testosterone (T), and females at peak fertility are believed to prefer more masculine faces because, despite the apparent or perceived costs of masculine mates in terms of paternal investment and prosociality (e.g.,
Boothroyd, Jones, Burt, and Perrett, 2007;
Johnston, Hagel, Franklin, Fink, and Grammer, 2001;
Perrett et al., 1998), they can acquire “good” genes conferring heritable immunity for their offspring, and increase their long term inclusive fitness.
In order for this hypothesized chain of causality to be supported, at least three key lines of evidence are required (
Bussière, Tinsley, and Laugen, 2013;
Scott, Clark, Boothroyd, and Penton-Voak, 2013). Firstly, testosterone must be immunosuppressive in humans, secondly, an association between facial masculinity and circulating testosterone levels must exist, and thirdly, it should be possible to link, empirically, masculinity with some measure of immune functioning. There is some evidence that facial masculinity in men is associated with circulating testosterone levels (
Penton-Voak and Chen, 2004;
Pound, Penton-Voak, and Surridge, 2009;
Roney, Hanson, Durante, and Maestripieri, 2006) and consequently the current paper will focus on the first and third of these questions, and will report the results of an empirical study addressing the third question specifically.
Although the evolutionary psychology literature has sometimes treated immunocompetence as a unitary, linear trait, and testosterone as a positive cue of immunity, in recent years more complex perspectives have emerged (
Muehlenbein and Bribiescas, 2005; see
Scott et al., 2013 for review). These propose that testosterone levels may reflect a strategic
reorganization of resources, rather than (or in addition to) reflecting an abundance of them (due to heritable disease resistance). Resources may be reorganized either across stages of the lifespan or across immune functions, and individual variation in testosterone profile may reflect multiple factors, some of which may be related to genetic quality, but others at least partially independent of it. These latter factors may include (developmental) exposure to conflict, aggression, and other exogenous sources of mortality risk (
Archer, 2006;
Mazur and Booth, 1998;
Qvarnstrom and Forsgren, 1998). Indeed, there is now a fairly substantial body of literature linking factors such as social conflict and deprivation with higher testosterone, early reproduction, and increased senescence (
Archer, 2006;
Daly and Wilson, 1988,
2001;
Mazur and Booth, 1998).
The view of testosterone-immunity link as a complex one is further supported by an increasing acknowledgement of the distinct and sometimes independent components of the immune system (
Muehlenbein and Bribiescas, 2005). The two broad systems of innate immunity (a rapid, non-specific system) and acquired immunity (involving recognition of and responses to specific pathogens) each break down into a vast array of different subcomponents (see
Figure 1 for a basic schematic summary, and
Muehlenbein and Bribiescas, 2005, for further description). These different subcomponents, however, are often not correlated with each other across individuals (see
Adamo, 2004, for a review), and reductions in activity within one component of the immune system may be associated with increased activity in another component (e.g.,
Keil, Luebke, and Pruett, 2001). Indeed, some aspects of the immune system, such as the Th-1 and Th-2 morphs of helper T lymphocytes, are mutually antagonistic. It is perhaps not surprising, then, that empirical tests of how testosterone levels relate to disease susceptibility and immune parameters have yielded varied and, at times, contradictory results.
Empirical research findings for humans are summarized in
Table 1 and show that higher levels of endogenous testosterone are associated in different studies with both greater and lesser likelihood of contracting sometimes closely related diseases, and with higher levels of T cells and lower levels of C-reactive protein (a general marker of inflammation), with mixed or null results for several other parameters. At the time of writing, the only correlational study to have looked at acute responses to a particular immune challenge (rather than baseline immunological parameters) found that men with higher testosterone showed a larger antibody response to immunization with a particular antigen (
Rantala et al., 2012). There is, however, some intriguing evidence suggesting that in chronic viral infection (hepatitis B), increased testosterone levels arising during puberty are associated with reductions in viral load (
Wu et al., 2010) – that is to say, testosterone may facilitate rather than suppress immune function in this instance. Only two experimental studies (
Bhasin et al., 2000,
Singh et al., 2002) have investigated the impact of exogenous testosterone administration on immunological parameters, and neither found any effect, despite one of those parameters showing a correlational relationship with circulating testosterone in one study (although not another). However, it is notable that with the exception of the Rantala study, effect sizes in correlational studies of associations between testosterone and immune parameters have been extremely small, with no
r value reaching .1.
Overall, then, the extant literature shows a pattern of male testosterone potentially being both a positive and a negative factor in immune functioning, or perhaps having no causal role at all (as per the experimental data) but rather sharing one or more antecedents with some different elements of the system.
The findings summarized above then raise the question of whether male facial masculinity is a consistent predictor of male health outcomes across different disease measures and research designs. Surprisingly, however, little published data exists to address this issue.
Rhodes, Chan, Zebrowitz, and Simmons (2003) used a prospective design with archival data and found that males whose photographs (taken in adolescence) were rated as more masculine had medical records which were independently rated as showing better health between ages 11 and 18 than those males rated as less masculine. More masculine looking faces were also rated as looking healthier. However, this composite medical health score is as much a reflection of what individuals will report to their doctor as it is an objective measure of health; furthermore, the composite nature of the general health score may mask any more variable relationships as seen in the testosterone studies above. Finally, it should be noted that the purpose of the Rhodes study was to examine associations between sexually dimorphic aspects of face
shape and health. Consequently, black and white photographs were used, meaning shape and some skin texture cues were available to raters, but not skin color, which may be important for perception of health in faces (see, e.g.,
Stephen, Coetzee, and Perrett, 2011).
Thornhill and Gangestad (2006) used a retrospective design and found that undergraduate men with more anatomically masculine faces (i.e., assessed through measurements of sexually dimorphic facial proportions rather than rated by observers) tended to report less respiratory illness and less use of antibiotics in the preceding 3 years, but showed no differences in the incidence of stomach illness. Although the images in this study and the measure of masculinity were superior to those utilized by Rhodes et al., the purely retrospective design remains a shortcoming.
Finally,
Rantala et al. (2013) examined the physical masculinity of the same men (mean age = 23,
SD = 3.9) from whom they had gathered the immune response data discussed above. They found that although some indices of dimorphism did correlate with immune response strength to the vaccine, this could not explain anything female observers found attractive about those men. Thus, although the data offers partial support for the ICHH, it falls short of providing evidence that any immune function superiority inherent in more masculine men is utilized in mate choice.
The above research designs varied not only in terms of the health and disease variables measured, but also on a number of other dimensions, including whether they were prospective or retrospective, and whether masculinity was assessed through ratings or objective measurements. The prospective/retrospective distinction may be important given that, as described above, testosterone may be involved in the reallocation of resources over the life course. The distinction between rated and objectively measured masculinity may also have important implications given that rated masculinity has been proposed to be more closely linked to male attractiveness than measured masculinity, and possibly even attributed to faces on that basis (
Scott, Pound, Stephen, Clark, and Penton-Voak, 2010). Thus, the purpose of the current study was to add to this small group of studies and consider further whether there is any link between masculinity and indices of health in Western men, as measured via a range of illnesses, using both prospective and retrospective methods, and with both rated and measured masculinity. We photographed a group of men at the start of their university careers and examined both contemporaneous and prospective health outcomes with follow-up data collected at the end of their first term. The choice of this particular time window was specifically to try and capture a period of time in which individuals may be exposed to a range of novel pathogens, and thus when variation between individuals might be most consequential, particularly given the fact that Western medicine may be a great “leveler” of health outcomes. Durham University students are rarely locals, and come from across the UK and the world. The start of a new university term provides an opportunity for the mixing of individuals from a wide geographical area carrying a range of pathogens. Thus, illness during this period could arguably be considered a window into how the immune system copes with new challenges.
Results
As shown in
Table 2, more masculine men were rated as looking more attractive (rated masculinity:
r = .49,
p < .01; morphometric masculinity:
r = .378,
p < .05) and more healthy (rated masculinity:
r = .43,
p < .05; morphometric masculinity:
r = .46,
p < .01). Similarly, rated attractiveness and rated health were strongly correlated (
r = .80,
p < 0.001). The relationship between rated and morphometric masculinity was positive but did not reach significance (
r = .25,
p = .10).
Correlations between facial features and actual past health showed relationships between attractiveness and illness such that more attractive individuals reported fewer instances of cold or flu in the last three years (
r = −.28,
p < .05) and a statistically non-significant tendency toward less use of antibiotics (
r = −.25,
p = .08). Healthier looking individuals also reported a statistically non-significant tendency toward fewer instances of antibiotic use in the last three years (
r = −.24,
p =.09). More masculine men reported fewer bouts of colds or flu (rated:
r = −.25,
p =.08; morphometric:
r = −.38,
p < .05) in the last three years. There were no other significant correlations (see
Table 2 for details).
Correlations between facial features and future health scores showed that individuals rated as more attractive or more healthy reported less use of antibiotics (attractiveness:
r = −.35,
p < .05; health:
r = −.36,
p < .05). There was a positive correlation between morphometric masculinity and reported number of days of flu (
r = .32,
p < .05). There were no other significant correlations (see
Table 2 for details).
Rated mood of faces did not correlate with any health or facial measure (all r < .19, all p > .1), and controlling for rated mood or participant age did not change the pattern of results.
Discussion
The aim of this study was to investigate whether masculine facial appearance and facial structure could predict not only past health experience and current health appearance but also future health outcomes in an environment in which individuals were likely to be exposed to novel pathogens.
With regard to masculinity, results were inconsistent across prospective, (rated) current, and retrospective measures. Men who looked more masculine or had more masculine facial proportions were rated as looking healthier and more attractive, and we replicated
Thornhill and Gangestad's (2006) relationship between anatomical masculinity and reported past health insofar as more masculine men reported fewer colds or flu bouts (but not antibiotics) in the last 3 years. Regarding future health, however, a more masculine facial structure was associated with
greater, not less, ill health in one domain (days suffering flu).
In contrast, rated facial health and attractiveness appeared to be valid modest cues to not only an individual's health history, but also their likely future good health on at least one measure. Men who looked healthier reported less use of antibiotics both prior to and during the university term, although rated health was not associated with reported rates of other illnesses.
Similarly, more attractive men looked healthier and reported fewer colds and less antibiotic use in the retrospective questionnaire, and fewer days using antibiotics during the prospective phase of the study. Given the sample sizes used in the present study (range = 44 to 57) we had more than adequate (96% to 99%) statistical power to detect associations with large (
r > 0.5) effect sizes (G*power 3: see
Faul, Erdfelder, Lang, and Buchner, 2007). However, power to detect medium-sized effects (
r > 0.3) was more limited (range = 53% to 64%). Consequently, further research with larger samples may be needed to establish with more certainty whether, for example, rated masculinity is associated with indicators of past and future health status. In contrast, even with a relatively small sample we were able to find a number of medium sized associations between morphometric masculinity and health measures.
Our mixed findings regarding the relationships between facial masculinity and measures of health and illness are consistent with more complex, nuanced perspectives on testosterone and immunity, as outlined in the introduction. Previously,
Muehlenbein and Bribiescas (2005) have argued strongly against treating immunocompetence as a unitary trait and it is notable that previous studies investigating relationships between testosterone and multiple immune parameters have tended to find a mixture of null and significant results. Thus, just as susceptibility to one form of illness may be unrelated to susceptibility to another across individuals, so too physical characteristics, such as masculinity, may be indicative of better resistance to some pathogens but not to others, and perhaps greater susceptibility to still more.
Mixed and inconsistent findings with respect to associations between facial appearance and measures of health and illness may also arise because relationships between sexually selected signals and viability may be more complex than is commonly assumed by evolutionary psychologists applying “handicap” principles to human mate choice. Condition-dependent trait expression and life-history trade-offs mean that even where male traits are considered to signal “good genes,” both positive and negative associations between signal quality and disease burden and/or survival are theoretically possible (
Getty, 2006;
Kokko, Brooks, McNamara, and Houston, 2002;
Kokko, Brooks, Jennions, and Morley, 2003), and indeed positive associations between signal quality and pathogen load are found in many comparative studies (
Getty, 2002).
A potential further, speculative, explanation for our findings is that the relationships between masculinity and health are variable across age groups. As noted in previous papers, masculinity is a cue to early developmental factors, and its association with health may therefore decrease, or even reverse, with age (
Scott et al., 2010;
Stephen et al., 2012). In our data, the relationship between health measures and masculinity was positive using past health measures, but negative using future health measures, suggesting a possible change of direction in the relationship between adolescence and adulthood. The limited time period and age range of our sample make it difficult to draw strong conclusions from these data, but our findings are broadly consistent with an account of testosterone in which high T levels are associated with earlier mating success but also earlier senescence (
Muehlenbein and Bribiescas, 2005; see
Scott et al., 2013 for review). Future studies could address this issue by examining links between masculinity/testosterone and disease across a wider age range, and we feel that this could be a productive avenue for further research.
More generally, our current data exemplify the problem posed by
Adamo and Spiteri (2009); to paraphrase, “he may not have a cold, but will he survive the plague?” Overall mortality rates in the face of disease are likely to be determined by a wide number of factors, of which each facet of immune function is just one, and which also include other elements of broader condition such as digestive and metabolic efficiency (i.e., capacity to make good use of food energy) and cardiovascular fitness, to name just two. What is perhaps required, therefore, is not only a broadening of our conception of “good genes” in general and “immunocompetence” in particular, but also empirical consideration of whether masculinity (or other traits of interest) are able to predict more extreme and selectively powerful health outcomes such as premature mortality or offspring mortality. Indeed, associations between paternal features and offspring mortality represent the critical test of the ICHH in regards to facial masculinity. Such studies cannot easily be carried out in western samples where, thankfully, modern medicine mitigates the morbidity associated with individual susceptibility to infectious diseases, and this is a significant limitation on both our own and previous studies of masculinity and health.
Aside from the nature of the population studied, the current research has some other significant limitations. The apparently contradictory associations between morphometric masculinity and decreased incidence of colds and flus prior to the start of the study but increased incidence of flu during the prospective phase suggest that it would have been desirable to separate these two illnesses in the past health questions. Flu is generally a considerably more severe illness than a cold (for instance, only flu carries a real mortality risk for groups such as the elderly and pregnant), despite both being respiratory infections, and indeed the correlation coefficient for future colds suggests that if anything, more masculine men suffer less with colds in the future, in contrast to their apparently greater susceptibility to flu.
Additionally, given the relatively small sample size, statistical power is only modest. Furthermore, although we sought to correct for one potential source of variance amongst the pictures (namely, expression), these images were used opportunistically, and more standardized images would be desirable where an opportunity arises to recruit a large number of men within the first 2 days of the term. Future research could attempt to reduce additional sources of variance, for example, by applying color calibration techniques (
Stephen et al., 2012). The fact that significant (or borderline significant) results were found despite these potential sources of noise, however, suggests that our paradigm could be usefully replicated with such a larger and more standardized sample.