Mapping EQ-5D utilities to GBD 2010 and GBD 2013 disability weights: results of two pilot studies in Belgium

Maertens de Noordhout, C.; Devleesschauwer, B.; Gielens, L.; Plasmans, M. H. D.; Haagsma, J. A.; Speybroeck, N.

doi:10.1186/s13690-017-0174-z

Research
Open access
Published: 06 February 2017

Mapping EQ-5D utilities to GBD 2010 and GBD 2013 disability weights: results of two pilot studies in Belgium

C. Maertens de Noordhout¹,
B. Devleesschauwer²,
L. Gielens¹,
M. H. D. Plasmans³,
J. A. Haagsma^4,5 &
…
N. Speybroeck¹

Archives of Public Health volume 75, Article number: 6 (2017) Cite this article

3388 Accesses
10 Citations
6 Altmetric
Metrics details

Abstract

Background

Utilities and disability weights (DWs) are metrics used for calculating Quality-Adjusted Life Years and Disability-Adjusted Life Years (DALYs), respectively. Utilities can be obtained with multi-attribute instruments such as the EuroQol 5 dimensions questionnaire (EQ-5D). In 2010 and 2013, Salomon et al. proposed a set of DWs for 220 and 183 health states, respectively. The objective of this study is to develop an approach for mapping EQ-5D utilities to existing GBD 2010 and GBD 2013 DWs, allowing to predict new GBD 2010/2013 DWs based on EQ-5D utilities.

Methods

We conducted two pilot studies including respectively four and twenty-seven health states selected from the 220 DWs of the GBD 2010 study. In the first study, each participant evaluated four health conditions using the standard written EQ-5D-5 L questionnaire. In the second study, each participant evaluated four health conditions randomly selected among the twenty-seven health states using a previously developed web-based EQ-5D-5 L questionnaire. The EQ-5D responses were translated into utilities using the model developed by Cleemput et al. A loess regression allowed to map EQ-5D utilities to logit transformed DWs.

Results

Overall, 81 and 393 respondents completed the first and the second survey, respectively. In the first study, a monotonic relationship between derived utilities and predicted GBD 2010/2013 DWs was observed, but not in the second study. There were some important differences in ranking of health states based on utilities versus GBD 2010/2013 DWs. The participants of the current study attributed a relatively higher severity level to musculoskeletal disorders such as ‘Amputation of both legs’ and a relatively lower severity level to non-functional disorders such as ‘Headache migraine’ compared to the participants of the GBD 2010/2013 studies.

Conclusion

This study suggests the possibility to translate any utility derived from EQ-5D scores into a DW, but also highlights important caveats. We observed a satisfactory result of this methodology when utilities were derived from a population of public health students, a written questionnaire and a small number of health states in the presence of a study leader. However the results were unsatisfactory when utilities were derived from a sample of the general population, using a web-based questionnaire. We recommend to repeat the study in a larger and more diverse sample to obtain a more representative distribution of educational level and age.

Peer Review reports

Background

Disability weights (DWs) and utilities (also known as index values, preference weights or QALY weights) are common metrics allowing a quantitative evaluation of health states related to diseases, injuries or risk factors. They reflect the preference of individuals for a certain health state and quantify the severity of health loss associated with a certain health state. They are also crucial components in health economic evaluations and in disease burden assessments.

Disease burden assessments using the Disability-Adjusted Life Year (DALY) metric play an increasingly important role in public health research. The disability weight (DW) is a crucial component in the DALY formula, as it allows combining morbidity and mortality [1–4, 5, 6]. The DW expresses the relative reduction in Health-Related Quality of Life (HRQoL), on a scale from zero (perfect health) to one (worst possible health state). By weighing the years lived with disease, these years are “translated” into fully lost life years. Indeed, living 10 years with a 10% reduction in HRQoL (i.e., DW = 0.10) would be equal to losing one full year of good health (e.g., by dying one year before the life expectancy). As a result, both morbidity and mortality are expressed in the same currency, namely the number of healthy life years lost. Initially, DWs were elicited from participants who did not necessarily suffer from the studied health state, using time trade-off or person trade-off methods. However other valuation techniques for eliciting DWs exist [7]. Ideally DWs should be developed from international studies hence ensuring internal comparability. However, several health states are not included in such international initiatives. Therefore resorting to specific elicitation studies is the only option. For instance, to estimate the YLDs of stage 0 (in situ) melanoma in Belgium during the two first month of treatment, Tromme et al. multiplied the number of patients with stage 0 melanoma in Belgium for the 2009–2011 period, i.e. 1772 patients by the derived DW for the two first month of treatment, i.e. 0.3345 and by the duration, i.e. 2 months [8] but the DWs obtained in these standalone studies are not comparable to an international set of disability weights.

As part of the Global Burden of Disease 2010 Study (GBD 2010), Salomon et al. proposed a wide set of DWs based on paired comparison questions for 220 health states in which respondents considered two health outcomes which were described briefly in lay language [1]. An update of these DWs was generated for the Global Burden of Disease 2013 Study (GBD 2013) by incorporating results of new surveys in four European countries (Hungary, Italy, the Netherlands, and Sweden). The two studies combined resulted in a set of DWs based on 60,890 participants [2, 3].

In health economic evaluations, health “utilities” are a crucial component of Quality-Adjusted Life Years (QALYs). A utility of 0 means that a health state is equivalent to death, whereas a utility of 1 means full health. These values can be derived using multi-attribute instruments such as the EuroQol 5 dimensions questionnaire (EQ-5D) [9]. The EQ-5D includes five dimensions (mobility, self-care, usual activities, pain/discomfort and anxiety/depression) and five severity levels in each dimension (EQ-5D-5 L). This self-completion questionnaire was developed by the EuroQol group and is applicable to a wide range of patient’s health states.

To derive utilities, individuals are asked to score their health state using EQ5D questionnaire. Their scores may then be converted to a utility using a set of regression coefficients (or tariffs). These tariffs are obtained using direct valuations of a subset of health states from the general public using visual analog scale or trade-off methods and are available for certain countries. Some countries have integrated EQ-5D in their national health surveys, resulting in population average utilities that can be used as an indicator of general population health [10–14].

Many studies calculated utilities using EQ-5D for a wide range of diseases such as cancer [8], cardiovascular diseases [15] or diabetes [16]. EQ-5D has also been applied in the context of DALY-based disease burden studies. In these instances, the EQ-5D utilities had to be “translated” to DWs. So far, authors have assumed the DW to be equal to one minus the utility, or equal to the age and sex specific population average utility minus the elicited utility [8, 17]. However, these approaches do not guarantee comparability with the GBD 2010/2013 DWs. Mapping is a better strategy to predict DWs from utilities when DWs are not available for burden estimations [18]. The objective of this study was to propose an approach for mapping EQ-5D utilities to existing GBD 2010 and GBD 2013 DWs, and to predict “new” GBD 2010/2013 DWs based on EQ-5D utilities, i.e., for health states currently not included in the GBD studies.

Methods

Study design and participants

First pilot study

Four health states representing the spread of the GBD 2010/2013 DW distribution were selected (Table 2). In other words, we selected four health states which values shared the GBD 2010/2013 DW distribution in three equal parts. Native speakers translated existing health state descriptions developed by Salomon et al. [1] into French and back-translated them into English (Additional file 1). A written version of the EQ-5D-5 L questionnaire excluding the visual analogue scale part was used and three different versions of the questionnaire were designed. As recommended by Euroquol group, we used the five level EQ-5D questionnaire (EQ-5D-5 L) because the measurement properties of EQ-5D-5 L are superior to the EQ-5D-3 L in terms of feasibility, ceiling effects, discriminatory power and convergent validity [9].

Each version was composed of the same set of health states but with a different order of questioning, to explore possible order effects. The first version presented health states in increasing order of GBD 2010/2013 health state severity, the second version in decreasing order of GBD 2010/2013 health state severity, and the third one did not present health states in a certain order of severity.

The questionnaire consisted of two main parts. The first part was the same in each of the three versions of the questionnaire and included questions about age, sex and marital status. The second part included evaluation of four health states. Participants were asked to imagine being in a certain health state described by the lay descriptions developed by Salomon el al. [3], but without knowing the name of the health state. They would then select the most appropriate level of functioning for each of the five dimensions: mobility, self-care, usual activities, pain/discomfort and anxiety/depression. The respondents were students of the Public Health faculty at the Université catholique de Louvain (UCL), Brussels, Belgium, and were recruited during two courses in June 2014. There was no time limitation to answer the questionnaire and the study leader was present during the study allowing participants to ask any question.

Second pilot study

Twenty-seven health states representing the spread of the GBD 2010/2013 DW distribution were selected (Table 2). Native speakers with knowledge of the topic at hand, translated existing health state descriptions developed by Salomon et al. [1] into French and then back-translated them into English (Additional file 1). Dutch translation of the thirty-two health states came for the Haagsma et al. study [2]. A web-based version of the EQ-5D questionnaire excluding the visual analogue scale part, was developed using the Qualtrics software. As for the first study, the first part of the questionnaire included socio-demographics questions, with additional questions on disease/injury experience (yes/no answers), income and educational level. Income categories were derived from Haagsma et al. [2]. The second part included evaluation of four health states. As in the first pilot study, participants were asked to imagine being in a certain health state described by the lay descriptions of Salomon et al. but without knowing the name of the health state [1]. Eight versions of the second part were developed. Each version included evaluation of four health states representing the spread of the GBD 2010/2013 DW distribution. The questionnaire was assigned to participants through a computer algorithm that attributed the questionnaire version with the lowest percentage of participants at the time of assignment. Participants had to answer all questions to validate their participation. The respondents were recruited by email among students of the Public Health faculty at the UCL and among principals of Brussels’s primary and secondary schools and using a social network through general and private messages on Facebook. Snowball sampling [19] was used for the recruitment strategy, i.e., existing study participants were asked to recruit future subjects among their contacts. The data were collected in June 2015. A reminder was sent three weeks after sending the first email to the students of UCL and the principals of Brussels’s schools. Through Facebook, a first request was published on the Facebook public wall of the two main leaders of the study on June 1 2015. Then, three recalls were sent through private messages to all friends of the two main leaders of the study on June 3, 16 and 22, 2015.

Data analysis

Two steps recommended by the EQ-5D-5 L user guide were followed to transform the EQ-5D-5 L scores reported by each participant into a utility [9]. A function previously developed by the EuroQoL group was used to translate the scores from a 5 L to a 3 L scale, because no Belgian tariffs exist for the 5 L version. The EQ-5D-3 L results were then converted to utilities using the tariff model developed by Cleemput et al. for the Belgian population [10].

Statistical analysis

For all EQ-5D utilities, overall means, medians, standard deviation and ranges were calculated. Utilities in this study were compared with GBD 2010 and GBD 2013 DWs using Spearman correlation coefficients. We used sample paired t-tests to test whether there was a difference of mean utilities between the pilot studies and the corresponding GBD 2010/2013 DWs.

A locally weighted scatterplot smoothing (loess) regression model was run between the utilities and the logit transformed corresponding GBD 2010/2013 DWs [1, 3]. We arbitrarily chose an anchor point that when DWs equated to 1, utility equated 0 and when DWs equated 0, utility equated 1. Logit transformed DWs were then predicted for each of the utility scores from the loess fit. To retransform the scale and obtain DW values that range from 0 to 1, an inverse logistic transformation was applied to these predicted values.

Analyses were done with R version 3.0.2 and JMP pro 12. R code and developed mapping functions are available through: https://github.com/brechtdv/eq5d-mapping.

Results

First pilot study

In total 81 students completed the first survey, 57 were female (70.4%) and 57 had never been married (70.4%). They were on average 29.6 years old (SD 8.4) (Table 1).

Table 1 Description of the study population N (%) - Mean (SD)

Full size table

Average utilities ranged from 0.667 (SD 0.131) for “Fractures, treated (long term)” to 0.289 (SD 0.203) for “Schizophrenia: acute state” (Table 2).

Table 2 Utilities by studies

Full size table

One participant attributed a negative utility (i.e., worse than death) to ‘schizophrenia, acute’ (−0.158) and three participants attributed a utility of one (i.e., perfect health) to ‘severe wasting’.

There was a monotonic relationship between derived utilities and predicted GBD 2010/2013 DWs, i.e., the lower the utility, the higher the predicted GBD DW (Fig. 1).

There was a negative correlation between utilities and GBD 2010 DWs (rho = −1, p =0.083) and GBD 2013 DWs (rho = −1, p = 0.083).

There was a significant difference between the complement of the utilities (1 – utility) and GBD 2010 DWs (p = 0.085) and between the complement of the utilities and GBD 2013 DWs (p = 0.099).

‘Fractures treated, long term’ was ranked first, i.e. with the lowest disability weight as in the GBD 2010 study (but ranked third in the GBD 2013 study) and ‘Schizophrenia, acute’ was ranked last, i.e. with the highest disability weight as in the GBD 2010/2013 studies (Table 3).

Table 3 Ranking of the health states by studies

Full size table

Second pilot study

393 respondents completed the second survey. The average age was 36.4 years (SD 12.5) and 26.3% of the respondents were male. 2.5% were younger than 20, 36% were between 20 and 29 years old, 26% were between 30 and 39 years old and only 4% were older than 60. The majority of the respondents were highly educated, i.e. 39.7% had a bachelor’s and 46.6% had a master’s degree. Most of the respondents had a medium income level, i.e. 54.8% had a household annual income ranging from 25,000 to 52,000 euros (~27,000 US$ - 56,500 US$). Of all respondents, 30.8% already experienced a disease or injury experience in the past and 44.3% had never been married (Table 1).

The median of the utility distribution of the 27 health states was 0.455 (range: 0.106 - 0.826). The median of the selected 27 GBD 2010 DWs was 0.114 (range: 0.003 – 0.756). The median of the 27 GBD 2013 DW was 0.117 (range: 0.003 – 0.778). According to the utilities, the top three less severe health states in pilot study 2 were ‘Distance vision, mild’, 0.826 (SD 0.154), ‘Anemia, mild’, 0.813 (SD 0.138) and ‘Asthma, controlled’,0.809 (SD 0.128). ‘Fracture of pelvis short term’, 0.106 (SD 0.203), ‘Multiple sclerosis’, 0.144 (SD 0.198) and ‘Gout, acute’, 0.193 (SD 0.212) were the top three more severe health states (Table 2 & 3, Fig. 2). Most of the utilities were between 0.5 and 0.6, and corresponded to the health states with a moderate severity level.

Overall 64 of the participants (16%) in the second pilot study, attributed negative utilities to at least one health state. We obtained negative utilities for 11 health states (41%) among the 27 health states included in the second pilot study (Fig. 2). In health states including negative values, the most common were ‘Fracture of pelvis short term’ (22%), ‘Traumatic brain injury (long-term consequences)’ (20%) and ‘Multiple sclerosis (severe)’ (20%). Conversely, 64 participants (16%) attributed a utility of 1 to at least one health state. The top three most common health states with a utility of 1 were ‘Distance vision, mild’ (25%), ‘Anemia, mild’ (22%) and ‘Asthma, controlled’ (16%).

The Spearman rank correlation coefficient between the utilities derived from pilot study 2 and the GBD 2010 DWs equaled −0.808 (p < 0.001). There was a significant difference between the complement of the utilities (1 – utility) and GBD 2010 DWs (p < 0.001). The differences between the complement of the utilities and the GBD 2010 DWs ranged from −0.097 (‘Schizophrenia, acute’) to 0.645 (‘Amputation of both legs (long-term with treatment’). The average difference between 1-utilities and the GBD 2010 DWs was 0.30 (SD 0.18). The three highest observed differences between 1-utilities and the GBD 2010 DWs were ‘Amputation of both legs (long-term with treatment)’ (0.645), ‘Headache tension type’ (0.610) and ‘Fracture tibia, patella (short term)’ (0.524). The three lowest observed differences were noticed for ‘Headache (migraine)’ (0.019), ‘Rectovaginal fistula’ (0.059) and ‘AIDS cases not received ARV treatment)’ (0.093) (Fig. 3).

The Spearman rank correlation coefficient between the derived utilities from pilot study 2 and the GBD 2013 equaled −0.801 (p < 0.001). There was a significant difference between the complement of the utilities (1 – utility) and the GBD 2013 DWs (p < 0.001). The differences ranged from −0.112 (‘Schizophrenia, acute’) and 0.613 (‘Headache tension type’). The average difference between 1-utilities and the GBD 2013 DWs was 0.30 (SD 0.19). The three highest observed differences between 1-utilities and DWs GBD 2013 were for ‘Fracture of pelvis short term’ (0.615), ‘Headache tension type’ (0.613) and ‘Amputation of both legs (long-term with treatment)’ (0.608). The three lowest differences ‘Headache (migraine)’ (0.011), ‘Rectovaginal fistula’ (0.05) and AIDS cases (not received ARV treatment)’ (0.06).

The relationship between the utilities and predicted DWs for both GBD 2010 and 2013 studies was not monotonic (Fig. 3), meaning that both utilities and predicted DWs do not increase/decrease concurrently, and there were major differences in the ranking of health states between the second pilot study and the GBD 2010 and 2013 studies (Table 3). Most of the differences were located in the middle of the distribution, i.e. for health states that were located around the median of the distribution. ‘Amputation of both legs’ was ranked 11^th and 12^th in the GBD 2010/2013 studies and fell to the 23^rd place in this study. ‘Headache tension type’ was ranked 10^th in the GBD 2010/2013 studies and 19^th in this study. On the other hand, participants of GBD 2010/2013 studies judged more severe ‘Headache migraine’ and ‘Rectovaginal fistula’ which held respectively the 21^st and 22^nd compared to pilot study 2 where they held the 11^th and 15^th rank.

Pilot study 1 versus pilot study 2

We also observed some differences of mean utility between study 1 and study 2. The differences ranged from −0.05 for ‘Schizophrenia, acute’ to 0.180 for ‘Headache tension type’ and we found higher agreement with health states located at the ends of the distribution, i.e. ‘Fractures, treated’ and ‘Schizophrenia’.

For ‘Severe wasting’, most of the participants of the second pilot study assigned a score of three (i.e., moderate) to all five EQ5D dimensions. Participants of the second pilot study evaluated that ‘Severe wasting’ caused more frequently none/moderate (Level 1 & 3) problems for mobility (Dimension 1), no problem of autonomy (D2) and weak (Level 2) pain/discomfort (D4). Participants of the first pilot study evaluated more frequently that ‘Severe wasting’ caused weak (Level 2) problems of mobility (D1), weak (Level 2) problem of autonomy (D2) and none (Level 1) pain/discomfort (D4).

For ‘Headache tension’ the participants of the second study attributed a higher level for three of the five EuroQol dimensions than participants of the first study. Participants of the second study evaluated more frequently that ‘Headache tension’ caused severe pain/discomfort (Level 4 – D4), severe problems to accomplish daily activities (Level 4 – D3) and moderate anxiety/depression (Level 3 – D5). Participants of the first study evaluated more frequently that ‘Headache tension’ caused moderate pain/discomfort (Level 3 – D4), moderate problems to accomplish daily activities (Level 3 – D3) and weak anxiety/depression (Level 2 – D5) (Table 4).

Table 4 Dimensions and levels frequency by study

Full size table

Overall, in the first pilot study the scores for mobility (D1) and autonomy (D2) dimensions were in average higher than in the second pilot study. In the second pilot study, the scores for activities (D3), pain/discomfort (D4) and anxiety/depression (D5) dimensions were in average higher than in the first pilot study (Table 4).

Discussion

DALY-based burden of disease studies play an increasingly important role in public health research [20]. As the required DWs for the health states under study may not be available, flexible and practical ways of eliciting DWs that are comparable with existing GBD DWs are therefore needed. We proposed a mapping approach based on a loess regression model to easily predict any GBD 2010/2013 DW from EQ-5D5L utilities. To our knowledge, the current study is the first to map EQ-5D utilities to GBD 2010/2013 DWs.

Recently, Burstein et al. used loess regression to map SF-12 utilities to GBD 2010 DWs [21]. They showed a weaker rank order correlation (−0.7) between GBD 2010 DWs and SF-12 scores than we found between DWs and utilities (> − 0.8). They also found that the relationship between GBD 2010 DWs and SF-12 scores was non-monotonic. The mapping function developed by Burstein et al. was less satisfactory than in the first pilot study but better than in the second pilot study. Burstein et al. selected 62 health states, collected data from a sample of 3791 respondents and corrected for outliers. Except that participants were enrolled in Seattle and during two GBD workshops, no information was available on age, educational or income level and disease experience status of their study population.

We observed major differences in the validity of the mapping function between both pilot studies, which could be influenced by the different study populations and questionnaire forms. Indeed, a higher prediction quality was observed when we derived utilities from a written version of the EQ-5D questionnaire compared to a web-based questionnaire. One explanation could be that during the first study, which included the written version of the EQ-5D questionnaire, one of the study leaders was present to answer the questions of the participants. This may have increased the understanding of the health state definitions. The wider standard deviations of the second pilot study underscore this hypothesis. Even though we used the updated version of the lay definitions developed by Salomon et al. [3], we still observed that descriptions were not straightforward to understand for lay population. In the second pilot study, we observed a weaker quality of the prediction for more severe disorders, for example ‘Schizophrenia, acute’, ‘Epilepsy severe’ or ‘Rectovaginal fistula’.

In addition, we observed some overlap between the health state descriptions developed by Salomon et al. [3] and the five dimensions included in the EQ-5D questionnaire that could have influenced the health state valuations. For example, the definition of ‘Fracture of pelvis: short term’ was, “…You have severe pain, and cannot walk or do daily activities” which provided information on the level of mobility, self-care, usual activities and pain/discomfort dimensions included in the EQ-5D5L questionnaire. We observed that utilities for health states with overlap in descriptions had lower variation. For example, in the second pilot study the lowest standard deviation was observed for ‘Amputation of toe’ (SD 0.127) and ‘Asthma, controlled’ (SD 0.128), both of them included information on pain and daily activities in their descriptions.

We also observed that there were some important differences of ranking between GBD 2010/2013 DWs and utilities derived in the second utilities study. Respondents of our study ranked amputation of both legs lower (more severe) and acute schizophrenia higher (less severe) than the respondents of the GBD 2010/2013 studies. In addition to the overlap between the health state descriptions described above, one explanation could be that for participants included in the pilot studies it was more difficult to imagine and evaluate functional limitations (D1 – D3) for psychiatric disabilities than for impaired mobility.

Several limitations may also have influenced the final results.

In the second pilot study, a sample of 27 GBD 2010 health states was used, and each health state was evaluated at least 30 times. Chuang et al. determined that each health state has to be at least evaluated 100 times to be representative of the population of responses [22]. We indeed observed better results in the first study, where each health state was at least evaluated 81 times.

The study population was not representative of the Belgian population. The first study population (n = 81) was composed of students in public health, mostly female and young adults. In the second study (n = 393), and despite the snowball strategy, most of the participants were between 20 and 39 years (72%), were highly educated. The web-based questionnaire as well as the age of two main coordinators of the study could be an explanation of the study population characteristics. Hopefully, Haagsma et al. demonstrated no significant effects of educational level on DWs for injury consequences in the Dutch population [17]. However other studies, which also used EQ-5D questionnaire, demonstrated that participants aged 18–59 evaluated health states less severely than those aged 60 and over and that older participants attributed less weight to morbidities and pain experience than younger [23, 24].

In addition, some authors reported that the judgment of people who are in a certain health state and health professionals differ significantly from judgment of healthy people [15]. Thirty-one percent of the respondents included in the second study had a disease experience and mostly were expert in public health, which might have influenced the results. Utilities could have been under or over estimated. However we do not recommend to restrict participants to healthy individuals because we believe that utilities have to represent the average ‘preference’ of health of the studied population.

We chose to include negative values of individual utility in the Cleemput’s model, indicating some health states to be worse than death for some participants. We obtained a negative value for one (25%) health state in the first pilot study and for 11 health states (41%) in the second pilot study. For a study including a larger population, it is recommended to constrain values between 0 and 1 to improve the prediction model [25] but in the practice, the issues raised by the negative values for EQ-5D health states are complex [26]. In addition, to perform the final mapping we arbitrarily chose that when DWs equated to 1, utility equated 0, indicating that it exists health state worse than death. This is one of the methodological and philosophical choice difference between the utilities derived from EQ-5D tool and DWs derived from pairwise comparison that could impact the predictions. This is also why assuming the DW to be equal to one minus the utility do not guarantee comparability with the GBD 2010/2013 DWs.

In addition, both ‘short’ and ‘long’ term health states were included in the study and the time framing was not explicitly defined. Some studies demonstrated that the duration of a health state has an impact on the health state valuations and that poor states of health became more intolerable the longer they last [17, 18, 23, 27]. However Salomon et al. showed that the framing of paired comparsion questions in terms of temporary or chronic outcomes in a pairwise comparison did not affect the valuation of the health states [3].

Finally, there are fundamental differences between the pairwise comparison (PC) and EQ-5D techniques.

First, GBD PC was anchored to Population Health Equivalence answers, while EQ5D Flemish tariffs [10] was anchored to Visual Analogue Scale. Although there are systematic differences between those methods, both are indirect and should not affect the mapping. Second, EQ5D questionnaire was designed to assess Quality of Life (QoL) of patients, with a given scenario of health states but DWs were designed to quantify the severity of a single health state. In other words, utilities are patient specific, whereas DWs are health state specific. In this study, we deviate from this original definition by defining health state specific utilities. Both methods also do not evaluate health states on the same dimensions of health [28]. With EQ-5D instrument participants have to evaluate health states on five dimensions of health, each of them including five levels of severity and with pairwise comparison, two health states are presented to healthy people and they have to decide which they regarded as being healthier based on their own judgment and experience. These fundamental differences of methodology can also explain the differences we observed between GBD 2010/2013 DWs and utilities.

Conclusion

This study suggests the possibility to translate any utility derived from EQ-5D scores into a DW, but also highlights important caveats. We observed a satisfactory result of this methodology when utilities were derived from a population of public health students, a written questionnaire and a small number of health states in the presence of a study leader. The results were however unsatisfactory when utilities were derived from a sample of the general population, using a web-based questionnaire. For the sake of validating the study, we recommend repeating the first study (i.e. using a printed version of the questionnaire coupled with an available support of the study leader and a small number of health states to evaluate) in a larger and more diverse sample to obtain a more representative distribution across educational levels and ages because DWs may vary between and within countries potentially impacting the mapping between utilities and GBD 2010/2013 DWs. Recommendations on future study designs in regards of an optimal sample size are not straightforward to provide, since knowledge on the variability of data in such studies is lacking. Moreover according to Brazier et al., the sample size (number of people giving responses) used in other mapping studies is very broad as it ranged from 68 to 23,647 [29]. We think that further studies using simulation process, could help to answer the sample size. However at least as important as the sample size, is the representativeness of the sample for the population at hand. To avoid selection basis a probability sample (e.g. random sample) of the participants is required.

References

Salomon JA, Vos T, Hogan DR, Gagnon M, Naghavi M, Mokdad A, Begum N, Shah R, Karyana M, Kosen S, et al. Common values in assessing health outcomes from disease and injury: disability weights measurement study for the global burden of disease study 2010. Lancet. 2012;380:2129–43.
Article PubMed Google Scholar
Haagsma JA, de Noordhout CM, Polinder S, Vos T, Havelaar AH, Cassini A, Salomon JA. Assessing disability weights based on the responses of 30,660 people from four European countries. Population health metrics. 2015;13(1):1.
Salomon JA, Haagsma JA, Davis A, Maertens de Noordhout C, Polinder S, Havelaar A, Cassini A, Devleesschauwer B, Kretschmar M, Speybroeck N, et al. Disability weights for the Global Burden of Disease 2013 study. Lancet Glob Health. 2015;3(11):e712–e723.
Salomon JA, Wang H, Freeman MK, Vos T, Flaxman AD, Lopez AD, Murray CJ. Healthy life expectancy for 187 countries, 1990–2010: a systematic analysis for the global burden disease study 2010. Lancet. 2012;380:2144–62.
Article PubMed Google Scholar
Murray CJ. Quantifying the burden of disease: the technical basis for disability-adjusted life years. Bull World Health Organ. 1994;72(3):429.
CAS PubMed PubMed Central Google Scholar
Devleesschauwer B, Havelaar AH, De Noordhout CM, Haagsma JA, Praet N, Dorny P, et al. DALY calculation in practice: a stepwise approach. Int J Public Health. 2014;59(3):571.
Article PubMed Google Scholar
Kretzschmar M, Mangen MJ, Pinheiro P, Jahn B, Fevre EM, Longhi S, Lai T, Havelaar AH, Stein C, Cassini A, et al. New methodology for estimating the burden of infectious diseases in Europe. PLoS Med. 2012;9:e1001205.
Article PubMed PubMed Central Google Scholar
Tromme I, Devleesschauwer B, Beutels P, Richez P, Leroy A, Baurain JF, Cornelis F, Bertrand C, Legrand N, Degueldre J, et al. Health-related quality of life in patients with melanoma expressed as utilities and disability weights. Br J Dermatol. 2014;171:1443–50.
Article CAS PubMed Google Scholar
http://www.euroqol.org/. Accessed 25 Apr 2016.
Cleemput I. A social preference valuations set for EQ-5D health states in Flanders, Belgium. Eur J Health Econ. 2010;11:205–13.
Article PubMed Google Scholar
Dolan P. Modeling valuations for EuroQol health states. Med Care. 1997;35:1095–108.
Article CAS PubMed Google Scholar
Greiner W, Claes C, Busschbach JJ, von der Schulenburg JM. Validating the EQ-5D with time trade off for the German population. Eur J Health Econ. 2005;6:124–30.
Article CAS PubMed Google Scholar
Lamers LM, McDonnell J, Stalmeier PF, Krabbe PF, Busschbach JJ. The Dutch tariff: results and arguments for an effective design for national EQ-5D valuation studies. Health Econ. 2006;15:1121–32.
Article CAS PubMed Google Scholar
Shaw JW, Johnson JA, Coons SJ. US valuation of the EQ-5D health states: development and testing of the D1 valuation model. Med Care. 2005;43:203–20.
Article PubMed Google Scholar
Dyer MT, Goldsmith KA, Sharples LS, Buxton MJ. A review of health utilities using the EQ-5D in studies of cardiovascular disease. Health Qual Life Outcomes. 2010;8:13.
Article PubMed PubMed Central Google Scholar
Pan CW, Sun HP, Wang X, Ma Q, Xu Y, Luo N, Wang P. The EQ-5D-5L index score is more discriminative than the EQ-5D-3L index score in diabetes patients. Qual Life Res. 2015;24:1767–74.
Article PubMed Google Scholar
Haagsma JA, van Beeck EF, Polinder S, Hoeymans N, Mulder S, Bonsel GJ. Novel empirical disability weights to assess the burden of non-fatal injury. Inj Prev. 2008;14:5–10.
Article CAS PubMed Google Scholar
Nord E. Methods for quality adjustment of life years. Soc Sci Med. 1992;34:559–69.
Article CAS PubMed Google Scholar
Browne K. Snowball sampling: using social networks to research non‐heterosexual women. Int J Soc Res Methodol. 2005;8(1):47–60.
Article Google Scholar
Devleesschauwer B, de Noordhout CM, Smit GS, Duchateau L, Dorny P, Stein C, Van Oyen H, Speybroeck N. Quantifying burden of disease to support public health policy in Belgium: opportunities and constraints. BMC Public Health. 2014;14:1196.
Article PubMed PubMed Central Google Scholar
Burstein R, Fleming T, Haagsma J, Salomon JA, Vos T, Murray CJ. Estimating distributions of health state severity for the global burden of disease study. Popul Health Metrics. 2015;13(1):1.
Article Google Scholar
Chuang LH, Kind P. The effect of health state selection on the valuation of EQ-5D. Med Decis Making. 2011;31:186–94.
Article PubMed Google Scholar
Dolan P. Effect of age on health state valuations. J Health Serv Res Policy. 2000;5(1):17–21.
Article CAS PubMed Google Scholar
Hofman CS, Makai P, Boter H, Buurman BM, de Craen AJ, Olde Rikkert MG, Donders R, Melis RJ. The influence of age on health valuations: the older olds prefer functional independence while the younger olds prefer less morbidity. Clin Interv Aging. 2015;10:1131–9.
Article PubMed PubMed Central Google Scholar
Chevalier J. Mesure de l’utilité attachée aux états de santé. Valorisation de l’index d’utilité EQ-5D et évolution de l’échelle actuelle en France. Université de Paris IX Dauphine; 2010.
Brooks R. The EuroQol group after 25 years. 2013.
Book Google Scholar
Dolan P. The effect of experience of illness on health state valuations. J Clin Epidemiol. 1996;49(5):551–64.
Article CAS PubMed Google Scholar
Grieve R, Grishchenko M, Cairns J. SF-6D versus EQ-5D: reasons for differences in utility scores and impact on reported cost-utility. Eur J Health Econ. 2009;10:15–23.
Article PubMed Google Scholar
Brazier JE, Yang Y, Tsuchiya A, Rowen DL. A review of studies mapping (or cross walking) non-preference based measures of health to generic preference-based measures. Eur J Health Econ. 2010;11:215–25. doi:10.1007/s10198-009-0168-z.
Article PubMed Google Scholar

Download references

Acknowledgments

We would like to thank prof. Arie H Havelaar (University of Florida) for his precious comments and advices and all the participants for their participation.

Funding

The Université catholique de Louvain (Brussels, Belgium) funded the study. All authors had full access to all study data, and the analysis, interpretation, and the decision to publish were solely the responsibility of the authors.

Availability of data and materials

R code and developed mapping functions are available through: https://github.com/brechtdv/eq5d-mapping. Developed questionnaire is available in Additional file 1.

Authors’ contributions

CMdN study design, data-collection, data analysis, drafting the manuscript; BD study design, data analysis, critically revising the manuscript; LG study design, data-collection, data analysis, drafting the manuscript; MP study design, critically revising the manuscript; JH study design, critically revising the manuscript; NS study design, critically revising the manuscript. All authors read and approved the final manuscript.

Competing interests

The authors declare that they have no competing interest.

Consent for publication

Not applicable.

Ethics approval and consent to participate

Not applicable.

Author information

Authors and Affiliations

Institute of Health and Society (IRSS), Université catholique de Louvain Clos Chapelle-aux-Champs, 30 bte B1.30.15, Brussels, 1200, Belgium
C. Maertens de Noordhout, L. Gielens & N. Speybroeck
Department of Public Health and Surveillance, Scientific Institute of Public Health, Rue Juliette Wytsman 14, 1050, Brussels, Belgium
B. Devleesschauwer
National Institute for Public Health and the Environment, Centre for Health and Society, P.O. Box 1, 3720, BA, Bilthoven, The Netherlands
M. H. D. Plasmans
Department of Public Health, Erasmus MC, Dr. Molewaterplein 50, 3015, GE, Rotterdam, The Netherlands
J. A. Haagsma
Institute for Health Metrics and Evaluation, University of Washington, Seattle, WA, 98121, USA
J. A. Haagsma

Authors

C. Maertens de Noordhout
View author publications
You can also search for this author in PubMed Google Scholar
B. Devleesschauwer
View author publications
You can also search for this author in PubMed Google Scholar
L. Gielens
View author publications
You can also search for this author in PubMed Google Scholar
M. H. D. Plasmans
View author publications
You can also search for this author in PubMed Google Scholar
J. A. Haagsma
View author publications
You can also search for this author in PubMed Google Scholar
N. Speybroeck
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to C. Maertens de Noordhout.

Additional file

Additional file 1:

Section 1: Survey instrument: Example of one health state Web -questionnaire. (DOCX 30 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Maertens de Noordhout, C., Devleesschauwer, B., Gielens, L. et al. Mapping EQ-5D utilities to GBD 2010 and GBD 2013 disability weights: results of two pilot studies in Belgium. Arch Public Health 75, 6 (2017). https://doi.org/10.1186/s13690-017-0174-z

Download citation

Received: 12 October 2016
Accepted: 03 January 2017
Published: 06 February 2017
DOI: https://doi.org/10.1186/s13690-017-0174-z

Mapping EQ-5D utilities to GBD 2010 and GBD 2013 disability weights: results of two pilot studies in Belgium