Overweight and obesity are leading preventable causes of death worldwide (1). In 2008, over 2.8 million deaths were due to overweight or obesity (2, 3). Both conditions are associated with an increased risk of morbidity, and reduced life expectancy (4) and have been linked with increased healthcare use and medical costs over the past years (5). As prevalence and costs are expected to continue to rise, efforts to prevent overweight and obesity are warranted (6, 7).
Lifestyle interventions addressing modifiable risk factors such as physical activity and nutrition have shown to be promising methods for preventing overweight and its related morbidity (8, 9). For example, regular physical activity may reduce the risk of cardiovascular disease (CVD), including high blood pressure and diabetes (10, 11), as well as prevent weight gain (12, 13). A diet low in satured fat may reduce total cholesterol (14), while a diet high in fruits and vegetables may reduce the risk of CVD (15) and decrease body weight (16). However, a recent Cochrane review of interventions using counseling and education aimed at behavioral change showed no reduction in total or CVD mortality or clinical events in general populations (17).
Improving physical activity and dietary behavior through lifestyle interventions may not only be beneficial for health, but may also enhance quality of life, decrease healthcare costs, and increase productivity by decreasing illness and absenteeism (3). Occupational physicians (OP) have not conducted many lifestyle interventions although the occupational healthcare setting provides good opportunities to reach employees (18, 19). The Balance@Work project aims to evaluate the effectiveness of an occupational health practice guideline aimed at preventing weight gain among employees in the Netherlands compared to usual care. The intervention effects of guideline-based care on physical activity and dietary behavior after 6-months follow-up are the subject of another article (20). The aim of this study is to evaluate the effectiveness of guideline-based care during 18-months follow-up on body weight-related factors, CVD-risk factors, and quality of life.
Methods
The cluster randomized controlled trial (RCT) was conducted between 2009–2011 as part of the Balance@Work project. Details of the study design have been published elsewhere (20). The Ethics Committee of the VU University Medical Center approved the study protocol, and all participants signed informed consent. Relevant aspects of the study are reported following the Consort checklist for cluster RCT (21).
Study population
The Netherlands Society of Occupational Medicine, the professional association of OP, recruited the OP via a direct mailing to their member registry (>2100 OP). OP were asked to recruit ≥1 companies of medium or large size (>100 workers). Thereafter, OP recruited employees via a health risk appraisal consisting of anthropometric measurements and subsequent health advice. Employees were considered eligible to participate in the RCT when they: (i) had unhealthy levels of daily physical activity or dietary behavior (ie, not complying with public health physical activity or nutrition recommendations) (22–24) and/or were overweight (ie, waist circumference >80 cm for women and >94 cm for men); (ii) had completed a questionnaire in Dutch at baseline; (iii) were not on sick leave for >21 days; and (iv) were not pregnant or had a disease or condition that would make physical activity impossible.
Randomization, blinding, and sample size
Randomization to the intervention or control group was performed at the OP level. An independent researcher performed the randomization procedure using Random Allocation Software (version 1.0, Isfahan University of Medical Sciences, Iran). The intervention did not allow for blinding of participating OP who were asked not to reveal their group to employees or assistants performing measurements. An a priori power calculation to detect a difference in waist circumference of 1.5 [standard deviation (SD) 4.5) cm (25) with 80% power and an alpha of 5%] determined that 175 employees per group were needed at follow-up. Taking a loss to follow-up of 20–40% into account and possible dependency of employees within OP (regression analyses require independent observations) [intracluster correlation (ICC) of 0.20], a total of 600 employees among 20 OP were required at baseline (20).
Control group
OP in the control group were asked to provide care as usual, which generally consisted of a health risk appraisal with anthropometric measurements and subsequent health advice.
Intervention group
OP in the intervention group were additionally asked to provide guideline-based care. The development of the draft occupational health practice guideline has been described in detail elsewhere (20). Briefly, the guideline consists of three sections: (i) prevention at the environmental level (advice for the employer); (ii) prevention at the individual level (advice for the employee); and (iii) evaluation and maintenance.
For the first section, an environment scan was developed that provided an overview of environmental risk factors that could contribute to the prevention of weight gain (eg, availability of bike sheds and shower facilities, pricing strategies in cafeteria). Based on this overview, environmental goals could be prioritized, and feasibility and barriers for implementation could be discussed with the employer and the workers’ representative council at baseline and 6-months follow-up.
For the second section, prevention at the individual level, a minimal intervention strategy was developed for OP on how to promote the employee’s healthy lifestyle in five 20–30 minute counseling sessions during six months. For this purpose, OP were trained over two days in applying behavioral change counseling, an adapted form of motivational interviewing suitable for brief consultations in healthcare settings (20). After having discussed their risk profile and current health status, employees could choose the target behavior they would like to discuss (increasing physical activity, decreasing sedentary behavior, increasing fruit consumption, or reducing the energy intake derived from snacks) in the first counseling session. Next, ambivalence and motivation for change was assessed by discussing pros and cons of behavioral change, and willingness, importance, and perceived confidence to change behavior. OP then guided employees to make a decision on what behavior they needed to change and increased perceived behavioral control by asking employees to formulate a maximum of three implementation intentions. Last, employees set short- and long-term goals. In subsequent sessions, progress and barriers were discussed and short-term goals were adjusted if necessary. No specific weight loss advice was provided as the guideline aimed to prevent weight gain by improving employees’ physical activity and healthy dietary behavior. Moreover, obese employees could be referred to the Dutch guideline for treatment of obesity (26). To monitor their behavior, employees were provided with a toolkit containing a waist circumference measure tape, a pedometer, a diary, and leaflets on physical activity and nutrition from the Dutch Heart Foundations and the Netherlands Nutrition Centre.
Finally, the last section of the guideline considered the evaluation and maintenance of the previous sections.
Outcome measures
Outcome measures of this study were body weight-related outcomes [ie, waist circumference (cm), body weight (kg), and body mass index (BMI) (kg/m2)], biomedical risk factors [ie, systolic and diastolic blood pressure (in mmHG), total serum cholesterol (mmol/l)], and quality of life assessed at baseline and 6-, 12-, and 18-months follow-up. OP or their assistants performed the physical measurements. Questionnaires were used to assess quality of life.
Body weight-related outcomes
Waist circumference was measured as midway between the lower rib margin and the iliac crest to the nearest 0.1 cm. Participants were measured in standing position without heavy outer garments and with emptied pockets, breathing out gently (20). To standardize waist circumference measurement, OP or assistants were provided with a Seca 201 waist circumference measuring tape (Seca, Hamburg, Germany). As blinding of OP was not possible, independent researchers performed control measurements in a random sample of 141 employees during all measurements (8%). No differences were found between OP- and independent researcher-measured waist circumference [-0.4 (SD 4.5) cm; P=0.3], among and between intervention and control OP. Additionally, self-reported waist circumference was assessed from 1010 employees during all follow-up measurements (80%) using a non-stretchable paper measuring tape (range 0–130 cm) and written measurement instructions. Compared to OP, employees significantly under-reported their waist circumference by -1.4 (SD 3.9) cm (P<0.01). No difference was found between intervention or control participants. As employee measures tended to be less accurate, OP-measured waist circumference remains the best of the two options. Body weight and body height were measured with the participants standing without shoes and heavy outer garments. Participants were asked to push their heels softly to the wall, or the back of the stadiometer. BMI was calculated from measured height and weight as kg/m2.
CVD risk factors
Systolic and diastolic blood pressure (mmHG) were measured according to the standard Dutch protocol for blood pressure measurements (27) on employees in a seated position, after several minutes of rest. During the first consultation, both arms were measured twice. In follow-up consultations, the arm with the higher pressure was used if there was a difference of >10 mmHg between measurements. Otherwise, OP were advised to measure the same arm (preferably the left arm) across the remaining measurements in order to standardize measurements (28). As some OP performed blood pressure measurements once but others performed them twice, the first reading was used across all measurements (29). Readings of participants whose arms were measured interchangeably across measurements were excluded from analyses (N=13). The Reflotron or Accutrend finger capillary assay or lab assessments were used to assess total serum cholesterol (mmol/l). High density lipoprotein (HDL) and low density lipoprotein (LDL) cholesterol measurements were optional for OP, when feasible. Two intervention (N=76) and four control (N=69) OP were able to measure HDL and LDL at baseline. Therefore, these measurements were disregarded in this study.
Quality of life
We used the validated EQ-5D (30) to measure quality of life. Five questions were asked on self-reported mobility, self-care, activities of daily living, pain, and anxiety. The three answer categories were dichotomized into “no problems” versus “some problems” and “problems”, to address the fact that relatively few problems exist in general populations. Health status today was assessed using a visual analogue scale (VAS) ranging from 0 (worst imaginable health state) to 100 (best imaginable health state). Two additional questions were asked: “how would you rate your health in general? (“not good” versus “good”) and “compared to the last year, my health today is…” (“worse” versus “the same or better”).
Statistical analysis
Baseline differences between the intervention and control group were checked using t-tests for continuous variables and Chi-square tests for categorical variables using PASW software, version 18.0 (IBM, Armonk, NY, USA). To evaluate the intervention effects, multilevel regression analyses were performed for all outcome variables using MlwiN, version 2.18 (Center for Multilevel Modeling, University of Bristol, UK). Multilevel regression analyses are extended linear regression analyses that are most appropriate for analyzing longitudinal data sets containing correlated or clustered observations (31). In our dataset, a three-level data structure exists in which the four measurement occasions are clustered within employees, and employees are clustered within the 16 OP. To correct for this clustering, multilevel analyses adds a single variance parameter of the different regression lines of a level (for example for the 16 different OP) to the regression analyses. For each outcome variable, two multilevel regression analyses were performed. A crude analysis was performed to determine the differences between the intervention and control group at 6-, 12-, and 18-months follow-up reported using the regression coefficient (β) and 95% confidence interval (95% CI) for each outcome variable. Analyses were adjusted for the corresponding baseline outcome variable. Next, an adjusted analysis was performed to account for potential confounders (gender, age, irregular work hours).
Confounding was assigned when >10% change occurred in the regression coefficient. Effect modification was considered for age, gender, BMI, and the 10-year risk of fatal CVD measured at baseline using P<0.10 of the interaction term to indicate effect modification. The 10-year risk of fatal CVD was estimated using the European Systematic Coronary Risk Evaluation instrument (SCORE), using gender, age, smoking status, total cholesterol and systolic blood pressure (32). To be able to better discriminate between subjects’ risk, age was extrapolated to 60 years. This follows the method proposed by Lakerveld et al (33) to address the feature of the SCORE instrument that subjects <60 years have almost no 10-year risk of fatal CVD. The continuous SCORE variable was dichotomized at a minimum risk of 10% for CVD to assess the risk status of our population.
Next, using the output from MlwiN, the dependency of observations [intracluster correlation (ICC)] of employees within OP was assessed for each outcome measure. The ICC was calculated by dividing the between OP variance by the total variance, which is calculated by summation of the between and the within OP variances (31). ICC >0.10 indicate a moderate-to-high dependency of observations, in which case multilevel analyses are appropriate as they adjust for this dependency of observations (31). Finally, to test the robustness of our results, we performed sensitivity analyses using imputed data and complete cases data (31, 34). P<0.05 were considered to be significant.
Results
Participants
After recruitment, 38 OP expressed an interest to participate. Of these, 28 OP were eligible and randomized to either the intervention or control group. The Balance@Work project started with 7 intervention and 9 control OP because 12 OP withdrew after randomization (figure 1). No significant differences were found between OP who completed the study and OP who withdrew between randomization and baseline on demographic-, behavior-, or job-related characteristics of their worker population. Moreover, intervention OP did not differ significantly from control OP at baseline. During the 6-month intervention period, none of the OP were lost-to-follow-up. After 6 months, three OP were lost to follow-up due to ending of their contracts and a reorganization, as a result of which the Balance@Work team took over the follow-up measurements. Moreover, six OP perceived difficulties in collecting data during the follow-up measurements due to pregnancy, sick leave, resistance of an employer, switching jobs, and time constraints; one OP passed away. Therefore, several follow-up measurements were discontinued (figure 1). OP recruited 524 participants between March 2009 and March 2010. All employees met the inclusion criteria. One underweight subject was excluded from analyses because of having a weight gain goal. The baseline characteristics of the two study groups are described in table 1. Intervention employees were significantly different from control employees on age and irregular work hours at baseline. Intervention subjects were younger (46 versus 48 years) and worked less irregular work hours (19% versus 29%). After 18 months, 71 intervention (26%) and 57 control (23%) employees were lost-to-follow-up. These subjects were significantly younger, more often female, and had a lower income than those who completed the study, but this did not differ significantly between the intervention or control group.
Intervention effects
Body weight-related outcomes
Table 2 presents the values for waist circumference, body weight, and BMI for baseline and 6-, 12-, and 18-months follow-up per study group, as well as the results of the multilevel linear regression analyses. In general, no statistically significant intervention effects were found for waist circumference, body weight, and BMI. The dependency of employees within OP (ICC) was 0.15, 0.15, and 0.13, respectively. Contrary to what could be expected based on the group means, the multilevel analyses at 12-months follow-up showed a significant increase in waist circumference among the intervention group (β 2.7 cm, 95% CI 0.8–4.6). This difference disappeared in the longitudinal analyses using all follow-up measurements. Moreover, gender and BMI modified the intervention effect on waist circumference. Waist circumference increased among men and women in the intervention group compared to the control group (β 2.5 cm, 95% CI 0.5–4.5; and β 0.4 cm, 95% CI -1.4–2.0, respectively). These differences were significant among men. Also, waist circumference increased among normal weight, overweight and obese participants in the intervention group compared to controls (β 0.1 cm, 95% CI -1.9–2.1; β 0.6 cm, 95% CI -1.1–2.4, and β 2.7 cm, 95% CI 0.6–4.7, respectively). These differences were significant among obese intervention compared to obese control participants.
CVD risk factors
No significant intervention effects were found for systolic and diastolic blood pressure (table 2). Also, no effects were found on total serum cholesterol. The dependency of employees within OP was 0.12, 0.12, and 0.08, respectively. Although all values slightly decreased in both groups at 6-months follow-up, all values regained at 12- and 18-months follow-up.
Quality of life
The intervention did not result in significant effects on quality of life indicators (table 3). A significant increase among the intervention group was found on health status assessed by VAS scale at 18-months follow-up (β 3.0, 95% CI 0.5–5.5), but this difference disappeared in the longitudinal analyses using all follow-up measurements. Values for the dependency of employees within OP were not applicable.
Discussion
The aim of this study was to evaluate the effectiveness of a draft occupational health guideline aimed at preventing weight gain among employees in the Netherlands. No significant effects were found on body weight-related outcomes, CVD risk factors, or quality of life. Stratified analyses showed an increase in waist circumference among men and obese intervention participants.
The results of this study on body weight-related outcomes contrasts two meta-analyses of workplace physical activity and nutrition interventions that found moderate evidence for a net decrease in body weight of -1.3 kg and -1.2 kg, and BMI of -0.5 kg/m2 and -0.3 kg/m2, respectively (13, 35). Contrary to our study, the majority of the studies in the meta-analyses aimed to reduce CVD risk or improve physical fitness, via programs that often included exercise schemes and that were generally more intensive than ours. A more intensive intervention might thus be needed produce better effects. However, the programs used in many of the trials far exceed what may be feasible in routine clinical practice (13, 17). The significant increase in waist circumference among the intervention group at 12-months follow-up may be the result of non-random missing data that occurred due to time constraints at one intervention company. Although multilevel analyses are stated to be appropriate for analyzing data sets with missing data (31), our results suggest our analyses were influenced by non-random missings at the 12-months measurement. Ignoring these problems may lead to an over- or underestimation of results and significance. The sensitivity analyses revealed that effects on body weight-and health-related outcomes differed slightly between imputed data, multilevel data, and complete cases (data not shown). Nevertheless, it seems unlikely that our conclusions on the effectiveness of the guideline would change if complete data was present, considering the similar lack of results across analyses. Moreover, multilevel analyses are justified as adjustment for the dependency of observations was necessary (ICC>0.10) (31).
Remarkably, however, obese and male intervention participants increased in waist circumference. A possible explanation may be that the guideline was not suitable for obese employees. Attendance rates among obese participants were significantly lower compared to normal or overweight participants (data not shown). Moreover, others have shown that more intensive interventions may be necessary for obese workers, including guided dieting and physical activity, psychological interventions, and when necessary medication or surgery (26). Obese employees may therefore best be referred to the Dutch guideline for treatment of obesity (26). The increase in waist circumference among male intervention participants was not influenced by attendance rates, but, based on additional interviews, we found that dissatisfied participants under one intervention OP significantly increased waist circumference and body weight (data not shown). Nevertheless, the value of these results could be argued as, although significant, the increase in waist circumference may not have been clinically relevant (36, 37).
The short-term evaluation of the guideline (38) showed that directly after the 6-months intervention period, guideline-based care resulted in a more favorable sedentary behavior at work (β -28 minutes per day, 95% CI -2–-54) and increased fruit intake (β 2.1 pieces per week; 95% CI 0.6–3.6), but did not improve employees’ physical activity, snack intake, or body weight-related outcomes. After 18-months follow-up, only the effect on increased fruit intake (β 1.9 pieces per week; 95% CI 0.4–3.4) was sustained (39).
The overall lack of effectiveness of the guideline may be due to several factors. First, the guideline may have been poorly implemented, limiting the ability to detect effects. Although attention to both environmental and individual influences was incorporated in the current draft guideline (40), intervention OP did not perform the environmental component and counseling to the full extent necessary (41). Process data indicated that improvements could be made on both the content and performance of the guideline, such as better practical guidance or materials for OP, better training on relapse prevention and physician–employer communication skills, and more feedback on changing routines (35). Moreover, one control OP applied co-intervention. Also, control employees received four health risk appraisal with feedback for evaluation purposes as well, which in itself may have motivated control participants to change their behavior (the so-called “Hawthorne effect”) (42). These findings suggest that the contrast between the intervention and control group may have been too small. The question remains if the guideline could be effective in case of optimal implementation. Secondary analyses among intervention participants at 6-months follow-up suggested greater results can be achieved on waist circumference and body weight among those with higher attendance (5 versus <5 counseling sessions) and satisfaction scores (8 versus <8 on a scale 1–10) (41). These differences, however, were not sustained at 18-months follow-up. Moreover, due to a lack of effect on performance indicators, no one part of the guideline could be identified as being more effective.
Second, the evaluation of the guideline among a general workforce may have provided little room for improvement in the outcome measures. Two systematic reviews recently described that interventions using counseling and education aimed at behavior change may not reduce CVD morbidity or mortality in general populations (17, 43, 44). Comparable programs among high-risk populations indeed found better results, such as modest reductions in blood pressure, cholesterol, and weight, possibly because high risk participants are more likely to achieve measurable changes in behavior (18, 44, 45). The high risk approach however, does not solve the origin of the problem (46). It may be worthwhile to evaluate an adapted form of the guideline among high risk groups, such as populations at risk for CVD, hypertension, or diabetes. To achieve a meaningful degree of prevention and protection at the workplace, ultimately a combination of primary, secondary, and tertiary interventions may be needed (46). Also, there is ongoing debate what approach may be most suitable for obesity prevention. Although single and multiple factor interventions have shown positive effects, maintenance of effects is usually problematic (47). In our study, we chose to include both individual and environmental strategies as this combination was suggested as a promising strategy to facilitate change in complex health behaviors among relatively healthy people (48). The feature of our healthcare delivery system to divide complex health problems into smaller, more understandable parts, functions well for acute and simple diseases such as acute appendicitis. For chronic and complex diseases such as obesity, however, multiple factors are often responsible for the disease development or presentation (49). Whether a system that accounts for these multiple factors provides better and more effective care remains to be explored (49).
A final point that should be considered is the quality of the measurement instruments. Despite the use of standardized protocols, the fact that our study was performed in daily practice made it impossible to standardize blood pressure and cholesterol measurements. This is important for evaluative research, as variations in blood pressure and cholesterol measurements may lead to different results (29, 50). Nevertheless, we do not feel that these variations affected our study greatly considering our lack of results. As for measuring quality of life, the generic EQ-5D – a standardized health-related quality of life questionnaire developed to provide a simple, generic measure of health for clinical and economic appraisal (30) – may not have been appropriate in our population due to ceiling effects (51). However, among populations with a high gross domestic product (GDP) per capita (eg, the Netherlands) high mean VAS ratings are found (51). Moreover, general populations report few problems on quality of life domains (30). Thus, the assessment of quality of life for evaluation purposes may have limited value in intervention studies using general populations, such as ours.
The strengths of this study include the practice-based nature of the guideline and the minimized risk of contamination due to randomization at the OP level. There were some limitations as well. Due to randomization at the OP level, intervention and control employees were significantly different at baseline with respect to age and irregular work hours. We attempted to resolve this selection bias by controlling for these variables in all analyses. Also, as only a small group of all OP in the Netherlands (2%) participated in our study, therefore the generalizability of our results may be limited towards the total population of Dutch OP. Our study could thus be considered a pragmatic trial (ie, a trial reflects the heterogeneity of patients in actual clinical practice, minimizes exclusion criteria, and allows for variation in context, diagnosis and treatment), as opposed to an efficacy trial (ie, a trial that determines if an intervention produces the expected result under ideal circumstances) or an effectiveness trial (ie, a trial that measures the degree of beneficial effect under “real world” clinical settings) (36). Moreover, smoking behavior was not included in the evaluation of our study while smoking behavior may influence weight and CVD risk factors (52). As smoking behavior was not a goal of our intervention and it did not differ at baseline between groups, we did not expect to find effects. Nevertheless, because we did not assess this information, it is neither possible for us to check for changes in smoking behavior between the intervention and control condition nor to assess the influence of (changes in) smoking behavior on weight and CVD risk factors. Moreover, the occurrence of non-random missing data at the third measurement is probably due to time constraints at one intervention company; indicating attrition is a major problem. We encourage future statistical research to examine the issue of non-random missing data in multilevel analyses. Finally, many OP perceived difficulties collecting data for the follow-up measurements, indicating supporting structures for OP, such as a linkage group, may be necessary for long-term performance of the guideline (41).
Concluding remarks
The draft occupational health guideline was not effective in preventing weight gain, reducing CVD risk factors, or improving quality of life during 18-months follow-up among the intervention compared to the control group. Thus, we cannot recommend implementation of the guideline in its current form. It may be worth evaluating an adapted, more intensive form of the guideline among high risk groups. Also, more attention could be paid to maximizing attendance and satisfaction rates, as this may favorably affect results. Finally, future research should determine feasible ways to prevent weight gain effectively via occupational health services among general populations in the workplace.