Since the first discovery of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) at the end of 2019 (1), the incidence of COVID-19 rapidly increased with a substantial number of infected people across all countries around the world. Direct from the beginning of the pandemic, governments implemented far-reaching measures to change population behaviors in order to limit social contacts outside the household, and large scale testing for infections (2). This was extended with mass vaccination programs, starting with healthcare workers in January 2021 followed by other workers from February onwards in The Netherlands.
Even though the pandemic had a high impact on the entire working population, the risk of exposure to Sars-Cov-2 at work differed across occupations. During the strict lockdown in the first wave, exposure to Sars-Cov-2 was highest in essential occupations, as these occupations demand on-site work and involve close proximity with colleagues or the general public without the availability of personal protective equipment (3). Examples of occupations at increased risk of infection or death from COVID-19 were healthcare workers (4–11), workers in the transport sector (4, 7, 8, 11–13) and cleaners (5, 9). The risk of exposure to Sars-Cov-2 differed not only across occupations but also across pandemic waves. Whereas health care workers had the highest infection rates during the first wave (14), other workers were at higher risk (eg, cooks, bartenders) in later phases of the pandemic due to reopening of sectors while having less access or usage of face coverings and other preventive measures (10, 11).
Even though previous studies have shown that the workplace appears to be one of the key-settings for Sars-Cov-2 exposure, the most important setting in the spread of Sars-Cov-2 remained the household (15). The transmission within households even increased over time due to relaxation of restrictions (16). Also, an increase in transmissibility was noticed between children and adults after reopening of schools and because children became more susceptible to newer variants such as Delta and Omicron (17, 18). Outside the household, living in urban areas with higher population density and increased use of public transport is also a potential risk factor for the spread of Sars-Cov-2 (19, 20). Accordingly, the extent to which occupational exposures are associated with a COVID-19 infection, other settings and factors, such as household composition and living areas, should also be considered.
As the COVID-19 pandemic progressed, the testing facilities and guidelines changed in The Netherlands and testing increased both for people with and without symptoms. From June 2020 onwards polymerase chain reaction (PCR) tests and rapid antigen COVID-19 tests became widely available in The Netherlands. Widespread testing enables researchers to obtain a better understanding of risk factors for infections in the COVID-19 pandemic by using a test-negative design. In this design, people who are tested positive are defined as ‘cases’, while others – symptomatic or asymptomatic – who are tested negative are ‘controls’ (21). The test-negative design can be used to detect differences in risk factors between symptomatic persons who have COVID-19 (test-positives) and those who have other respiratory infections or no infection at all (test-negatives) (22). This design has quickly gained popularity recently to study risk infections or effect of vaccinations because it attempts to reduce confounding bias due to health-care-seeking behavior and severity of the symptoms (22). Direct comparisons of test-positives to test-negatives by applying this design might be helpful to gain insight into specific risk factors for becoming infected and symptomatic, and to unravel the effects of occupational exposure during the pandemic.
The aim of the current study was to systematically quantify the associations between occupational exposure and the risk of a positive test during the pandemic, while correcting for previous positive test result(s), personal factors, household composition and residence area for the entire pandemic period and across three different pandemic waves by applying the test-negative design.
Methods
Data
Different data sources from Statistics Netherlands and Municipal Health Services were used which are linked by a unique personal identification number. Firstly, the Dutch Labour Force Survey (DLFS) is an annual rotating panel in which a representative group of people aged ≥15 years who live in The Netherlands received five questionnaires over a period of 12 months on a variety of topics, including sociodemographic factors and occupation (23). People participating in the DLFS over the period 2010–2021 were included in the current study.
Secondly, the personal records database was added to retrieve data on education and household composition.
Thirdly, information regarding whether and when a participant died was also included.
Fourthly, DLFS data were enriched with the social statistical database (SSB) over the same period (24). This dataset contains monthly objective information on main income components, social benefit pensions and gross wages derived from the Dutch tax.
Fifthly, data on the residence address was used to define the province of residence and urbanity of residence area.
Lastly, these data from Statistics Netherlands were matched with the CoronIT database from the Municipal Health Services in The Netherlands which consist of Sars-Cov-2 test results for the period 1 June 2020 to 31 August 2021. In this period, testing was available for everyone experiencing symptoms compatible with COVID-19 as well as for contact tracing. This database includes all test results from residents in The Netherlands who applied for a test at the Municipal Health Services, but does not contain test data from hospitals and commercial test centers.
Study population
Participants in paid employment and with information on their job title were selected from the DLFS (N=1 356 982). Participants were only included when they were aged 18–74 years and still employed in June 2020 (ie, working population). Data from these participants were linked to the personal records database (N=1 279 943). Only participants who were tested for COVID-19 during the entire study period were included (N=616 855). Based on the SSB, individuals were also excluded if they changed their job or employment status (ie, self-employment, receiving benefits) between their participation in the DLFS and June 2020 (N=409 821). The final sample consisted of 207 034 workers. When the participant reached the age of 75 years old, changed jobs, or left paid employment during the test data period, the test results were excluded from that specific time point onwards.
COVID-19 test data
The primary outcome was the binary serological status (positive versus negative) for SARS-CoV-2—a marker of natural infection systematically recorded by the Municipal Health Services and retrieved from the CoronaIT dataset. The database consists of information on test date and test result for each test at individual level in the period 1 June 2020 to 31 August 2021. In this period, the Alpha variant was dominant up to June 2021 followed by the Delta variant in July and August 2021. We removed any positive tests following within 8 weeks of the first positive test. A positive test result at a previous time was included in the analyses as covariate. A previous positive test results was related with a lower risk to be infected again, but is not equally distributed across jobs. It was not possible to differentiate between the two tests (PCR and rapid antigen test) within the database.
Occupation
Self-reported on occupation were retrieved from the DFLS and coded by Statistics Netherlands into four-digit job codes according to the International Standard Classification of Occupations 2008 (ISCO-08) (25).
Occupational exposure to Sars-Cov-2
Exposure to Sars-Cov-2 at work at level of occupation was determined by an expert COVID-19 job exposure matrix (COVID-19-JEM; 26) The COVID-19-JEM exists of eight dimensions: four dimensions on transmission risk (number of contacts, nature of contacts, contaminated workspaces (eg, sharing surfaces, tools, and equipment) and location); two dimensions on mitigation measures (social distance and face covering); and two dimensions indicating precarious work (proportion of workers per occupation with income insecurity and proportion of migrant workers). The COVID-19-JEM was developed as a basic state of the pandemic, meaning that general measures are taken (social distancing, washing hands, face covering in public places, working from home as possible) but without closure of sectors. For each dimension, all 436 occupations within the ISCO-08 were assigned an exposure risk ranging from 0–3 (no, low, intermediate, and high probability). For the current study, the Dutch version of the COVID-19-JEM was used, which has been described extensively elsewhere (26).
Other variables
Age, gender, educational level, ethnic background, household position, having children living at home, province of the residence area and the urbanity of the residence area were also included. For each variable, information of the participant was set at time of test date.
Age, ethnicity and gender were retrieved from the DFLS. Ethnic background was defined as the country of origin and categorized into a Dutch background, a Western background and a non-Western background. Education was based on the personal records database and defined as the highest level of education completed, and categorized into low, intermediate and high.
Household position is obtained from the personal records database and categorized into four categories: single household, single parent, part of a couple, and other. Having children living at home was also included as variable. Children living at home was categorized into (i) having children until the age of 12 years (ii), having children of ≥12 years or older (iii), having children both < and ≥12 years of age, and (iv) not having children living at home.
Province and urbanity of residence area were based on the personal records database. Province of residence area consisted of the 12 provinces in The Netherlands. The urbanity of the residence area was defined as the number of addresses per square kilometer and was categorized as non-urban (<500), mildly (≥500 and <1000), moderately (≥1000 and <1500), highly (≥1500 and <2500), and very highly (≥2500).
Statistical analyses
A test-negative design was applied in which the risk for a positive test was analyzed in a conditional logit model with test date as strata, controlling for age, gender and previous positive test result. This is essentially a case–control type of analysis – where cases test positive and controls test negative – intended to remove or minimize selection biases that can arise when some subgroups of the population get tested more than others (27). As a first step, minimally adjusted models were estimated for each JEM dimension and the overall JEM score as well as for each potential confounder. Secondly, these minimally adjusted models for the JEM dimensions were each additionally adjusted for all other potential confounders (ie, previous positive test, educational level, ethnical background, household position, children living at home, province and urbanicity). Thirdly, all JEM dimensions were modeled with the same confounder adjustment. For the full multiple regression model, the predicted values were extracted per occupation at the 3-digit ISCO 2008 aggregate level. The conditional logit models were estimated with the “clogit” function from survival package using the Efron approximation method to estimate the likelihoods (28) in R statistical software version 4.1.3 (29).
All analyses were conducted for the overall test period and for each wave separately. We defined three different time periods based on the lowest infection rates in The Netherlands (30) – wave 1: 1 June 2020 to 8 February 2021; wave 2: 9 February to 28 June 2021; and wave 3: 29 June to 31 August 2021.
Results
The final study population consisted of 207 034 workers with a mean age of 45.2 [standard deviation (SD) 12.7] years, and half of them were male (49%; table 1). Of all participants, 12% was low educated, 42% was intermediate educated, and 46% was higher educated. The majority are of Dutch origin (86%), live in household as part of a couple (76%) and have children living at home (62%). Just over half the participants live in areas with high or very high population densities (53%). The risk to be exposed to Sars-Cov-2 at work, as defined by the eight dimensions of the COVID-19-JEM, can be found in the supplementary material (www.sjweh.fi/article/4086), table S1. The characteristics of the study population are in general similar over all pandemic waves. Over the entire study period, 9% of all tests were positive, and the percentage was slightly higher in the first wave (ie, 10%) as compared to waves 2 and 3 (both 9%). Of all workers, 19% of the people had at least one positive test (wave 1: 15%, wave 2: 13% and wave 3:10%).
Table 1
a Wave 1: June 2020 to February 8 2021, Wave 2: February 9 to June 28 2021, Wave 3 June 29 to August 31 2021.
Table 2 shows the results of minimally adjusted analyses, showing odds ratios (OR) for a positive test for all factors included in the analyses. The results of these analyses show that for all dimensions of occupational exposure increased odds of a positive test were observed for the higher risk categories compared to no risk category. The only exception was for the high risk category for the nature of contacts during wave 2.
Table 2
a All minimally adjusted analyses are adjusted for gender and age. b Wave 1: June 2020 to 8 February 2021, Wave 2: 9 February to 28 June 2021, Wave 3: 29 June to 31 August 2021.
A previous positive test decreased the odds of a positive test. Personal factors, such as low and intermediate educational levels, a non-western ethnic background, and being male, as well as living with older children increased the odds of a positive test (table 2). Significant differences in risk of infection between provinces were observed, with the highest infection rates generally found in the Southern provinces of The Netherlands (eg, Limburg, Brabant, Zuid-Holland). A very high urban area decreased the odds of a positive test for the entire study period, while no clear pattern between urbanity and the odds of a positive test was found (table 2).
After adjustment for a previous positive test, personal factors, household compositions and residence, for most of the dimensions of the COVID-19 JEM (eg, working at home/alone) increased odds for positive test were observed for the higher risk categories, compared to the lowest risk category (table 3, partially adjusted models). Over the entire study period, the odds for a positive test ranged from 1.03 (95% CI 1.00–1.07) for low income security to 1.13 for an elevated risk from contaminated workspaces (95% CI 1.09–1.16) and face covering (95% CI 1.10–1.16), and the highest proportion of workers with high job insecurity (95% CI 1.09–1.17). The partially adjusted odds showed similar patterns across the three pandemic waves, except for job insecurity and migrant workers. For high job insecurity, no significant increased odds for a positive test was observed at wave 1, in contrast to wave 2 and wave 3. A high risk due to the proportion of migrants was significant related to a positive test in wave 2 [OR 1.16 (95% CI 1.04–1.29)], but not for the other pandemic waves.
Table 3
a Partially adjusted analyses are corrected for a previous positive test, all personal characteristics and residence area (supplementary file 3). b Fully analyses are corrected for a previous positive test, personal characteristics, residence area and occupational exposure (supplementary file 3). c Wave 1: June 2020 to 8 February 2021, Wave 2: 9 February to 28 June 2021, Wave 3: 29 June to 31 August 2021. * Due to collinearity, number of contacts was not included in the fully adjusted models.
In the fully adjusted multiple regression models, all models were adjusted for a previous positive test, personal factors, household composition, residence area and all other dimensions of occupational exposure (table 3). We had to exclude number of contacts as a risk factor because of multicollinearity with nature of contacts and contaminated workspaces (correlation >0.9, supplementary table S2). A higher risk of exposure due to contaminated work spaces or low levels of use of face coverings resulted in higher odds of a positive test for the entire study period [OR 1.32 (95% CI 1.20–1.44) and OR 1.96 (95% CI 1.36–2.82), respectively] and across the first two pandemic waves [OR 1.27 (95% CI 1.12–1.43) and OR 1.87 (95% CI 1.15–3.03)]. Exposure to income insecurity resulted in higher odds of a positive test in the entire study period [up to: OR 1.11 (95% CI 1.05-1.17)) and in wave 1 (up to: OR 1.09 (95% CI 1.01–1.17)] and wave 3 [up to: OR 1.31 (95% CI 1.12–1.54)]. After adjusting for all other factors, exposure risks due to lack of social distance actually resulted in lower odds of a positive test in the overall period [OR 0.56 (95% CI 0.37–0.83)] and wave 1 [OR 0.58 (95% CI 0.34–0.98)]. The associations disappeared for nature of contacts, location and migrants workers for the entire study period and all three pandemic waves. Supplementary table S3 shows the OR and 95% CI for all variables included in the partially and fully adjusted models.
Figure 1–4 Figure 2 Figure 3 Figure 4 shows the 25 jobs with the highest predicted risk of a positive test for the entire pandemic period and across the three different pandemic waves. Regarding the entire study period, cleaners, refuse workers and machine operators had the highest predictive risk of a positive test. The three pandemic waves showed differences, refuse workers and machine operators have the highest risk in wave 1, mining and construction labors, cleaning workers and machine operators in wave 2, and food preparation assistants, cooks, other elementary workers, waiters and bartenders in wave 3.
Discussion
Nine percent of tests were positive during the entire study period and being exposed at work was related to a higher risk of a positive test in the period July 2020 to August 2021. The fully adjusted models suggest that contaminated workspaces, (lack of) face covering and income insecurity are the most important work-related risk factors for a positive test. Some differences were found over the study pandemic waves; contaminated workspaces and face covering were relevant in the first two pandemic waves, whereas income insecurity was especially important in the third wave. The risk to have a positive test differed across occupations and over time, whereby cleaning workers, refuse workers and machine operators had the highest predicted risk of a positive COVID-19 test for the entire pandemic period.
The minimally adjusted analyses showed that occupational exposures are associated with a positive test, which is in line with previous research on occupational exposure and COVID-19 infections (31–33). After correcting for other factors including children living at home and residence area, occupational exposure to Sars-Cov-2 still elevated the risk to be infected but became smaller for all dimensions and across all pandemic waves. Similarly, Rhodes et al (33) showed that the clear exposure–response relationship for the COVID-19 JEM dimensions on transmission risk and mitigation measures were reduced and even disappeared over time (33). The reason why the current study did not observe a complete disappearance could be explained by that fact that the current study covered a shorter period with still many restrictions and no roll out of the booster vaccination program yet.
After adjusting for all dimensions, contaminated workspaces, social distancing, face covering and job insecurity remained significantly associated with a higher risk of a positive test. For income insecurity, the odds for a positive test was highest in wave 3 (June–August 2021), probably due to the relaxation of measures whereby workers in occupations with a high income insecurity (eg, restaurant, retailers) were allowed to work. Due to collinearity between the dimensions, we could not draw any conclusion on which specific occupational exposure was the most important to drive the observed differences in infections. For example, even though it is unknown to what extent surface contamination contribute to outbreaks, the current study showed clear associations with a positive test. It might be that surface contamination also covers social distancing and nature of contacts. Furthermore, scientific evidence has shown that airborne transmission more often occurs in enclosed environments with poor ventilation (34), and poor ventilation is thereby also a risk factor in the work setting (35). The COVID-19-JEM did not include exposure to poor ventilated workplaces. Future research on occupational exposure and COVID-19 is needed to investigate the importance of each transmission risk including ventilation and vaccinations, which ask for an updated version of the COVID-19-JEM.
The risk to be infected differed across occupations and over time, with the highest predicted risk of a positive COVID-19 test for cleaning workers, refuse workers and machine operators over the entire period coved in this study. These results are in line with other studies identifying groups with a higher risk of a positive COVID-19 test (36), hospital admission (5, 37, 38) or even mortality (9–11). Occupations with the highest predictive risk varied over time, with cleaners, refuse workers and operators at the top during the first wave, operators and other high skilled workers during the second wave and food preparation assistants, waiters, bartenders, and cooks in the last wave. The latter is not surprising due to the reopening of restaurants and bars. Contrary to previous research (10, 11, 33, 38), it should be noticed that healthcare workers were not amongst the group of occupations with the highest risk in the current study. This can be explained by the absence of data of the first months of the COVID-19 pandemic in The Netherlands and the absence of test data from test centers located within hospitals. This probably resulted in an underestimation of the risks for infection at work in healthcare workers.
Strengths of the current study include its large study population, administrative data on all variables, except occupation, and distinguishing three different waves. Another major strength was the use of the test-negative design to avoid selection bias, where participants with a positive test were compared to those who were tested negative (27). As noted above, the test-negative design is intended to address biases arising from differences in the propensity to be tested. Whether someone was actually tested was a personal choice, influenced by symptoms, availability of healthcare, health seeking behavior, etc. This is exactly the situation that the test-negative design is intended to address, since it provides adjustment for these personal differences in propensity to be tested (22). However, two potential limitations of the test-negative design should be acknowledged. Firstly, the test-negative design will not provide valid effect estimates for factors that also affect the risks of other infections (22); we consider it unlikely that the occupations we have considered have increased risks for other infections during the pandemic period of the current study, but of course this cannot be ruled out. Secondly, it would have been desirable to also adjust for the reason for testing (39), but unfortunately this information was not available. This would be likely to lead to an underestimation of the OR for occupations which are regularly tested, in comparison with occupations that only involve symptomatic testing. Once again, we do not consider that the main occupations we have focused on would have substantial differences in testing policy, but this cannot be ruled out. In this context, we note that for almost all of the variables considered, we have found similar findings across the three waves, when different testing policies were operating; this would not be the case if the above-mentioned biases were having a significant effect on our findings. Some other limitations should be noted as well. Firstly, the sample size in the current study is not representative for the general working population as it does not include self-employed people and underrepresent workers that just entered the labor force. In addition, a relatively large number of workers were excluded from the dataset as they left paid employment between their DLFS. Due to selection within the DLFS, workers were more often higher educated (46%), with a Dutch background (86%) and female (51%) than in the Dutch working population (42%, 75% and 47%, respectively) (40). Moreover, it is likely that certain occupations will be underrepresented in the DFLS survey, such as those with a high proportion of low educated workers or non-white ethnic workers. Secondly, the CoronaIT database on Sars-Cov-2 infections has several limitations such as the time period (eg, missing the first months of the pandemic and no access from August 2021 onwards) and the absence of test results of some test centers (ie, hospitals and commercial test centers). Thirdly, as no information is available on the vaccination status of workers, we were not able to take this into account as variable which might differ across occupations (41). Fourth, the COVID-19-JEM was developed for a basic state, meaning that all sectors are (partly) open. However, different governmental measures were announced for specific months, meaning that some sectors were closed for a specific period, while the JEM indicate the jobs in these sectors are at elevated or high risk. As measures quickly changed over time, we were not able to adjust the JEM to all different measures in the current study.
Concluding remarks
Occupational exposures are associated with a higher risk of a positive COVID-19 test among workers. Even though adjusting for other factors strongly reduced the odds of being infected, most occupational dimensions remained at elevated risk. The multiple regression models showed that contaminated workspaces, (lack of) face covering and income insecurity are the most important risk factors for a positive test. Some occupations were at higher risk of being infected at the workplace than others, and variations in occupations with the highest risk exist over time. These findings provide insights for interventions among workers for future pandemic waves of COVID-19 or other respiratory epidemics.