AUTHORS: Kiran Pohar Manhas, Shawn X Dodd, Stacey Page, Nicole Letourneau, Xinjie Cui, and Suzanne Tough
Many funders, institutions and journals now insist that researchers share data from completed studies to enable new research questions to be answered, especially if the research was publicly-funded [1-5]. Numerous data platforms have emerged locally and worldwide to facilitate this sharing, such as biobanks for precision health and data repositories for reuse of clinical specimens, research and/or administrative data. SAGE, short for Secondary Analysis to Generate Evidence, is one such data platform.
While sharing data is understood to offer great opportunities, it also presents the following concerns:
To adequately recognize and address these concerns, SAGE commissioned our research team to consult with stakeholders to better understand how to ensure the benefits of data sharing do not outweigh their concerns. Key stakeholders include the data donors (research participants)..
During SAGE development, two Albertan longitudinal birth cohort studies collected rich datasets that would be deposited for sharing and reuse: All Our Families (AOF) and Alberta Pregnancy Outcomes and Nutrition (APrON). Together, these cohort studies recruited pregnant women beginning at 14 weeks gestation in 2008, and have continued with 9 collection time points over the subsequent years. Together, cohort participants (approximately 6400 people) provided information on their demographics, lifestyle, mental, psychosocial and physical health, pregnancy history, health service utilization, quality of life, and breastfeeding. Detailed information on AOF and APrON (both of which have deposited data in SAGE currently) is described elsewhere [6-8].
We wanted to understand the perspectives of parents who participated in these cohorts. We especially wanted to understand how they viewed privacy and governance issues related to data sharing and reuse. We were particularly interested in identifying how parents felt about sharing their own and their child’s non-biological research data. The health- and development- related data collected during a cohort study can be quite sensitive, but it is distinct from biological and tissue data, which most of the current literature on stakeholder engagement in data sharing emphasizes. Biological and non-biological data diverge in their nature, collection, storage, research potential and implications [9, 10].
We used a web-based survey, sent by personalized email to consenting AOF and APrON parents. They had 14 days to complete the survey if they wished to participate, and they received reminder emails on day 3 and 11 (AOF parents also got a phone call reminder on day 7 of the 14 day window). Interested parents who completed the survey could share their email in order to enter a draw for an iPod Touch.
We had 346 parent participants complete the survey (a response rate of 60.8%) in September 2014. Here are some of the highlights:
Parents considered pediatric data more sensitive than adult data and expressed significantly more reluctance towards sharing child identifiers compared to their own. In summary, parents stressed the importance of the processes and procedures in place (i.e. governance strategies) to sustain long-term, appropriate and secure access to valuable data assets, which aligns with previous research findings that it is governance not privacy or consent that is the issue for developing sustainable and trust-worthy data-sharing platforms [9, 11, 12].
We presented these research findings at the DATA 2017: the 6th International Conference on Data Science, Technology and Applications in Madrid, Spain (July 24-26, 2017). Our full research paper is available online, as are all of the research papers presented at this data focused conference.