Por favor, use este identificador para citar o enlazar este ítem: http://conacyt.repositorioinstitucional.mx/jspui/handle/1000/8066
From Collection to Analysis: A Comparison of GISAID and the Covid-19 Data Portal
Nathanael Sheehan
Sabina Leonelli
Federico Botta
Acceso Abierto
Atribución-NoComercial-SinDerivadas
https://doi.org/10.1101/2023.05.13.540634
https://www.biorxiv.org/content/10.1101/2023.05.13.540634v1
Abstract We analyse ongoing efforts to share genomic data about SARS-COV-2 through a comparison of the characteristics of the Global Initiative on Sharing All Influenza Data and the Covid-19 Data Portal with respect to the representativeness and governance of the research data therein. We focus on data and metadata on genetic sequences posted on the two infrastructures in the period between January 2020 and January 2023, thus capturing a period of acute response to the COVID-19 pandemic. Through a variety of data science methods, we compare the extent to which the two portals succeeded in attracting data submissions from different countries around the globe and look at the ways in which submission rates varied over time. We go on to analyse the structure and underlying architecture of the infrastructures, reviewing how they organise data access and use, the types of metadata and version tracking they provide. Finally, we explore usage patterns of each infrastructure based on publications that mention the data to understand how data reuse can facilitate forms of diversity between institutions, cities, countries, and funding groups. Our findings reveal disparities in representation between the two infrastructures and differing practices in data governance and architecture. We conclude that both infrastructures offer useful lessons, with GISAID demonstrating the importance of expanding data submissions and representation, while the COVID-19 data portal offers insights into how to enhance data usability.
bioRxiv
13-05-2023
Preimpreso
Inglés
Público en general
VIRUS RESPIRATORIOS
Aparece en las colecciones: Materiales de Consulta y Comunicados Técnicos

Cargar archivos: