Por favor, use este identificador para citar o enlazar este ítem:
http://conacyt.repositorioinstitucional.mx/jspui/handle/1000/8855
Hidden Challenges in Evaluating Spillover Risk of Zoonotic Viruses using Machine Learning Models | |
Junna Kawasaki Tadaki Suzuki Michiaki Hamada | |
Acceso Abierto | |
Atribución-NoComercial-SinDerivadas | |
https://doi.org/10.1101/2024.04.25.591033 | |
https://www.biorxiv.org/content/10.1101/2024.04.25.591033v1 | |
Machine learning models have been deployed to assess the zoonotic spillover risk of viruses by identifying their human infectivity potential. However, the scarcity of comprehensive datasets poses a major challenge, limiting the predictable range of viruses. Our study addressed this limitation through two key strategies: constructing expansive datasets across 26 viral families and developing new models leveraging large language models pre-trained on extensive nucleotide sequences. Our approaches substantially boosted our model performance. This enhancement was particularly notable in segmented RNA viruses, which are involved with severe zoonoses but have been overlooked due to limited data availability. Furthermore, models trained on data up to 2018 displayed strong generalization capability for viruses emerging post-2018. Nonetheless, we also found remaining challenges in alerting the zoonotic potential of specific viral lineages, including SARS-CoV-2. Our study elaborates on the models and datasets for predicting viral infectivity and highlights the unresolved issues to fully exploit machine learning in preparing for future zoonotic threats. | |
bioRxiv | |
29-04-2024 | |
Preimpreso | |
Inglés | |
Público en general | |
VIRUS RESPIRATORIOS | |
Aparece en las colecciones: | Materiales de Consulta y Comunicados Técnicos |
Cargar archivos:
Fichero | Tamaño | Formato | |
---|---|---|---|
Hidden Challenges in Evaluating Spillover Risk of Zoonotic Viruses using Machine Learning Models.pdf | 22.68 MB | Adobe PDF | Visualizar/Abrir |