Abstract
The increased availability of biomedical data, particularly in the public domain, offers the opportunity to better understand human health and to develop effective therapeutics for a wide range of unmet medical needs. However, data scientists remain stymied by the fact that data remain hard to find and to productively reuse because data and their metadata i) are wholly inaccessible, ii) are in non-standard or incompatible representations, iii) do not conform to community standards, and iv) have unclear or highly restricted terms and conditions that preclude legitimate reuse. These limitations require a rethink on how data can be made machine and AI-ready- the key motivation behind the FAIR Guiding Principles. Concurrently, while recent efforts have explored the use of deep learning to fuse disparate data into predictive models for a wide range of biomedical applications, these models often fail even when the correct answer is already known, and fail to explain individual predictions in terms that data scientists can appreciate. These limitations suggest that new methods to produce practical artificial intelligence are still needed.
In this talk, I will discuss our work in (1) building an integrative knowledge infrastructure to prepare FAIR and ”AI-ready” data and services along with (2) neurosymbolic AI methods to improve the quality of predictions and to generate plausible explanations. Attention is given to standards, platforms, and methods to wrangle knowledge into simple, but effective semantic and latent representations, and to make these available into standards-compliant and discoverable interfaces that can be used in model building, validation, and explanation. Our work, and those of others in the field, create a baseline for building trustworthy and easy-to-deploy AI models in biomedicine.
In this talk, I will discuss our work in (1) building an integrative knowledge infrastructure to prepare FAIR and ”AI-ready” data and services along with (2) neurosymbolic AI methods to improve the quality of predictions and to generate plausible explanations. Attention is given to standards, platforms, and methods to wrangle knowledge into simple, but effective semantic and latent representations, and to make these available into standards-compliant and discoverable interfaces that can be used in model building, validation, and explanation. Our work, and those of others in the field, create a baseline for building trustworthy and easy-to-deploy AI models in biomedicine.
Original language | English |
---|---|
Title of host publication | Proceedings of the 7th Workshop on Semantic Web solutions for large-scale biomedical data analytics co-located with The ESWC 2024: Extended Semantic Web Conference (ESWC 2024) |
Publisher | CEUR Workshop Proceedings |
Volume | 3726 |
Publication status | Published - 2024 |
Event | 7th Workshop on Semantic Web Solutions for Large-Scale Biomedical Data Analytics, SeWeBMeDa 2024 - Hersonissos, Greece Duration: 26 May 2024 → 26 May 2024 https://sites.google.com/view/sewebmeda-2024/ |
Publication series
Series | CEUR Workshop Proceedings |
---|---|
Volume | 3726 |
ISSN | 1613-0073 |
Workshop
Workshop | 7th Workshop on Semantic Web Solutions for Large-Scale Biomedical Data Analytics, SeWeBMeDa 2024 |
---|---|
Abbreviated title | SeWeBMeDa 2024 |
Country/Territory | Greece |
City | Hersonissos |
Period | 26/05/24 → 26/05/24 |
Internet address |