Back to Search Start Over

Data journeys: Explaining AI workflows through abstraction.

Authors :
Daga, Enrico
Groth, Paul
Source :
Semantic Web (1570-0844); 2024, Vol. 15 Issue 4, p1057-1083, 27p
Publication Year :
2024

Abstract

Artificial intelligence systems are not simply built on a single dataset or trained model. Instead, they are made by complex data science workflows involving multiple datasets, models, preparation scripts, and algorithms. Given this complexity, in order to understand these AI systems, we need to provide explanations of their functioning at higher levels of abstraction. To tackle this problem, we focus on the extraction and representation of data journeys from these workflows. A data journey is a multi-layered semantic representation of data processing activity linked to data science code and assets. We propose an ontology to capture the essential elements of a data journey and an approach to extract such data journeys. Using a corpus of Python notebooks from Kaggle, we show that we are able to capture high-level semantic data flow that is more compact than using the code structure itself. Furthermore, we show that introducing an intermediate knowledge graph representation outperforms models that rely only on the code itself. Finally, we report on a user survey to reflect on the challenges and opportunities presented by computational data journeys for explainable AI. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
15700844
Volume :
15
Issue :
4
Database :
Complementary Index
Journal :
Semantic Web (1570-0844)
Publication Type :
Academic Journal
Accession number :
180592015
Full Text :
https://doi.org/10.3233/SW-233407