1. ISACA
Auditing Artificial Intelligence
2023: In the section on "Data Governance
" the guide states
"Data lineage and provenance should be established to trace data back to its source
providing transparency and accountability for the data used in AI systems." (p. 21). This highlights lineage as a core mechanism for validating data governance.
2. National Institute of Standards and Technology (NIST)
AI Risk Management Framework (AI RMF 1.0)
January 2023: The framework's "Govern" function emphasizes the need for data governance. Under section 4.3.1 (Data)
it notes
"Data sourcing
quality
and any pre-processing or labeling steps should be documented to enable traceability and reproducibility
" which is the essence of assessing data lineage.
3. Ko
H.
Lee
S.
& Lee
K. (2022). A Study on the Data Governance Framework for Artificial Intelligence. Journal of Theoretical and Applied Information Technology
100(15)
5099-5111: This academic paper discusses that "Data lineage is essential for tracking the origins and transformations of data
which is a cornerstone for ensuring data quality and integrity within an AI data governance framework." (Section 3.2).