Skip to main content
ScriptAI Infrastructurev1.0

Data Lineage Tracker

by OpenLineage · open-source · Last verified 2026-03-17

Instruments ETL and ML pipelines with OpenLineage events, shipping dataset-level provenance metadata to a Marquez or Apache Atlas backend. Generates interactive lineage DAGs showing data transformations from source to model artifact, supporting impact analysis and audit trails.

https://github.com/OpenLineage/OpenLineage
C+
C+Average
Adoption: C+Quality: AFreshness: ACitations: CEngagement: F

Specifications

License
Apache-2.0
Pricing
open-source
Capabilities
openlineage-events, dag-visualization, impact-analysis, audit-trail
Integrations
openlineage, marquez, apache-atlas, airflow, dbt
Use Cases
regulatory-compliance, data-breach-tracing, ml-reproducibility
API Available
Yes
Language
python
Dependencies
openlineage-python, requests, apache-airflow, pydantic
Environment
Python 3.10+
Est. Runtime
Negligible overhead on pipelines
Tags
data-lineage, openlineage, marquez, provenance, data-governance
Added
2026-03-17
Completeness
100%

Index Score

50.4
Adoption
55
Quality
82
Freshness
85
Citations
48
Engagement
0

Explore the full AI ecosystem on Agents as a Service