Data Lineage Tracker
by OpenLineage · open-source · Last verified 2026-03-17
Instruments ETL and ML pipelines with OpenLineage events, shipping dataset-level provenance metadata to a Marquez or Apache Atlas backend. Generates interactive lineage DAGs showing data transformations from source to model artifact, supporting impact analysis and audit trails.
https://github.com/OpenLineage/OpenLineage ↗C+
C+—Average
Adoption: C+Quality: AFreshness: ACitations: CEngagement: F
Specifications
- License
- Apache-2.0
- Pricing
- open-source
- Capabilities
- openlineage-events, dag-visualization, impact-analysis, audit-trail
- Integrations
- openlineage, marquez, apache-atlas, airflow, dbt
- Use Cases
- regulatory-compliance, data-breach-tracing, ml-reproducibility
- API Available
- Yes
- Language
- python
- Dependencies
- openlineage-python, requests, apache-airflow, pydantic
- Environment
- Python 3.10+
- Est. Runtime
- Negligible overhead on pipelines
- Tags
- data-lineage, openlineage, marquez, provenance, data-governance
- Added
- 2026-03-17
- Completeness
- 100%
Index Score
50.4Adoption
55
Quality
82
Freshness
85
Citations
48
Engagement
0