PII Redaction Pipeline
by Microsoft · open-source · Last verified 2026-03-17
Detects and redacts personally identifiable information from text and structured data using Microsoft Presidio with configurable entity recognizers for GDPR and HIPAA compliance. Supports reversible pseudonymization with a secure vault for re-identification by authorized parties.
https://github.com/microsoft/presidio ↗B
B—Above Average
Adoption: B+Quality: AFreshness: ACitations: BEngagement: F
Specifications
- License
- MIT
- Pricing
- open-source
- Capabilities
- entity-recognition, redaction, pseudonymization, reversible-vault
- Integrations
- presidio, spacy, fastapi, postgresql
- Use Cases
- training-data-anonymization, customer-data-compliance, healthcare-nlp
- API Available
- Yes
- Language
- python
- Dependencies
- presidio-analyzer, presidio-anonymizer, spacy, fastapi, psycopg2
- Environment
- Python 3.10+
- Est. Runtime
- 1-5 minutes per 100k records
- Tags
- pii, redaction, presidio, privacy, gdpr, hipaa
- Added
- 2026-03-17
- Completeness
- 100%
Index Score
61.7Adoption
72
Quality
87
Freshness
88
Citations
62
Engagement
0