De-identify clinical notes without breaking analytics. Joinable tokens + round-trip re-ID + cryptographic audit trails.
Three core differentiators that no competitor offers at our price point.
Same patient across 1000 documents = same token. Enable longitudinal analysis without exposing PHI.
De-ID → Send to LLM → Get response → Re-ID. Critical for AI scribes and clinical documentation AI.
Every operation produces a signed receipt proving what went in, what came out, and what was found.
Detects all 18 HIPAA identifiers plus clinical extensions. Ages >89 automatically generalized.
Process PDF, Word, TXT, JSON, HTML, RTF, Markdown, plus native FHIR R4 and HL7v2 parsing.
Process 100+ documents per second. Pattern engine runs on CPU, no GPU required.
Upload clinical documents in any format. We extract and de-identify automatically.
Scanned or digital PDFs with text extraction
Microsoft Word documents with formatting preserved
.txt, .text, .md Markdown files
Structured data with automatic text extraction
.html, .htm web pages with tag stripping
Rich Text Format documents
Native FHIR Bundle and Resource parsing
ADT, ORU, ORM message parsing
95.4% F1 accuracy. 80% cheaper than AWS Comprehend Medical. 5-minute setup.
| Feature | Open Source Presidio, Philter |
Cloud APIs AWS, Azure |
Enterprise Private AI, JSL |
RedactiPHI |
|---|---|---|---|---|
| F1 Score | ~70-75%* | 83-91% | 96-98% | 95.4% |
| Precision | Varies widely | 85-95% | 97%+ | 95.7% |
| Recall | 53-65%* | 80-88% | 93-99% | 95.2% |
| HIPAA Compliant | You're responsible | With BAA | Yes | Yes + BAA |
| Starting Price | Free + DevOps | ~$1/GB inspect | $10k+/yr | $0 (25 docs free) |
| 5,000 docs/month | Free + your infra | ~$1,000/mo | $5,000+/mo | $199/mo |
| Setup Time | Days to weeks | Hours | Weeks to months | 5 minutes |
| Infrastructure | Self-managed | Cloud-only | On-premise required | Fully managed API |
| Developer Dashboard | None | Basic console | None | Full dashboard + analytics |
| SDKs & Libraries | DIY integration | Vendor SDKs | Contact sales | Python, Node, cURL ready |
| Re-identification | Build your own | Not available | Limited | One-click API |
| Audit Receipts | Not included | CloudTrail logs | Enterprise only | Cryptographic proof |
| Webhooks | Not included | SNS/EventBridge | Custom integration | Built-in |
Start free, scale as you grow. No hidden fees.
For testing
For indie devs
For teams
For production
For healthcare orgs
Security and compliance are foundational, not afterthoughts.
In progress. Expected Q2 2025.
BAA available for all paid plans.
PHI never stored. Memory only.
TLS 1.3 + AES-256-GCM.
High-value integrations we're actively building.
HIPAA-compliant AI in one line of code. Drop-in replacement for OpenAI/Anthropic APIs - change your base URL and we handle the rest.
Bulk de-identification for clinical trials and research. Our joinable tokenization lets you link patient data across sites while maintaining privacy.
Partnership-ready PHI layer for ambient clinical documentation. We handle HIPAA compliance so AI scribes can focus on their AI.
Native integrations where clinicians already work. SMART on FHIR apps for Epic, Cerner, Athenahealth, and other major EHRs.
Want early access to the LLM Proxy?
Join the WaitlistOne endpoint. JSON in, JSON out. Start in minutes.
# De-identify clinical text curl -X POST https://api.redact.health/api/v1/deidentify \ -H "Content-Type: application/json" \ -H "Authorization: Bearer YOUR_API_KEY" \ -d '{ "text": "Patient John Smith, DOB 01/15/1980", "policy": "safe_harbor" }' # Response { "text": "Patient [NAM_abc123], DOB 02/02/1980", "document_id": "doc-xyz789", "phi_found": 2, "phi_types": {"PATIENT_NAME": 1, "DOB": 1} }