Is OpenRedaction free?

Yes, the regex-only library is free and open-source. You can use it locally without any external services. AI-assist via the hosted API has a free tier with IP-based limits, and a Pro tier with 50,000 requests/month for £9/month.

What is the difference between regex mode and AI mode?

Regex mode uses 500+ tested patterns for fast, deterministic PII detection. It works completely offline and is free. AI mode is optional and uses a hosted AI proxy for better detection on messy or unstructured text. AI mode requires the hosted API and has usage limits.

Does OpenRedaction store my data?

No, the hosted API is stateless and does not store raw text. Processing happens in memory and data is discarded after redaction. Minimal metadata may be logged for rate limiting. For complete privacy, you can self-host the open-source library.

What is the API limit?

Free tier: IP-based rate limiting with fair-use limits (approximately 200 AI-assist requests per day). Pro tier: 50,000 AI-assist requests per month with an API key. Regex-only redaction has no limits and works completely offline.

From Regex Library to Real API: Building OpenRedaction's ...

You know that feeling when you build something small, open-source… then suddenly people star it, fork it, and ask: "How much for the API?"

That's where we found ourselves with OpenRedaction. What began as a deterministic regex-based redaction library — simple, local, dependable — has now become something bigger: a hosted AI-assist proxy, Stripe payments, API keys, and a real product behind it.

In this post I walk you through that journey: why we built each piece, what worked, what we learned — and how you can use the same blueprint for your own dev tools.

1. The beginning: an open-source library for privacy-first redaction

Origins — OpenRedaction started as a personal tool: a regex-based engine to strip out names, emails, phone numbers, addresses, etc. from text. It was simple, deterministic, fast, and local.
Why regex-first? Regex gives control: no external calls, no hidden AI, no data leaks. That's important for privacy, security, and compliance.
Open-sourcing — I made it MIT, published on GitHub, added many patterns, tests, and documentation. People liked it: devs could trust its transparency and deterministic behaviour.

Value delivered: a lean, dependable redaction engine for anyone who needs PII-safe output — logs, disclosures, transcripts, and more.

2. The gap: real-world data isn't neat — regex wasn't enough

Real world isn't clean. Names are lowercase, uppercase, mixed case; people combine first/last names incorrectly; addresses vary; phone numbers have weird formats; blobs of unstructured text with noise.

Regex did a great job — but still missed messy, ambiguous, or unusual cases.

So I asked: What if we layer an AI-powered detection pass over regex?

But I also wanted to stay true to the original values: privacy, transparency, and optionality.

3. The hybrid solution: regex-first + optional AI-assist

We designed the architecture to be hybrid:

Regex-first core — still default, local, open-source.
Optional AI-assist via hosted proxy — when you need extra detection power.
User decides — you can stay 100% local, or use hosted API.

That balance preserves trust while giving flexibility.

4. From library to product: building the API, proxy, billing

A few big steps:

Built a hosted AI proxy — accepts text, passes it to a model provider, returns structured entity spans.
Wrapped with rate-limiting, API keys, quota checks — using Upstash + KV storage.
Integrated payment handling (via Stripe) → after checkout, generate API key + email to user.
Updated docs + README + site messaging — made clear what's free, what's paid, and how to use.
Added free tier + pro tier — free tier for experimentation; pro tier for real usage (e.g. 50,000 AI-assisted requests/month).

This turned OpenRedaction from a hobby-library to a real dev-tool product.

5. What we learned (the hard and the good)

Good

Open source brings visibility and trust.
Hybrid model satisfies both "privacy-first" and "power-when-needed" communities.
Simple billing + API key logic is enough at early stage.
Transparent docs + clear messaging convert interested devs quickly.
Hosting under your own proxy lets you control quota, avoid vendor friction, and shield users from complexity.

Challenges & trade-offs

You must explain clearly when AI-assist sends data externally — honesty builds trust.
Edge cases: very long inputs, abuse, rate-limiting — had to harden API accordingly.
Documentation & UX must stay rock-solid to avoid confusion.
You lose the "fully local only" claim when users choose AI mode — needs clear communication.

6. The result: a tool devs can trust — with flexibility

OpenRedaction today:

Is still free and open-source at its core.
Lets you redact with pure regex quickly and privately.
Offers optional AI-assist for messy or unstructured text.
Provides a hosted API, billing, and key-based access — ideal for production.
Gives community & enterprise users flexibility: local vs hosted; free vs paid; DIY vs plug-and-play.

It's become a full-featured redaction platform, but with developer values and transparency intact.

7. Advice for other dev-tool creators

If you're building a developer library and thinking of turning it into a product:

Start with deterministic core functionality — something reliable, open-source, and trustable.
Expose a clear switch — "core library only" vs "hosted service" — let users choose.
Build incremental — library → self-hosted → hosted API → billing.
Keep docs simple, honest, and upfront about trade-offs (privacy, cost, limits).
Don't over-engineer early. A simple API key + rate limiting + small quota is enough to test demand.
Use a hosted proxy rather than exposing vendor complexity — shield users from underlying dependencies.

Conclusion

OpenRedaction's journey — from small regex library to hosted API product — is a classic developer-tool growth arc. Because we stayed grounded in simplicity, transparency and dev-values, we haven't lost flexibility or trust — and unlocked real usage and revenue potential.

If you're building a tool, library or small SaaS: treat your users as developers, give them control, stay honest — and build slowly.

Want to see the live code or try it? Check out GitHub → OpenRedaction or visit openredaction.com.

Ready to get started?

Learn about PII Detection for AI and how to safely use user data with LLMs
Read Understanding PII Detection for a primer on the basics
Check out Node.js Redaction for integration guides
View pricing and get an API key for the Pro tier
Try the playground to test redaction in your browser
Read the documentation for integration guides and API details

From Regex Library to Real API: Building OpenRedaction's Developer Journey