// the find
nucleuscloud/neosync
Open Source Data Security Platform for Developers to Monitor and Detect PII, Anonymize Production Data and Sync it across environments.
Neosync is a data anonymization and synthetic data pipeline tool — you define jobs that pull from production databases, run transformers over PII columns, and push sanitized data to dev/staging environments. It's for teams who need to test against realistic data without exposing real user records. Note: the project was acquired by Grow Therapy and is no longer actively maintained.
Referential integrity handling is the genuinely hard part here — anonymizing a users table while keeping foreign keys consistent across orders, sessions, etc. is where most homegrown scripts fall apart, and Neosync solves it. The transformer model is well-designed: built-in types for common PII (email, phone, SSN) plus escape hatches via JavaScript or LLM for anything custom. Benthos under the hood gives you a mature async pipeline with retries and backpressure that would take months to build yourself. The full Docker Compose setup with pre-seeded connections and jobs means you can actually see it working in under 10 minutes.
Abandoned mid-growth is a real risk — the acquisition notice says it's no longer maintained, so you're adopting a dead upstream. The dependency surface is large (Go backend, Next.js frontend, Temporal for orchestration, Postgres for state) which makes self-hosting non-trivial; this is not a single-binary tool. MSSQL support exists but feels like an afterthought compared to the Postgres path — the schema introspection code is shallower. Custom JavaScript transformers running server-side with user-provided code is a sandbox problem the docs don't address clearly.