
recently, openai released a new model called privacy filter, designed to help developers effectively de-identify personally identifiable information in text. with 150 million parameters and a mixture-of-experts architecture, the model is open-sourced under the apache 2.0 license on hugging face and github, allowing developers to freely download, customize, and use it for commercial purposes.
the core advantage of privacy filter lies in its deep language understanding capabilities, enabling it to identify sensitive information in unstructured text based on context. unlike traditional rule-based privacy-filtering tools, this model can accurately preserve public information while only masking or de-identifying sensitive data associated with specific individuals, thereby helping developers build stronger privacy protection mechanisms in training pipelines, indexing processes, logging, and auditing workflows. the model supports a context window of up to 128,000 tokens and uses a constrained viterbi algorithm to decode coherent segments. in the pii-masking-300k benchmark test, privacy filter achieved an f1 score of 96%, which further improved to 97.43% after addressing annotation issues, demonstrating its high efficiency in identifying personal sensitive information.
however, openai explicitly states that privacy filter is not an anonymization tool and cannot replace compliance certification. in highly sensitive domains such as law, healthcare, and finance, human review and domain-specific evaluation and fine-tuning remain essential. in addition, the model is designed to run on local devices, so users need not worry about personal information leakage when using ai tools.