PIIFILL CLI v0.2.0
Secure your data. Mask sensitive information. Stay fully offline.
Overview
PIIFILL is a high-performance command-line utility designed to automatically detect and redact Personally Identifiable Information (PII) from your documents, datasets, and images.
Whether you are a developer sanitizing logs, a researcher preparing datasets, or a professional sharing sensitive files, PIIFILL provides a "security-first" workflow to ensure privacy compliance.
Advanced Recognition: Detects 50+ sensitive data types including names, SSNs, and financial records with high precision.
Local Integrity: Zero telemetry policy ensures all operations remain 100% on-device, meeting GDPR and HIPAA standards.
IMPORTANT: 100% Offline Processing
PIIFILL is built for privacy. All detection, masking, and OCR processing happen locally on your machine. Your data is never uploaded to any cloud service.
Industry research shows that automated PII masking can reduce accidental data exposure risks by over 99% in AI workflows.
Every byte is processed locally. Zero cloud dependency ensures full compliance with GDPR, HIPAA, and CCPA standards.
Advanced detection models identify 50+ unique entity types including SSNs, financial records, and medical identifiers.
What PII can it detect?
Phone numbers, Email addresses.
SSN, Aadhaar Card details.
Credit/Debit card numbers.
Physical addresses and ZIP codes.
Quick Start
Installation
Ensure you have Python 3.8+ installed. You can install PIIFILL
directly via pip:
pip install piifill-cli
Usage Guide
PIIFILL follows a simple two-phase workflow: Scan (To identify) and Mask (To protect).
1. Identify Privacy Risks (scan)
Use the scan command to audit your files. This is
a read-only operation that provides a detailed report of
potential PII without modifying your source files.
piifill scan sensitive_data.pdf
piifill scan ./private_docs/ --recursive
2. Protect Your Files (mask)
Once verified, use mask to generate sanitized
versions of your files. By default, it creates an
out/ directory with the protected copies.
piifill mask user_records.csv
piifill mask ./raw_logs/ -o ./sanitized_logs/
Command Reference
| Command | Description | Key Options |
|---|---|---|
scan
|
Detects PII and generates a risk report. |
--recursive, --format
|
mask
|
Redacts PII and creates safe file copies. |
-o (output), --mode
|
config
|
Displays current PIIFILL configuration. | N/A |
version
|
Displays version and environment info. | N/A |
Masking Modes
Customize how PII is hidden using the
--mode flag:
Replaces data with descriptive placeholders (e.g.,
[REDACTED]).
Completely removes the sensitive data from the file.
Replaces data with unique, trackable tokens (e.g.,
<EMAIL_123>).
Security & Risk Analytics
PIIFILL doesn't just hide data—it helps you understand your privacy posture through integrated analytics:
Security Grade
Standardized rating (A-F) based on PII density.
Risk Score
Quantitative metric of data exposure severity.
Frequency
Detailed breakdown of detected entities.
Supported File Formats
| Category | Extensions |
|---|---|
| Structured Data | .csv, .json, .sql, .xlsx |
| Documents | .txt, .pdf, .docx |
| Images (OCR) | .png, .jpg, .jpeg |
TIP: Image Detection
PIIFILL uses built-in OCR capabilities to detect and mask text hidden inside screenshots and scanned documents.