Spreadsheets Are Not Documents
A Word file is a text stream. An Excel file is something else. Cells point to other cells. Formulas run on ranges. Pivot tables group named data. Macros move through the whole workbook. Most redaction tools treat Excel like a text document. That is the wrong model.
Here is a simple example. Column A has customer names. Column D has this formula: =VLOOKUP(A2, CustomerTable, 5, FALSE). This formula looks up the account balance by name. You replace the name in column A. You do not update the formula or the lookup table. The formula still returns the real balance for the original name. The file looks clean. It is not.
This is common in enterprise Excel files. Data lives in relationships — not just in cells. Replacing cell values without tracing those relationships leaves PII exposed.
GDPR Article 28 and External Sharing
GDPR Article 28 covers data sharing with processors. If you send personal data to a consultant, a vendor, or an auditor, you need technical safeguards in place.
Say you need to share a 50,000-row customer file with an analytics vendor. A PDF export strips formulas. It also breaks large files with complex formatting. A CSV strips formulas and pivot tables too. Neither gives the vendor a usable dataset.
The only option that works: anonymize inside the native Excel format. Replace the identifying values. Keep the structure. The vendor gets a working file. You meet the GDPR safeguard requirement.
Air-Gapped Environments
67% of government and defense procurement RFPs cite air-gapped environment requirements (DISA 2024). Defense contractors handle personnel data, logistics records, and procurement files in Excel. They cannot use cloud tools. The data cannot leave the controlled network.
The Desktop App solves this. It processes Excel files on the local machine. No network calls happen during processing. The output file never leaves the air-gapped environment. Internal teams can share clean files within the controlled network.
This meets the technical profile required for government contract compliance.
Three Levels of Cell Intelligence
Good Excel anonymization works at three levels at once.
Value level: Find and replace PII in individual cells. Names, emails, phone numbers, and national IDs are flagged using the same detection engine as document processing.
Formula level: Find cells whose formulas reference PII cells. Update those references to point to the anonymized values. Or replace the formula with its result to stop formula-based PII exposure.
Structure level: Clear pivot table data caches. Process hidden rows and columns. Handle VBA macro code that uses specific cell addresses or values.
All three levels must run together. Fixing values without fixing formulas leaves PII in place. Fixing formulas without clearing caches does the same.
This challenge spans every file format. See how format fragmentation affects PII detection for a broader view.
For teams working with structured data at API level, see GDPR data minimization in real-time APIs.
If your team runs large DSAR exports, see GDPR DSAR batch processing at scale for workflow patterns that apply here.