Doc Scrubber Tutorial: Remove Metadata and Hidden Content in Minutes
What Doc Scrubber does
Doc Scrubber removes hidden metadata and embedded content from Office documents and PDFs so you can share files without exposing author names, revision history, comments, hidden text, embedded objects, or tracked changes.
Before you start
- Backup: Save a copy of the original file.
- Install: Ensure Doc Scrubber (or similar tool) is installed and updated.
- File types: This tutorial assumes Word (.docx), Excel (.xlsx), PowerPoint (.pptx), and PDF files.
Quick step-by-step (under 5 minutes)
- Open Doc Scrubber. Launch the app on your computer.
- Load file. Click “Open” or drag the document into the window.
- Select scan scope. Choose the types of hidden data to check (metadata, comments, tracked changes, hidden text, embedded files, macros, document properties, personal info).
- Run scan. Click “Scan” or “Analyze” — the tool will list findings grouped by category.
- Review results. Quickly inspect flagged items. Common entries:
- Author, last modified by
- Comments and annotations
- Tracked changes and revision marks
- Hidden text and white-on-white text
- Embedded objects (templates, OLE objects)
- Custom document properties
- Macros and VBA code
- Remove or keep. Use the software checkboxes to remove all or selected items. When unsure, remove metadata and comments; keep visible content.
- Save cleaned file. Use “Save As” to create a scrubbed copy (recommended filename: filename_cleaned.ext).
- Verify. Re-run the scan on the cleaned copy to confirm no hidden items remain.
Options for advanced users
- Batch processing: Use the batch mode to scrub multiple files at once (ideal for folders of documents).
- Automations: Configure command-line options or watch-folders to auto-clean new files.
- Custom rules: Create profiles that always remove specific properties or preserve certain metadata fields.
- PDF specifics: Flatten annotations, remove embedded fonts only if acceptable, and sanitize XMP metadata.
Best practices
- Always keep an original backup.
- Maintain an audit log of scrubbed files (filename, date, who scrubbed).
- For legal or compliance-sensitive documents, consult legal counsel before removing revision history.
- Use password-protected archives when sharing cleaned files for additional security.
Troubleshooting
- If tracked changes reappear: ensure you accepted all changes before scrubbing, or use the tool’s “accept changes” option.
- Macros not removed: disable macros in the application settings or remove VBA projects manually.
- Large files hang during scan: try batching smaller groups or increase app memory limits.
Quick checklist
- Backup original — done
- Scan for metadata — done
- Remove comments/tracked changes — done
- Remove embedded objects/macros — done
- Save cleaned copy — done
- Re-scan to verify — done
Follow these steps and you can remove metadata and hidden content from most documents in minutes, reducing risk when sharing files.
Related search suggestions provided.
Leave a Reply