857 B
857 B
description, argument-hint, allowed-tools
| description | argument-hint | allowed-tools | ||||||
|---|---|---|---|---|---|---|---|---|
| Audit samples/ PDFs for page counts, text-layer quality, images, and OCR candidates. |
|
|
/sample-audit
Arguments
The user invoked this command with: $ARGUMENTS
Workflow
- Read
AGENTS.md,PLAN.md,PROGRESS.md, anddocs/CONVERSION_POLICY.md. - Use PyMuPDF from
.\venvto inspect matchingsamples/*.pdffiles. - Report page count, first-page text length, image counts, suspected scan/OCR pages, Korean filename coverage, and obvious layout risks.
- If the user asks to write metadata, create or update
samples/metadata.json; otherwise only report. - Update
PROGRESS.mdwhen files are changed.
Output
- Corpus Summary
- Per-PDF Traits
- OCR Candidates
- Test Implications
- Recommended Metadata Changes