Files
PDFToMD/.codex/commands/sample-audit.md
T
김경종 7e985ae94a add files
2026-04-30 17:05:19 +09:00

857 B

description, argument-hint, allowed-tools
description argument-hint allowed-tools
Audit samples/ PDFs for page counts, text-layer quality, images, and OCR candidates.
pdf-glob-or-empty
Read
Glob
Bash
Write
Edit

/sample-audit

Arguments

The user invoked this command with: $ARGUMENTS

Workflow

  1. Read AGENTS.md, PLAN.md, PROGRESS.md, and docs/CONVERSION_POLICY.md.
  2. Use PyMuPDF from .\venv to inspect matching samples/*.pdf files.
  3. Report page count, first-page text length, image counts, suspected scan/OCR pages, Korean filename coverage, and obvious layout risks.
  4. If the user asks to write metadata, create or update samples/metadata.json; otherwise only report.
  5. Update PROGRESS.md when files are changed.

Output

  • Corpus Summary
  • Per-PDF Traits
  • OCR Candidates
  • Test Implications
  • Recommended Metadata Changes