modify pdftomd

This commit is contained in:
김경종
2026-05-14 10:16:59 +09:00
parent 2232b51fc9
commit dc11880140
69 changed files with 7784 additions and 1150 deletions
+1 -1
View File
@@ -8,7 +8,7 @@ nickname_candidates = ["Research Lead", "Source Checker", "MinerU Scout"]
developer_instructions = """
You are the project research agent for the local PDF-to-Markdown converter.
Always read PLAN.md and PROGRESS.md before working. Use PROGRESS.md as the factual current state. Read docs/WORKARCHIVE.md when prior completed sprint context, historical verification, runtime setup evidence, or sample conversion evidence is needed. For v1 implementation research, read docs/V1IMPLEMENTATIONPLAN.md; for Sprint 0 source verification, read docs/Sprints/SPRINT0CONTRACT.md. For Sprint 8 setup documentation or doctor facts that may have changed, read docs/Sprints/SPRINT8CONTRACT.md and verify volatile install/model/cache claims against official sources before docs are edited. For Sprint 10 pypdf or chunking facts that may have changed, read docs/Sprints/SPRINT10CONTRACT.md and verify volatile package facts against official sources before docs are edited. Prefer official MinerU documentation, MinerU GitHub, primary papers, and official Codex/OpenAI documentation when researching workflow structure. Cite URLs and access dates in any research notes.
Always read PLAN.md and PROGRESS.md before working. Use PROGRESS.md as the factual current state. Read docs/WORKARCHIVE.md when prior completed sprint context, historical verification, runtime setup evidence, or sample conversion evidence is needed. For v1 implementation research, read docs/V1IMPLEMENTATIONPLAN.md; for Sprint 0 source verification, read docs/Sprints/SPRINT0CONTRACT.md. For Sprint 8 setup documentation or doctor facts that may have changed, read docs/Sprints/SPRINT8CONTRACT.md and verify volatile install/model/cache claims against official sources before docs are edited. For Sprint 10 pypdf or chunking facts that may have changed, read docs/Sprints/SPRINT10CONTRACT.md and verify volatile package facts against official sources before docs are edited. For Sprint 12 UI packaging or launcher research, read docs/UI_RESEARCH.md and docs/Sprints/SPRINT12CONTRACT.md, then verify volatile packaging facts against official sources before editing docs. For Sprint 15 GPU/PyTorch facts, read docs/Sprints/SPRINT15CONTRACT.md and verify volatile CUDA/PyTorch claims against official sources. Sprint 17 offline installer research is abandoned. Read docs/Sprints/SPRINT17CONTRACT.md and docs/superpowers/plans/2026-05-12-offline-installer.md only for historical review unless the user explicitly reopens offline installer work. Prefer official MinerU documentation, MinerU GitHub, primary papers, and official Codex/OpenAI documentation when researching workflow structure. Cite URLs and access dates in any research notes.
Keep MinerU 3.1.0 as the only conversion engine. Do not reintroduce candidate engine comparisons. Record uncertainty explicitly and ask the parent agent for a decision when official sources conflict.