modify documents
This commit is contained in:
@@ -4,7 +4,7 @@ This file is the shared work plan for agents. Read it before starting work, then
|
||||
|
||||
## Current Goal
|
||||
|
||||
CUDA-enabled PyTorch and MinerU 3.1.0 runtime setup is complete in the project `.venv`. Sprint 10 pre-conversion PDF chunking is implemented; next work is optional real local sample validation only if requested.
|
||||
Completed work history is archived in `docs/WORKARCHIVE.md`. CUDA-enabled PyTorch and MinerU 3.1.0 runtime setup is complete in the project `.venv`; Sprint 10 pre-conversion PDF chunking is implemented; next work is optional real local sample validation only if requested.
|
||||
|
||||
## Active Constraints
|
||||
|
||||
@@ -34,6 +34,7 @@ CUDA-enabled PyTorch and MinerU 3.1.0 runtime setup is complete in the project `
|
||||
11. Use `evaluation-agent` as the independent contract reviewer and QA evaluator before and after each implementation chunk.
|
||||
12. Follow `docs/V1IMPLEMENTATIONPLAN.md` for the v1 implementation sprint sequence.
|
||||
13. Use `docs/Sprints/SPRINT10CONTRACT.md` for the implemented long-PDF pre-conversion chunking sprint.
|
||||
14. Use `docs/WORKARCHIVE.md` for completed sprint history, prior verification, runtime setup evidence, and sample conversion evidence.
|
||||
|
||||
## Open Questions
|
||||
|
||||
@@ -43,6 +44,7 @@ CUDA-enabled PyTorch and MinerU 3.1.0 runtime setup is complete in the project `
|
||||
|
||||
- Use `PLAN.md` for intended work and ownership.
|
||||
- Use `PROGRESS.md` for completed work, current status, blockers, and next actions.
|
||||
- Use `docs/WORKARCHIVE.md` for archived completed work and historical handoff details.
|
||||
- MinerU default local CLI execution is the only v1 execution mode.
|
||||
- MinerU 3.1.0 may launch a temporary local `mineru-api` internally when `mineru` CLI runs without `--api-url`.
|
||||
- Strict-local mode forbids `--api-url`, remote APIs, router mode, HTTP client backends, and remote OpenAI-compatible backends.
|
||||
@@ -81,7 +83,6 @@ CUDA-enabled PyTorch and MinerU 3.1.0 runtime setup is complete in the project `
|
||||
- The MinerU adapter maps CUDA device requests to local subprocess environment variables instead of adding speculative MinerU CLI flags.
|
||||
- GTX 1070 Ti local runtime uses PyTorch `2.6.0+cu126` and `torchvision 0.21.0+cu126` installed after `uv sync`, followed by `mineru[core]==3.1.0`.
|
||||
- MinerU models are downloaded with `mineru-models-download -s huggingface -m all`, and runtime model loading uses `MINERU_MODEL_SOURCE=local`.
|
||||
- Sprint 10 should use `pypdf` for local 20-page PDF chunk creation if implementation is approved.
|
||||
- Sprint 10 uses `pypdf` for local PDF page chunk planning and temporary chunk PDF writing.
|
||||
- Sprint 10 converts chunk PDFs independently and does not merge generated Markdown outputs.
|
||||
- Chunking is opt-in through `--chunk-pages`; if the option is present without a value, the CLI uses 20 pages per chunk.
|
||||
|
||||
Reference in New Issue
Block a user