Files
PDFToMD/PROGRESS.md
T
2026-05-14 10:16:59 +09:00

57 lines
5.0 KiB
Markdown

# PROGRESS.md
This file records current progress for agents. Read it before starting work, then update it after meaningful changes. Completed historical work is archived in `docs/WORKARCHIVE.md`.
## Current Status
- Project direction is documented in `PRD.md`, `ARCHITECTURE.md`, `AGENTS.md`, and `docs/KNOWLEDGEBASE.md`.
- MinerU 3.1.0 is fixed as the only conversion engine.
- The converter currently includes path planning, project-owned records, internal provenance, direct local MinerU adapter boundary, Obsidian Markdown normalization, local quality checks, report rendering, conversion orchestration, simplified output layout, `pdf2md convert`, legacy `pdf2md recheck`, `pdf2md doctor`, local MathJax render checking, conservative MathJax warning mitigation, release-gate tests, opt-in grouped page conversion, a minimal Windows UI launcher with direct-folder PDF batch conversion, pypdf-based text layer fidelity diagnostics, NVIDIA GPU inventory, optional `--gpu auto`, and MinerU profile tuning.
- `docs/V1IMPLEMENTATIONPLAN.md` now tracks current v1 state and open future decisions; completed implementation details are archived in `docs/WORKARCHIVE.md`.
- `docs/Sprints/` contains completed sprint contracts through Sprint 16 and the abandoned Sprint 17 offline installer contract.
- `docs/superpowers/specs/2026-05-13-ui-folder-batch-conversion-design.md` and `docs/superpowers/plans/2026-05-13-ui-folder-batch-conversion.md` record the completed UI direct-folder batch work.
- `docs/WORKARCHIVE.md` contains completed sprint history, historical verification results, runtime setup notes, sample conversion evidence, archived UI work, and abandoned Sprint 17 planning context.
- `samples/` exists locally as fixture context.
- `outputs/`, `build/`, and `dist/` are local generated artifact locations and must stay out of commits.
## Environment Notes
- OS/workspace: Windows PowerShell in `D:\Work\Repos\AICoding\ConvertPDFToMD`.
- Python target: 3.12.
- Local project Python observed: 3.12.7 through `uv run pdf2md doctor` on 2026-05-11.
- `uv` is installed per-user at `C:\Users\user\.local\bin`.
- Target GPU documented for this project setup: NVIDIA GTX 1070 Ti 8GB.
- Current PC GPU observed by `doctor`: NVIDIA GeForce GTX 1070 Ti 8GB.
- Default conversion device: `cuda:0`.
- Default MinerU profile: `auto`.
- MinerU execution mode: direct local `mineru` CLI only.
- Strict-local allows MinerU 3.1.0's CLI-internal temporary local `mineru-api` when the CLI runs without `--api-url`.
- Strict-local prohibits `--api-url`, remote APIs, router mode, HTTP client backends, and remote OpenAI-compatible backends.
- Current `.venv` has project fast-test dependencies, CUDA-enabled PyTorch `2.6.0+cu126`, `torchvision 0.21.0+cu126`, `mineru[core]==3.1.0`, local MathJax npm dependencies, and local MinerU models.
- Current `pdf2md doctor` status is WARN because GTX 1070 Ti is Pascal/pre-Turing; MinerU, CUDA PyTorch visibility, local model config, MathJax, and strict-local checks otherwise pass. Doctor selects `cuda:0` for `--gpu auto` on this machine and recommends MinerU profile `safe`.
- MinerU models were downloaded from Hugging Face by explicit setup command. Runtime model loading uses `MINERU_MODEL_SOURCE=local`.
## Recent Completed Work
- Archived completed coordination details from `PLAN.md`, `PROGRESS.md`, and `docs/V1IMPLEMENTATIONPLAN.md` into `docs/WORKARCHIVE.md`.
- Refreshed current docs so abandoned Sprint 17 offline installer planning, completed UI direct-folder batch conversion, simplified output layout, legacy-only `recheck`, and no-public-metadata behavior are consistently referenced.
- Updated project agent/source-document references so future document reviews and implementation work can find Sprint 15/16 contracts, abandoned Sprint 17 context, and the UI folder batch design/plan.
- Abandoned Sprint 17 offline installer planning at the user's request. The contract and plan remain as historical records only.
## In Progress
- No active implementation sprint.
## Blockers
- No active blocker.
- Residual risk: direct CLI conversion smokes for `samples\FourNodeQuadrilateralShellElementMITC4.pdf` exceeded the 15-minute timeout on 2026-05-11 and stalled on source page 2 with Sprint 14 `--chunk-pages` on 2026-05-12, so hands-on UI conversion smoke remains pending.
- Residual risk: conversion can still be impractically slow or stall on GTX 1070 Ti 8GB for some source pages even when Sprint 14 sends one source page to MinerU at a time.
## Next Actions
1. Run hands-on UI smoke when practical: launch `dist\pdf2md-ui.exe`, click Doctor, then run one small local conversion to ignored `outputs/`.
2. Preserve strict-local runtime behavior: use local model paths, direct CLI execution, and no user-specified API or remote backend.
3. Decide in a future sprint whether simplified outputs need metadata-free `pdf2md recheck`; current behavior intentionally remains legacy-only.
4. On a stronger NVIDIA GPU PC, run `uv run pdf2md doctor` and an optional local conversion with `--gpu auto --mineru-profile auto` to validate the auto profile against ignored `outputs/`.