claw-code

mirror of https://github.com/instructkr/claw-code.git synced 2026-06-10 08:22:14 +08:00

Author	SHA1	Message	Date
Yeachan-Heo	2e34949507	Keep latest-session timestamps increasing under tight loops The next repo-local sweep target was ROADMAP #73: repeated backlog sweeps exposed that session writes could share the same wall-clock millisecond, which made semantic recency fragile and forced the resume-latest regression to sleep between saves. The fix makes session timestamps monotonic within the process and removes the timing hack from the test so latest-session selection stays stable under tight loops. Constraint: Preserve the existing session file format while changing only the timestamp source semantics Rejected: Keep the sleep-based test workaround \| hides the real ordering hazard instead of fixing timestamp generation Confidence: high Scope-risk: narrow Reversibility: clean Directive: Any future session-recency logic must keep `current_time_millis`, ordering tests, and latest-session expectations aligned Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: Cross-process monotonicity when multiple binaries write sessions concurrently	2026-04-12 10:51:19 +00:00
Yeachan-Heo	8f53524bd3	Make backlog-scan lanes say what they actually selected The next repo-local sweep target was ROADMAP #65: backlog-scanning lanes could stop with prose-only summaries naming roadmap items, but there was no machine-readable record of which items were chosen, which were skipped, or whether the lane intended to execute, review, or no-op. The fix teaches completed lane persistence to extract a structured selection outcome while preserving the existing quality- floor and review-verdict behavior for other lanes. Constraint: Keep selection-outcome extraction on the existing `lane.finished` metadata path instead of inventing a separate event stream Rejected: Add a dedicated selection event type first \| unnecessary for this focused closeout because `lane.finished` already persists structured data downstream can read Confidence: high Scope-risk: narrow Reversibility: clean Directive: If backlog-scan summary conventions change later, update `extract_selection_outcome`, its regression test, and the ROADMAP closeout wording together Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE after roadmap closeout update Not-tested: Downstream consumers that may still ignore `lane.finished.data.selectionOutcome`	2026-04-12 09:54:37 +00:00
Yeachan-Heo	b5e30e2975	Make completed review lanes emit machine-readable verdicts The next repo-local sweep target was ROADMAP #67: scoped review lanes could stop with prose-only output, leaving downstream consumers to infer approval or rejection from later chatter. The fix teaches completed lane persistence to recognize review-style `APPROVE`/`REJECT`/`BLOCKED` results, attach structured verdict metadata to `lane.finished`, and keep ordinary non-review lanes on the existing quality-floor path. Constraint: Preserve the existing non-review lane summary path while enriching only review-style completions Rejected: Add a brand-new lane event type just for review results \| unnecessary when `lane.finished` already carries structured metadata and downstream consumers can read it there Confidence: high Scope-risk: narrow Reversibility: clean Directive: If review verdict parsing changes later, update `extract_review_outcome`, the finished-event payload fields, and the review-lane regression together Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: External consumers that may still ignore `lane.finished.data.reviewVerdict`	2026-04-12 08:49:40 +00:00
Yeachan-Heo	dbc2824a3e	Keep latest session selection tied to real session recency The next repo-local sweep target was ROADMAP #72: the `latest` managed-session alias could depend on filesystem mtime before the session's own persisted recency markers, which made the selection path vulnerable to coarse or misleading file timestamps. The fix promotes `updated_at_ms` into the summary/order path, keeps CLI wrappers in sync, and locks the mtime-vs-session-recency case with regression coverage. Constraint: Preserve existing managed-session storage layout while changing only the ordering signal Rejected: Keep sorting by filesystem mtime and just sleep longer in tests \| hides the semantic ordering bug instead of fixing it Confidence: high Scope-risk: narrow Reversibility: clean Directive: Any future managed-session ordering change must keep runtime and CLI summary structs aligned on the same recency fields Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: Cross-filesystem behavior where persisted session JSON cannot be read and fallback ordering uses mtime only	2026-04-12 07:49:32 +00:00
Yeachan-Heo	f309ff8642	Stop repo lanes from executing the wrong task payload The next repo-local sweep target was ROADMAP #71: a claw-code lane accepted an unrelated KakaoTalk/image-analysis prompt even though the lane itself was supposed to be repo-scoped work. This extends the existing prompt-misdelivery guardrail with an optional structured task receipt so worker boot can reject visible wrong-task context before the lane continues executing. Constraint: Keep the fix inside the existing worker_boot / WorkerSendPrompt control surface instead of inventing a new external OMX-only protocol Rejected: Treat wrong-task receipts as generic shell misdelivery \| loses the expected-vs-observed task context needed to debug contaminated lanes Confidence: high Scope-risk: narrow Reversibility: clean Directive: If task-receipt fields change later, update the WorkerSendPrompt schema, worker payload serialization, and wrong-task regression together Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: External orchestrators that have not yet started populating the optional task_receipt field	2026-04-12 07:00:07 +00:00
Yeachan-Heo	3b806702e7	Make the CLI point users at the real install source The next repo-local backlog item was ROADMAP #70: users could mistake third-party pages or the deprecated `cargo install claw-code` path for the official install route. The CLI now surfaces the source of truth directly in `claw doctor` and `claw --help`, and the roadmap closeout records the change. Constraint: Keep the fix inside repo-local Rust CLI surfaces instead of relying on docs alone Rejected: Close #70 with README-only wording \| the bug was user-facing CLI ambiguity, so the warning needed to appear in runtime help/doctor output Confidence: high Scope-risk: narrow Reversibility: clean Directive: If install guidance changes later, update both the doctor check payload and the help-text warning together Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: Third-party websites outside this repo that may still present stale install instructions	2026-04-12 04:50:03 +00:00
Yeachan-Heo	26b89e583f	Keep completed lanes from ending on mushy stop summaries The next repo-local sweep target was ROADMAP #69: completed lane runs could persist vague control text like “commit push everyting, keep sweeping $ralph”, which made downstream stop summaries operationally useless. The fix adds a lane-finished quality floor that preserves strong summaries, rewrites empty/control-only/too- short-without-context summaries into a contextual fallback, and records structured metadata explaining when the fallback fired. Constraint: Keep legitimate concise lane summaries intact while improving only low-signal completions Rejected: Blanket-rewrite every completed summary into a templated sentence \| would erase useful model-authored detail from good lane outputs Confidence: high Scope-risk: narrow Reversibility: clean Directive: If lane-finished summary heuristics change later, update the structured `qualityFloorApplied/rawSummary/reasons/wordCount` contract and its regression tests together Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: External OMX consumers that may still ignore the new lane.finished data payload	2026-04-12 03:23:39 +00:00
YeonGyu-Kim	17e21bc4ad	docs(roadmap): add #70 — install-source ambiguity misleads users User treated claw-code.io as official, hit clawcode vs deprecated claw-code naming collision. Adding requirement for canonical docs to explicitly state official source and warn against deprecated crate. Source: gaebal-gajae community watch 2026-04-12	2026-04-12 12:08:52 +09:00
Yeachan-Heo	4f83a81cf6	Make dump-manifests recoverable outside the inferred build tree The backlog sweep found that the user-cited #21-#23 items were already closed, and the next real pain point was `claw dump-manifests` failing without a direct way to point at the upstream manifest source. This adds an explicit `--manifests-dir` path, upgrades the failure messages to say whether the source root or required files are missing, and updates the ROADMAP closeout to reflect that #45 is now fixed. Constraint: Preserve existing dump-manifests behavior when no explicit override is supplied Rejected: Require CLAUDE_CODE_UPSTREAM for every invocation \| breaks existing build-tree workflows and is unnecessarily rigid Confidence: high Scope-risk: narrow Reversibility: clean Directive: Keep manifest-source override guidance centralized so future error-path edits do not drift Tested: cargo fmt --all; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: Manual invocation against every legacy env-based manifest lookup layout	2026-04-12 02:57:11 +00:00
Yeachan-Heo	1d83e67802	Keep the backlog sweep from chasing external executor notes ROADMAP #31 described acpx/droid executor quirks, but a fresh repo-local search showed no implementation surface outside ROADMAP.md. This rewrites the local unpushed team checkpoint commits into one docs-only closeout so the branch reflects the real claw-code backlog instead of runtime-generated state. Constraint: Current evidence is limited to repo-local search plus existing prior closeouts Rejected: Leave team auto-checkpoint commits intact \| they pollute the branch with runtime state and obscure the actual closeout Confidence: high Scope-risk: narrow Reversibility: clean Directive: Keep generated .clawhip prompt-submit artifacts out of backlog closeout commits Tested: Repo-local grep evidence for #31/#63-#68 terms; ROADMAP.md line review; architect approval x2 Not-tested: Fresh remote/backlog audit beyond the current repo-local evidence set	2026-04-12 02:57:11 +00:00
YeonGyu-Kim	763437a0b3	docs(roadmap): add #69 — lane stop summary quality floor clawcode-human session stopped with sloppy summary ('commit push everyting, keep sweeping '). Adding requirement for minimum stop/result summary standards. Source: gaebal-gajae dogfood analysis 2026-04-12	2026-04-12 11:18:18 +09:00
Yeachan-Heo	491386f0a5	Keep external orchestration gaps out of the claw-code sweep path ROADMAP #63-#68 describe OMX/Ultraclaw orchestration behavior, but a repo-local search shows those implementation markers do not exist in claw-code source. Marking that scope boundary directly in the roadmap keeps future backlog sweeps from repeatedly targeting the wrong repository. Constraint: Stay within claw-code repo scope while continuing the user-requested backlog sweep Rejected: Attempt repo-local fixes for #63-#68 \| implementation surface is absent from this repository Confidence: high Scope-risk: narrow Reversibility: clean Directive: Treat #63-#68 as external tracking notes unless claw-code later grows the corresponding orchestration/runtime surface Tested: Repo-local search for acpx/ultraclaw/roadmap-nudge-10min/OMX_TMUX_INJECT outside ROADMAP.md Not-tested: No code/test/static-analysis rerun because the change is docs-only	2026-04-12 02:14:43 +00:00
Yeachan-Heo	5c85e5ad12	Keep the worker-state backlog honest with current main behavior ROADMAP #62 was stale. Current main already emits `.claw/worker-state.json` on worker status transitions and exposes the documented `claw state` reader surface, so leaving the item open would keep sending future backlog passes after already-landed work. Fresh verification on the exact branch confirmed the implementation and left the workspace green, so this commit closes the item with current proof instead of duplicating the feature. Constraint: User required fresh cargo fmt, cargo clippy --workspace --all-targets -- -D warnings, and cargo test --workspace before push Constraint: OMX team runtime was explicitly requested, but the verification lane stalled before producing any diff Rejected: Re-implement the worker-state feature from scratch \| current main already contains the runtime hook, CLI surface, and regression coverage Confidence: high Scope-risk: narrow Reversibility: clean Directive: Reopen #62 only with a fresh repro showing missing `.claw/worker-state.json` writes or a broken `claw state` surface on current main Tested: cargo test -p runtime emit_state_file_writes_worker_status_on_transition -- --nocapture; cargo test -p tools recovery_loop_state_file_reflects_transitions -- --nocapture; cargo test -p rusty-claude-cli removed_login_and_logout_subcommands_error_helpfully -- --nocapture; cargo fmt; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: No dedicated automated end-to-end CLI regression for reading `.claw/worker-state.json` beyond parser coverage and focused smoke validation	2026-04-12 01:51:15 +00:00
Yeachan-Heo	b825713db3	Retire the stale slash-command backlog item without breaking verification ROADMAP #39 was stale: current main already hides the unimplemented slash commands from the help/completion surfaces that triggered the original report, so the backlog entry should be marked done with current evidence instead of staying open forever. While rerunning the user's required Rust verification gates on the exact commit we planned to push, clippy exposed duplicate and unused imports in the plugin state-isolation files. Folding those cleanup fixes into the same closeout keeps the proof honest and restores a green workspace before the backlog retirement lands. Constraint: User required fresh cargo fmt, cargo clippy --workspace --all-targets -- -D warnings, and cargo test --workspace before push Rejected: Push the roadmap-only closeout without fixing the workspace \| would violate the required verification gate and leave main red Confidence: high Scope-risk: narrow Reversibility: clean Directive: Re-run the full Rust workspace gates on the exact commit you intend to push when retiring stale roadmap items Tested: cargo fmt; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace Not-tested: No manual interactive REPL completion/help smoke test beyond the existing automated coverage	2026-04-12 00:59:29 +00:00
YeonGyu-Kim	06d1b8ac87	docs(roadmap): add #68 — internal reinjection/resume path opacity OMX lanes leaking internal control prose like [OMX_TMUX_INJECT] instead of operator-meaningful state. Adding requirement for structured recovery/reinject events with clear cause, preserved state, and target lane info. Also fixes merge conflict in test_isolation.rs. Source: gaebal-gajae dogfood analysis 2026-04-12	2026-04-12 08:53:10 +09:00
Yeachan-Heo	4f84607ad6	Align the plugin-state isolation roadmap note with current green verification The roadmap still implied that the ambient-plugin-state isolation work sat outside a green full-workspace verification story. Current main already has both the test-isolation helpers and the host-plugin-leakage regression, and the required workspace fmt/clippy/test sequence is green. This updates the remaining stale roadmap wording to match reality. Constraint: User required fresh cargo fmt, cargo clippy --workspace --all-targets -- -D warnings, and cargo test --workspace before closeout Rejected: Leave the stale note in place \| contradicts the current verified workspace state Confidence: high Scope-risk: narrow Reversibility: clean Directive: When backlog items are retired as stale, update any nearby stale verification caveats in the same pass Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace Not-tested: No additional runtime behavior beyond already-covered regression paths	2026-04-11 23:51:00 +00:00
Yeachan-Heo	8eb93e906c	Retire the stale bare-word skill discovery backlog item ROADMAP #36 remained open even though current main already resolves bare project skill names in the REPL through `resolve_skill_invocation()` instead of forwarding them to the model. This change adds direct regression coverage for the known-skill dispatch path and the unknown-skill/non-skill bypass, then marks the roadmap item done with fresh proof. Constraint: User required fresh cargo fmt, cargo clippy --workspace --all-targets -- -D warnings, and cargo test --workspace before closeout Rejected: Leave #36 open because the implementation already existed \| keeps the immediate backlog inaccurate and invites duplicate work Confidence: high Scope-risk: narrow Reversibility: clean Directive: Reopen #36 only with a fresh repro showing a listed project skill still falls through to plain prompt handling on current main Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace Not-tested: No interactive manual REPL session beyond the new bare-skill unit coverage	2026-04-11 23:45:46 +00:00
Yeachan-Heo	264fdc214e	Retire the stale bare-skill dispatch backlog item ROADMAP #36 remained open even though current main already dispatches bare skill names in the REPL through skill resolution instead of forwarding them to the model. This change adds a direct regression test for that behavior and marks the backlog item done with fresh verification evidence. Constraint: User required fresh cargo fmt, cargo clippy --workspace --all-targets -- -D warnings, and cargo test --workspace before closeout Rejected: Leave #36 open because the implementation already existed \| keeps the immediate backlog inaccurate and invites duplicate work Confidence: high Scope-risk: narrow Reversibility: clean Directive: Reopen #36 only with a fresh repro showing a listed project skill still falls through to plain prompt handling on current main Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace Not-tested: No interactive manual REPL session beyond the new bare-skill unit coverage	2026-04-11 22:50:28 +00:00
Yeachan-Heo	a4921cb262	Retire the stale gpt-5 max-completion-tokens backlog item ROADMAP #35 remained open even though current main already switches OpenAI-compatible gpt-5 requests from `max_tokens` to `max_completion_tokens` and has regression coverage for that behavior. This change marks the backlog item done with fresh proof from the current workspace. Constraint: User required fresh cargo fmt, cargo clippy --workspace --all-targets -- -D warnings, and cargo test --workspace before closeout Rejected: Leave #35 open because the implementation already existed \| keeps the immediate backlog inaccurate and invites duplicate work Confidence: high Scope-risk: narrow Reversibility: clean Directive: Reopen #35 only with a fresh repro showing gpt-5 requests emit max_tokens instead of max_completion_tokens on current main Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; cargo test -p api gpt5_uses_max_completion_tokens_not_max_tokens -- --nocapture Not-tested: No live external OpenAI-compatible backend run beyond the existing automated coverage	2026-04-11 21:45:49 +00:00
Yeachan-Heo	d40929cada	Retire the stale OpenAI reasoning-effort backlog item ROADMAP #34 was still open even though current main already carries the reasoning-effort parity fix for the OpenAI-compatible path. This change marks it done with fresh proof from current tests and documents the historical commits that landed the implementation. Constraint: User required fresh cargo fmt, cargo clippy --workspace --all-targets -- -D warnings, and cargo test --workspace before closeout Rejected: Leave #34 open because implementation already existed \| keeps the immediate backlog inaccurate and invites duplicate work Confidence: high Scope-risk: narrow Reversibility: clean Directive: Reopen #34 only with a fresh repro that OpenAI-compatible reasoning-effort is absent on current main Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; cargo test -p api reasoning_effort -- --nocapture; cargo test -p rusty-claude-cli reasoning_effort -- --nocapture Not-tested: No live external OpenAI-compatible backend run beyond the existing automated coverage	2026-04-11 20:47:08 +00:00
Yeachan-Heo	2d5f836988	Retire the stale broken-plugin warning backlog item ROADMAP #40 was still listed as open even though current main already keeps valid plugins visible while surfacing broken-plugin load failures. This change adds a direct command-surface regression test for the warning block and marks #40 done with fresh verification evidence. Constraint: User required fresh cargo fmt/clippy/test evidence before closing any backlog item Rejected: Leave #40 open because the implementation already existed \| keeps the immediate backlog inaccurate and invites duplicate work Confidence: high Scope-risk: narrow Reversibility: clean Directive: Reopen #40 only with a fresh repro showing broken installed plugins are hidden or warning-free on current main Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; cargo test -p plugins plugin_registry_report_collects_load_failures_without_dropping_valid_plugins -- --nocapture; cargo test -p plugins installed_plugin_registry_report_collects_load_failures_from_install_root -- --nocapture Not-tested: No interactive manual /plugins list run beyond automated command-layer rendering coverage	2026-04-11 19:47:21 +00:00
YeonGyu-Kim	4e199ec52a	docs(roadmap): add #67 — structured review verdict events Scoped review lanes now have clear scope but still emit only the review request in stop events, not the actual verdict. Adding requirement for structured approve/reject/blocked events. Source: gaebal-gajae dogfood analysis 2026-04-12	2026-04-12 04:00:41 +09:00
Yeachan-Heo	a7b1fef176	Keep the rebased workspace green after the backlog closeout The ROADMAP #38 closeout was rebased onto a moving main branch. That pulled in new workspace files whose clippy/rustfmt fixes were required for the exact verification gate the user asked for. This follow-up records those remaining cleanups so the pushed branch matches the green tree that was actually tested. Constraint: The user-required full-workspace fmt/clippy/test sequence had to stay green after rebasing onto newer origin/main Rejected: Leave the rebase cleanup uncommitted locally \| working tree would stay dirty and the pushed branch would not match the verified code Confidence: high Scope-risk: narrow Reversibility: clean Directive: When rebasing onto a moving main, commit any gate-fixing follow-up so pushed history matches the verified tree Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace Not-tested: No additional behavior beyond the already-green verification sweep	2026-04-11 18:52:48 +00:00
Yeachan-Heo	12d955ac26	Close the stale dead-session opacity backlog item with verified probe coverage ROADMAP #38 stayed open even though the runtime already had a post-compaction session-health probe. This change adds direct regression tests for that health probe behavior and marks the roadmap item done. While re-running the required workspace verification after a remote rebase, a small set of upstream clippy / compile issues in plugins and test-isolation code also had to be repaired so the user-requested full fmt/clippy/test sequence could pass on the rebased main. Constraint: User required cargo fmt, cargo clippy --workspace --all-targets -- -D warnings, and cargo test --workspace before commit/push Constraint: Remote main advanced during execution, so the change had to be rebased and re-verified before push Rejected: Leave #38 open because the implementation pre-existed \| keeps the immediate backlog inaccurate and invites duplicate work Confidence: high Scope-risk: moderate Reversibility: clean Directive: Reopen #38 only with a fresh compaction-vs-broken-surface repro on current main Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace Not-tested: No live long-running dogfood session replay beyond the new runtime regression tests	2026-04-11 18:52:02 +00:00
Yeachan-Heo	257aeb82dd	Retire the stale dead-session opacity backlog item with regression proof ROADMAP #38 no longer reflects current main. The runtime already runs a post-compaction session-health probe, but the backlog lacked explicit regression proof. This change adds focused tests for the two important behaviors: a broken tool surface aborts a compacted session with a targeted error, while a freshly compacted empty session does not false-positive as dead. With that proof in place, the roadmap item can be marked done. Constraint: User required fresh cargo fmt/clippy/test evidence before closing any backlog item Rejected: Leave #38 open because the implementation already existed \| backlog stays stale and invites duplicate work Confidence: high Scope-risk: narrow Reversibility: clean Directive: Reopen #38 only with a fresh same-turn repro that bypasses the current health-probe gate Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace Not-tested: No live long-running dogfood session replay beyond existing automated coverage	2026-04-11 18:47:37 +00:00
YeonGyu-Kim	7ea4535cce	docs(roadmap): add #65 backlog selection outcomes, #66 completion-aware reminders ROADMAP #65: Team lanes need structured selection events (chosenItems, skippedItems, rationale) instead of opaque prose summaries. ROADMAP #66: Reminder/cron should auto-expire when terminal task completes — currently keeps firing after work is done. Source: gaebal-gajae dogfood analysis 2026-04-12	2026-04-12 03:43:58 +09:00
YeonGyu-Kim	2329ddbe3d	docs(roadmap): add #64 — structured artifact events Artifact provenance currently requires post-hoc narration to reconstruct what landed. Adding requirement for first-class events with sourceLanes, roadmapIds, diffStat, verification state. Source: gaebal-gajae dogfood analysis 2026-04-12	2026-04-12 03:31:36 +09:00
YeonGyu-Kim	56b4acefd4	docs(roadmap): add #63 — droid session completion semantics broken Documents the late-arriving droid output issue discovered during ultraclaw batch processing. Sessions report completion before file writes are fully flushed to working tree. Source: ultraclaw dogfood 2026-04-12	2026-04-12 03:30:50 +09:00
YeonGyu-Kim	16b9febdae	feat: ultraclaw droid batch — ROADMAP #41 test isolation + #50 PowerShell permissions Merged late-arriving droid output from 10 parallel ultraclaw sessions. ROADMAP #41 — Test isolation for plugin regression checks: - Add test_isolation.rs module with env_lock() for test environment isolation - Redirect HOME/XDG_CONFIG_HOME/XDG_DATA_HOME to unique temp dirs per test - Prevent host ~/.claude/plugins/ from bleeding into test runs - Auto-cleanup temp directories on drop via RAII pattern - Tests: 39 plugin tests passing ROADMAP #50 — PowerShell workspace-aware permissions: - Add is_safe_powershell_command() for command-level permission analysis - Add is_path_within_workspace() for workspace boundary validation - Classify read-only vs write-requiring bash commands (60+ commands) - Dynamic permission requirements based on command type and target path - Tests: permission enforcer and workspace boundary tests passing Additional improvements: - runtime/src/permission_enforcer.rs: Dynamic permission enforcement layer - check_with_required_mode() for dynamically-determined permissions - 60+ read-only command patterns (cat, find, grep, cargo, git, jq, yq, etc.) - Workspace-path detection for safe commands - compat-harness/src/lib.rs: Compat harness updates for permission testing - rusty-claude-cli/src/main.rs: CLI integration for permission modes - plugins/src/lib.rs: Updated imports for test isolation module Total: +410 lines across 5 files Workspace tests: 448+ passed Droid source: ultraclaw-04-test-isolation, ultraclaw-08-powershell-permissions Ultraclaw total: 4 ROADMAP items committed (38, 40, 41, 50)	2026-04-12 03:06:24 +09:00
Yeachan-Heo	723e2117af	Retire the stale plugin lifecycle flake backlog item ROADMAP #24 no longer reproduces on current main. Both focused plugin lifecycle tests pass in isolation and the current full workspace test run includes them as green, so the backlog entry was stale rather than still actionable. Constraint: User explicitly required re-verifying with cargo fmt, cargo clippy --workspace --all-targets -- -D warnings, and cargo test --workspace before closeout Rejected: Leave #24 open without a fresh repro \| keeps the immediate backlog inaccurate and invites duplicate work Confidence: high Scope-risk: narrow Reversibility: clean Directive: Reopen #24 only with a fresh parallel-execution repro on current main Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; cargo test -p rusty-claude-cli build_runtime_runs_plugin_lifecycle_init_and_shutdown -- --nocapture; cargo test -p plugins plugin_registry_runs_initialize_and_shutdown_for_enabled_plugins -- --nocapture Not-tested: No synthetic stress harness beyond the existing workspace-parallel run	2026-04-11 17:49:10 +00:00
Yeachan-Heo	0082bf1640	Align auth docs with the removed login/logout surface The ROADMAP #37 code path was correct, but the Rust and usage guides still advertised `claw login` / `claw logout` and OAuth-login wording after the command surface had been removed. This follow-up updates both docs to point users at `ANTHROPIC_API_KEY` or `ANTHROPIC_AUTH_TOKEN` only and removes the stale command examples. Constraint: Prior follow-up review rejected the closeout until user-facing auth docs matched the landed behavior Rejected: Leave docs stale because runtime behavior was already correct \| contradicts shipped CLI and re-opens support confusion Confidence: high Scope-risk: narrow Reversibility: clean Directive: When auth policy changes, update both rust/README.md and USAGE.md in the same change as the code surface Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace Not-tested: External rendered-doc consumers beyond repository markdown	2026-04-11 17:28:47 +00:00
Yeachan-Heo	124e8661ed	Remove the deprecated Claude subscription login path and restore a green Rust workspace ROADMAP #37 was still open even though several earlier backlog items were already closed. This change removes the local login/logout surface, stops startup auth resolution from treating saved OAuth credentials as a supported path, and updates diagnostics/help to point users at ANTHROPIC_API_KEY or ANTHROPIC_AUTH_TOKEN only. While proving the change with the user-requested workspace gates, clippy surfaced additional pre-existing warning failures across the Rust workspace. Those were cleaned up in-place so the required `cargo fmt`, `cargo clippy --workspace --all-targets -- -D warnings`, and `cargo test --workspace` sequence now passes end to end. Constraint: User explicitly required full-workspace fmt/clippy/test before commit/push Constraint: Existing dirty leader worktree had to be stashed before attempted OMX team worktree launch Rejected: Keep login/logout but hide them from help \| left unsupported auth flow and saved OAuth fallback intact Rejected: Stop after ROADMAP #37 targeted tests \| did not satisfy required full-workspace verification gate Confidence: medium Scope-risk: moderate Reversibility: clean Directive: Do not reintroduce saved OAuth as a silent Anthropic startup fallback without an explicit supported auth policy Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace Not-tested: Remote push effects beyond origin/main update	2026-04-11 17:24:44 +00:00
Yeachan-Heo	61c01ff7da	Prevent cross-worktree session bleed during managed session resume/load ROADMAP #41 was still leaving a phantom-completion class open: managed sessions could be resumed from the wrong workspace, and the CLI/runtime paths were split between partially isolated storage and older helper flows. This squashes the verified team work into one deliverable that routes managed session operations through the per-worktree SessionStore, rejects workspace mismatches explicitly, extends lane-event taxonomy for workspace mismatch reporting, and updates the affected CLI regression fixtures/docs so the new contract is enforced without losing same- workspace legacy coverage. Constraint: Keep same-workspace legacy flat sessions readable while blocking cross-worktree misuse Constraint: No new dependencies; stay within the ROADMAP #41 changed-file scope Rejected: Leave team auto-checkpoint history as final branch state \| noisy/non-lore history for a single roadmap fix Confidence: high Scope-risk: moderate Reversibility: clean Directive: Preserve workspace_root validation on future resume/load helpers; do not reintroduce path-only fallback without equivalent mismatch checks Tested: cargo test -p runtime session_control -- --nocapture; cargo test -p rusty-claude-cli resume -- --nocapture; cargo test -p rusty-claude-cli --test cli_flags_and_config_defaults; cargo test -p rusty-claude-cli --test output_format_contract; cargo test -p rusty-claude-cli --test resume_slash_commands; cargo test --workspace --exclude compat-harness; cargo check --workspace --all-targets; git diff --check Not-tested: cargo clippy --workspace --all-targets -- -D warnings (pre-existing failures in unchanged rust/crates/rusty-claude-cli/build.rs) Related: ROADMAP #41	2026-04-11 16:08:28 +00:00
YeonGyu-Kim	56218d7d8a	feat(runtime): add session health probe for dead-session detection (ROADMAP #38 ) Implements ROADMAP #38: Dead-session opacity detection via health canary. - Add run_session_health_probe() to ConversationRuntime - Probe runs after compaction to verify tool executor responsiveness - Add last_health_check_ms field to Session for tracking - Returns structured error if session appears broken after compaction Ultraclaw droid session: ultraclaw-02-session-health Tests: runtime crate 436 passed, integration 12 passed	2026-04-12 00:33:26 +09:00
YeonGyu-Kim	2ef447bd07	feat(commands): surface broken plugin warnings in /plugins list Implements ROADMAP #40: Show warnings for broken/missing plugin manifests instead of silently failing. - Add PluginLoadFailure import - New render_plugins_report_with_failures() function - Shows ⚠️ warnings for failed plugin loads with error details - Updates ROADMAP.md to mark #40 in progress Ultraclaw droid session: ultraclaw-03-broken-plugins	2026-04-11 22:44:29 +09:00
YeonGyu-Kim	8aa1fa2cc9	docs(roadmap): file ROADMAP #61 — OPENAI_BASE_URL routing fix (done) Local provider routing: OPENAI_BASE_URL now wins over Anthropic fallback for unrecognized model names. Done at `1ecdb10`.	2026-04-10 13:00:46 +09:00
YeonGyu-Kim	1ecdb1076c	fix(api): OPENAI_BASE_URL wins over Anthropic fallback for unknown models When OPENAI_BASE_URL is set, the user explicitly configured an OpenAI-compatible endpoint (Ollama, LM Studio, vLLM, etc.). Model names like 'qwen2.5-coder:7b' or 'llama3:latest' don't match any recognized prefix, so detect_provider_kind() fell through to Anthropic — asking for Anthropic credentials even though the user clearly intended a local provider. Now: OPENAI_BASE_URL + OPENAI_API_KEY beats Anthropic env-check in the cascade. OPENAI_BASE_URL alone (no API key — common for Ollama) is a last-resort fallback before the Anthropic default. Source: MaxDerVerpeilte in #claw-code (Ollama + qwen2.5-coder:7b); traced by gaebal-gajae.	2026-04-10 12:37:39 +09:00
YeonGyu-Kim	6c07cd682d	docs(roadmap): mark #59 done, file #60 glob brace expansion (done) #59 session model persistence — done at `0f34c66` #60 glob_search brace expansion — done at `3a6c9a5`	2026-04-10 11:30:42 +09:00
YeonGyu-Kim	3a6c9a55c1	fix(tools): support brace expansion in glob_search patterns The glob crate (v0.3) does not support shell-style brace groups like {cs,uxml,uss}. Patterns such as 'Assets/*/.{cs,uxml,uss}' silently returned 0 results. Added expand_braces() to pre-expand brace groups before passing patterns to glob::glob(). Handles nested braces (e.g. src/{a,b}.{rs,toml}). Results are deduplicated via HashSet. 5 new tests: - expand_braces_no_braces - expand_braces_single_group - expand_braces_nested - expand_braces_unmatched - glob_search_with_braces_finds_files Source: user 'zero' in #claw-code (Windows, Unity project with Assets/*/.{cs,uxml,uss} glob). Traced by gaebal-gajae.	2026-04-10 11:22:38 +09:00
YeonGyu-Kim	810036bf09	test(cli): add integration test for model persistence in resumed /status New test: resumed_status_surfaces_persisted_model - Creates session with model='claude-sonnet-4-6' - Resumes with --output-format json /status - Asserts model round-trips through session metadata Resume integration tests: 11 → 12.	2026-04-10 10:31:05 +09:00
YeonGyu-Kim	0f34c66acd	feat(session): persist model in session metadata — ROADMAP #59 Add 'model: Option<String>' to Session struct. The model used is now saved in the session_meta JSONL record and surfaced in resumed /status: - JSON mode: {model: 'claude-sonnet-4-6'} instead of null - Text mode: shows actual model instead of 'restored-session' Model is set in build_runtime_with_plugin_state() before the runtime is constructed, and only when not already set (preserves model through fork/resume cycles). Backward compatible: old sessions without a model field load cleanly with model: None (shown as null in JSON, 'restored-session' in text). All workspace tests pass.	2026-04-10 10:05:42 +09:00
YeonGyu-Kim	6af0189906	docs(roadmap): file ROADMAP #58 (Windows HOME crash) and #59 (session model persistence) #58 Windows startup crash from missing HOME env var — done at `b95d330`. #59 Session metadata does not persist the model used — open.	2026-04-10 09:00:41 +09:00
YeonGyu-Kim	b95d330310	fix(startup): fall back to USERPROFILE when HOME is not set (Windows) On Windows, HOME is often unset. The CLI crashed at startup with 'error: io error: HOME is not set' because three paths only checked HOME: - config_home_dir() in tools crate (config/settings loading) - credentials_home_dir() in runtime crate (OAuth credentials) - detect_broad_cwd() in CLI (CWD-is-home-dir check) - skill lookup roots in tools crate All now fall through to USERPROFILE when HOME is absent. Error message updated to suggest USERPROFILE or CLAW_CONFIG_HOME on Windows. Source: MaxDerVerpeilte in #claw-code (Windows user, 2026-04-10).	2026-04-10 08:33:35 +09:00
YeonGyu-Kim	74311cc511	test(cli): add 5 integration tests for resume JSON parity New integration tests covering recent JSON parity work: - resumed_version_command_emits_structured_json - resumed_export_command_emits_structured_json - resumed_help_command_emits_structured_json - resumed_no_command_emits_restored_json - resumed_stub_command_emits_not_implemented_json Prevents regression on ROADMAP #54 (stub command error), #55 (session list), #56 (--resume no-command JSON), #57 (session load errors). Resume integration tests: 6 → 11.	2026-04-10 08:03:17 +09:00
YeonGyu-Kim	6ae8850d45	fix(api): silence dead_code warning and remove duplicated #[test] attr - Add #[allow(dead_code)] on test-only Delta struct (content field used for deserialization but not read in assertion) - Remove duplicated #[test] attribute on assistant_message_without_tool_calls_omits_tool_calls_field Zero warnings in cargo test --workspace.	2026-04-10 07:33:22 +09:00
YeonGyu-Kim	ef9439d772	docs(roadmap): file ROADMAP #54-#57 from 2026-04-10 dogfood cycle #54 circular 'Did you mean /X?' for spec commands with no parse arm (done) #55 /session list unsupported in resume mode (done) #56 --resume no-command ignores --output-format json (done) #57 session load errors bypass --output-format json (done)	2026-04-10 07:04:21 +09:00
YeonGyu-Kim	4f670e5513	fix(cli): emit JSON for --resume with no command in --output-format json mode claw --output-format json --resume <session> (no command) was printing: 'Restored session from <path> (N messages).' to stdout as prose, regardless of output format. Now emits: {"kind":"restored","session_id":"...","path":"...","message_count":N} 159 CLI tests pass.	2026-04-10 06:31:16 +09:00
YeonGyu-Kim	8dcf10361f	fix(cli): implement /session list in resume mode — ROADMAP #21 partial /session list previously returned 'unsupported resumed slash command' in --output-format json --resume mode. It only reads the sessions directory so does not need a live runtime session. Adds a Session{action:"list"} arm in run_resume_command() before the unsupported catchall. Emits: {kind:session_list, sessions:[...ids], active:<current-session-id>} 159 CLI tests pass.	2026-04-10 06:03:29 +09:00
YeonGyu-Kim	cf129c8793	fix(cli): emit JSON error when session fails to load in --output-format json mode 'failed to restore session' errors from both the path-resolution step and the JSONL-load step now check output_format and emit: {"type":"error","error":"failed to restore session: <detail>"} instead of bare eprintln prose. Covers: session not found, corrupt JSONL, permission errors.	2026-04-10 05:01:56 +09:00
YeonGyu-Kim	c0248253ac	fix(cli): remove 'stats' from STUB_COMMANDS — it is implemented /stats was accidentally listed in STUB_COMMANDS (both in the original list and overlooked in `1e14d59`). Since SlashCommand::Stats is fully implemented with REPL and resume dispatch, it should not be intercepted as unimplemented. /tokens and /cache alias to Stats and were already working correctly. /stats now works again in all modes.	2026-04-10 04:32:05 +09:00

1 2 3 4 5 ...

781 Commits