claw-code

mirror of https://github.com/instructkr/claw-code.git synced 2026-06-10 08:22:14 +08:00

Author	SHA1	Message	Date
Yeachan-Heo	b5e30e2975	Make completed review lanes emit machine-readable verdicts The next repo-local sweep target was ROADMAP #67: scoped review lanes could stop with prose-only output, leaving downstream consumers to infer approval or rejection from later chatter. The fix teaches completed lane persistence to recognize review-style `APPROVE`/`REJECT`/`BLOCKED` results, attach structured verdict metadata to `lane.finished`, and keep ordinary non-review lanes on the existing quality-floor path. Constraint: Preserve the existing non-review lane summary path while enriching only review-style completions Rejected: Add a brand-new lane event type just for review results \| unnecessary when `lane.finished` already carries structured metadata and downstream consumers can read it there Confidence: high Scope-risk: narrow Reversibility: clean Directive: If review verdict parsing changes later, update `extract_review_outcome`, the finished-event payload fields, and the review-lane regression together Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: External consumers that may still ignore `lane.finished.data.reviewVerdict`	2026-04-12 08:49:40 +00:00
Yeachan-Heo	dbc2824a3e	Keep latest session selection tied to real session recency The next repo-local sweep target was ROADMAP #72: the `latest` managed-session alias could depend on filesystem mtime before the session's own persisted recency markers, which made the selection path vulnerable to coarse or misleading file timestamps. The fix promotes `updated_at_ms` into the summary/order path, keeps CLI wrappers in sync, and locks the mtime-vs-session-recency case with regression coverage. Constraint: Preserve existing managed-session storage layout while changing only the ordering signal Rejected: Keep sorting by filesystem mtime and just sleep longer in tests \| hides the semantic ordering bug instead of fixing it Confidence: high Scope-risk: narrow Reversibility: clean Directive: Any future managed-session ordering change must keep runtime and CLI summary structs aligned on the same recency fields Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: Cross-filesystem behavior where persisted session JSON cannot be read and fallback ordering uses mtime only	2026-04-12 07:49:32 +00:00
Yeachan-Heo	f309ff8642	Stop repo lanes from executing the wrong task payload The next repo-local sweep target was ROADMAP #71: a claw-code lane accepted an unrelated KakaoTalk/image-analysis prompt even though the lane itself was supposed to be repo-scoped work. This extends the existing prompt-misdelivery guardrail with an optional structured task receipt so worker boot can reject visible wrong-task context before the lane continues executing. Constraint: Keep the fix inside the existing worker_boot / WorkerSendPrompt control surface instead of inventing a new external OMX-only protocol Rejected: Treat wrong-task receipts as generic shell misdelivery \| loses the expected-vs-observed task context needed to debug contaminated lanes Confidence: high Scope-risk: narrow Reversibility: clean Directive: If task-receipt fields change later, update the WorkerSendPrompt schema, worker payload serialization, and wrong-task regression together Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: External orchestrators that have not yet started populating the optional task_receipt field	2026-04-12 07:00:07 +00:00
Yeachan-Heo	3b806702e7	Make the CLI point users at the real install source The next repo-local backlog item was ROADMAP #70: users could mistake third-party pages or the deprecated `cargo install claw-code` path for the official install route. The CLI now surfaces the source of truth directly in `claw doctor` and `claw --help`, and the roadmap closeout records the change. Constraint: Keep the fix inside repo-local Rust CLI surfaces instead of relying on docs alone Rejected: Close #70 with README-only wording \| the bug was user-facing CLI ambiguity, so the warning needed to appear in runtime help/doctor output Confidence: high Scope-risk: narrow Reversibility: clean Directive: If install guidance changes later, update both the doctor check payload and the help-text warning together Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: Third-party websites outside this repo that may still present stale install instructions	2026-04-12 04:50:03 +00:00
Yeachan-Heo	26b89e583f	Keep completed lanes from ending on mushy stop summaries The next repo-local sweep target was ROADMAP #69: completed lane runs could persist vague control text like “commit push everyting, keep sweeping $ralph”, which made downstream stop summaries operationally useless. The fix adds a lane-finished quality floor that preserves strong summaries, rewrites empty/control-only/too- short-without-context summaries into a contextual fallback, and records structured metadata explaining when the fallback fired. Constraint: Keep legitimate concise lane summaries intact while improving only low-signal completions Rejected: Blanket-rewrite every completed summary into a templated sentence \| would erase useful model-authored detail from good lane outputs Confidence: high Scope-risk: narrow Reversibility: clean Directive: If lane-finished summary heuristics change later, update the structured `qualityFloorApplied/rawSummary/reasons/wordCount` contract and its regression tests together Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: External OMX consumers that may still ignore the new lane.finished data payload	2026-04-12 03:23:39 +00:00
Yeachan-Heo	4f83a81cf6	Make dump-manifests recoverable outside the inferred build tree The backlog sweep found that the user-cited #21-#23 items were already closed, and the next real pain point was `claw dump-manifests` failing without a direct way to point at the upstream manifest source. This adds an explicit `--manifests-dir` path, upgrades the failure messages to say whether the source root or required files are missing, and updates the ROADMAP closeout to reflect that #45 is now fixed. Constraint: Preserve existing dump-manifests behavior when no explicit override is supplied Rejected: Require CLAUDE_CODE_UPSTREAM for every invocation \| breaks existing build-tree workflows and is unnecessarily rigid Confidence: high Scope-risk: narrow Reversibility: clean Directive: Keep manifest-source override guidance centralized so future error-path edits do not drift Tested: cargo fmt --all; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; architect review APPROVE Not-tested: Manual invocation against every legacy env-based manifest lookup layout	2026-04-12 02:57:11 +00:00
Yeachan-Heo	b825713db3	Retire the stale slash-command backlog item without breaking verification ROADMAP #39 was stale: current main already hides the unimplemented slash commands from the help/completion surfaces that triggered the original report, so the backlog entry should be marked done with current evidence instead of staying open forever. While rerunning the user's required Rust verification gates on the exact commit we planned to push, clippy exposed duplicate and unused imports in the plugin state-isolation files. Folding those cleanup fixes into the same closeout keeps the proof honest and restores a green workspace before the backlog retirement lands. Constraint: User required fresh cargo fmt, cargo clippy --workspace --all-targets -- -D warnings, and cargo test --workspace before push Rejected: Push the roadmap-only closeout without fixing the workspace \| would violate the required verification gate and leave main red Confidence: high Scope-risk: narrow Reversibility: clean Directive: Re-run the full Rust workspace gates on the exact commit you intend to push when retiring stale roadmap items Tested: cargo fmt; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace Not-tested: No manual interactive REPL completion/help smoke test beyond the existing automated coverage	2026-04-12 00:59:29 +00:00
YeonGyu-Kim	06d1b8ac87	docs(roadmap): add #68 — internal reinjection/resume path opacity OMX lanes leaking internal control prose like [OMX_TMUX_INJECT] instead of operator-meaningful state. Adding requirement for structured recovery/reinject events with clear cause, preserved state, and target lane info. Also fixes merge conflict in test_isolation.rs. Source: gaebal-gajae dogfood analysis 2026-04-12	2026-04-12 08:53:10 +09:00
Yeachan-Heo	264fdc214e	Retire the stale bare-skill dispatch backlog item ROADMAP #36 remained open even though current main already dispatches bare skill names in the REPL through skill resolution instead of forwarding them to the model. This change adds a direct regression test for that behavior and marks the backlog item done with fresh verification evidence. Constraint: User required fresh cargo fmt, cargo clippy --workspace --all-targets -- -D warnings, and cargo test --workspace before closeout Rejected: Leave #36 open because the implementation already existed \| keeps the immediate backlog inaccurate and invites duplicate work Confidence: high Scope-risk: narrow Reversibility: clean Directive: Reopen #36 only with a fresh repro showing a listed project skill still falls through to plain prompt handling on current main Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace Not-tested: No interactive manual REPL session beyond the new bare-skill unit coverage	2026-04-11 22:50:28 +00:00
Yeachan-Heo	2d5f836988	Retire the stale broken-plugin warning backlog item ROADMAP #40 was still listed as open even though current main already keeps valid plugins visible while surfacing broken-plugin load failures. This change adds a direct command-surface regression test for the warning block and marks #40 done with fresh verification evidence. Constraint: User required fresh cargo fmt/clippy/test evidence before closing any backlog item Rejected: Leave #40 open because the implementation already existed \| keeps the immediate backlog inaccurate and invites duplicate work Confidence: high Scope-risk: narrow Reversibility: clean Directive: Reopen #40 only with a fresh repro showing broken installed plugins are hidden or warning-free on current main Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace; cargo test -p plugins plugin_registry_report_collects_load_failures_without_dropping_valid_plugins -- --nocapture; cargo test -p plugins installed_plugin_registry_report_collects_load_failures_from_install_root -- --nocapture Not-tested: No interactive manual /plugins list run beyond automated command-layer rendering coverage	2026-04-11 19:47:21 +00:00
Yeachan-Heo	a7b1fef176	Keep the rebased workspace green after the backlog closeout The ROADMAP #38 closeout was rebased onto a moving main branch. That pulled in new workspace files whose clippy/rustfmt fixes were required for the exact verification gate the user asked for. This follow-up records those remaining cleanups so the pushed branch matches the green tree that was actually tested. Constraint: The user-required full-workspace fmt/clippy/test sequence had to stay green after rebasing onto newer origin/main Rejected: Leave the rebase cleanup uncommitted locally \| working tree would stay dirty and the pushed branch would not match the verified code Confidence: high Scope-risk: narrow Reversibility: clean Directive: When rebasing onto a moving main, commit any gate-fixing follow-up so pushed history matches the verified tree Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace Not-tested: No additional behavior beyond the already-green verification sweep	2026-04-11 18:52:48 +00:00
Yeachan-Heo	12d955ac26	Close the stale dead-session opacity backlog item with verified probe coverage ROADMAP #38 stayed open even though the runtime already had a post-compaction session-health probe. This change adds direct regression tests for that health probe behavior and marks the roadmap item done. While re-running the required workspace verification after a remote rebase, a small set of upstream clippy / compile issues in plugins and test-isolation code also had to be repaired so the user-requested full fmt/clippy/test sequence could pass on the rebased main. Constraint: User required cargo fmt, cargo clippy --workspace --all-targets -- -D warnings, and cargo test --workspace before commit/push Constraint: Remote main advanced during execution, so the change had to be rebased and re-verified before push Rejected: Leave #38 open because the implementation pre-existed \| keeps the immediate backlog inaccurate and invites duplicate work Confidence: high Scope-risk: moderate Reversibility: clean Directive: Reopen #38 only with a fresh compaction-vs-broken-surface repro on current main Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace Not-tested: No live long-running dogfood session replay beyond the new runtime regression tests	2026-04-11 18:52:02 +00:00
Yeachan-Heo	257aeb82dd	Retire the stale dead-session opacity backlog item with regression proof ROADMAP #38 no longer reflects current main. The runtime already runs a post-compaction session-health probe, but the backlog lacked explicit regression proof. This change adds focused tests for the two important behaviors: a broken tool surface aborts a compacted session with a targeted error, while a freshly compacted empty session does not false-positive as dead. With that proof in place, the roadmap item can be marked done. Constraint: User required fresh cargo fmt/clippy/test evidence before closing any backlog item Rejected: Leave #38 open because the implementation already existed \| backlog stays stale and invites duplicate work Confidence: high Scope-risk: narrow Reversibility: clean Directive: Reopen #38 only with a fresh same-turn repro that bypasses the current health-probe gate Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace Not-tested: No live long-running dogfood session replay beyond existing automated coverage	2026-04-11 18:47:37 +00:00
YeonGyu-Kim	16b9febdae	feat: ultraclaw droid batch — ROADMAP #41 test isolation + #50 PowerShell permissions Merged late-arriving droid output from 10 parallel ultraclaw sessions. ROADMAP #41 — Test isolation for plugin regression checks: - Add test_isolation.rs module with env_lock() for test environment isolation - Redirect HOME/XDG_CONFIG_HOME/XDG_DATA_HOME to unique temp dirs per test - Prevent host ~/.claude/plugins/ from bleeding into test runs - Auto-cleanup temp directories on drop via RAII pattern - Tests: 39 plugin tests passing ROADMAP #50 — PowerShell workspace-aware permissions: - Add is_safe_powershell_command() for command-level permission analysis - Add is_path_within_workspace() for workspace boundary validation - Classify read-only vs write-requiring bash commands (60+ commands) - Dynamic permission requirements based on command type and target path - Tests: permission enforcer and workspace boundary tests passing Additional improvements: - runtime/src/permission_enforcer.rs: Dynamic permission enforcement layer - check_with_required_mode() for dynamically-determined permissions - 60+ read-only command patterns (cat, find, grep, cargo, git, jq, yq, etc.) - Workspace-path detection for safe commands - compat-harness/src/lib.rs: Compat harness updates for permission testing - rusty-claude-cli/src/main.rs: CLI integration for permission modes - plugins/src/lib.rs: Updated imports for test isolation module Total: +410 lines across 5 files Workspace tests: 448+ passed Droid source: ultraclaw-04-test-isolation, ultraclaw-08-powershell-permissions Ultraclaw total: 4 ROADMAP items committed (38, 40, 41, 50)	2026-04-12 03:06:24 +09:00
Yeachan-Heo	0082bf1640	Align auth docs with the removed login/logout surface The ROADMAP #37 code path was correct, but the Rust and usage guides still advertised `claw login` / `claw logout` and OAuth-login wording after the command surface had been removed. This follow-up updates both docs to point users at `ANTHROPIC_API_KEY` or `ANTHROPIC_AUTH_TOKEN` only and removes the stale command examples. Constraint: Prior follow-up review rejected the closeout until user-facing auth docs matched the landed behavior Rejected: Leave docs stale because runtime behavior was already correct \| contradicts shipped CLI and re-opens support confusion Confidence: high Scope-risk: narrow Reversibility: clean Directive: When auth policy changes, update both rust/README.md and USAGE.md in the same change as the code surface Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace Not-tested: External rendered-doc consumers beyond repository markdown	2026-04-11 17:28:47 +00:00
Yeachan-Heo	124e8661ed	Remove the deprecated Claude subscription login path and restore a green Rust workspace ROADMAP #37 was still open even though several earlier backlog items were already closed. This change removes the local login/logout surface, stops startup auth resolution from treating saved OAuth credentials as a supported path, and updates diagnostics/help to point users at ANTHROPIC_API_KEY or ANTHROPIC_AUTH_TOKEN only. While proving the change with the user-requested workspace gates, clippy surfaced additional pre-existing warning failures across the Rust workspace. Those were cleaned up in-place so the required `cargo fmt`, `cargo clippy --workspace --all-targets -- -D warnings`, and `cargo test --workspace` sequence now passes end to end. Constraint: User explicitly required full-workspace fmt/clippy/test before commit/push Constraint: Existing dirty leader worktree had to be stashed before attempted OMX team worktree launch Rejected: Keep login/logout but hide them from help \| left unsupported auth flow and saved OAuth fallback intact Rejected: Stop after ROADMAP #37 targeted tests \| did not satisfy required full-workspace verification gate Confidence: medium Scope-risk: moderate Reversibility: clean Directive: Do not reintroduce saved OAuth as a silent Anthropic startup fallback without an explicit supported auth policy Tested: cargo fmt --all --check; cargo clippy --workspace --all-targets -- -D warnings; cargo test --workspace Not-tested: Remote push effects beyond origin/main update	2026-04-11 17:24:44 +00:00
Yeachan-Heo	61c01ff7da	Prevent cross-worktree session bleed during managed session resume/load ROADMAP #41 was still leaving a phantom-completion class open: managed sessions could be resumed from the wrong workspace, and the CLI/runtime paths were split between partially isolated storage and older helper flows. This squashes the verified team work into one deliverable that routes managed session operations through the per-worktree SessionStore, rejects workspace mismatches explicitly, extends lane-event taxonomy for workspace mismatch reporting, and updates the affected CLI regression fixtures/docs so the new contract is enforced without losing same- workspace legacy coverage. Constraint: Keep same-workspace legacy flat sessions readable while blocking cross-worktree misuse Constraint: No new dependencies; stay within the ROADMAP #41 changed-file scope Rejected: Leave team auto-checkpoint history as final branch state \| noisy/non-lore history for a single roadmap fix Confidence: high Scope-risk: moderate Reversibility: clean Directive: Preserve workspace_root validation on future resume/load helpers; do not reintroduce path-only fallback without equivalent mismatch checks Tested: cargo test -p runtime session_control -- --nocapture; cargo test -p rusty-claude-cli resume -- --nocapture; cargo test -p rusty-claude-cli --test cli_flags_and_config_defaults; cargo test -p rusty-claude-cli --test output_format_contract; cargo test -p rusty-claude-cli --test resume_slash_commands; cargo test --workspace --exclude compat-harness; cargo check --workspace --all-targets; git diff --check Not-tested: cargo clippy --workspace --all-targets -- -D warnings (pre-existing failures in unchanged rust/crates/rusty-claude-cli/build.rs) Related: ROADMAP #41	2026-04-11 16:08:28 +00:00
YeonGyu-Kim	56218d7d8a	feat(runtime): add session health probe for dead-session detection (ROADMAP #38 ) Implements ROADMAP #38: Dead-session opacity detection via health canary. - Add run_session_health_probe() to ConversationRuntime - Probe runs after compaction to verify tool executor responsiveness - Add last_health_check_ms field to Session for tracking - Returns structured error if session appears broken after compaction Ultraclaw droid session: ultraclaw-02-session-health Tests: runtime crate 436 passed, integration 12 passed	2026-04-12 00:33:26 +09:00
YeonGyu-Kim	2ef447bd07	feat(commands): surface broken plugin warnings in /plugins list Implements ROADMAP #40: Show warnings for broken/missing plugin manifests instead of silently failing. - Add PluginLoadFailure import - New render_plugins_report_with_failures() function - Shows ⚠️ warnings for failed plugin loads with error details - Updates ROADMAP.md to mark #40 in progress Ultraclaw droid session: ultraclaw-03-broken-plugins	2026-04-11 22:44:29 +09:00
YeonGyu-Kim	1ecdb1076c	fix(api): OPENAI_BASE_URL wins over Anthropic fallback for unknown models When OPENAI_BASE_URL is set, the user explicitly configured an OpenAI-compatible endpoint (Ollama, LM Studio, vLLM, etc.). Model names like 'qwen2.5-coder:7b' or 'llama3:latest' don't match any recognized prefix, so detect_provider_kind() fell through to Anthropic — asking for Anthropic credentials even though the user clearly intended a local provider. Now: OPENAI_BASE_URL + OPENAI_API_KEY beats Anthropic env-check in the cascade. OPENAI_BASE_URL alone (no API key — common for Ollama) is a last-resort fallback before the Anthropic default. Source: MaxDerVerpeilte in #claw-code (Ollama + qwen2.5-coder:7b); traced by gaebal-gajae.	2026-04-10 12:37:39 +09:00
YeonGyu-Kim	3a6c9a55c1	fix(tools): support brace expansion in glob_search patterns The glob crate (v0.3) does not support shell-style brace groups like {cs,uxml,uss}. Patterns such as 'Assets/*/.{cs,uxml,uss}' silently returned 0 results. Added expand_braces() to pre-expand brace groups before passing patterns to glob::glob(). Handles nested braces (e.g. src/{a,b}.{rs,toml}). Results are deduplicated via HashSet. 5 new tests: - expand_braces_no_braces - expand_braces_single_group - expand_braces_nested - expand_braces_unmatched - glob_search_with_braces_finds_files Source: user 'zero' in #claw-code (Windows, Unity project with Assets/*/.{cs,uxml,uss} glob). Traced by gaebal-gajae.	2026-04-10 11:22:38 +09:00
YeonGyu-Kim	810036bf09	test(cli): add integration test for model persistence in resumed /status New test: resumed_status_surfaces_persisted_model - Creates session with model='claude-sonnet-4-6' - Resumes with --output-format json /status - Asserts model round-trips through session metadata Resume integration tests: 11 → 12.	2026-04-10 10:31:05 +09:00
YeonGyu-Kim	0f34c66acd	feat(session): persist model in session metadata — ROADMAP #59 Add 'model: Option<String>' to Session struct. The model used is now saved in the session_meta JSONL record and surfaced in resumed /status: - JSON mode: {model: 'claude-sonnet-4-6'} instead of null - Text mode: shows actual model instead of 'restored-session' Model is set in build_runtime_with_plugin_state() before the runtime is constructed, and only when not already set (preserves model through fork/resume cycles). Backward compatible: old sessions without a model field load cleanly with model: None (shown as null in JSON, 'restored-session' in text). All workspace tests pass.	2026-04-10 10:05:42 +09:00
YeonGyu-Kim	b95d330310	fix(startup): fall back to USERPROFILE when HOME is not set (Windows) On Windows, HOME is often unset. The CLI crashed at startup with 'error: io error: HOME is not set' because three paths only checked HOME: - config_home_dir() in tools crate (config/settings loading) - credentials_home_dir() in runtime crate (OAuth credentials) - detect_broad_cwd() in CLI (CWD-is-home-dir check) - skill lookup roots in tools crate All now fall through to USERPROFILE when HOME is absent. Error message updated to suggest USERPROFILE or CLAW_CONFIG_HOME on Windows. Source: MaxDerVerpeilte in #claw-code (Windows user, 2026-04-10).	2026-04-10 08:33:35 +09:00
YeonGyu-Kim	74311cc511	test(cli): add 5 integration tests for resume JSON parity New integration tests covering recent JSON parity work: - resumed_version_command_emits_structured_json - resumed_export_command_emits_structured_json - resumed_help_command_emits_structured_json - resumed_no_command_emits_restored_json - resumed_stub_command_emits_not_implemented_json Prevents regression on ROADMAP #54 (stub command error), #55 (session list), #56 (--resume no-command JSON), #57 (session load errors). Resume integration tests: 6 → 11.	2026-04-10 08:03:17 +09:00
YeonGyu-Kim	6ae8850d45	fix(api): silence dead_code warning and remove duplicated #[test] attr - Add #[allow(dead_code)] on test-only Delta struct (content field used for deserialization but not read in assertion) - Remove duplicated #[test] attribute on assistant_message_without_tool_calls_omits_tool_calls_field Zero warnings in cargo test --workspace.	2026-04-10 07:33:22 +09:00
YeonGyu-Kim	4f670e5513	fix(cli): emit JSON for --resume with no command in --output-format json mode claw --output-format json --resume <session> (no command) was printing: 'Restored session from <path> (N messages).' to stdout as prose, regardless of output format. Now emits: {"kind":"restored","session_id":"...","path":"...","message_count":N} 159 CLI tests pass.	2026-04-10 06:31:16 +09:00
YeonGyu-Kim	8dcf10361f	fix(cli): implement /session list in resume mode — ROADMAP #21 partial /session list previously returned 'unsupported resumed slash command' in --output-format json --resume mode. It only reads the sessions directory so does not need a live runtime session. Adds a Session{action:"list"} arm in run_resume_command() before the unsupported catchall. Emits: {kind:session_list, sessions:[...ids], active:<current-session-id>} 159 CLI tests pass.	2026-04-10 06:03:29 +09:00
YeonGyu-Kim	cf129c8793	fix(cli): emit JSON error when session fails to load in --output-format json mode 'failed to restore session' errors from both the path-resolution step and the JSONL-load step now check output_format and emit: {"type":"error","error":"failed to restore session: <detail>"} instead of bare eprintln prose. Covers: session not found, corrupt JSONL, permission errors.	2026-04-10 05:01:56 +09:00
YeonGyu-Kim	c0248253ac	fix(cli): remove 'stats' from STUB_COMMANDS — it is implemented /stats was accidentally listed in STUB_COMMANDS (both in the original list and overlooked in `1e14d59`). Since SlashCommand::Stats is fully implemented with REPL and resume dispatch, it should not be intercepted as unimplemented. /tokens and /cache alias to Stats and were already working correctly. /stats now works again in all modes.	2026-04-10 04:32:05 +09:00
YeonGyu-Kim	1e14d59a71	fix(cli): stop circular 'Did you mean /X?' for spec commands with no parse arm 23 spec-registered commands had no parse arm in validate_slash_command_input, causing the circular error 'Unknown slash command: /X — Did you mean /X?' when users typed them in --resume mode. Two fixes: 1. Add the 23 confirmed parse-armless commands to STUB_COMMANDS (excluded from REPL completions and help output). 2. In resume dispatch, intercept STUB_COMMANDS before SlashCommand::parse and emit a clean '{error: "/X is not yet implemented in this build"}' instead of the confusing error from the Err parse path. Affected: /allowed-tools, /bookmarks, /workspace, /reasoning, /budget, /rate-limit, /changelog, /diagnostics, /metrics, /tool-details, /focus, /unfocus, /pin, /unpin, /language, /profile, /max-tokens, /temperature, /system-prompt, /notifications, /telemetry, /env, /project, plus ~40 additional unreachable spec names. 159 CLI tests pass.	2026-04-10 04:05:41 +09:00
YeonGyu-Kim	11e2353585	fix(cli): JSON parity for /export and /agents in resume mode /export now emits: {kind:export, file:<path>, message_count:<n>} /agents now emits: {kind:agents, text:<agents report>} Previously both returned json:None and fell through to prose output even in --output-format json --resume mode. 159 CLI tests pass.	2026-04-10 03:32:24 +09:00
YeonGyu-Kim	0845705639	fix(tests): update test assertions for null model in resume /status; drop unused import Two integration tests expected 'model':'restored-session' in the /status JSON output but `dc4fa55` changed resume mode to emit null for model. Updated both assertions to assert model is null (correct behavior). Also remove unused 'estimate_session_tokens' import in compact.rs tests (surfaced as warning in CI, kept failing CI green noise). All workspace tests pass.	2026-04-10 03:21:58 +09:00
YeonGyu-Kim	316864227c	fix(cli): JSON parity for /help and /diff in resume mode /help now emits: {kind:help, text:<full help text>} /diff now emits: - no git repo: {kind:diff, result:no_git_repo, detail:...} - clean tree: {kind:diff, result:clean, staged:'', unstaged:''} - changes: {kind:diff, result:changes, staged:..., unstaged:...} Previously both returned json:None and fell through to prose output even in --output-format json --resume mode. 159 CLI tests pass.	2026-04-10 03:02:00 +09:00
YeonGyu-Kim	c8cac7cae8	fix(cli): doctor config check hides non-existent candidate paths Before: doctor reported 'loaded 0/5' and listed 5 'Discovered file' entries for paths that don't exist on disk. This looked like 5 files failed to load, when in fact they are just standard search locations. After: only paths that actually exist on disk are shown as 'Discovered file'. 'loaded N/M' denominator is now the count of present files, not candidate paths. With no config files present: 'loaded 0/0' + 'Discovered files <none> (defaults active)'. 159 CLI tests pass.	2026-04-10 02:32:47 +09:00
YeonGyu-Kim	dc4fa55d64	fix(cli): /status JSON emits null model and correct session_id in resume mode Two bugs in --output-format json --resume /status: 1. 'model' field emitted 'restored-session' (a run-mode label) instead of the actual model or null. Fixed: status_json_value now takes Option<&str> for model; resume path passes None; live REPL path passes Some(model). 2. 'session_id' extracted parent dir name ('sessions') instead of the file stem. Session files are session-<id>.jsonl directly under .claw/sessions/, not in a subdirectory. Fixed: extract file_stem() instead of parent().file_name(). 159 CLI tests pass.	2026-04-10 02:03:14 +09:00
YeonGyu-Kim	a3d0c9e5e7	fix(api): sanitize orphaned tool messages at request-building layer Adds sanitize_tool_message_pairing() called from build_chat_completion_request() after translate_message() runs. Drops any role:"tool" message whose immediately-preceding non-tool message is role:"assistant" but has no tool_calls entry matching the tool_call_id. This is the second layer of the tool-pairing invariant defense: - `6e301c8`: compaction boundary fix (producer layer) - this commit: request-builder sanitizer (sender layer) Together these close the 400-error loop for resumed/compacted multi-turn tool sessions on OpenAI-compatible backends. Sanitization only fires when preceding message is role:assistant (not user/system) to avoid dropping valid translation artifacts from mixed user-message content blocks. Regression tests: sanitize_drops_orphaned_tool_messages covers valid pair, orphaned tool (no tool_calls in preceding assistant), mismatched id, and two tool results both referencing the same assistant turn. 116 api + 159 CLI + 431 runtime tests pass. Fmt clean.	2026-04-10 01:35:00 +09:00
YeonGyu-Kim	78dca71f3f	fix(cli): JSON parity for /compact and /clear in resume mode /compact now emits: {kind:compact, skipped, removed_messages, kept_messages} /clear now emits: {kind:clear, previous_session_id, new_session_id, backup, session_file} /clear (no --confirm) now emits: {kind:error, error:..., hint:...} Previously both returned json:None and fell through to prose output even in --output-format json --resume mode. 159 CLI tests pass.	2026-04-10 01:31:21 +09:00
YeonGyu-Kim	d95149b347	fix(cli): surface resolved path in dump-manifests error — ROADMAP #45 partial Before: error: failed to extract manifests: No such file or directory (os error 2) After: error: failed to extract manifests: No such file or directory (os error 2) looked in: /Users/yeongyu/clawd/claw-code/rust The workspace_dir is computed from CARGO_MANIFEST_DIR at compile time and only resolves correctly when running from the build tree. Surfacing the resolved path lets users understand immediately why it fails outside the build context. ROADMAP #45 root cause (build-tree-only path) remains open.	2026-04-10 01:01:53 +09:00
YeonGyu-Kim	47aa1a57ca	fix(cli): surface command name in 'not yet implemented' REPL message Add SlashCommand::slash_name() to the commands crate — returns the canonical '/name' string for any variant. Used in the REPL's stub catch-all arm to surface which command was typed instead of printing the opaque 'Command registered but not yet implemented.' Before: typing /rewind → 'Command registered but not yet implemented.' After: typing /rewind → '/rewind is not yet implemented in this build.' Also update the compacts_sessions_via_slash_command test assertion to tolerate the boundary-guard fix from `6e301c8` (removed_message_count can be 1 or 2 depending on whether the boundary falls on a tool-result pair). All 159 CLI + 431 runtime + 115 api tests pass.	2026-04-10 00:39:16 +09:00
YeonGyu-Kim	6e301c8bb3	fix(runtime): prevent orphaned tool-result at compaction boundary; /cost JSON Two fixes: 1. compact.rs: When the compaction boundary falls at the start of a tool-result turn, the preceding assistant turn with ToolUse would be removed — leaving an orphaned role:tool message with no preceding assistant tool_calls. OpenAI-compat backends reject this with 400. Fix: after computing raw_keep_from, walk the boundary back until the first preserved message is not a ToolResult (or its preceding assistant has been included). Regression test added: compaction_does_not_split_tool_use_tool_result_pair. Source: gaebal-gajae multi-turn tool-call 400 repro 2026-04-09. 2. /cost resume: add JSON output: {kind:cost, input_tokens, output_tokens, cache_creation_input_tokens, cache_read_input_tokens, total_tokens} 159 CLI + 431 runtime tests pass. Fmt clean.	2026-04-10 00:13:45 +09:00
YeonGyu-Kim	7587f2c1eb	fix(cli): JSON parity for /memory and /providers in resume mode Two gaps closed: 1. /memory (resume): json field was None, emitting prose regardless of --output-format json. Now emits: {kind:memory, cwd, instruction_files:N, files:[{path,lines,preview}...]} 2. /providers (resume): had a spec entry but no parse arm, producing the circular 'Unknown slash command: /providers — Did you mean /providers'. Added 'providers' as an alias for 'doctor' in the parse match so /providers dispatches to the same structured diagnostic output. 3. /doctor (resume): also wired json_value() so --output-format json returns the structured doctor report instead of None. Continues ROADMAP #26 resumed-command JSON parity track. 159 CLI tests pass, fmt clean.	2026-04-09 23:35:25 +09:00
YeonGyu-Kim	ed42f8f298	fix(api): surface provider error in SSE stream frames (companion to `ff416ff`) Same fix as `ff416ff` but for the streaming path. Some backends embed an error JSON object in an SSE data: frame: data: {"error":{"message":"context too long","code":400}} parse_sse_frame() was attempting to deserialize this as ChatCompletionChunk and failing with 'missing field' / 'invalid type', hiding the actual backend error message. Fix: check for an 'error' key before full chunk deserialization, same as the non-streaming path in `ff416ff`. Symmetric pair: - `ff416ff`: non-streaming path (response body) - this: streaming path (SSE data: frame) 115 api + 159 CLI tests pass. Fmt clean.	2026-04-09 23:03:33 +09:00
YeonGyu-Kim	ff416ff3e7	fix(api): surface provider error body before attempting completion parse When a local/proxy OpenAI-compatible backend returns an error object: {"error":{"message":"...","type":"...","code":...}} claw was trying to deserialize it as a ChatCompletionResponse and failing with the cryptic 'failed to parse OpenAI response: missing field id', completely hiding the actual backend error message. Fix: before full deserialization, check if the parsed JSON has an 'error' key and promote it directly to ApiError::Api so the user sees the real error (e.g. 'The number of tokens to keep from the initial prompt is greater than the context length'). Source: devilayu in #claw-code 2026-04-09 — local LM Studio context limit error was invisible; user saw 'missing field id' instead. 159 CLI + 115 api tests pass. Fmt clean.	2026-04-09 22:33:07 +09:00
YeonGyu-Kim	6ac7d8cd46	fix(api): omit tool_calls field from assistant messages when empty When serializing a multi-turn conversation for the OpenAI-compatible path, assistant messages with no tool calls were always emitting 'tool_calls: []'. Some providers reject requests where a prior assistant turn carries an explicit empty tool_calls array (400 on subsequent turns after a plain text assistant response). Fix: only include 'tool_calls' in the serialized assistant message when the vec is non-empty. Empty case omits the field entirely. This is a companion fix to `fd7aade` (null tool_calls in stream delta). The two bugs are symmetric: `fd7aade` handled inbound null -> empty vec; this handles outbound empty vec -> field omitted. Two regression tests added: - assistant_message_without_tool_calls_omits_tool_calls_field - assistant_message_with_tool_calls_includes_tool_calls_field 115 api tests pass. Fmt clean. Source: gaebal-gajae repro 2026-04-09 (400 on multi-turn, companion to null tool_calls stream-delta fix).	2026-04-09 22:06:25 +09:00
YeonGyu-Kim	7ec6860d9a	fix(cli): emit JSON for /config in --output-format json --resume mode /config resumed returned json:None, falling back to prose output even in --output-format json mode. Adds render_config_json() that produces: { "kind": "config", "cwd": "...", "loaded_files": N, "merged_keys": N, "files": [{"path":"...","source":"user\|project\|local","loaded":true\|false}, ...] } Wires it into the SlashCommand::Config resume arm alongside the existing prose render. Continues the resumed-command JSON parity track (ROADMAP #26). 159 CLI tests pass, fmt clean.	2026-04-09 22:03:11 +09:00
YeonGyu-Kim	0e12d15daf	fix(cli): add --allow-broad-cwd; require confirmation or flag in broad-CWD mode	2026-04-09 21:55:22 +09:00
YeonGyu-Kim	fd7aade5b5	fix(api): tolerate null tool_calls in OpenAI-compat stream delta chunks Some OpenAI-compatible providers emit 'tool_calls: null' in streaming delta chunks instead of omitting the field or using an empty array: "delta": {"content":"","function_call":null,"tool_calls":null} serde's #[serde(default)] only handles absent keys — an explicit null value still fails deserialization with: 'invalid type: null, expected a sequence' Fix: replace #[serde(default)] with a custom deserializer helper deserialize_null_as_empty_vec() that maps null -> Vec::default(), keeping the existing absent-key default behaviour. Regression test added: delta_with_null_tool_calls_deserializes_as_empty_vec uses the exact provider response shape from gaebal-gajae's repro (2026-04-09). 112 api lib tests pass. Fmt clean. Companion to gaebal-gajae's local 448cf2c — independently reproduced and landed on main.	2026-04-09 21:39:52 +09:00
YeonGyu-Kim	60ec2aed9b	fix(cli): wire /tokens and /cache as aliases for /stats; implement /stats Dogfood found that /tokens and /cache had spec entries (resume_supported: true) but no parse arms in the command parser, resulting in: 'Unknown slash command: /tokens — Did you mean /tokens' (the suggestion engine found the spec entry but parsing always failed) Fix three things: 1. Add 'tokens' \| 'cache' as aliases for 'stats' in the parse match so the commands actually resolve to SlashCommand::Stats 2. Implement SlashCommand::Stats in the REPL dispatch — previously fell through to 'Command registered but not yet implemented'. Now shows cumulative token usage for the session. 3. Implement SlashCommand::Stats in run_resume_command — previously returned 'unsupported resumed slash command'. Now emits: text: Cost / Input tokens / Output tokens / Cache create / Cache read json: {kind:stats, input_tokens, output_tokens, cache_*, total_tokens} 159 CLI tests pass, fmt clean.	2026-04-09 21:34:36 +09:00
YeonGyu-Kim	5f6f453b8d	fix(cli): warn when launched from home dir or filesystem root Users launching claw from their home directory (or /) have no project boundary — the agent can read/search the entire machine, often far beyond the intended scope. kapcomunica in #claw-code reported exactly this: 'it searched my entire computer.' Add warn_if_broad_cwd() called at prompt and REPL startup: - checks if CWD == $HOME or CWD has no parent (fs root) - prints a clear warning to stderr: Warning: claw is running from a very broad directory (/home/user). The agent can read and search everything under this path. Consider running from inside your project: cd /path/to/project && claw Warning fires on both claw (REPL) and claw prompt '...' paths. Does not fire from project subdirectories. Uses std::env::var_os("HOME"), no extra deps. 159 CLI tests pass, fmt clean.	2026-04-09 21:26:51 +09:00

1 2 3 4 5 ...

612 Commits