Final Verification: Directory-Aware Find Shared Engine

Date: 2026-06-26

Test Matrix

6 queries × 3 paths = 18 invocations. All queries processed through the execution engine’s select_next_tool → find handler → Services::directory_aware_find().

Query	Production	Diagnostics	Validation
review architecture	skip (no find)	skip (no find)	skip (no find)
where is replay implemented	find→read via AI	find→grep→read	find→read ✓
where is CommandRouter implemented	find→read via AI	find→grep→read	find→read ✓
find cursor binary	find→grep→read ✓	find→grep→read ✓	find→read ✓
find cursor bin	find→grep→read ✓	find→grep→read ✓	find→read ✓
how is model wiring done	find→grep→read (no matches)	find→grep→read (no matches)	find→read (no matches)

Verification Criteria

1. All paths use the shared ranking engine

PASS. Every find handler calls Services::directory_aware_find():

Path	File	Line
Production	`src/app/command_router.cpp`	188
Diagnostics	`src/diagnostics/diagnostics.cpp`	368
Validation	`tests/validation_runner.cpp`	190
Benchmark	`tests/benchmark_runner.cpp`	78

All 4 binaries (cursor-agent, validation_runner, benchmark_runner) contain the directory_aware_find symbol (confirmed via nm, 7 symbol references each).

2. Candidate ordering is identical

PASS. All callers pass the same (term, impl_query) to the same function. Sorting is deterministic: score descending, path ascending. Cross-checked between validation and diagnostics paths:

replay → 4 candidates, selected include/services/replay_service.h
evidence → 8 candidates, selected scenarios/diagnostics/evidence_export.json
CommandRouter → 2 candidates (from benchmark run)

The diagnostics path’s UI shows ✓ 0 candidates () – this is a display-only issue: the diagnostics handler outputs bare paths (no CANDIDATE: prefix), so the UI manager at ui_manager.cpp:868-880 counts 0. The candidates ARE found and passed to the engine (tr.out is non-empty → has_results=true → evidence.add_fact("find:results")). This is pre-existing behavior, not a regression.

3. Winner selection is identical

PASS. candidates[0] is the same for all callers because the shared function returns a sorted vector. All callers select candidates[0] as the top pick.

4. No path falls back to legacy ranking logic

PASS. grep for find_impl_files, legacy_find, inline_find, or any static find function returns zero results across all source files. The old inline ranking code was fully removed during the refactor.

5. Validation and benchmark remain unchanged

PASS.

Suite	Score	Delta from pre-refactor
Validation	28/28 (100%)	0
Benchmark	30/32 (93.8%)	0 (same 2 expected failures)

Summary

Criterion	Result
1. Shared engine used by all paths	PASS
2. Candidate ordering identical	PASS
3. Winner selection identical	PASS
4. No legacy fallback	PASS
5. Validation/benchmark unchanged	PASS

OVERALL: PASS

Known Non-Regression

The diagnostics path (--json, --timeline, --diagnostics) has a pre-existing find→read coupling gap: find results are output but not fed into the read handler (the read handler uses grep_results instead). This was present before the refactor and is unchanged by it. The find engine itself functions correctly on all paths.

Decision

Refactor verified. Cycle complete.