gitlore

Author	SHA1	Message	Date
Taylor Eernisse	0034958faf	fix(who): match old_path in detail/overlap queries, include closed MRs Fix two asymmetries between the main expert scoring query (build_expert_sql_v2) and the auxiliary queries (query_expert_details and query_overlap): 1. Missing old_path matching: The main scoring query correctly searches both position_new_path AND position_old_path (via UNION ALL branches) for DiffNotes, and both new_path AND old_path for mr_file_changes. However, query_expert_details and query_overlap only checked position_new_path / fc.new_path. This caused --detail mode and overlap mode to silently drop activity on renamed files that the main scoring correctly captured — the score and detail table wouldn't match. 2. State filter mismatch: The main query uses m.state IN ('opened','merged','closed') with a closed_mr_multiplier to downweight closed MRs. The detail and overlap queries used m.state IN ('opened','merged'), completely excluding closed MRs. This meant detail couldn't fully explain an expert's score. Fix: Add OR clauses for old_path in all signal branches of both queries, and include 'closed' in the state filter. The INDEXED BY hints are removed from the auxiliary queries (they use OR across path columns), which is acceptable since these run once per command. Also imports build_path_query, normalize_repo_path, and PathQuery from the new core::path_resolver module, removing the previously duplicated private functions (~261 lines deleted). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 12:37:21 -05:00
Taylor Eernisse	7e0e6a91f2	refactor: extract unit tests into separate _tests.rs files Move inline #[cfg(test)] mod tests { ... } blocks from 22 source files into dedicated _tests.rs companion files, wired via: #[cfg(test)] #[path = "module_tests.rs"] mod tests; This keeps implementation-focused source files leaner and more scannable while preserving full access to private items through `use super::*;`. Modules extracted: core: db, note_parser, payloads, project, references, sync_run, timeline_collect, timeline_expand, timeline_seed cli: list (55 tests), who (75 tests) documents: extractor (43 tests), regenerator embedding: change_detector, chunking gitlab: graphql (wiremock async tests), transformers/issue ingestion: dirty_tracker, discussions, issues, mr_diffs Also adds conflicts_with("explain_score") to the --detail flag in the who command to prevent mutually exclusive flags from being combined. All 629 unit tests pass. No behavior changes. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-13 10:54:02 -05:00
teernisse	94c8613420	feat(bd-226s): implement time-decay expert scoring model Replace flat-weight expertise scoring with exponential half-life decay, split reviewer signals (participated vs assigned-only), dual-path rename awareness, and new CLI flags (--as-of, --explain-score, --include-bots, --all-history). Changes: - ScoringConfig: 8 new fields with validation (config.rs) - half_life_decay() and normalize_query_path() pure functions (who.rs) - CTE-based SQL with dual-path matching, mr_activity, reviewer_participation (who.rs) - Rust-side decay aggregation with deterministic f64 ordering (who.rs) - Path resolution probes check old_path columns (who.rs) - Migration 026: 5 new indexes for dual-path and reviewer participation - Default --since changed from 6m to 24m - 31 new tests (example-based + invariant), 621 total who tests passing - Autocorrect registry updated with new flags Closes: bd-226s, bd-2w1p, bd-1soz, bd-18dn, bd-2ao4, bd-2yu5, bd-1b50, bd-1hoq, bd-1h3f, bd-13q8, bd-11mg, bd-1vti, bd-1j5o	2026-02-12 15:44:55 -05:00
teernisse	acc5e12e3d	perf: force partial index for DiffNote queries, batch stats counts Query optimizer fixes for the `who` and `stats` commands based on a systematic performance audit of the SQLite query plans. who command (expert/reviews/detail modes): - Add INDEXED BY idx_notes_diffnote_path_created hints to all DiffNote queries. SQLite's planner was selecting idx_notes_system (38% of rows) over the far more selective partial index (9.3% of rows). Measured 50-133x speedup on expert queries, 26x on reviews queries. - Reorder JOIN clauses in detail mode's MR-author sub-select to match the index scan direction (notes -> discussions -> merge_requests). stats command: - Replace 12+ sequential COUNT() queries with conditional aggregates (COALESCE + SUM + CASE). Documents, dirty_sources, pending_discussion_ fetches, and pending_dependent_fetches tables each scanned once instead of 2-3 times. Measured 1.7x speedup (109ms -> 65ms warm cache). - Switch FTS document count from COUNT() on the virtual table to COUNT(*) on documents_fts_docsize shadow table (B-tree scan vs FTS5 virtual table overhead). Measured 19x speedup for that single query. Database: 61652 docs, 282K notes, 211K discussions, 1.5GB.	2026-02-12 11:21:00 -05:00
Taylor Eernisse	d9f99ef21d	feat(cli): status display/filtering, expanded --fields, and robot-docs --brief Work item status integration across all CLI output: Issue listing (lore list issues): - New Status column appears when any issue has status data, with hex-color rendering using ANSI 256-color approximation - New --status flag for case-insensitive filtering (OR logic for multiple values): lore issues --status "In progress" --status "To do" - Status fields (name, category, color, icon_name, synced_at) in issue list query and JSON output with conditional serialization Issue detail (lore show issue): - Displays "Status: In progress (in_progress)" with color-coded output using ANSI 256-color approximation from hex color values - Status fields included in robot mode JSON with ISO timestamps - IssueRow, IssueDetail, IssueDetailJson all carry status columns Robot mode field selection expanded to new commands: - search: --fields with "minimal" preset (document_id, title, source_type, score) - timeline: --fields with "minimal" preset (timestamp, type, entity_iid, detail) - who: --fields with per-mode presets (expert_minimal, workload_minimal, etc.) - robot-docs: new --brief flag strips response_schema from output (~60% smaller) - strip_schemas() utility in robot.rs for --brief mode - expand_fields_preset() extended for search, timeline, and all who modes Robot-docs manifest updated with --status flag documentation, --fields flags for search/timeline/who, fields_presets sections, and corrected search response schema field names. Note: replaces empty commit `dcfd449` which lost staging during hook execution. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 08:13:37 -05:00
Taylor Eernisse	41504b4941	feat(who): configurable scoring weights, MR refs, detail mode, and suffix path resolution Expert mode now surfaces the specific MR references (project/path!iid) that contributed to each expert's score, capped at 50 per user. A new --detail flag adds per-MR breakdowns showing role (Author/Reviewer/both), note count, and last activity timestamp. Scoring weights (author_weight, reviewer_weight, note_bonus) are now configurable via the config file's `scoring` section with validation that rejects negative values. Defaults shift to author_weight=25, reviewer_weight=10, note_bonus=1 — better reflecting that code authorship is a stronger expertise signal than review assignment alone. Path resolution gains suffix matching: typing "login.rs" auto-resolves to "src/auth/login.rs" when unambiguous, with clear disambiguation errors when multiple paths match. Project-scoping (-p) narrows the candidate set. The MAX_MR_REFS_PER_USER constant is promoted to module scope for reuse across expert and overlap modes. Human output shows MR refs inline and detail sub-rows when requested. Robot JSON includes mr_refs, mr_refs_total, mr_refs_truncated, and optional details array. Includes comprehensive tests for suffix resolution, scoring weight configurability, MR ref aggregation across projects, and detail mode. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-09 10:15:15 -05:00
Taylor Eernisse	c54a969269	fix(who): exclude self-assigned reviewers from file-change reviewer signal Signal 4 (mr_reviewers + mr_file_changes) was missing the self-review exclusion that signal 1 (DiffNote reviewer) already had. An MR author listed as their own reviewer would be double-counted as both author and reviewer, inflating their score. Also removes redundant SELECT DISTINCT from signal 2 (GROUP BY already ensures uniqueness). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-08 13:42:40 -05:00
Taylor Eernisse	95b7183add	feat(who): expand expert + overlap queries with mr_file_changes and mr_reviewers Chain: bd-jec (config flag) -> bd-2yo (fetch MR diffs) -> bd-3qn6 (rewrite who queries) - Add fetch_mr_file_changes config option and --no-file-changes CLI flag - Add GitLab MR diffs API fetch pipeline with watermark-based sync - Create migration 020 for diffs_synced_for_updated_at watermark column - Rewrite query_expert() and query_overlap() to use 4-signal UNION ALL: DiffNote reviewers, DiffNote MR authors, file-change authors, file-change reviewers - Deduplicate across signal types via COUNT(DISTINCT CASE WHEN ... THEN mr_id END) - Add insert_file_change test helper, 8 new who tests, all 397 tests pass - Also includes: list performance migration 019, autocorrect module, README updates Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-08 13:35:14 -05:00
Taylor Eernisse	cc11d3e5a0	fix: peer review — 5 correctness bugs across who, db, lock, embedding, main Comprehensive peer code review identified and fixed the following: 1. who.rs: @-prefixed path routing used `target` (with @) instead of `clean` (stripped) when checking for '/' and passing to Expert mode, causing `lore who @src/auth/` to silently return zero results because the SQL LIKE matched against `@src/auth/%` which never exists. 2. db.rs: After ROLLBACK TO savepoint on migration failure, the savepoint was never RELEASEd, leaving it active on the connection. Fixed in both run_migrations() and run_migrations_from_dir(). 3. lock.rs: Multiple acquire() calls (e.g. re-acquiring a stale lock) replaced the heartbeat_handle without stopping the old thread, causing two concurrent heartbeat writers competing on the same lock row. Now signals the old thread to stop and joins it before spawning a new one. 4. chunk_ids.rs: encode_rowid() had no guard for chunk_index >= 1000 (CHUNK_ROWID_MULTIPLIER), which would cause rowid collisions between adjacent documents. Added range assertion [0, 1000). 5. main.rs: Fallback JSON error formatting in handle_auth_test interpolated LoreError Display output without escaping quotes or backslashes, potentially producing malformed JSON for robot-mode consumers. Now escapes both characters before interpolation. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-08 08:07:59 -05:00
Taylor Eernisse	e6b880cbcb	fix: prevent panics in robot-mode JSON output and arithmetic paths Peer code review found multiple panic-reachable paths: 1. serde_json::to_string().unwrap() in 4 robot-mode output functions (who.rs, main.rs x3). If serialization ever failed (e.g., NaN from edge-case division), the CLI would panic with an unhelpful stack trace. Replaced with unwrap_or_else that emits a structured JSON error fallback. 2. encode_rowid() in chunk_ids.rs used unchecked multiplication (document_id * 1000). On extreme document IDs this could silently wrap in release mode, causing embedding rowid collisions. Now uses checked_mul + checked_add with a diagnostic panic message. 3. HTTP response body truncation at byte index 500 in client.rs could split a multi-byte UTF-8 character, causing a panic. Now uses floor_char_boundary(500) for safe truncation. 4. who.rs reviews mode: SQL used `m.author_username != ?1` which silently dropped MRs with NULL author_username (SQL NULL != anything = NULL). Changed to `(m.author_username IS NULL OR m.author_username != ?1)` to match the pattern already used in expert mode. 5. handle_auth_test hardcoded exit code 5 for all errors regardless of type. Config not found (20), token not set (4), and network errors (8) all incorrectly returned 5. Now uses e.exit_code() from the actual LoreError, with proper suggestion hints in human mode. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-08 07:55:20 -05:00
Taylor Eernisse	f267578aab	feat: implement lore who — people intelligence commands (5 modes) Add `lore who` command with 5 query modes answering collaboration questions using existing DB data (280K notes, 210K discussions, 33K DiffNotes): - Expert: who knows about a file/directory (DiffNote path analysis + MR breadth scoring) - Workload: what is a person working on (assigned issues, authored/reviewing MRs, discussions) - Active: what discussions need attention (unresolved resolvable, global/project-scoped) - Overlap: who else is touching these files (dual author+reviewer role tracking) - Reviews: what review patterns does a person have (prefix-based category extraction) Includes migration 017 (5 composite indexes), CLI skeleton with clap conflicts_with validation, robot JSON output with input+resolved_input reproducibility, human terminal output, and 20 unit tests. All quality gates pass. Closes: bd-1q8z, bd-34rr, bd-2rk9, bd-2ldg, bd-zqpf, bd-s3rc, bd-m7k1, bd-b51e, bd-2711, bd-1rdi, bd-3mj2, bd-tfh3, bd-zibc, bd-g0d5 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-07 23:11:14 -05:00

11 Commits