trueref-legacy

Author	SHA1	Message	Date
Giancarmine Salucci	6297edf109	chore(TRUEREF-0022): fix lint errors and update architecture docs - Fix 15 ESLint errors across pipeline workers, SSE endpoints, and UI - Replace explicit any with proper entity types in worker entries - Remove unused imports and variables (basename, SSEEvent, getBroadcasterFn, seedRules) - Use empty catch clauses instead of unused error variables - Use SvelteSet for reactive Set state in repository page - Fix operator precedence in nullish coalescing expression - Replace $state+$effect with $derived for concurrency input - Use resolve() directly in href for navigation lint rule - Update ARCHITECTURE.md and FINDINGS.md for worker-thread architecture	2026-03-30 17:28:38 +02:00
Giancarmine Salucci	7630740403	feat(TRUEREF-0022): complete iteration 0 — worker-thread indexing, parallel jobs, SSE progress - Move IndexingPipeline.run() into Worker Threads via WorkerPool - Add dedicated embedding worker thread with single model instance - Add stage/stageDetail columns to indexing_jobs schema - Create ProgressBroadcaster for SSE channel management - Add SSE endpoints: GET /api/v1/jobs/:id/stream, GET /api/v1/jobs/stream - Replace UI polling with EventSource on repo detail and admin pages - Add concurrency settings UI and API endpoint - Build worker entries separately via esbuild	2026-03-30 17:08:23 +02:00
Giancarmine Salucci	09c6f9f7c1	fix(MULTIVERSION-0001): eliminate NULL-row contamination in getRules When a versioned query is made, getRules() now returns only the version-specific repository_configs row. The NULL (HEAD/repo-wide) row is no longer merged in, preventing v4 rules from bleeding into v1/v2/v3 versioned context responses. Tests updated to assert the isolation: versioned queries return only their own rules row; a new test verifies that a version with no config row returns an empty rules array even when a NULL row exists. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-29 11:47:31 +02:00
Giancarmine Salucci	666ec7d55f	feat(MULTIVERSION-0001): wire trueref.json into pipeline + per-version rules - Add migration 0003: recreate repository_configs with nullable version_id column and two partial unique indexes (repo-wide: version_id IS NULL, per-version: (repository_id, version_id) WHERE version_id IS NOT NULL) - Update schema.ts to reflect the new composite structure with uniqueIndex partial constraints via drizzle-orm sql helper - IndexingPipeline: parse trueref.json / context7.json after crawl, apply excludeFiles filter before diff computation, update totalFiles accordingly - IndexingPipeline: persist repo-wide rules (version_id=null) and version-specific rules (when versionId set) via upsertRepoConfig helper - Add matchesExcludePattern static helper supporting plain filename, glob prefix (docs/legacy*), and exact path patterns - context endpoint: split getRules into repo-wide + version-specific lookup with dedup merge; pass versionId at call site - Update test DB loaders to include migration 0003 - Add pipeline tests for excludeFiles, repo-wide rules persistence, and per-version rules persistence - Add integration tests for merged rules, repo-only rules, and dedup logic Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-28 10:44:30 +01:00
Giancarmine Salucci	255838dcc0	fix(MULTIVERSION-0001): fix version isolation, 404 on unknown version, commit-hash lookup, and searchModeUsed Bug 1: Thread version tag from run() into crawl() via getVersionTag() helper so LocalCrawler and GithubCrawler receive the correct ref when indexing a named version instead of always crawling HEAD. Bug 2: Return HTTP 404 with code VERSION_NOT_FOUND when a requested version tag is not found in repository_versions, instead of silently falling back to a cross-version mixed result set. Bug 4: Before returning 404, attempt a commit_hash prefix match (min 7 chars) so callers can request a version by full or short SHA. Bug 3: Change HybridSearchService.search() to return { results, searchModeUsed } and propagate searchModeUsed through ContextResponseMetadata and ContextJsonResponseDto so callers can see which strategy (keyword / semantic / hybrid / keyword_fallback) was actually used. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-28 10:31:15 +01:00
Giancarmine Salucci	1c5b634ea4	fix(MULTIVERSION-0001): fix multi-version indexing — jobs never created or triggered for secondary versions Two bugs prevented secondary versions from ever being indexed: 1. JobQueue.enqueue() and RepositoryService.createIndexingJob() deduplication only checked repository_id, so a queued default-branch job blocked all version-specific jobs for the same repo. Fix: include version_id in the WHERE clause so only exact (repository_id, version_id) pairs are deduped. 2. POST /api/v1/libs/:id/versions used repoService.createIndexingJob() which inserts a job record but never triggers queue processing. Fix: use queue.enqueue() (same fallback pattern as the libs endpoint) so setImmediate fires processNext() after the job is inserted. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-28 09:32:27 +01:00
Giancarmine Salucci	781d224adc	feat(EMBEDDINGS-0001): enable local embedder by default and overhaul settings page - Wire local embedding provider as the default on startup when no profile is configured - Refactor embedding settings into dedicated service, DTOs, mappers and models - Rebuild settings page with profile management UI and live test feedback - Expose index summary (indexed versions + embedding count) on repo endpoints - Harden indexing pipeline and context search with additional test coverage Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-28 09:28:01 +01:00
Giancarmine Salucci	d1381f7fc0	fix(ROUTING-0001): repair repo routing and isolate MCP filtering	2026-03-27 19:01:47 +01:00
Giancarmine Salucci	5a3c27224d	chore(FEEDBACK-0001): linting	2026-03-27 02:23:01 +01:00
Giancarmine Salucci	16436bfab2	fix(FEEDBACK-0001): complete iteration 0 - harden context search	2026-03-27 01:25:46 +01:00
Giancarmine Salucci	fef6f66930	wip(TRUEREF-0018): commit version-scoped indexing work	2026-03-25 19:03:22 +01:00
Giancarmine Salucci	215cadf070	refactor: introduce domain model classes and mapper layer Replace ad-hoc inline row casting (snake_case → camelCase) spread across services, routes, and the indexing pipeline with explicit model classes (Repository, IndexingJob, RepositoryVersion, Snippet, SearchResult) and dedicated mapper classes that own the DB → domain conversion. - Add src/lib/server/models/ with typed model classes for all domain entities - Add src/lib/server/mappers/ with mapper classes per entity - Remove duplicated RawRow interfaces and inline map functions from job-queue, repository.service, indexing.pipeline, and all API routes - Add dtoJsonResponse helper to standardise JSON responses via SvelteKit json() - Add api-contract.integration.test.ts as a regression baseline Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-25 14:29:49 +01:00

12 Commits