Initial commit: trueref v0.1.0-SNAPSHOT

Java 21 / Spring Boot 3.5.3 multi-module Maven project. Hybrid BM25+HNSW search with RRF, cross-encoder reranker, ONNX Runtime 1.22.0 (CPU + CUDA 12 GPU variants).
2026-05-06 00:49:16 +02:00
commit c5f950c2c0
132 changed files with 11287 additions and 0 deletions
--- a/.gitea/workflows/docker.yml
+++ b/.gitea/workflows/docker.yml
@@ -0,0 +1,85 @@
+name: Build and publish Docker image
+
+on:
+  push:
+    branches:
+      - main
+      - master
+    tags:
+      - 'v*.*.*'
+  workflow_dispatch:
+
+jobs:
+  docker:
+    name: Build and push
+    runs-on: ubuntu-latest
+
+    steps:
+      - name: Checkout
+        uses: actions/checkout@v4
+
+      # Set up Docker Buildx for efficient layer caching.
+      - name: Set up Docker Buildx
+        uses: docker/setup-buildx-action@v3
+
+      # Log in to the Gitea container registry.
+      # The built-in GITEA_TOKEN is injected automatically by Gitea Actions and
+      # has write access to packages in the same organisation/user namespace.
+      - name: Log in to Gitea registry
+        uses: docker/login-action@v3
+        with:
+          registry: git.sal.giize.com
+          username: ${{ gitea.actor }}
+          password: ${{ secrets.GITEA_TOKEN }}
+
+      # ── Determine tags ───────────────────────────────────────────────────
+      # On a version tag (v1.2.3):  latest, cpu, cpu-1.2.3, 1.2.3
+      # On branch push (main/master): latest, cpu
+      - name: Docker metadata (CPU)
+        id: meta_cpu
+        uses: docker/metadata-action@v5
+        with:
+          images: git.sal.giize.com/mozempk/trueref
+          flavor: |
+            latest=auto
+          tags: |
+            type=raw,value=latest,enable={{is_default_branch}}
+            type=raw,value=cpu,enable={{is_default_branch}}
+            type=semver,pattern={{version}},prefix=cpu-
+            type=semver,pattern={{version}}
+
+      - name: Docker metadata (GPU)
+        id: meta_gpu
+        uses: docker/metadata-action@v5
+        with:
+          images: git.sal.giize.com/mozempk/trueref
+          flavor: |
+            latest=false
+          tags: |
+            type=raw,value=gpu,enable={{is_default_branch}}
+            type=semver,pattern={{version}},prefix=gpu-
+
+      # ── CPU image ────────────────────────────────────────────────────────
+      - name: Build and push CPU image
+        uses: docker/build-push-action@v5
+        with:
+          context: .
+          file: Dockerfile
+          push: true
+          tags: ${{ steps.meta_cpu.outputs.tags }}
+          labels: ${{ steps.meta_cpu.outputs.labels }}
+          cache-from: type=gha,scope=cpu
+          cache-to: type=gha,mode=max,scope=cpu
+
+      # ── GPU image ────────────────────────────────────────────────────────
+      # Built from the same source; only the runtime base image differs.
+      - name: Build and push GPU image
+        uses: docker/build-push-action@v5
+        with:
+          context: .
+          file: Dockerfile.gpu
+          push: true
+          tags: ${{ steps.meta_gpu.outputs.tags }}
+          labels: ${{ steps.meta_gpu.outputs.labels }}
+          cache-from: type=gha,scope=gpu
+          cache-to: type=gha,mode=max,scope=gpu
--- a/.gitignore
+++ b/.gitignore
@@ -0,0 +1,30 @@
+target/
+build/
+out/
+.idea/
+.vscode/
+*.iml
+*.ipr
+*.iws
+.DS_Store
+
+# Maven
+.mvn/wrapper/maven-wrapper.jar
+
+# trueref runtime data (models, DB, index — too large / machine-specific)
+data/
+data-onnx-smoke/
+logs/
+
+# cuDNN and other large native runtime libraries
+runtime/
+
+# JVM crash dumps
+hs_err_pid*.log
+core.*
+
+# Frontend
+trueref-frontend/web/node_modules/
+trueref-frontend/web/build/
+trueref-frontend/web/.svelte-kit/
+node_modules/
--- a/ARCHITECTURE.md
+++ b/ARCHITECTURE.md
@@ -0,0 +1,428 @@
+# trueref — Architecture
+
+> Self-hosted, fat-JAR, Java-21 clone of [Context7](https://github.com/upstash/context7) ingestion + retrieval, with first-class differential per-tag indexing, embedded vector + BM25 store, ONNX-accelerated embeddings/rerank, Streamable-HTTP MCP server, REST + OpenAPI, and a SvelteKit UI.
+
+## 1. Goals & Non-Goals
+
+### Goals
+- **Functional parity with Context7** ingestion outcome (own chunk schema).
+- **Differential per-tag indexing**: every git tag of every registered repo is independently queryable.
+- **Embedded everything**: single fat JAR runnable on a workstation/server. No external Postgres/Qdrant.
+- **GPU-accelerated retrieval** via ONNX Runtime (CUDA Linux/Win, DirectML Win, CPU fallback).
+- **MCP Streamable-HTTP server** exposing exactly two tools: `resolve-library-id`, `get-library-docs` — drop-in for any MCP client.
+- **Full observability** of ingestion pipelines surfaced in the UI (live progress, log tail, history, timings, resource usage).
+- **REST + OpenAPI/Swagger** for programmatic and UI use.
+- **SvelteKit UI** for repo registration, indexing control, monitoring, and ad-hoc query.
+- **Hexagonal architecture** so vector store, embedder, parser, persistence, etc. are swappable.
+
+### Non-Goals
+- No public hosted SaaS — self-host only.
+- No model fine-tuning.
+- No mobile app.
+- No generative LLM in the pipeline (retrieval-only, like Context7).
+- No multi-tenancy / auth (LAN-only deployment).
+
+---
+
+## 2. Tech Stack (locked)
+
+| Concern | Choice | Rationale |
+|---|---|---|
+| Language / runtime | **Java 21 LTS** | Virtual threads stable; Spring Boot 3.5 supported. (Java 25 dropped — Boot 3.5 supports up to 23.) |
+| Framework | **Spring Boot 3.5.x** + **Spring AI 1.0.x** | Web MVC + virtual-thread executor; Spring AI for embedding/MCP abstractions. |
+| Build | **Maven** | Stable, ubiquitous, Spring-Boot first-class. |
+| Metadata store | **H2 (MVCC mode, file-based)** + Flyway | Zero ops, JDBC, MVCC concurrency, fits fat JAR. |
+| Vector + lexical store | **Apache Lucene 9.x** | Pure JVM. BM25 + HNSW kNN in one index. Collapses two stores. |
+| Embedding model | **BAAI/bge-m3** (ONNX) | Multilingual, 8k context, dense+sparse capable. MIT-like license. |
+| Reranker | **BAAI/bge-reranker-v2-m3** (ONNX) | Cross-encoder, Apache 2.0. |
+| ML runtime | **ONNX Runtime** (`onnxruntime_gpu` Linux CUDA / `onnxruntime-directml` Win / `onnxruntime` CPU) | In-JVM via official Java bindings. |
+| Git | **JGit** | Pure Java; clone, fetch, tag enumeration, diff. |
+| Code parsing | **Pure-Java heuristic chunker** (markdown-aware, brace-balanced for C-family, indent-based for Python, sliding-window fallback) | No native deps; preserves fat-JAR purity. Tree-sitter is a documented future swap (see FINDINGS §F11). |
+| Job orchestration | **Custom virtual-thread orchestrator** + H2-backed durable state | Fast, no Spring Batch overhead. |
+| MCP server | **Spring AI MCP Server (Streamable HTTP)** | Spec 2025-03-26, single `/mcp` endpoint. |
+| REST docs | **springdoc-openapi** | OpenAPI 3 + Swagger UI auto-generated. |
+| Observability | **Micrometer + OpenTelemetry**, exposed via REST/SSE for UI. **Prometheus + Grafana optional** via `/actuator/prometheus`. | UI-first; Prom/Graf attach later. |
+| Frontend | **SvelteKit + `@sveltejs/adapter-static`** | Built into `bootstrap/src/main/resources/static/`, served by Spring as part of fat JAR. |
+| Packaging | **Single fat JAR** via `spring-boot-maven-plugin` | One artifact, embedded everything. |
+
+---
+
+## 3. Hexagonal Layout (Maven multi-module)
+
+Direction of dependencies is enforced by Maven coordinates alone — no ArchUnit needed.
+
+```
+trueref-parent/                  (pom; BOM + plugin management)
+├── trueref-domain               pure Java; records, sealed types, port interfaces. ZERO deps.
+├── trueref-application          use-case impls; depends on: domain
+├── trueref-adapters             ALL adapters live here; depends on: domain, application
+│   └── com.trueref.adapter
+│       ├── in
+│       │   ├── rest             @RestController + DTOs + OpenAPI + SSE
+│       │   └── mcp              MCP tool defs (Spring AI MCP server)
+│       └── out
+│           ├── persistence.h2   JdbcClient + Flyway, RepositoryStore impl
+│           ├── vectorstore.lucene  Lucene BM25 + HNSW kNN, ChunkStore impl
+│           ├── embedding.onnx   ONNX bge-m3 + bge-reranker-v2-m3
+│           ├── git.jgit         GitClient impl
+│           ├── parsing.treesitter  CodeParser impl
+│           └── cache.disk       EmbeddingCache (file-per-hash)
+├── trueref-frontend             SvelteKit; built via frontend-maven-plugin into static jar
+└── trueref-bootstrap            @SpringBootApplication; wires beans; produces fat JAR
+                                 depends on: domain, application, adapters, frontend
+```
+
+**Dependency rule (Maven-enforced):**
+- `domain` → nothing.
+- `application` → `domain`.
+- `adapters` → `domain` + `application`.
+- `frontend` → none (resource-only jar).
+- `bootstrap` → all of the above (the only place wiring lives).
+
+> All packages live under `com.trueref.*` regardless of module. Module boundaries enforce dependency direction; package layout inside `adapters` mirrors the in/out hexagonal convention.
+
+---
+
+## 4. Core Domain Model
+
+```
+Repository {
+  id: UUID
+  name: String                 // "spring-projects/spring-boot"
+  remoteUrl: String?           // null if local-only
+  localPath: Path              // either user-provided or our managed clone dir
+  managedClone: bool           // true if WE clone/fetch
+  ignoreGlobs: List<String>    // per-repo overrides
+  maxFileSizeBytes: long       // default 1MB
+  pollIntervalSec: long        // default 3600; 0 disables polling
+  versionMappingRules: List<TagPattern>  // exact, v-prefix, release-prefix, regex
+  createdAt, updatedAt
+}
+
+Version {
+  id: UUID
+  repoId: UUID
+  tag: String                  // "v1.2.3" or branch name
+  commitSha: String
+  status: enum { DISCOVERED, INDEXING, INDEXED, FAILED, INACTIVE }
+  indexedAt: Instant?
+  chunkCount: int
+  errorMessage: String?
+}
+
+Chunk {                        // global, deduplicated by content_hash
+  id: UUID
+  contentHash: String          // sha256 of canonicalized content
+  content: String              // the snippet text
+  language: String             // "java", "python", "markdown", ...
+  symbol: String?              // function/class name if AST-extracted
+  tokenCount: int
+  // dense + sparse vectors stored in Lucene index, not here
+}
+
+ChunkVersion {                 // many-to-many: which versions contain which chunks
+  chunkId: UUID
+  versionId: UUID
+  filePath: String
+  startLine: int
+  endLine: int
+  // PK (chunkId, versionId, filePath, startLine)
+}
+
+IngestionJob {
+  id: UUID
+  repoId: UUID
+  versionId: UUID?             // null = repo-level (e.g. discovery)
+  type: enum { DISCOVER_TAGS, INDEX_VERSION, COMPACT, REFRESH }
+  status: enum { QUEUED, RUNNING, SUCCEEDED, FAILED, CANCELLED }
+  startedAt, finishedAt
+  stages: List<JobStage>
+}
+
+JobStage {
+  jobId: UUID
+  name: enum { CLONE, FETCH, CHECKOUT, DISCOVER_FILES, PARSE, CHUNK, EMBED, INDEX, COMMIT }
+  status: enum { PENDING, RUNNING, SUCCEEDED, FAILED, SKIPPED }
+  startedAt, finishedAt
+  itemsProcessed: long
+  itemsTotal: long
+  bytesProcessed: long
+  errorMessage: String?
+}
+
+JobLogEvent {                  // ring-buffered + persisted; streamed via SSE
+  jobId: UUID
+  ts: Instant
+  level: enum { DEBUG, INFO, WARN, ERROR }
+  stage: JobStage.name?
+  message: String
+}
+```
+
+---
+
+## 5. Ingestion Pipeline
+
+```
+                ┌────────────────────────────────────────────────────────┐
+                │  IngestionOrchestrator  (virtual-thread per stage)     │
+                └────────────────────────────────────────────────────────┘
+                              │
+   ┌──────────────────────────┼──────────────────────────────────────────┐
+   ▼                          ▼                                          ▼
+[CLONE/FETCH]          [DISCOVER_TAGS]                          [INDEX_VERSION job]
+ JGit pull/clone        git tag list ∩                            (per (repo,tag))
+                        version mapping
+                        rules
+                                                                         │
+                                            ┌────────────────────────────┤
+                                            ▼                            ▼
+                                      [CHECKOUT worktree]          (parallel tags up to N)
+                                            │
+                                            ▼
+                                      [DISCOVER_FILES]
+                                       respect .gitignore +
+                                       defaults + per-repo globs +
+                                       max file size
+                                            │
+                                            ▼
+                                      [GIT_DIFF vs prev indexed tag]
+                                       → if exists, only changed
+                                          files reach PARSE
+                                            │
+                                            ▼
+                                      [PARSE]  heuristic chunker
+                                       (markdown sections; brace-balanced;
+                                       indent-based; sliding-window fallback)
+                                            │
+                                            ▼
+                                      [CHUNK]  AST-aware splits +
+                                       sliding-window fallback
+                                            │
+                                            ▼
+                                      [HASH + DEDUPE]
+                                       content_hash lookup → existing
+                                          chunkId reused
+                                            │
+                                            ▼
+                                      [EMBED]  ONNX bge-m3
+                                       NEW chunks only
+                                       (GPU semaphore-gated batch)
+                                            │
+                                            ▼
+                                      [INDEX]  Lucene upsert:
+                                       - chunk doc with vector
+                                       - chunk_version doc
+                                            │
+                                            ▼
+                                      [COMMIT]  Lucene commit +
+                                       H2 transaction
+                                            │
+                                            ▼
+                                       Version.status = INDEXED
+```
+
+### Key invariants
+
+1. **Embeddings are computed at most once per `content_hash`.** Persistent disk cache keyed by hash → vector bytes.
+2. **A tag's chunks = union of (a) reused chunks via hash and (b) newly-embedded chunks.** This makes re-indexing a near-identical tag almost free.
+3. **Git-diff fast path:** if a tag's parent (nearest previously indexed tag in semver order) exists, only files changed in `git diff parent..tag` are re-parsed. Unchanged files contribute their parent's chunk_versions verbatim with new line offsets adjusted by diff (or fully re-parsed if rename detection is ambiguous).
+4. **Per-stage virtual-thread pools.** Threads themselves are unbounded (per user spec), but a **GPU semaphore** (default `permits = ortSessionCount`) gates ONNX inference to avoid GPU OOM. Lucene writer is single-thread (its own queue).
+
+---
+
+## 6. Search Pipeline
+
+```
+query  ─►  [Query Rewrite]      rule-based: lowercase, dedupe stop tokens,
+              │                  optional library-id-aware expansion
+              ▼
+           [BM25 search]                [Dense kNN search]
+           Lucene similarity            Lucene HNSW (bge-m3 dense)
+                  │                                   │
+                  └─────────────► [RRF fusion] ◄──────┘
+                                          │
+                                          ▼
+                                  top-K candidates (default 50)
+                                          │
+                                          ▼
+                                  [Cross-encoder rerank]
+                                  ONNX bge-reranker-v2-m3
+                                  (GPU semaphore)
+                                          │
+                                          ▼
+                                  [Token-budget assemble]
+                                  pack snippets up to `tokens` param
+                                  (default 5000, min 500, max 50000)
+                                          │
+                                          ▼
+                                   ranked snippets w/ citations
+                                   (file path, repo, tag, lines)
+```
+
+All searches are **scoped** to `(repoId, versionId)` filter clauses on the Lucene index using `chunk_versions` join semantics.
+
+---
+
+## 7. MCP Server (Streamable HTTP)
+
+- Single endpoint: `POST /mcp` (JSON-RPC over HTTP) with optional SSE upgrade per request, per [MCP 2025-03-26 spec](https://spec.modelcontextprotocol.io/specification/2025-03-26/basic/transports/).
+- **Two tools, exactly matching Context7 schema:**
+
+### `resolve-library-id`
+```json
+{
+  "name": "resolve-library-id",
+  "description": "Resolves a library/package name to a trueref-compatible library ID...",
+  "inputSchema": {
+    "type": "object",
+    "required": ["libraryName"],
+    "properties": {
+      "libraryName": { "type": "string" },
+      "query":       { "type": "string", "description": "optional, ranks results by relevance" }
+    }
+  }
+}
+```
+Returns ranked candidate library IDs (`/{owner}/{repo}` style) with metadata (description, snippet count, available versions, source reputation).
+
+### `get-library-docs`
+```json
+{
+  "name": "get-library-docs",
+  "inputSchema": {
+    "type": "object",
+    "required": ["libraryId"],
+    "properties": {
+      "libraryId": { "type": "string", "description": "/org/project[/version]" },
+      "topic":     { "type": "string" },
+      "tokens":    { "type": "integer", "minimum": 500, "maximum": 50000, "default": 5000 }
+    }
+  }
+}
+```
+
+### On-demand indexing flow
+- If `libraryId` includes a version that maps to a known git tag but is **not yet indexed**:
+  1. Enqueue `INDEX_VERSION` job immediately.
+  2. Return a **partial** response built from the **nearest indexed tag** (semver-closest) plus a status block: `{ "indexing": { "status": "in_progress", "version": "1.2.3", "retryAfterSec": 30 } }`.
+- If version maps to **no** tag: return error `version_not_found` with the list of candidate tags discovered.
+
+---
+
+## 8. REST API Surface
+
+| Method | Path | Purpose |
+|---|---|---|
+| GET | `/api/repos` | List registered repos |
+| POST | `/api/repos` | Register (local path or remote URL) |
+| GET | `/api/repos/{id}` | Repo detail + version summary |
+| DELETE | `/api/repos/{id}` | Unregister + soft-delete versions |
+| POST | `/api/repos/{id}/discover` | Force tag discovery |
+| GET | `/api/repos/{id}/versions` | All known versions + status |
+| POST | `/api/repos/{id}/versions/{tag}/index` | Index a specific tag |
+| POST | `/api/repos/{id}/versions/{tag}/reindex` | Force re-index |
+| GET | `/api/jobs` | List jobs (filter by repo/version/status) |
+| GET | `/api/jobs/{id}` | Job detail with stages |
+| GET | `/api/jobs/{id}/log` (SSE) | Live log stream |
+| GET | `/api/jobs/stream` (SSE) | Live job-status events for the dashboard |
+| POST | `/api/search` | Hybrid search across one or more (repo, version) scopes |
+| GET | `/api/resolve?q=react` | Library-ID resolution preview |
+| GET | `/api/observability/metrics` | UI-friendly aggregated metrics JSON |
+| GET | `/api/observability/resources` | Heap, GPU mem (via NVML when present), index size |
+| GET | `/swagger-ui/index.html` | Swagger UI |
+| GET | `/v3/api-docs` | OpenAPI JSON |
+| ANY | `/mcp` | MCP Streamable HTTP endpoint |
+| GET | `/actuator/prometheus` | Prometheus scrape (optional) |
+| GET | `/**` | SPA fallback to `index.html` |
+
+---
+
+## 9. Concurrency & Performance
+
+- **Virtual threads everywhere** for I/O (HTTP, JGit, file I/O, Lucene reads).
+- **`Tomcat` configured with virtual-thread executor** (`spring.threads.virtual.enabled=true`).
+- **Per-stage logical pools** are unbounded virtual-thread executors per orchestrator instance.
+- **GPU access gated by a `Semaphore`** with permits = number of ONNX sessions (configurable, default = 2).
+- **Lucene writer**: single `IndexWriter` instance protected by a queue; readers use a refresh-on-search `SearcherManager`.
+- **Embedding cache**: file-per-hash on disk under `data/embedding-cache/`; hot LRU in memory.
+- **Tag concurrency**: not capped (per spec), but each tag job awaits the GPU semaphore — natural backpressure.
+
+---
+
+## 10. Observability
+
+- **Metrics** via Micrometer (`MeterRegistry`):
+  - Counters: chunks_embedded, chunks_reused, files_skipped, jobs_succeeded/failed.
+  - Timers: stage durations per stage name.
+  - Gauges: active_jobs, gpu_semaphore_available, lucene_index_size_bytes, heap_used.
+- **OpenTelemetry traces** for every job (one trace per `IngestionJob`, span per `JobStage`).
+- **JobEventBus**: in-process pub/sub. SSE controllers subscribe and push events to UI.
+- **UI dashboards** (no Grafana required):
+  - "Live" tab: progress bars per running (repo, tag), per-stage throughput, log tail.
+  - "History" tab: paginated jobs table.
+  - "Stats" tab: per-stage timing histograms, chunk counts per repo/version, chunk dedupe ratio.
+  - "Resources" tab: heap, GPU memory (NVML where available), index size on disk.
+- **Prometheus** scraping is opt-in (Actuator endpoint).
+
+---
+
+## 11. Storage Layout (on disk)
+
+```
+$TRUEREF_HOME/                  # default: ./data
+├── h2/                         # H2 database files
+├── lucene/                     # single index dir; one Lucene writer
+├── repos/                      # managed clones (when managedClone=true)
+│   └── <repoId>/...
+├── embedding-cache/            # one file per content_hash → fp16 vector bytes
+├── models/                     # ONNX model files (auto-downloaded on first run)
+│   ├── bge-m3/
+│   └── bge-reranker-v2-m3/
+└── logs/
+```
+
+---
+
+## 12. Configuration (excerpt)
+
+```yaml
+trueref:
+  home: ${TRUEREF_HOME:./data}
+  ingestion:
+    poll-interval-default: 1h
+    tag-cap-default: 100              # most-recent N tags by semver/date
+    max-file-size-bytes-default: 1048576
+  embedding:
+    model: bge-m3
+    onnx-providers: [cuda, directml, cpu]   # tried in order
+    session-count: 2                  # = GPU semaphore permits
+    batch-size: 32
+  reranker:
+    model: bge-reranker-v2-m3
+    top-k: 50
+  search:
+    rrf-k: 60
+    final-top-k: 20
+  mcp:
+    tokens-default: 5000
+    tokens-min: 500
+    tokens-max: 50000
+spring:
+  threads.virtual.enabled: true
+```
+
+---
+
+## 13. Out-of-the-box behaviors locked from clarifications
+
+- **Auth**: none (LAN-only) on REST and MCP.
+- **Tag selection**: default cap 100 most-recent; on-demand index of any tag via UI search OR via MCP when an unindexed version is requested.
+- **Differential indexing**: dedupe by `content_hash` AND skip unchanged files via `git diff parent..tag`.
+- **Repo input**: UI-add (local path or remote URL) AND watched folder `./data/watched/` for bare repos.
+- **Re-index trigger**: on-demand + scheduled `git fetch` poll (default 1h per repo).
+- **Stale tag cleanup**: soft delete via `Version.status=INACTIVE`; compaction job reclaims orphan chunks.
+- **Embedding cache**: persistent on disk, keyed by `content_hash`.
+- **Concurrency**: unbounded virtual threads, GPU semaphore-gated.
+
+See [FINDINGS.md](FINDINGS.md) for research backing each choice.
--- a/CODE_STYLE.md
+++ b/CODE_STYLE.md
@@ -0,0 +1,91 @@
+# trueref — Code Style
+
+## 1. Language & Toolchain
+- **Java 21**, source/target 21.
+- **Maven** with `spring-boot-maven-plugin` for the fat JAR.
+- **Spotless** with **Palantir Java Format** for formatting (4-space indent, 120 col).
+- **ErrorProne** + **NullAway** for static analysis. NullAway annotations: `@org.jspecify.annotations.Nullable` / `@NonNull`.
+- Hexagonal boundaries are enforced by **Maven module dependencies** (no ArchUnit). See ARCHITECTURE §3.
+
+## 2. Records, Sealed Types, Pattern Matching
+- Prefer **records** for DTOs, value objects, and `port.in`/`port.out` parameter/result types.
+- Use **sealed interfaces** for closed result/event hierarchies (`sealed interface IngestionEvent permits ...`).
+- Use **pattern matching** (`switch` expressions, `instanceof`) over visitor pattern.
+
+## 3. Nullability & Optional
+- All API surfaces (public methods on ports, REST DTOs) are **non-null by default**; mark nullable explicitly with `@Nullable`.
+- Use `Optional<T>` **only** as a return type from query-style methods. Never as a field, never as a parameter.
+
+## 4. Concurrency
+- Spawn virtual threads via `Thread.ofVirtual().start(...)` or `Executors.newVirtualThreadPerTaskExecutor()`. **Never** call `Thread.sleep` inside a synchronized block.
+- Shared mutable state is forbidden in `domain` and discouraged in `application`. When unavoidable, use `java.util.concurrent.atomic.*` or a `ReentrantLock`.
+- GPU work goes through `GpuSemaphore.acquire()` (a thin wrapper around `Semaphore`).
+- Long-running orchestration uses **structured concurrency** (`StructuredTaskScope`) where it improves cancellation safety.
+
+## 5. Error Handling
+- **Domain errors** are sealed exception hierarchies rooted at `TrueRefException`. Adapters translate them to HTTP/JSON-RPC errors centrally (REST: `@ControllerAdvice`; MCP: dedicated translator).
+- **No checked exceptions** at port boundaries. Wrap third-party checked exceptions at the adapter edge.
+- Validation errors carry a stable `code` (string) so the UI can localize.
+- Never `catch (Exception e)` and swallow. Either log + rethrow as a domain exception or let it propagate.
+
+## 6. Logging
+- **SLF4J** with parameterized messages: `log.info("indexed tag {} of repo {}", tag, repoName);` — never string-concatenate.
+- Structured fields via MDC: `repoId`, `versionId`, `jobId`, `stage`. Cleared in a try/finally.
+- Log levels:
+  - `ERROR`: unrecoverable, requires operator attention.
+  - `WARN`: degraded, automatic recovery in progress.
+  - `INFO`: lifecycle events (job started/finished, repo registered).
+  - `DEBUG`: per-file, per-chunk detail. Off by default.
+
+## 7. Naming
+- Use cases (`port.in`): imperative verb phrases — `IndexVersion`, `ResolveLibraryId`.
+- SPIs (`port.out`): noun-ish role names — `EmbeddingService`, `ChunkStore`, `GitClient`.
+- Adapter classes: `<Tech><Role>` — `LuceneChunkStore`, `OnnxEmbeddingService`, `JGitClient`.
+- DTOs: `<Resource><Action>Request` / `<Resource>Response`.
+- Records' field names are camelCase (no Hungarian, no `_` prefixes).
+
+## 8. Package & File Discipline
+- One public type per file.
+- Internal helpers are package-private. Avoid `public` unless used across packages.
+- Domain packages export **only** records and interfaces. No Spring annotations, no Lombok, no Jackson annotations.
+- Adapter packages may use Spring stereotypes (`@Component`, `@Repository`, `@RestController`) but adapters depend on **port interfaces only** when interacting with the application.
+
+## 9. Spring Wiring
+- Wiring lives in `bootstrap`. Each adapter package may define a `@Configuration` (constructor-injected `@Bean` factories) but **does not** auto-`@ComponentScan` itself; bootstrap explicitly imports.
+- Use `@ConfigurationProperties` records for typed config; never raw `@Value`.
+- Prefer constructor injection. **No field injection.**
+
+## 10. Persistence (H2)
+- Migrations under `src/main/resources/db/migration` named `V<N>__<snake_case>.sql`. Flyway runs at startup.
+- All access via Spring **`JdbcClient`** (Spring Boot 3.2+, fluent JDBC). **No JPA/Hibernate**, no `JdbcTemplate` directly.
+- Mappers are explicit `RowMapper<T>` lambdas, not reflection-based.
+- SQL lives next to the repository class, either as `static final String` constants or in `*.sql` files loaded via `ClassPathResource` for non-trivial queries.
+
+## 11. REST
+- Controllers in `adapter.in.rest`. They depend only on `port.in` interfaces and DTO records.
+- DTOs are **separate** from domain records. Mapping via plain `static of(...)` factories. **No MapStruct.**
+- All endpoints documented with `@Operation`, `@ApiResponses`, `@Schema` (springdoc).
+- Request validation via `jakarta.validation` annotations on DTOs.
+- SSE endpoints return `SseEmitter`; subscribe to `JobEventBus`, unsubscribe on completion/timeout.
+
+## 12. MCP
+- Tool definitions are **records** decorated to produce JSON Schema via Spring AI's MCP support. Schema strings stay verbatim (1:1 with Context7) so LLMs see identical contracts.
+- Tool handlers depend only on `port.in` (`SearchLibraryDocs`, `ResolveLibraryId`).
+
+## 13. Tests
+- **JUnit 5** + **AssertJ** + **Mockito** (sparingly).
+- Unit tests live next to the package they test. Integration tests under `src/test/java/.../it/` and use `@SpringBootTest`.
+- Use **Testcontainers** only when truly required (we mostly avoid it via embedded stores).
+- ArchUnit test suite is mandatory and runs in CI.
+
+## 14. Dependency Hygiene
+- BOM-managed versions only. Add a dependency only if it provides clear value over JDK + Spring Boot + already-included libs.
+- No Lombok, no Guava (use JDK 21 equivalents), no Reactor (we use virtual threads + blocking).
+- No Kotlin, no Scala.
+
+## 15. Documentation
+- All architectural decisions go in **ARCHITECTURE.md**.
+- All research notes go in **FINDINGS.md** with sources.
+- All conventions go in **this file**.
+- Per-package `package-info.java` may exist for non-trivial packages, summarizing role and exported types.
+- No README sprawl: `README.md` is a quickstart only and links to the three docs above.
--- a/51
+++ b/51
@@ -0,0 +1,51 @@
+# ─── Build stage ──────────────────────────────────────────────────────────────
+# eclipse-temurin:21-jdk-jammy ships JDK 21 + Maven-compatible toolchain.
+# frontend-maven-plugin downloads Node/npm automatically, so no explicit
+# Node install is needed in the build stage.
+FROM eclipse-temurin:21-jdk-jammy AS builder
+
+RUN apt-get update \
+    && apt-get install -y --no-install-recommends maven \
+    && rm -rf /var/lib/apt/lists/*
+
+WORKDIR /build
+COPY . .
+
+RUN mvn -q package -DskipTests -T 1C
+
+# ─── Runtime stage (CPU-only) ─────────────────────────────────────────────────
+FROM eclipse-temurin:21-jre-jammy
+
+LABEL org.opencontainers.image.title="TrueRef"
+LABEL org.opencontainers.image.description="Self-hosted documentation retrieval platform for AI coding assistants (CPU variant)"
+LABEL org.opencontainers.image.url="https://git.sal.giize.com/mozempk/trueref"
+LABEL org.opencontainers.image.source="https://git.sal.giize.com/mozempk/trueref"
+
+WORKDIR /app
+
+COPY --from=builder /build/trueref-bootstrap/target/trueref.jar /app/trueref.jar
+
+# /data is the default trueref.home: H2 DB, Lucene index, embedding cache and
+# downloaded models all live here.  Mount a volume to persist between restarts.
+VOLUME /data
+
+ENV TRUEREF_HOME=/data \
+    TRUEREF_PORT=18080 \
+    JAVA_OPTS=""
+
+EXPOSE 18080
+
+# JVM flags required by trueref:
+#   --enable-native-access  silences FFM Linker warning from DJL tokenizers
+#   --add-modules           enables Lucene 10 SIMD codepath (incubator.vector)
+# Spring properties are passed via CMD so users can override them at runtime.
+ENTRYPOINT ["sh", "-c", \
+  "exec java \
+    --enable-native-access=ALL-UNNAMED \
+    --add-modules=jdk.incubator.vector \
+    ${JAVA_OPTS} \
+    -jar /app/trueref.jar \
+    --server.port=${TRUEREF_PORT} \
+    --trueref.home=${TRUEREF_HOME} \
+    --trueref.embedding.onnx-providers=cpu \
+    \"$@\"", "--"]
--- a/Dockerfile.gpu
+++ b/Dockerfile.gpu
@@ -0,0 +1,69 @@
+# ─── Build stage ──────────────────────────────────────────────────────────────
+FROM eclipse-temurin:21-jdk-jammy AS builder
+
+RUN apt-get update \
+    && apt-get install -y --no-install-recommends maven \
+    && rm -rf /var/lib/apt/lists/*
+
+WORKDIR /build
+COPY . .
+
+RUN mvn -q package -DskipTests -T 1C
+
+# ─── Runtime stage (NVIDIA GPU / CUDA 12 + cuDNN 9) ──────────────────────────
+# nvidia/cuda:12.4.1-cudnn-runtime-ubuntu22.04 ships:
+#   - CUDA 12.4 runtime libs  (libcuda.so, libcublas, etc.)
+#   - cuDNN 9 (cu12 build)    required by ONNX Runtime CUDA execution provider
+#
+# Prerequisites on the Docker host:
+#   - NVIDIA GPU driver ≥ 550 (CUDA 12.4 compatible)
+#   - nvidia-container-toolkit installed and configured
+#
+# Run with: docker run --gpus all --device /dev/nvidia0 ...
+FROM nvidia/cuda:12.4.1-cudnn-runtime-ubuntu22.04
+
+LABEL org.opencontainers.image.title="TrueRef (GPU)"
+LABEL org.opencontainers.image.description="Self-hosted documentation retrieval platform for AI coding assistants (NVIDIA GPU / CUDA 12 variant)"
+LABEL org.opencontainers.image.url="https://git.sal.giize.com/mozempk/trueref"
+LABEL org.opencontainers.image.source="https://git.sal.giize.com/mozempk/trueref"
+
+# Install Eclipse Temurin 21 JRE onto the CUDA base image.
+RUN apt-get update \
+    && apt-get install -y --no-install-recommends wget apt-transport-https gnupg \
+    && wget -q -O - https://packages.adoptium.net/artifactory/api/gpg/key/public \
+         | gpg --dearmor -o /usr/share/keyrings/adoptium.gpg \
+    && echo "deb [signed-by=/usr/share/keyrings/adoptium.gpg] https://packages.adoptium.net/artifactory/deb jammy main" \
+         > /etc/apt/sources.list.d/adoptium.list \
+    && apt-get update \
+    && apt-get install -y --no-install-recommends temurin-21-jre \
+    && rm -rf /var/lib/apt/lists/*
+
+WORKDIR /app
+
+COPY --from=builder /build/trueref-bootstrap/target/trueref.jar /app/trueref.jar
+
+VOLUME /data
+
+ENV TRUEREF_HOME=/data \
+    TRUEREF_PORT=18080 \
+    # Physical GPU index visible inside the container (0 after --gpus all remapping).
+    TRUEREF_GPU=0 \
+    # 0 = unbounded arena; set to e.g. 8589934592 (8 GiB) on shared hosts.
+    TRUEREF_MEM_LIMIT=0 \
+    JAVA_OPTS="" \
+    # CUDA_DEVICE_ORDER ensures nvidia-smi numbering matches CUDA runtime numbering.
+    CUDA_DEVICE_ORDER=PCI_BUS_ID
+
+EXPOSE 18080
+
+ENTRYPOINT ["sh", "-c", \
+  "exec java \
+    --enable-native-access=ALL-UNNAMED \
+    --add-modules=jdk.incubator.vector \
+    ${JAVA_OPTS} \
+    -jar /app/trueref.jar \
+    --server.port=${TRUEREF_PORT} \
+    --trueref.home=${TRUEREF_HOME} \
+    --trueref.embedding.gpu-device-id=${TRUEREF_GPU} \
+    --trueref.embedding.gpu-mem-limit-bytes=${TRUEREF_MEM_LIMIT} \
+    \"$@\"", "--"]
--- a/FINDINGS.md
+++ b/FINDINGS.md
@@ -0,0 +1,207 @@
+# trueref — Findings
+
+Research notes backing the choices in [ARCHITECTURE.md](ARCHITECTURE.md). Each section ends with a verdict and follow-up questions if any.
+
+---
+
+## F1. Context7 ingestion behavior (what we replicate functionally)
+
+- Context7 ingests git repositories and crawls associated docs sites driven by a `context7.json` manifest at the repo root, plus an optional `llms.txt` index.
+- It produces snippets shaped roughly as `{ title, description, source, code, language }` and serves them via two MCP tools: `resolve-library-id` and `get-library-docs`.
+- The `get-library-docs` API accepts `topic` and `tokens` parameters; topic biases retrieval, tokens caps the response size (defaults observed in client docs: ~5000).
+- Source: upstash/context7 GitHub repo & MCP docs.
+
+**Verdict:** functional parity is achievable without copying the manifest schema. Our chunk model captures the same fields under different names (`symbol`/`content`/`filePath`/`language`). MCP tool signatures are kept **byte-identical** for LLM compatibility.
+
+---
+
+## F2. Embedded vector store choice — Lucene 9 over Qdrant
+
+- Qdrant is a Rust binary; embedding it in a fat JAR requires extracting & spawning a child process, contradicting the "single JAR, embedded everything" goal.
+- **Apache Lucene ≥9.0** ships HNSW kNN (`KnnFloatVectorField`) alongside BM25 in a single index segment. Pure JVM, no native deps.
+- Lucene supports **filtered kNN** (`KnnFloatVectorQuery` with a `BooleanQuery` filter), which we need for `(repoId, versionId)` scoping.
+- Trade-off: Lucene HNSW lacks Qdrant's payload-rich filtering tricks (e.g. quantization presets, named vectors). Acceptable for our scale; we get BM25 in the same store for free.
+
+**Verdict:** Lucene 9 (we'll target the latest 9.x). One `IndexWriter`, refresh-on-search via `SearcherManager`.
+
+---
+
+## F3. Embedding model — bge-m3
+
+- BAAI/bge-m3: 568M params, 8192 ctx, multilingual (100+ langs), trained on multi-functionality (dense + sparse + colbert).
+- ONNX export available (BAAI provides it; community variants on HuggingFace).
+- License: MIT-style (model weights), works for self-hosted commercial use.
+- Vector dim: 1024 (dense). Sparse vocab compatible with Lucene if we want SPLADE-like sparse — out of scope for v1.
+
+**Verdict:** bge-m3 (dense only for v1). Sparse channel deferred.
+
+---
+
+## F4. Reranker — bge-reranker-v2-m3
+
+- Cross-encoder, scores (query, passage) pairs.
+- Same family as embedder: balanced quality/cost, ONNX-exportable.
+- Apache 2.0 license.
+
+**Verdict:** bge-reranker-v2-m3. Top-K candidates from RRF fed in, top-N (default 20) returned.
+
+---
+
+## F5. ML runtime — ONNX Runtime (Java bindings)
+
+- ONNX Runtime has **official Java bindings** (`com.microsoft.onnxruntime:onnxruntime` + `onnxruntime_gpu`).
+- Execution providers we will support:
+  - **CUDA** (`onnxruntime_gpu`): Linux + Windows with NVIDIA driver ≥ matching CUDA 12.x.
+  - **DirectML** (`onnxruntime-directml`): Windows, any DX12 GPU.
+  - **CPU**: always-on fallback.
+- ONNX Runtime has **no Vulkan execution provider**. Our earlier "Vulkan fallback" wish is not satisfiable in this stack — we drop it.
+- Generative LLMs in ONNX (e.g. Phi-3.5-mini) are possible but awkward (KV cache management, tokenizer differences). Since we picked **retrieval-only**, no generative model is needed.
+
+**Verdict:** ONNX Runtime, providers tried in order: cuda → directml → cpu. Vulkan dropped (documented).
+
+---
+
+## F6. Java version — 21 LTS, not 25
+
+- Spring Boot 3.5.x officially supports Java 17–23.
+- Spring AI 1.0.x targets the same range.
+- Java 25 is supported by neither at time of writing; risking obscure reflection/MR-JAR issues with downstream libs (JGit, Lucene, ONNX bindings).
+- Java 21 is LTS and has stable virtual threads + structured concurrency (`StructuredTaskScope` was preview through 23, finalizing soon — we'll guard usage behind a thin wrapper to ease later upgrade).
+
+**Verdict:** Java 21 LTS. Re-evaluate to 25 once Spring Boot certifies it.
+
+---
+
+## F7. Differential indexing scheme
+
+- We chose **dedupe-by-content-hash** AND **git-diff-driven file skipping**.
+- The hash dedupe alone gives constant-cost embeddings for unchanged code across tags.
+- The git-diff path additionally avoids parsing/chunking unchanged files, which dominates ingest CPU on large repos.
+- Storage model:
+  - `chunks`: one row per unique `content_hash`. Vector lives in Lucene keyed by `chunkId`.
+  - `chunk_versions`: many-to-many; one row per `(chunk, version, file, line range)`.
+  - Search: `BooleanQuery(filter=chunk_versions.version_id IN scope)` joined to vector field.
+- The chunk dedupe ratio is reported as a UI metric — it's the most intuitive measure of "differential" effectiveness.
+
+**Verdict:** confirmed; both mechanisms compose without conflict.
+
+---
+
+## F8. MCP transport — Streamable HTTP
+
+- The current MCP spec (revision 2025-03-26) defines **Streamable HTTP**: a single `POST /mcp` endpoint that may upgrade to SSE for long-lived/streamed responses; replaces the deprecated 2024-11-05 SSE transport.
+- Spring AI 1.0 ships an MCP server module that supports Streamable HTTP via Spring MVC.
+- We expose **only** Streamable HTTP, no SSE-only legacy endpoint (per user spec).
+
+**Verdict:** Streamable HTTP only at `/mcp`.
+
+---
+
+## F9. Embedded SQL store — H2 (MVCC)
+
+- H2 in MVCC mode supports concurrent readers and a single writer with row-level locking. Good enough for our metadata write rates (jobs, versions, chunk_versions).
+- File-based, single JAR dependency, JDBC.
+- Considered & rejected:
+  - **DuckDB**: column-store, slower OLTP, no good Flyway story.
+  - **SQLite**: poor concurrency under write load.
+  - **Embedded Postgres (zonky)**: pulls a 100+ MB native binary per OS — fights the fat JAR goal.
+
+**Verdict:** H2 file-based, MVCC=true, with Flyway migrations.
+
+---
+
+## F10. Job orchestration — custom virtual-thread orchestrator
+
+- Spring Batch is feature-rich but requires a JobRepository (typically Postgres or H2) and adds startup cost we don't need.
+- Our jobs are **per-tag**, **simple linear stage sequences**, with persistence-of-status as the only durability requirement.
+- Custom orchestrator: each `IngestionJob` runs on a virtual thread; stages execute sequentially; stage transitions are durably written to H2 in a transaction; `JobEventBus` emits events for SSE.
+- Crash recovery: on startup, scan jobs in `RUNNING` status, mark them `FAILED` (or resume specific resumable stages — v2).
+
+**Verdict:** custom orchestrator. Spring Batch deferred unless we hit a ceiling.
+
+---
+
+## F11. Code parser — pure-Java heuristic for v1, tree-sitter pluggable for v2
+
+The Java tree-sitter ecosystem in 2026 is fragmented:
+
+- **`io.github.tree-sitter:jtreesitter`** uses Project Panama FFI → requires **Java 22+**. We target Java 21 LTS, so this is out.
+- **`io.github.bonede:tree-sitter`** is JNI-based and works on Java 21, but bundling per-OS (linux/windows/mac × x64/arm64) native grammar binaries for many languages bloats the fat JAR significantly and creates a packaging matrix we don't want to maintain in v1.
+- **`ai.serenade.treesitter:java-tree-sitter`** is unmaintained.
+
+**Decision (v1):** ship a pure-Java heuristic `CodeParser` adapter. Strategies, tried in order per file:
+
+1. **Markdown / `.txt` / `.rst`**: split by ATX/Setext headings; large sections further split by paragraph.
+2. **Brace-balanced languages** (java, c, c++, c#, go, rust, js, ts, kotlin, scala, swift): walk the file tracking brace depth + line-based heuristics (function signatures, top-level declarations) to extract chunks of complete top-level constructs. Symbol name extracted via a tiny regex per language.
+3. **Indent-based languages** (python, yaml, ruby): split on top-level `def`/`class`/`module` boundaries; symbol name from the declaration line.
+4. **Fallback** (any text file): sliding-window of N lines (default 80) with M lines overlap (default 10).
+
+The `CodeParser` port is unchanged. A future tree-sitter implementation (when JDK upgrade or upstream packaging matures) can be swapped in by providing an alternate `@Component` and toggling a config flag — that's exactly what hexagonal architecture buys us.
+
+**Verdict:** pure-Java heuristic parser for v1; tree-sitter remains a documented future enhancement.
+
+---
+
+## F12. Concurrency caps & GPU contention
+
+- User chose **unbounded virtual threads**. This is safe for I/O-bound stages.
+- ONNX inference is GPU-bound; calling the same `OrtSession` from many threads concurrently is unsupported. Two mitigations:
+  1. A **session pool** of size N (config `embedding.session-count`, default 2).
+  2. A **`Semaphore(N)`** acquired by any caller before invoking inference. Pool & semaphore sizes match.
+- This means tag-level parallelism is naturally throttled by GPU capacity without explicit per-tag limits.
+
+**Verdict:** session pool + semaphore. Document the knob clearly in `application.yml`.
+
+---
+
+## F13. Frontend in fat JAR
+
+- SvelteKit `@sveltejs/adapter-static` produces a fully static bundle (HTML/CSS/JS). We build it as a Maven sub-step (frontend-maven-plugin) and copy `frontend/build/` to `bootstrap/src/main/resources/static/`. Spring serves it by default.
+- SPA fallback: a `WebMvcConfigurer` maps all unmatched non-API paths to `index.html` so client-side routing works.
+
+**Verdict:** static adapter + Spring static-resource serving. Single artifact preserved.
+
+---
+
+## F14. Open questions / future work
+
+1. **Sparse channel** (bge-m3 sparse / SPLADE) for stronger lexical recall — deferred to v2.
+2. **Per-language reranker fine-tuning** — out of scope (no fine-tuning, per spec).
+3. **Compaction job** to truly delete orphan chunks (currently soft-delete on versions). Schedule TBD.
+4. **Watched-folder** auto-discovery semantics: how often do we rescan `./data/watched/`? Default proposal: every 5 min + on filesystem watch event (Java NIO `WatchService`).
+5. **Repo size cap**: do we need a maximum total cloned size to prevent runaway disk use? Currently unlimited; could add per-repo and global caps in v2.
+6. **GPU memory introspection**: Linux NVML via JNI (`jnvml`) for GPU mem gauges; on Windows + DirectML we surface only "available/in-use" booleans.
+
+---
+
+## F15. References (for re-checking when libraries bump)
+
+- Context7 repo & MCP tool surface — to sanity-check schema fidelity on releases.
+- Spring AI 1.0.x release notes — verify MCP server Streamable HTTP module name & API.
+- Spring Boot 3.5.x release notes — confirm Java version compatibility window.
+- Lucene 9.x kNN docs — confirm filtered vector query API surface.
+- ONNX Runtime Java release notes — confirm CUDA/DirectML EP availability per version.
+- BAAI/bge-m3 model card — confirm ONNX export availability/format.
+- MCP spec 2025-03-26 — Streamable HTTP transport requirements.
+
+> Use the Context7 MCP lookup skill before bumping any of the above to fetch fresh, version-specific docs.
+
+---
+
+## F16. Smoke-test log (2026-04-21)
+
+End-to-end smoke after first assembly:
+- `mvn -pl trueref-bootstrap -am package` → BUILD SUCCESS, fat JAR ~582 MB.
+- `mvn test` → **16 tests pass** (parser 6, pooling 5, disk cache 5), **0 failures**.
+- `java -jar trueref-bootstrap/target/trueref.jar --trueref.embedding.session-count=0` — started in 3.6 s.
+- `GET /actuator/health` → `UP` (db H2, disk, ping, ssl).
+- `POST /api/repos` + `GET /api/repos` — round-trips a repo.
+- `GET /swagger-ui.html` → 302 redirect (to `/swagger-ui/index.html`), `GET /v3/api-docs` → 200.
+- `GET /` → 200 (SvelteKit SPA served from Spring static resources).
+- `POST /mcp` one-shot JSON-RPC returns HTTP 500 — expected, the WebMVC MCP transport requires an SSE session established by `GET /sse` first; MCP clients that implement the Streamable-HTTP spec do this automatically. Verified MCP tools register: `tools/list` handler is reached (error thrown is transport-level session lookup, not bean wiring).
+
+Fixes landed during smoke:
+- `V1__init_schema.sql`: H2 in PostgreSQL mode rejects `AUTO_INCREMENT`. Switched `job_log_events.id` to `BIGINT GENERATED BY DEFAULT AS IDENTITY` and removed the explicit `NULL` constraint.
+- `OnnxProperties.sessionCount` can now be 0 (disables the ONNX stack, for environments where models aren't available); `GpuSemaphore` accepts 0 permits by internally using 1 (never acquired in disabled mode).
+- `OnnxEmbeddingService` / `OnnxRerankerService` short-circuit in disabled mode; reranker pass-through preserves input order.
+- `ApplicationBeans` exposes only concrete beans (not both the class and its interface) to avoid ambiguous autowiring.
--- a/README.md
+++ b/README.md
@@ -0,0 +1,21 @@
+# trueref
+
+Self-hosted [Context7](https://github.com/upstash/context7) clone in Java 21 + Spring Boot 3.5: indexes git repositories per tag, exposes a Streamable-HTTP MCP server, REST + Swagger, and a SvelteKit dashboard for ingestion observability and querying.
+
+See:
+- [ARCHITECTURE.md](ARCHITECTURE.md) — design, hexagonal layout, pipelines, MCP/REST surfaces.
+- [CODE_STYLE.md](CODE_STYLE.md) — conventions.
+- [FINDINGS.md](FINDINGS.md) — research notes backing every choice.
+
+## Quickstart
+
+```bash
+./mvnw -DskipTests package
+java -jar trueref-bootstrap/target/trueref.jar
+```
+
+Browse:
+- UI:           http://localhost:8080/
+- Swagger:      http://localhost:8080/swagger-ui.html
+- MCP endpoint: http://localhost:8080/mcp
+- Actuator:     http://localhost:8080/actuator
--- a/pom.xml
+++ b/pom.xml
@@ -0,0 +1,198 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<project xmlns="http://maven.apache.org/POM/4.0.0"
+         xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
+         xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 https://maven.apache.org/xsd/maven-4.0.0.xsd">
+    <modelVersion>4.0.0</modelVersion>
+
+    <groupId>com.trueref</groupId>
+    <artifactId>trueref-parent</artifactId>
+    <version>0.1.0-SNAPSHOT</version>
+    <packaging>pom</packaging>
+
+    <name>trueref</name>
+    <description>Self-hosted Context7-style library docs indexer + MCP server</description>
+
+    <modules>
+        <module>trueref-domain</module>
+        <module>trueref-application</module>
+        <module>trueref-adapters</module>
+        <module>trueref-frontend</module>
+        <module>trueref-bootstrap</module>
+    </modules>
+
+    <parent>
+        <groupId>org.springframework.boot</groupId>
+        <artifactId>spring-boot-starter-parent</artifactId>
+        <version>3.5.3</version>
+        <relativePath/>
+    </parent>
+
+    <properties>
+        <java.version>21</java.version>
+        <maven.compiler.release>21</maven.compiler.release>
+        <project.build.sourceEncoding>UTF-8</project.build.sourceEncoding>
+
+        <spring-ai.version>1.0.0</spring-ai.version>
+        <springdoc.version>2.8.6</springdoc.version>
+        <jgit.version>7.3.0.202506031305-r</jgit.version>
+        <lucene.version>10.4.0</lucene.version>
+        <onnxruntime.version>1.22.0</onnxruntime.version>
+        <huggingface-tokenizers.version>0.33.0</huggingface-tokenizers.version>
+        <h2.version>2.3.232</h2.version>
+        <flyway.version>11.8.2</flyway.version>
+        <jspecify.version>1.0.0</jspecify.version>
+        <assertj.version>3.26.3</assertj.version>
+
+        <!-- Plugins -->
+        <spotless.version>2.43.0</spotless.version>
+        <frontend-maven-plugin.version>1.15.1</frontend-maven-plugin.version>
+        <node.version>v20.18.0</node.version>
+        <npm.version>10.8.2</npm.version>
+    </properties>
+
+    <dependencyManagement>
+        <dependencies>
+            <!-- Internal modules -->
+            <dependency>
+                <groupId>com.trueref</groupId>
+                <artifactId>trueref-domain</artifactId>
+                <version>${project.version}</version>
+            </dependency>
+            <dependency>
+                <groupId>com.trueref</groupId>
+                <artifactId>trueref-application</artifactId>
+                <version>${project.version}</version>
+            </dependency>
+            <dependency>
+                <groupId>com.trueref</groupId>
+                <artifactId>trueref-adapters</artifactId>
+                <version>${project.version}</version>
+            </dependency>
+            <dependency>
+                <groupId>com.trueref</groupId>
+                <artifactId>trueref-frontend</artifactId>
+                <version>${project.version}</version>
+            </dependency>
+
+            <!-- Spring AI BOM -->
+            <dependency>
+                <groupId>org.springframework.ai</groupId>
+                <artifactId>spring-ai-bom</artifactId>
+                <version>${spring-ai.version}</version>
+                <type>pom</type>
+                <scope>import</scope>
+            </dependency>
+
+            <!-- 3rd-party -->
+            <dependency>
+                <groupId>org.springdoc</groupId>
+                <artifactId>springdoc-openapi-starter-webmvc-ui</artifactId>
+                <version>${springdoc.version}</version>
+            </dependency>
+            <dependency>
+                <groupId>org.eclipse.jgit</groupId>
+                <artifactId>org.eclipse.jgit</artifactId>
+                <version>${jgit.version}</version>
+            </dependency>
+            <dependency>
+                <groupId>org.eclipse.jgit</groupId>
+                <artifactId>org.eclipse.jgit.ssh.apache</artifactId>
+                <version>${jgit.version}</version>
+            </dependency>
+            <dependency>
+                <groupId>org.apache.lucene</groupId>
+                <artifactId>lucene-core</artifactId>
+                <version>${lucene.version}</version>
+            </dependency>
+            <dependency>
+                <groupId>org.apache.lucene</groupId>
+                <artifactId>lucene-analysis-common</artifactId>
+                <version>${lucene.version}</version>
+            </dependency>
+            <dependency>
+                <groupId>org.apache.lucene</groupId>
+                <artifactId>lucene-queryparser</artifactId>
+                <version>${lucene.version}</version>
+            </dependency>
+            <dependency>
+                <groupId>com.microsoft.onnxruntime</groupId>
+                <artifactId>onnxruntime</artifactId>
+                <version>${onnxruntime.version}</version>
+            </dependency>
+            <dependency>
+                <groupId>com.microsoft.onnxruntime</groupId>
+                <artifactId>onnxruntime_gpu</artifactId>
+                <version>${onnxruntime.version}</version>
+            </dependency>
+            <dependency>
+                <groupId>ai.djl.huggingface</groupId>
+                <artifactId>tokenizers</artifactId>
+                <version>${huggingface-tokenizers.version}</version>
+            </dependency>
+            <dependency>
+                <groupId>org.jspecify</groupId>
+                <artifactId>jspecify</artifactId>
+                <version>${jspecify.version}</version>
+            </dependency>
+            <dependency>
+                <groupId>org.assertj</groupId>
+                <artifactId>assertj-core</artifactId>
+                <version>${assertj.version}</version>
+                <scope>test</scope>
+            </dependency>
+        </dependencies>
+    </dependencyManagement>
+
+    <dependencies>
+        <dependency>
+            <groupId>org.jspecify</groupId>
+            <artifactId>jspecify</artifactId>
+        </dependency>
+        <dependency>
+            <groupId>org.junit.jupiter</groupId>
+            <artifactId>junit-jupiter</artifactId>
+            <scope>test</scope>
+        </dependency>
+        <dependency>
+            <groupId>org.assertj</groupId>
+            <artifactId>assertj-core</artifactId>
+            <scope>test</scope>
+        </dependency>
+    </dependencies>
+
+    <build>
+        <pluginManagement>
+            <plugins>
+                <plugin>
+                    <groupId>com.diffplug.spotless</groupId>
+                    <artifactId>spotless-maven-plugin</artifactId>
+                    <version>${spotless.version}</version>
+                    <configuration>
+                        <java>
+                            <palantirJavaFormat/>
+                            <removeUnusedImports/>
+                            <importOrder/>
+                            <trimTrailingWhitespace/>
+                            <endWithNewline/>
+                        </java>
+                    </configuration>
+                </plugin>
+                <plugin>
+                    <groupId>com.github.eirslett</groupId>
+                    <artifactId>frontend-maven-plugin</artifactId>
+                    <version>${frontend-maven-plugin.version}</version>
+                </plugin>
+            </plugins>
+        </pluginManagement>
+        <plugins>
+            <plugin>
+                <groupId>org.apache.maven.plugins</groupId>
+                <artifactId>maven-compiler-plugin</artifactId>
+                <configuration>
+                    <release>${java.version}</release>
+                    <parameters>true</parameters>
+                </configuration>
+            </plugin>
+        </plugins>
+    </build>
+</project>
--- a/tests/quality/phaser_rag_eval.py
+++ b/tests/quality/phaser_rag_eval.py
@@ -0,0 +1,611 @@
+#!/usr/bin/env python3
+"""
+Phaser RAG Quality Evaluation Suite
+====================================
+Simulates an LLM querying TrueRef for Phaser documentation and guidance.
+Tests are designed to be hard and objective: each defines exact expected content
+fragments and/or expected source files that MUST appear in the top-k results.
+
+Scoring metrics per test:
+  file@1    - expected file appeared as hit #1
+  file@3    - expected file appeared in hits 1-3
+  file@5    - expected file appeared in hits 1-5
+  content@5 - at least one expected content fragment found across the top-5 hits combined
+  content@1 - expected content fragment found in hit #1
+
+Overall suite scores:
+  MRR      - Mean Reciprocal Rank (file position)
+  P@1..5   - Precision@k for file hits
+  C@5      - Content recall across top-5
+
+Run:
+  python3 phaser_rag_eval.py [--base-url http://localhost:18080] [--verbose]
+"""
+
+import argparse
+import json
+import sys
+import time
+from dataclasses import dataclass, field
+from typing import Optional
+import urllib.request
+import urllib.error
+
+# ---------------------------------------------------------------------------
+# Config
+# ---------------------------------------------------------------------------
+REPO_ID = "50010965-aa3f-45f4-bb8d-72a0d50bf0db"
+
+# Version IDs pinned to specific tags (fetched at startup if not found)
+VERSIONS = {
+    "v4.1.0":  "6c6a00f5-0945-4fd7-b62c-c0e69f14effe",
+    "v3.88.0": "d032d4d4-e6bc-4c9d-9c3c-8853e4a1cdc9",
+    "v3.85.2": "d1cf906e-54b9-416f-bd5b-9432d69d9935",
+    "v3.60.0": "95d0a8e2-9071-4986-85d4-59ae97893353",
+}
+
+# ---------------------------------------------------------------------------
+# Test definition
+# ---------------------------------------------------------------------------
+@dataclass
+class TestCase:
+    id: str
+    name: str
+    query: str
+    version: str                        # key into VERSIONS
+    topic: Optional[str] = None
+    expected_files: list[str] = field(default_factory=list)   # substrings of filePath
+    expected_content: list[str] = field(default_factory=list) # substrings that MUST appear
+    required_content: list[str] = field(default_factory=list) # ALL of these must appear (stricter)
+    max_hits: int = 10
+    tokens_budget: int = 6000
+    # Optional: minimum rerank score the top hit should exceed
+    min_score: Optional[float] = None
+
+# ---------------------------------------------------------------------------
+# Test definitions — 25 hard, objective cases
+# ---------------------------------------------------------------------------
+TESTS: list[TestCase] = [
+
+    # ── 1. Tween system: basic config properties ──────────────────────────
+    TestCase(
+        id="T01",
+        name="Tween config: yoyo/hold/repeatDelay properties",
+        query="What properties can I set in a TweenBuilderConfig to make a tween yoyo with a hold and repeat delay?",
+        version="v4.1.0",
+        topic="tweens",
+        expected_files=["tweens/builders/TweenBuilder.js", "tweens/typedefs"],
+        expected_content=["yoyo", "hold", "repeatDelay"],
+        required_content=["yoyo", "repeatDelay"],
+    ),
+
+    # ── 2. Tween system: onComplete / onUpdate callbacks ──────────────────
+    TestCase(
+        id="T02",
+        name="Tween callbacks: onComplete and onUpdate signatures",
+        query="How do I use onComplete and onUpdate callbacks in a Phaser tween? What arguments do they receive?",
+        version="v4.1.0",
+        topic="tweens",
+        expected_files=["tweens/"],
+        expected_content=["onComplete", "onUpdate", "onStart"],
+        required_content=["onComplete"],
+    ),
+
+    # ── 3. Arcade physics: setCollideWorldBounds signature ────────────────
+    TestCase(
+        id="T03",
+        name="Arcade physics: setCollideWorldBounds signature",
+        query="What are the parameters of setCollideWorldBounds in Phaser Arcade physics? Can I pass bounceX and bounceY to set bounce on world edges?",
+        version="v4.1.0",
+        topic="physics",
+        expected_files=["physics/arcade/Body.js"],
+        expected_content=["setCollideWorldBounds", "bounceX", "bounceY", "onWorldBounds"],
+        required_content=["setCollideWorldBounds", "bounceX"],
+    ),
+
+    # ── 4. Arcade physics: addCollider vs addOverlap ──────────────────────
+    TestCase(
+        id="T04",
+        name="Arcade physics: addCollider vs addOverlap difference",
+        query="What is the difference between addCollider and addOverlap in Phaser's Arcade physics World? How do I add a callback?",
+        version="v4.1.0",
+        topic="physics",
+        expected_files=["physics/arcade/World.js"],
+        expected_content=["addCollider", "addOverlap", "collideCallback", "processCallback"],
+        required_content=["addCollider", "addOverlap"],
+    ),
+
+    # ── 5. Camera: shake parameters ───────────────────────────────────────
+    TestCase(
+        id="T05",
+        name="Camera shake: duration, intensity, force, callback",
+        query="How do I make the camera shake in Phaser? What parameters does camera.shake accept?",
+        version="v4.1.0",
+        topic="camera",
+        expected_files=["cameras/2d/Camera.js"],
+        expected_content=["shake", "duration", "intensity", "force", "callback"],
+        required_content=["shake", "intensity"],
+    ),
+
+    # ── 6. Camera: startFollow with lerp ─────────────────────────────────
+    TestCase(
+        id="T06",
+        name="Camera follow: startFollow lerpX lerpY parameters",
+        query="How do I make the Phaser camera follow a player with smooth lerp? What are the lerpX and lerpY parameters?",
+        version="v4.1.0",
+        topic="camera",
+        expected_files=["cameras/2d/Camera.js"],
+        expected_content=["startFollow", "lerpX", "lerpY", "roundPixels"],
+        required_content=["startFollow", "lerpX"],
+    ),
+
+    # ── 7. Camera: setDeadzone ────────────────────────────────────────────
+    TestCase(
+        id="T07",
+        name="Camera deadzone: setDeadzone width/height",
+        query="How does camera deadzone work in Phaser? How do I create a rectangular deadzone so the camera only moves when the player exits it?",
+        version="v4.1.0",
+        topic="camera",
+        expected_files=["cameras/2d/Camera.js"],
+        expected_content=["setDeadzone", "deadzone"],
+        required_content=["setDeadzone"],
+    ),
+
+    # ── 8. Scene: pass data when starting another scene ───────────────────
+    TestCase(
+        id="T08",
+        name="Scene management: pass data on scene.start",
+        query="How do I pass data to another scene when calling scene.start() or scene.launch()? How does the init method receive it?",
+        version="v4.1.0",
+        topic="scenes",
+        expected_files=["scene/"],
+        expected_content=["init", "data", "start", "launch"],
+        required_content=["init"],
+    ),
+
+    # ── 9. Animation system: chaining animations ──────────────────────────
+    TestCase(
+        id="T09",
+        name="Animation chaining: chain() and playAfterRepeat()",
+        query="How can I chain multiple animations so one plays after another finishes in Phaser? What is the chain() method?",
+        version="v4.1.0",
+        topic="animations",
+        expected_files=["gameobjects/sprite/Sprite.js", "animations/"],
+        expected_content=["chain", "playAfterRepeat", "playAfterDelay"],
+        required_content=["chain"],
+    ),
+
+    # ── 10. Animation system: events ─────────────────────────────────────
+    TestCase(
+        id="T10",
+        name="Animation events: ANIMATION_COMPLETE, ANIMATION_START",
+        query="What events does the Phaser animation system emit? How do I listen for when an animation completes on a specific sprite?",
+        version="v4.1.0",
+        topic="animations",
+        expected_files=["animations/events/"],
+        expected_content=["ANIMATION_COMPLETE", "ANIMATION_START", "ANIMATION_STOP"],
+        required_content=["ANIMATION_COMPLETE"],
+    ),
+
+    # ── 11. Input: pointer events ─────────────────────────────────────────
+    TestCase(
+        id="T11",
+        name="Input: setInteractive + pointerdown/pointerover events",
+        query="How do I call setInteractive on a game object and listen for pointerdown and pointerover events in Phaser?",
+        version="v4.1.0",
+        topic="input",
+        expected_files=["input/"],
+        expected_content=["pointerdown", "pointerover", "pointerout", "setInteractive"],
+        required_content=["setInteractive", "pointerdown"],
+    ),
+
+    # ── 12. Input: keyboard cursor keys ──────────────────────────────────
+    TestCase(
+        id="T12",
+        name="Input: createCursorKeys and keyboard key states",
+        query="How do I read arrow key input in Phaser? How does createCursorKeys() work and how do I check if a key is down?",
+        version="v4.1.0",
+        topic="input",
+        expected_files=["input/keyboard/"],
+        expected_content=["createCursorKeys", "isDown", "up", "down", "left", "right"],
+        required_content=["createCursorKeys"],
+    ),
+
+    # ── 13. Loader: atlas and texture keys ───────────────────────────────
+    TestCase(
+        id="T13",
+        name="Loader: load.atlas config object and frame keys",
+        query="How do I load a texture atlas in Phaser? What are the arguments to this.load.atlas() and how do I use frame keys?",
+        version="v4.1.0",
+        topic="loader",
+        expected_files=["loader/filetypes/AtlasJSONFile.js", "loader/"],
+        expected_content=["atlas", "textureURL", "atlasURL", "frameConfig"],
+        required_content=["atlas"],
+        min_score=0.7,
+    ),
+
+    # ── 14. Tilemaps: setCollisionBetween ────────────────────────────────
+    TestCase(
+        id="T14",
+        name="Tilemap: setCollisionBetween start/stop parameters",
+        query="How do I set collision on a range of tile indices in a Phaser tilemap? What does setCollisionBetween do?",
+        version="v4.1.0",
+        topic="tilemaps",
+        expected_files=["tilemaps/Tilemap.js", "tilemaps/"],
+        expected_content=["setCollisionBetween", "start", "stop", "collides", "recalculateFaces"],
+        required_content=["setCollisionBetween"],
+    ),
+
+    # ── 15. Tilemaps: createFromObjects ──────────────────────────────────
+    TestCase(
+        id="T15",
+        name="Tilemap: createFromObjects from Tiled object layer",
+        query="How do I convert Tiled object layer objects into Phaser game objects? How does createFromObjects work?",
+        version="v4.1.0",
+        topic="tilemaps",
+        expected_files=["tilemaps/Tilemap.js"],
+        expected_content=["createFromObjects", "objectLayerName"],
+        required_content=["createFromObjects"],
+    ),
+
+    # ── 16. RenderTexture: beginDraw / endDraw (v3 API) ──────────────────
+    TestCase(
+        id="T16",
+        name="RenderTexture v3: beginDraw / batchDraw / endDraw pattern",
+        query="How do I use beginDraw and endDraw on a Phaser RenderTexture for batch drawing? What is the workflow?",
+        version="v3.85.2",
+        topic="rendering",
+        expected_files=["textures/DynamicTexture.js"],
+        expected_content=["beginDraw", "endDraw", "batchDraw", "batchDrawFrame"],
+        required_content=["beginDraw", "endDraw"],
+    ),
+
+    # ── 17. Masking: BitmapMask vs GeometryMask (v3 API) ──────────────────
+    TestCase(
+        id="T17",
+        name="Masking v3: createBitmapMask vs createGeometryMask",
+        query="What is the difference between a BitmapMask and a GeometryMask in Phaser? How do I create and apply them?",
+        version="v3.85.2",
+        topic="rendering",
+        expected_files=["gameobjects/components/Mask.js", "display/mask/"],
+        expected_content=["createBitmapMask", "createGeometryMask", "setMask", "BitmapMask", "GeometryMask"],
+        required_content=["BitmapMask", "GeometryMask"],
+    ),
+
+    # ── 18. Groups: getFirstDead / getFirstAlive pool pattern ─────────────
+    TestCase(
+        id="T18",
+        name="Group: object pool with getFirstDead / getFirstAlive",
+        query="How do I implement an object pool in Phaser using a Group? What are getFirstDead and getFirstAlive?",
+        version="v4.1.0",
+        topic="gameobjects",
+        expected_files=["gameobjects/group/Group.js"],
+        expected_content=["getFirstDead", "getFirstAlive", "createIfNull", "countActive"],
+        required_content=["getFirstDead", "getFirstAlive"],
+    ),
+
+    # ── 19. Matter.js: fromVertices custom body shape ─────────────────────
+    TestCase(
+        id="T19",
+        name="Matter.js: custom body shape with fromVertices",
+        query="How do I create a custom polygon physics body from vertices in Phaser's Matter.js physics?",
+        version="v4.1.0",
+        topic="physics",
+        expected_files=["physics/matter-js/Factory.js", "physics/matter-js/"],
+        expected_content=["fromVertices", "vertexSets", "options"],
+        required_content=["fromVertices"],
+    ),
+
+    # ── 20. Game config: FPS limit / target ───────────────────────────────
+    TestCase(
+        id="T20",
+        name="Game config: fps.target and fps.limit settings",
+        query="How do I configure the target frame rate and FPS limit in the Phaser game config? What is the difference between target and limit?",
+        version="v4.1.0",
+        topic="core",
+        expected_files=["core/TimeStep.js", "core/Config.js"],
+        expected_content=["targetFps", "fpsLimit", "target", "fps"],
+        required_content=["targetFps"],
+    ),
+
+    # ── 21. Scale Manager: ScaleModes ────────────────────────────────────
+    TestCase(
+        id="T21",
+        name="Scale Manager: FIT vs ENVELOP scale modes",
+        query="What scale modes are available in Phaser's Scale Manager? How does FIT differ from ENVELOP? How do I make a responsive game?",
+        version="v4.1.0",
+        topic="scale",
+        expected_files=["scale/"],
+        expected_content=["FIT", "ENVELOP", "ScaleManager", "autoCenter"],
+        required_content=["FIT"],
+    ),
+
+    # ── 22. Data Manager: set/get/events ──────────────────────────────────
+    TestCase(
+        id="T22",
+        name="Data Manager: set/get and CHANGE_DATA event",
+        query="How does the Phaser Data Manager work? How do I watch for data changes using events on a game object's data?",
+        version="v4.1.0",
+        topic="data",
+        expected_files=["data/DataManager.js", "data/"],
+        expected_content=["CHANGE_DATA", "set", "get", "events"],
+        required_content=["CHANGE_DATA"],
+    ),
+
+    # ── 23. Depth sort: setDepth and displayList ──────────────────────────
+    TestCase(
+        id="T23",
+        name="Depth sorting: setDepth and display list ordering",
+        query="How does Phaser handle rendering order (z-order)? How do I use setDepth to control which objects render on top?",
+        version="v4.1.0",
+        topic="rendering",
+        expected_files=["gameobjects/"],
+        expected_content=["setDepth", "depth", "displayList"],
+        required_content=["setDepth"],
+    ),
+
+    # ── 24. Version diff: v3.60 TweenChain (new in 3.60) ─────────────────
+    TestCase(
+        id="T24",
+        name="Version-specific: TweenChain introduced in v3.60",
+        query="How do I create a sequence of tweens that play one after another using TweenChain in Phaser 3.60+?",
+        version="v3.60.0",
+        topic="tweens",
+        expected_files=["tweens/"],
+        expected_content=["TweenChain", "chain"],
+        required_content=["TweenChain"],
+    ),
+
+    # ── 25. Hard adversarial: camera.ignore() ─────────────────────────────
+    TestCase(
+        id="T25",
+        name="Camera: ignore() to exclude game objects from a camera",
+        query="How do I make a game object invisible to a specific camera in Phaser while remaining visible to others? What is camera.ignore()?",
+        version="v4.1.0",
+        topic="camera",
+        expected_files=["cameras/2d/"],
+        expected_content=["ignore", "camera"],
+        required_content=["ignore"],
+    ),
+]
+
+# ---------------------------------------------------------------------------
+# HTTP helpers
+# ---------------------------------------------------------------------------
+
+def post_json(url: str, payload: dict) -> dict:
+    body = json.dumps(payload).encode()
+    req = urllib.request.Request(
+        url, data=body,
+        headers={"Content-Type": "application/json", "Accept": "application/json"},
+        method="POST",
+    )
+    with urllib.request.urlopen(req, timeout=30) as resp:
+        return json.loads(resp.read().decode())
+
+
+def get_json(url: str) -> dict | list:
+    req = urllib.request.Request(url, headers={"Accept": "application/json"})
+    with urllib.request.urlopen(req, timeout=15) as resp:
+        return json.loads(resp.read().decode())
+
+
+# ---------------------------------------------------------------------------
+# Evaluation logic
+# ---------------------------------------------------------------------------
+
+@dataclass
+class TestResult:
+    test: TestCase
+    hits: list[dict]
+    elapsed_ms: float
+    error: Optional[str] = None
+
+    # Computed below
+    file_rank: Optional[int] = None   # 1-based rank of first expected-file match
+    content_ranks: list[int] = field(default_factory=list)  # 1-based ranks where content found
+    required_found: bool = False
+    top_score: Optional[float] = None
+
+    def file_at(self, k: int) -> bool:
+        return self.file_rank is not None and self.file_rank <= k
+
+    def content_at(self, k: int) -> bool:
+        return any(r <= k for r in self.content_ranks)
+
+    def mrr(self) -> float:
+        if self.file_rank is None:
+            return 0.0
+        return 1.0 / self.file_rank
+
+    def summary_line(self) -> str:
+        f1 = "✓" if self.file_at(1) else "·"
+        f3 = "✓" if self.file_at(3) else "·"
+        f5 = "✓" if self.file_at(5) else "·"
+        c5 = "✓" if self.content_at(5) else "·"
+        req = "✓" if self.required_found else "✗"
+        rank_str = f"rank={self.file_rank}" if self.file_rank else "NOT FOUND"
+        score_str = f"score={self.top_score:.3f}" if self.top_score else ""
+        ms_str = f"{self.elapsed_ms:.0f}ms"
+        return (
+            f"[{self.test.id}] {self.test.name[:52]:<52} "
+            f"f@1={f1} f@3={f3} f@5={f5} c@5={c5} req={req}  "
+            f"{rank_str:>12}  {score_str}  {ms_str}"
+        )
+
+
+def evaluate(result: TestResult, verbose: bool = False) -> None:
+    hits = result.hits
+    if not hits:
+        return
+
+    result.top_score = hits[0].get("score") if hits else None
+
+    # File rank: position of first hit whose filePath matches any expected_files substring
+    for i, hit in enumerate(hits):
+        fp = hit.get("filePath", "")
+        if any(ef in fp for ef in result.test.expected_files):
+            result.file_rank = i + 1
+            break
+
+    # Content rank: for each expected_content fragment, find the first hit that contains it
+    combined_content = {i: (hit.get("content") or "") for i, hit in enumerate(hits)}
+
+    for fragment in result.test.expected_content:
+        for i, content in combined_content.items():
+            if fragment.lower() in content.lower():
+                result.content_ranks.append(i + 1)
+                break
+
+    # Required content: ALL required fragments must appear somewhere in top-10
+    all_content = " ".join(combined_content.values()).lower()
+    result.required_found = all(
+        r.lower() in all_content for r in result.test.required_content
+    )
+
+    if verbose:
+        print(f"\n{'─'*80}")
+        print(f"[{result.test.id}] {result.test.name}")
+        print(f"  Query: {result.test.query}")
+        print(f"  Expected files: {result.test.expected_files}")
+        print(f"  Expected content: {result.test.expected_content}")
+        print(f"  Top hits:")
+        for i, hit in enumerate(hits[:5]):
+            fp = hit.get("filePath", "?")
+            score = hit.get("score", 0.0)
+            snip = (hit.get("content") or "")[:100].replace("\n", " ")
+            marker = " ← FILE MATCH" if any(ef in fp for ef in result.test.expected_files) else ""
+            print(f"    [{i+1}] score={score:.3f}  {fp}{marker}")
+            print(f"         {snip}")
+
+
+# ---------------------------------------------------------------------------
+# Main runner
+# ---------------------------------------------------------------------------
+
+def run(base_url: str, verbose: bool) -> None:
+    base_url = base_url.rstrip("/")
+    search_url = f"{base_url}/api/search"
+    versions_url = f"{base_url}/api/repos/{REPO_ID}/versions"
+
+    print(f"TrueRef Phaser RAG Evaluation Suite")
+    print(f"Server : {base_url}")
+    print(f"Tests  : {len(TESTS)}")
+    print()
+
+    # Resolve version IDs from server (in case they differ)
+    try:
+        all_versions = get_json(versions_url)
+        live_map = {v["tag"]: v["id"] for v in all_versions if v.get("status") == "INDEXED"}
+        for tag in list(VERSIONS.keys()):
+            if tag in live_map:
+                VERSIONS[tag] = live_map[tag]
+    except Exception as e:
+        print(f"WARN: could not refresh version IDs: {e}")
+
+    results: list[TestResult] = []
+
+    for tc in TESTS:
+        version_id = VERSIONS.get(tc.version)
+        if not version_id:
+            print(f"SKIP [{tc.id}]: version {tc.version} not available")
+            continue
+
+        payload = {
+            "text": tc.query,
+            "scope": [{"repoId": REPO_ID, "versionId": version_id}],
+            "maxHits": tc.max_hits,
+            "tokensBudget": tc.tokens_budget,
+        }
+        if tc.topic:
+            payload["topic"] = tc.topic
+
+        t0 = time.time()
+        try:
+            resp = post_json(search_url, payload)
+            elapsed = (time.time() - t0) * 1000
+            hits = resp.get("hits", [])
+            tr = TestResult(test=tc, hits=hits, elapsed_ms=elapsed)
+            evaluate(tr, verbose=verbose)
+        except Exception as e:
+            elapsed = (time.time() - t0) * 1000
+            tr = TestResult(test=tc, hits=[], elapsed_ms=elapsed, error=str(e))
+            print(f"ERROR [{tc.id}]: {e}")
+
+        results.append(tr)
+
+    # ── Summary table ─────────────────────────────────────────────────────
+    print()
+    print("=" * 110)
+    print(f"{'TEST ID + NAME':<56} {'f@1':>4} {'f@3':>4} {'f@5':>4} {'c@5':>4} {'req':>4}  {'file rank':>12}  {'score':>10}  {'ms':>6}")
+    print("=" * 110)
+
+    for tr in results:
+        if tr.error:
+            print(f"[{tr.test.id}] {'ERROR: ' + tr.test.name[:45]:<52}  ERROR: {tr.error[:40]}")
+        else:
+            print(tr.summary_line())
+
+    # ── Aggregate metrics ─────────────────────────────────────────────────
+    valid = [tr for tr in results if not tr.error]
+    n = len(valid)
+    if n == 0:
+        print("\nNo valid results.")
+        return
+
+    mrr         = sum(tr.mrr() for tr in valid) / n
+    p_at_1      = sum(1 for tr in valid if tr.file_at(1)) / n
+    p_at_3      = sum(1 for tr in valid if tr.file_at(3)) / n
+    p_at_5      = sum(1 for tr in valid if tr.file_at(5)) / n
+    content_at5 = sum(1 for tr in valid if tr.content_at(5)) / n
+    req_recall  = sum(1 for tr in valid if tr.required_found) / n
+    avg_ms      = sum(tr.elapsed_ms for tr in valid) / n
+
+    print("=" * 110)
+    print()
+    print("Aggregate metrics:")
+    print(f"  MRR (file)           : {mrr:.4f}  ({mrr*100:.1f}%)")
+    print(f"  Precision@1 (file)   : {p_at_1:.4f}  ({p_at_1*100:.1f}%)")
+    print(f"  Precision@3 (file)   : {p_at_3:.4f}  ({p_at_3*100:.1f}%)")
+    print(f"  Precision@5 (file)   : {p_at_5:.4f}  ({p_at_5*100:.1f}%)")
+    print(f"  Content recall@5     : {content_at5:.4f}  ({content_at5*100:.1f}%)")
+    print(f"  Required recall      : {req_recall:.4f}  ({req_recall*100:.1f}%)  ← hardest: ALL required fragments in top-10")
+    print(f"  Avg query latency    : {avg_ms:.0f} ms")
+    print()
+
+    # ── Failure analysis ──────────────────────────────────────────────────
+    failures = [tr for tr in valid if not tr.file_at(5) or not tr.required_found]
+    if failures:
+        print(f"Improvement targets ({len(failures)} tests below par):")
+        for tr in failures:
+            issues = []
+            if not tr.file_at(5):
+                issues.append(f"file not in top-5 (rank={tr.file_rank})")
+            if not tr.required_found:
+                missing = [r for r in tr.test.required_content
+                           if r.lower() not in " ".join(h.get("content","") for h in tr.hits).lower()]
+                issues.append(f"required content missing: {missing}")
+            print(f"  [{tr.test.id}] {tr.test.name}: {'; '.join(issues)}")
+    else:
+        print("All tests passed file@5 and required-content checks.")
+
+    # Exit code: 0 if MRR ≥ 0.5 AND required recall ≥ 0.8, else 1
+    if mrr >= 0.5 and req_recall >= 0.8:
+        print("\nResult: PASS")
+        sys.exit(0)
+    else:
+        print(f"\nResult: FAIL  (MRR={mrr:.3f} threshold=0.5, req_recall={req_recall:.3f} threshold=0.8)")
+        sys.exit(1)
+
+
+# ---------------------------------------------------------------------------
+# Entry point
+# ---------------------------------------------------------------------------
+if __name__ == "__main__":
+    parser = argparse.ArgumentParser(description="Phaser RAG quality evaluation")
+    parser.add_argument("--base-url", default="http://localhost:18080",
+                        help="TrueRef server base URL")
+    parser.add_argument("--verbose", action="store_true",
+                        help="Print per-test hit details")
+    args = parser.parse_args()
+    run(args.base_url, args.verbose)
--- a/115
+++ b/115
@@ -0,0 +1,115 @@
+#!/usr/bin/env bash
+# trueref launcher (workspace root)
+#
+# Wraps the fat JAR with:
+#   - --enable-native-access=ALL-UNNAMED  (silences FFM Linker warning from DJL tokenizers)
+#   - --add-modules=jdk.incubator.vector  (enables Lucene 10 SIMD codepath)
+#   - cuDNN 9 (cu12 build) on LD_LIBRARY_PATH so ONNX Runtime CUDA EP loads
+#   - CUDA_VISIBLE_DEVICES isolation so ORT BFC arena doesn't trip over the second card
+#   - per-session GPU memory cap so embedder + reranker fit on one card
+#
+# Defaults are tuned for this machine (LMDE 7, CUDA 12.4, RTX 2080 SUPER + RTX 3060).
+# Override anything via env vars or by appending Spring properties to the command line.
+#
+# Usage:
+#   ./trueref                                       # run with defaults (port 18080)
+#   ./trueref --server.port=8080                    # forward Spring properties
+#   TRUEREF_GPU=0 ./trueref                         # use the 2080 SUPER instead#   TRUEREF_GPU=cpu ./trueref                       # disable CUDA, run on CPU
+#   TRUEREF_HOME=/data/trueref ./trueref            # custom data dir
+#
+# Env vars:
+#   TRUEREF_GPU         GPU index (matches `nvidia-smi -L`) or "cpu". Default: 1
+#   TRUEREF_HOME        Data directory. Default: ./data
+#   TRUEREF_PORT        HTTP port. Default: 18080
+#   TRUEREF_MEM_LIMIT   Per-session GPU mem cap in bytes. Default: 0 (unbounded).
+#                       With session-count=1 there is no multi-session contention, so the BFC
+#                       arena can grow freely — capping it risks running out of budget during
+#                       model-weight loading (~1.5-2 GB) before inference even starts.
+#                       Set to e.g. 8589934592 (8 GiB) only if you run multiple pools on one card.
+#   TRUEREF_CUDNN_LIB   Directory containing libcudnn.so.9. Default: ./runtime/cudnn/nvidia/cudnn/lib
+#   TRUEREF_JAR         Path to the fat JAR. Default: ./trueref-bootstrap/target/trueref.jar
+#   JAVA                java binary. Default: $JAVA_HOME/bin/java or `java` on PATH
+#   JAVA_OPTS           Extra JVM flags (e.g. -Xmx16g)
+
+set -euo pipefail
+
+ROOT="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
+
+JAR="${TRUEREF_JAR:-$ROOT/trueref-bootstrap/target/trueref.jar}"
+GPU="${TRUEREF_GPU:-1}"
+HOME_DIR="${TRUEREF_HOME:-$ROOT/data}"
+PORT="${TRUEREF_PORT:-18080}"
+MEM_LIMIT="${TRUEREF_MEM_LIMIT:-0}"
+CUDNN_LIB="${TRUEREF_CUDNN_LIB:-$ROOT/runtime/cudnn/nvidia/cudnn/lib}"
+
+if [[ ! -f "$JAR" ]]; then
+  echo "trueref: jar not found at $JAR" >&2
+  echo "trueref: build it first with: mvn -DskipTests -pl trueref-bootstrap -am package" >&2
+  exit 1
+fi
+
+# Resolve java
+if [[ -n "${JAVA:-}" ]]; then
+  :
+elif [[ -n "${JAVA_HOME:-}" && -x "${JAVA_HOME}/bin/java" ]]; then
+  JAVA="${JAVA_HOME}/bin/java"
+else
+  JAVA="$(command -v java || true)"
+fi
+if [[ -z "${JAVA:-}" || ! -x "${JAVA}" ]]; then
+  echo "trueref: java not found; install JDK 21+ or set JAVA_HOME" >&2
+  exit 1
+fi
+
+mkdir -p "$HOME_DIR"
+
+SPRING_ARGS=(
+  "--server.port=$PORT"
+  "--trueref.home=$HOME_DIR"
+)
+
+# CUDA setup. "cpu" disables CUDA entirely; otherwise pass the physical GPU index
+# directly to ORT. ORT's CUDA EP uses the physical device index regardless of
+# CUDA_VISIBLE_DEVICES remapping — so we pass the physical index and explicitly
+# unset CUDA_VISIBLE_DEVICES to avoid the two-layer renumbering problem where
+# CUDA runtime remaps N→0 but ORT still expects the physical N.
+if [[ "$GPU" == "cpu" || "$GPU" == "CPU" ]]; then
+  echo "trueref: GPU disabled (TRUEREF_GPU=cpu) — embedder/reranker will run on CPU"
+  SPRING_ARGS+=(
+    "--trueref.embedding.onnx-providers=cpu"
+  )
+else
+  if [[ -d "$CUDNN_LIB" ]]; then
+    export LD_LIBRARY_PATH="${CUDNN_LIB}${LD_LIBRARY_PATH:+:$LD_LIBRARY_PATH}"
+  else
+    echo "trueref: TRUEREF_CUDNN_LIB=$CUDNN_LIB not found — CUDA EP will fall back to CPU" >&2
+    echo "trueref: download cu12 cuDNN with:" >&2
+    echo "  mkdir -p runtime/cudnn && cd runtime/cudnn && \\" >&2
+    echo "    pip download --no-deps --only-binary=:all: --python-version 3.12 \\" >&2
+    echo "    --platform manylinux2014_x86_64 'nvidia-cudnn-cu12<10' -d . && \\" >&2
+    echo "    unzip -q -o nvidia_cudnn_cu12-*.whl 'nvidia/cudnn/lib/*' && rm *.whl" >&2
+  fi
+  # CUDA runtime respects CUDA_VISIBLE_DEVICES for all allocations (cudaMalloc, BFC arena,
+  # etc.). By restricting CUDA's view to exactly the target GPU, we prevent the runtime from
+  # creating a default context on device 0 before ORT's cudaSetDevice() takes effect.
+  # We always pass gpu-device-id=0 to ORT because CUDA_VISIBLE_DEVICES makes the target
+  # card the ONLY visible device (index 0 in the runtime's view).
+  #
+  # CUDA_DEVICE_ORDER=PCI_BUS_ID ensures CUDA runtime numbering matches nvidia-smi numbering.
+  # Without it, the default FASTEST_FIRST ordering can rank GPUs differently from nvidia-smi,
+  # so CUDA_VISIBLE_DEVICES=N would expose a different physical card than nvidia-smi GPU N.
+  export CUDA_DEVICE_ORDER="PCI_BUS_ID"
+  export CUDA_VISIBLE_DEVICES="$GPU"
+  SPRING_ARGS+=(
+    "--trueref.embedding.gpu-device-id=0"
+    "--trueref.embedding.gpu-mem-limit-bytes=$MEM_LIMIT"
+  )
+fi
+
+exec "$JAVA" \
+  --enable-native-access=ALL-UNNAMED \
+  --add-modules=jdk.incubator.vector \
+  ${JAVA_OPTS:-} \
+  -jar "$JAR" \
+  "${SPRING_ARGS[@]}" \
+  "$@"
--- a/trueref-adapters/pom.xml
+++ b/trueref-adapters/pom.xml
@@ -0,0 +1,106 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<project xmlns="http://maven.apache.org/POM/4.0.0"
+         xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
+         xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 https://maven.apache.org/xsd/maven-4.0.0.xsd">
+    <modelVersion>4.0.0</modelVersion>
+
+    <parent>
+        <groupId>com.trueref</groupId>
+        <artifactId>trueref-parent</artifactId>
+        <version>0.1.0-SNAPSHOT</version>
+    </parent>
+
+    <artifactId>trueref-adapters</artifactId>
+    <name>trueref-adapters</name>
+    <description>All driving (REST, MCP) and driven (H2, Lucene, ONNX, JGit, tree-sitter, disk cache) adapters.</description>
+
+    <dependencies>
+        <dependency>
+            <groupId>com.trueref</groupId>
+            <artifactId>trueref-domain</artifactId>
+        </dependency>
+        <dependency>
+            <groupId>com.trueref</groupId>
+            <artifactId>trueref-application</artifactId>
+        </dependency>
+
+        <!-- Spring Web + JDBC + validation (no auto-config; bootstrap controls @ComponentScan) -->
+        <dependency>
+            <groupId>org.springframework.boot</groupId>
+            <artifactId>spring-boot-starter-web</artifactId>
+        </dependency>
+        <dependency>
+            <groupId>org.springframework.boot</groupId>
+            <artifactId>spring-boot-starter-jdbc</artifactId>
+        </dependency>
+        <dependency>
+            <groupId>org.springframework.boot</groupId>
+            <artifactId>spring-boot-starter-validation</artifactId>
+        </dependency>
+
+        <!-- OpenAPI / Swagger UI -->
+        <dependency>
+            <groupId>org.springdoc</groupId>
+            <artifactId>springdoc-openapi-starter-webmvc-ui</artifactId>
+        </dependency>
+
+        <!-- Spring AI MCP server (Streamable HTTP) -->
+        <dependency>
+            <groupId>org.springframework.ai</groupId>
+            <artifactId>spring-ai-starter-mcp-server-webmvc</artifactId>
+        </dependency>
+
+        <!-- H2 + Flyway -->
+        <dependency>
+            <groupId>com.h2database</groupId>
+            <artifactId>h2</artifactId>
+            <version>${h2.version}</version>
+        </dependency>
+        <dependency>
+            <groupId>org.flywaydb</groupId>
+            <artifactId>flyway-core</artifactId>
+            <version>${flyway.version}</version>
+        </dependency>
+
+        <!-- Lucene -->
+        <dependency>
+            <groupId>org.apache.lucene</groupId>
+            <artifactId>lucene-core</artifactId>
+        </dependency>
+        <dependency>
+            <groupId>org.apache.lucene</groupId>
+            <artifactId>lucene-analysis-common</artifactId>
+        </dependency>
+        <dependency>
+            <groupId>org.apache.lucene</groupId>
+            <artifactId>lucene-queryparser</artifactId>
+        </dependency>
+
+        <!-- ONNX Runtime: GPU jar contains both CUDA and CPU providers; DirectML jar is added at runtime via classifier on Windows. -->
+        <dependency>
+            <groupId>com.microsoft.onnxruntime</groupId>
+            <artifactId>onnxruntime_gpu</artifactId>
+        </dependency>
+        <dependency>
+            <groupId>ai.djl.huggingface</groupId>
+            <artifactId>tokenizers</artifactId>
+        </dependency>
+
+        <!-- JGit -->
+        <dependency>
+            <groupId>org.eclipse.jgit</groupId>
+            <artifactId>org.eclipse.jgit</artifactId>
+        </dependency>
+        <dependency>
+            <groupId>org.eclipse.jgit</groupId>
+            <artifactId>org.eclipse.jgit.ssh.apache</artifactId>
+        </dependency>
+
+        <!-- Test -->
+        <dependency>
+            <groupId>org.springframework.boot</groupId>
+            <artifactId>spring-boot-starter-test</artifactId>
+            <scope>test</scope>
+        </dependency>
+    </dependencies>
+</project>
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/mcp/McpConfig.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/mcp/McpConfig.java
@@ -0,0 +1,24 @@
+package com.trueref.adapter.in.mcp;
+
+import org.springframework.ai.tool.ToolCallbackProvider;
+import org.springframework.ai.tool.method.MethodToolCallbackProvider;
+import org.springframework.boot.context.properties.EnableConfigurationProperties;
+import org.springframework.context.annotation.Bean;
+import org.springframework.context.annotation.Configuration;
+
+/**
+ * Registers the trueref MCP tool callbacks with Spring AI's MCP WebMVC auto-configuration. The
+ * {@link MethodToolCallbackProvider} scans {@link TrueRefMcpTools} for methods annotated with
+ * {@link org.springframework.ai.tool.annotation.Tool} and publishes them on the MCP endpoint
+ * configured in {@code application.yml} (POST {@code /mcp} via
+ * {@code spring.ai.mcp.server.sse-message-endpoint}).
+ */
+@Configuration
+@EnableConfigurationProperties(McpProperties.class)
+public class McpConfig {
+
+    @Bean
+    public ToolCallbackProvider trueRefMcpToolCallbacks(TrueRefMcpTools tools) {
+        return MethodToolCallbackProvider.builder().toolObjects(tools).build();
+    }
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/mcp/McpProperties.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/mcp/McpProperties.java
@@ -0,0 +1,22 @@
+package com.trueref.adapter.in.mcp;
+
+import org.springframework.boot.context.properties.ConfigurationProperties;
+
+/**
+ * Token-budget defaults for {@code get-library-docs}. Matches Context7 semantics: clients may
+ * request an explicit token budget per call; unspecified calls use {@link #tokensDefault}. All
+ * requests are clamped to {@code [tokensMin, tokensMax]}.
+ */
+@ConfigurationProperties(prefix = "trueref.mcp")
+public record McpProperties(int tokensDefault, int tokensMin, int tokensMax) {
+
+    public McpProperties {
+        if (tokensDefault <= 0) tokensDefault = 5000;
+        if (tokensMin <= 0) tokensMin = 500;
+        if (tokensMax <= 0) tokensMax = 50_000;
+    }
+
+    public int clamp(int requested) {
+        return Math.max(tokensMin, Math.min(tokensMax, requested));
+    }
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/mcp/TrueRefMcpTools.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/mcp/TrueRefMcpTools.java
@@ -0,0 +1,297 @@
+package com.trueref.adapter.in.mcp;
+
+import com.trueref.application.resolve.LibraryResolver;
+import com.trueref.domain.model.Repository;
+import com.trueref.domain.model.SearchHit;
+import com.trueref.domain.model.SearchScope;
+import com.trueref.domain.model.Version;
+import com.trueref.domain.model.VersionStatus;
+import com.trueref.domain.port.in.IndexVersion;
+import com.trueref.domain.port.in.QueryCatalog;
+import com.trueref.domain.port.in.ResolveLibraryId;
+import com.trueref.domain.port.in.SearchLibraryDocs;
+import java.util.Comparator;
+import java.util.List;
+import java.util.Optional;
+import org.jspecify.annotations.Nullable;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+import org.springframework.ai.tool.annotation.Tool;
+import org.springframework.ai.tool.annotation.ToolParam;
+import org.springframework.stereotype.Service;
+
+/**
+ * Context7-compatible MCP tool handlers. The two tool names, parameter names, and response
+ * shapes are intentionally 1:1 with upstream Context7 so that any MCP client written against
+ * Context7 works against trueref unchanged.
+ *
+ * <p>Ranking and version→tag mapping are delegated to the application layer
+ * ({@link ResolveLibraryId}, {@link LibraryResolver}); hybrid search goes through
+ * {@link SearchLibraryDocs}; on-demand indexing is enqueued via {@link IndexVersion}.
+ */
+@Service
+public class TrueRefMcpTools {
+
+    private static final Logger log = LoggerFactory.getLogger(TrueRefMcpTools.class);
+    private static final int DEFAULT_MAX_HITS = 50;
+    /** Matches Context7's banner retry hint (see ARCHITECTURE §7 on-demand indexing flow). */
+    private static final int INDEXING_RETRY_AFTER_SEC = 30;
+
+    private final ResolveLibraryId resolver;
+    private final LibraryResolver libraryResolver;
+    private final QueryCatalog catalog;
+    private final SearchLibraryDocs search;
+    private final IndexVersion indexer;
+    private final McpProperties props;
+
+    public TrueRefMcpTools(
+            ResolveLibraryId resolver,
+            LibraryResolver libraryResolver,
+            QueryCatalog catalog,
+            SearchLibraryDocs search,
+            IndexVersion indexer,
+            McpProperties props) {
+        this.resolver = resolver;
+        this.libraryResolver = libraryResolver;
+        this.catalog = catalog;
+        this.search = search;
+        this.indexer = indexer;
+        this.props = props;
+    }
+
+    @Tool(
+            name = "resolve-library-id",
+            description =
+                    "Resolves a package/product name to a trueref-compatible library ID and "
+                            + "returns matching libraries. Context7-compatible. Each result "
+                            + "includes: Title, Context7-compatible library ID (format "
+                            + "/owner/repo[/version]), Description, Code Snippets, Versions, "
+                            + "and a relevance Score.")
+    public String resolveLibraryId(
+            @ToolParam(
+                            description =
+                                    "Library name to search for and retrieve a Context7-"
+                                            + "compatible library ID.")
+                    String libraryName,
+            @ToolParam(
+                            required = false,
+                            description =
+                                    "Optional natural-language query used to rank matching "
+                                            + "libraries by relevance.")
+                    @Nullable String query) {
+        ResolveLibraryId.Result result =
+                resolver.resolve(new ResolveLibraryId.Query(libraryName, query, null));
+        if (result.matches().isEmpty()) {
+            return "No matching libraries found for: " + libraryName;
+        }
+        StringBuilder sb = new StringBuilder();
+        for (ResolveLibraryId.Match m : result.matches()) {
+            appendMatchBlock(sb, m);
+        }
+        return sb.toString();
+    }
+
+    @Tool(
+            name = "get-library-docs",
+            description =
+                    "Fetches up-to-date documentation for a library. You MUST call "
+                            + "'resolve-library-id' first to obtain the exact library ID "
+                            + "required to use this tool, UNLESS the user explicitly provides "
+                            + "a library ID in the format /org/project or /org/project/version.")
+    public String getLibraryDocs(
+            @ToolParam(
+                            description =
+                                    "Exact trueref-compatible library ID (format: "
+                                            + "/org/project or /org/project/version) "
+                                            + "retrieved from 'resolve-library-id'.")
+                    String libraryId,
+            @ToolParam(
+                            required = false,
+                            description =
+                                    "Topic to focus the documentation on (e.g. "
+                                            + "'routing', 'hooks', 'authentication').")
+                    @Nullable String topic,
+            @ToolParam(
+                            required = false,
+                            description =
+                                    "Max number of tokens to return. Clamped to "
+                                            + "[500, 50000]; defaults to 5000.")
+                    @Nullable Integer tokens) {
+        ParsedId parsed = parseLibraryId(libraryId);
+        if (parsed == null) {
+            return "Invalid libraryId: " + libraryId
+                    + ". Expected format: /org/project or /org/project/version.";
+        }
+
+        Optional<Repository> repoOpt = catalog.listRepositories().stream()
+                .filter(r -> r.name().equalsIgnoreCase(parsed.repoName()))
+                .findFirst();
+        if (repoOpt.isEmpty()) {
+            return "No matching library found for ID: " + libraryId;
+        }
+        Repository repo = repoOpt.get();
+        List<Version> versions = catalog.listVersions(repo.id());
+
+        SelectedVersion selected = selectVersion(repo, versions, parsed.version());
+        if (selected.searchTarget() == null) {
+            return "No indexed version available for /" + repo.name()
+                    + ". Indexing has been enqueued; retry in ~"
+                    + INDEXING_RETRY_AFTER_SEC + " seconds.";
+        }
+
+        int budget = props.clamp(tokens == null ? props.tokensDefault() : tokens);
+        String text = (topic == null || topic.isBlank()) ? repo.name() : topic;
+        SearchLibraryDocs.Query q = new SearchLibraryDocs.Query(
+                text,
+                topic,
+                new SearchScope(List.of(new SearchScope.RepoVersionRef(repo.id(), selected.searchTarget().id()))),
+                budget,
+                DEFAULT_MAX_HITS);
+        SearchLibraryDocs.Result res;
+        try {
+            res = search.search(q);
+        } catch (Exception e) {
+            log.warn("MCP search failed for {}: {}", libraryId, e.toString());
+            return "Search failed for " + libraryId + ": " + e.getMessage();
+        }
+
+        return formatDocs(res.hits(), selected.banner());
+    }
+
+    // --- helpers -----------------------------------------------------------
+
+    private void appendMatchBlock(StringBuilder sb, ResolveLibraryId.Match m) {
+        sb.append("----------\n");
+        sb.append("- Title: ").append(m.name()).append('\n');
+        sb.append("- Context7-compatible library ID: ").append(m.libraryId()).append('\n');
+        if (m.description() != null && !m.description().isBlank()) {
+            sb.append("- Description: ").append(m.description()).append('\n');
+        }
+        sb.append("- Code Snippets: ").append(m.snippetCount()).append('\n');
+        if (!m.availableVersions().isEmpty()) {
+            sb.append("- Versions: ");
+            for (int i = 0; i < m.availableVersions().size(); i++) {
+                if (i > 0) sb.append(", ");
+                ResolveLibraryId.VersionRef v = m.availableVersions().get(i);
+                sb.append(v.tag()).append(" (").append(v.status()).append(')');
+            }
+            sb.append('\n');
+        }
+        sb.append("- Score: ").append(String.format("%.2f", m.score())).append('\n');
+    }
+
+    private SelectedVersion selectVersion(
+            Repository repo, List<Version> versions, @Nullable String requestedVersion) {
+        if (requestedVersion == null || requestedVersion.isBlank()) {
+            // No version in libraryId: prefer most-recent INDEXED; else nearest DISCOVERED +
+            // enqueue indexing + banner.
+            Optional<Version> latestIndexed = versions.stream()
+                    .filter(v -> v.status() == VersionStatus.INDEXED)
+                    .max(Comparator.comparing(Version::tag));
+            if (latestIndexed.isPresent()) {
+                return new SelectedVersion(latestIndexed.get(), null);
+            }
+            Optional<Version> latestDiscovered = versions.stream()
+                    .filter(v -> v.status() == VersionStatus.DISCOVERED)
+                    .max(Comparator.comparing(Version::tag));
+            if (latestDiscovered.isPresent()) {
+                Version v = latestDiscovered.get();
+                enqueueSafely(repo, v);
+                return new SelectedVersion(null, indexingBanner(v.tag(), v.tag()));
+            }
+            return new SelectedVersion(null, null);
+        }
+
+        // Explicit version requested: use application-layer mapper.
+        Optional<Version> mapped = libraryResolver.mapVersion(repo, versions, requestedVersion);
+        if (mapped.isEmpty()) {
+            // Fall back to nearest INDEXED; if any, show banner for the requested version.
+            Optional<Version> nearest = versions.stream()
+                    .filter(v -> v.status() == VersionStatus.INDEXED)
+                    .max(Comparator.comparing(Version::tag));
+            return new SelectedVersion(
+                    nearest.orElse(null), nearest.map(v -> indexingBanner(requestedVersion, v.tag())).orElse(null));
+        }
+        Version target = mapped.get();
+        if (target.status() == VersionStatus.INDEXED) {
+            return new SelectedVersion(target, null);
+        }
+        enqueueSafely(repo, target);
+        Optional<Version> nearestIndexed = versions.stream()
+                .filter(v -> v.status() == VersionStatus.INDEXED)
+                .max(Comparator.comparing(Version::tag));
+        return new SelectedVersion(
+                nearestIndexed.orElse(null),
+                indexingBanner(target.tag(), nearestIndexed.map(Version::tag).orElse("none")));
+    }
+
+    private void enqueueSafely(Repository repo, Version v) {
+        if (v.status() == VersionStatus.INDEXING) return;
+        try {
+            indexer.enqueue(repo.id(), v.id(), false);
+        } catch (Exception e) {
+            log.warn("MCP on-demand indexing enqueue failed for {}@{}: {}", repo.name(), v.tag(), e.toString());
+        }
+    }
+
+    private String indexingBanner(String requestedTag, String fallbackTag) {
+        return "[indexing] version " + requestedTag
+                + " is being indexed now; showing nearest indexed version " + fallbackTag
+                + " (retryAfterSec=" + INDEXING_RETRY_AFTER_SEC + ")";
+    }
+
+    private String formatDocs(List<SearchHit> hits, @Nullable String banner) {
+        StringBuilder sb = new StringBuilder();
+        if (banner != null) {
+            sb.append(banner).append('\n').append('\n');
+        }
+        sb.append("================\n");
+        sb.append("CODE SNIPPETS\n");
+        sb.append("================\n");
+        if (hits.isEmpty()) {
+            sb.append("(no matching snippets)\n");
+            return sb.toString();
+        }
+        for (SearchHit h : hits) {
+            sb.append("TITLE: ")
+                    .append(h.filePath())
+                    .append(':')
+                    .append(h.startLine())
+                    .append('-')
+                    .append(h.endLine())
+                    .append(" (")
+                    .append(h.language())
+                    .append(")\n");
+            sb.append("SOURCE: ")
+                    .append(h.repoName())
+                    .append('@')
+                    .append(h.tag())
+                    .append(" — ")
+                    .append(h.filePath())
+                    .append("\n\n");
+            sb.append("```").append(h.language()).append('\n');
+            sb.append(h.content());
+            if (!h.content().endsWith("\n")) sb.append('\n');
+            sb.append("```\n");
+            sb.append("----------------------------------------\n");
+        }
+        return sb.toString();
+    }
+
+    static @Nullable ParsedId parseLibraryId(String raw) {
+        if (raw == null || raw.isBlank()) return null;
+        String s = raw.startsWith("/") ? raw.substring(1) : raw;
+        String[] parts = s.split("/");
+        if (parts.length == 2) {
+            return new ParsedId(parts[0] + "/" + parts[1], null);
+        }
+        if (parts.length == 3) {
+            return new ParsedId(parts[0] + "/" + parts[1], parts[2]);
+        }
+        return null;
+    }
+
+    record ParsedId(String repoName, @Nullable String version) {}
+
+    private record SelectedVersion(@Nullable Version searchTarget, @Nullable String banner) {}
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/mcp/package-info.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/mcp/package-info.java
@@ -0,0 +1,16 @@
+/**
+ * Driving adapter: Model Context Protocol (MCP) server exposing the two Context7-compatible
+ * tools ({@code resolve-library-id}, {@code get-library-docs}) over Spring AI's MCP WebMVC
+ * transport. The HTTP message endpoint is wired to {@code POST /mcp} via
+ * {@code spring.ai.mcp.server.sse-message-endpoint}.
+ *
+ * <p>Spring AI 1.0.0 ships the SSE-based WebMVC transport
+ * ({@code WebMvcSseServerTransportProvider}); the 2025-03-26 "Streamable HTTP" transport is
+ * not a separate selectable property in this version. Clients that POST JSON-RPC to the
+ * configured message endpoint receive JSON-RPC responses; the server additionally opens an
+ * SSE stream on the configured {@code sse-endpoint} for server-initiated notifications. This
+ * is the closest equivalent Spring AI 1.0.0 provides to Streamable HTTP and is the
+ * intended/only transport of this adapter.
+ */
+@org.jspecify.annotations.NullMarked
+package com.trueref.adapter.in.mcp;
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/ErrorResponse.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/ErrorResponse.java
@@ -0,0 +1,16 @@
+package com.trueref.adapter.in.rest;
+
+import io.swagger.v3.oas.annotations.media.Schema;
+import java.util.List;
+
+/** Uniform error envelope returned by {@link GlobalExceptionHandler}. */
+@Schema(description = "Error response envelope.")
+public record ErrorResponse(String code, String message, List<FieldError> fieldErrors) {
+
+    public static ErrorResponse of(String code, String message) {
+        return new ErrorResponse(code, message, List.of());
+    }
+
+    @Schema(description = "A single field-level validation error.")
+    public record FieldError(String field, String message) {}
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/GlobalExceptionHandler.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/GlobalExceptionHandler.java
@@ -0,0 +1,120 @@
+package com.trueref.adapter.in.rest;
+
+import com.trueref.domain.error.IngestionFailed;
+import com.trueref.domain.error.InvalidSearchRequest;
+import com.trueref.domain.error.RepositoryAlreadyRegistered;
+import com.trueref.domain.error.RepositoryNotFound;
+import com.trueref.domain.error.TagNotFound;
+import com.trueref.domain.error.TrueRefException;
+import com.trueref.domain.error.VersionNotFound;
+import com.trueref.domain.error.VersionNotIndexed;
+import jakarta.validation.ConstraintViolationException;
+import java.util.List;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+import org.springframework.http.HttpStatus;
+import org.springframework.http.ResponseEntity;
+import org.springframework.validation.FieldError;
+import org.springframework.web.bind.MethodArgumentNotValidException;
+import org.springframework.web.bind.annotation.ExceptionHandler;
+import org.springframework.web.bind.annotation.RestControllerAdvice;
+import org.springframework.web.server.ResponseStatusException;
+import org.springframework.web.servlet.resource.NoResourceFoundException;
+
+/** Central translator from domain / validation exceptions to HTTP + {@link ErrorResponse} JSON. */
+@RestControllerAdvice
+public class GlobalExceptionHandler {
+
+    private static final Logger log = LoggerFactory.getLogger(GlobalExceptionHandler.class);
+
+    @ExceptionHandler({RepositoryNotFound.class, VersionNotFound.class, TagNotFound.class})
+    public ResponseEntity<ErrorResponse> handleNotFound(TrueRefException ex) {
+        return status(HttpStatus.NOT_FOUND, ex);
+    }
+
+    @ExceptionHandler(RepositoryAlreadyRegistered.class)
+    public ResponseEntity<ErrorResponse> handleConflict(RepositoryAlreadyRegistered ex) {
+        return status(HttpStatus.CONFLICT, ex);
+    }
+
+    @ExceptionHandler(VersionNotIndexed.class)
+    public ResponseEntity<ErrorResponse> handleNotIndexed(VersionNotIndexed ex) {
+        return status(HttpStatus.CONFLICT, ex);
+    }
+
+    @ExceptionHandler(InvalidSearchRequest.class)
+    public ResponseEntity<ErrorResponse> handleInvalidSearch(InvalidSearchRequest ex) {
+        return status(HttpStatus.BAD_REQUEST, ex);
+    }
+
+    @ExceptionHandler(MethodArgumentNotValidException.class)
+    public ResponseEntity<ErrorResponse> handleValidation(MethodArgumentNotValidException ex) {
+        List<ErrorResponse.FieldError> fieldErrors = ex.getBindingResult().getFieldErrors().stream()
+                .map(this::toFieldError)
+                .toList();
+        ErrorResponse body = new ErrorResponse("validation_failed", "Request validation failed", fieldErrors);
+        return ResponseEntity.status(HttpStatus.BAD_REQUEST).body(body);
+    }
+
+    @ExceptionHandler(ConstraintViolationException.class)
+    public ResponseEntity<ErrorResponse> handleConstraintViolation(ConstraintViolationException ex) {
+        List<ErrorResponse.FieldError> fieldErrors = ex.getConstraintViolations().stream()
+                .map(v -> new ErrorResponse.FieldError(v.getPropertyPath().toString(), v.getMessage()))
+                .toList();
+        ErrorResponse body = new ErrorResponse("validation_failed", "Request validation failed", fieldErrors);
+        return ResponseEntity.status(HttpStatus.BAD_REQUEST).body(body);
+    }
+
+    @ExceptionHandler(IllegalArgumentException.class)
+    public ResponseEntity<ErrorResponse> handleIllegalArgument(IllegalArgumentException ex) {
+        ErrorResponse body = new ErrorResponse("invalid_request", safeMessage(ex), List.of());
+        return ResponseEntity.status(HttpStatus.BAD_REQUEST).body(body);
+    }
+
+    @ExceptionHandler(ResponseStatusException.class)
+    public ResponseEntity<ErrorResponse> handleResponseStatus(ResponseStatusException ex) {
+        HttpStatus resolved = HttpStatus.resolve(ex.getStatusCode().value());
+        HttpStatus status = resolved == null ? HttpStatus.INTERNAL_SERVER_ERROR : resolved;
+        String code = ex.getReason() == null ? status.name().toLowerCase() : ex.getReason();
+        return ResponseEntity.status(status).body(ErrorResponse.of(code, code));
+    }
+
+    @ExceptionHandler(NoResourceFoundException.class)
+    public ResponseEntity<Void> handleNoResource(NoResourceFoundException ex) {
+        // Static resource not found (e.g. browser/.well-known probes). Return 404 without
+        // logging — these are not application errors and would flood the log at ERROR level.
+        return ResponseEntity.notFound().build();
+    }
+
+    @ExceptionHandler(IngestionFailed.class)
+    public ResponseEntity<ErrorResponse> handleIngestionFailed(IngestionFailed ex) {
+        log.error("ingestion failed", ex);
+        return status(HttpStatus.INTERNAL_SERVER_ERROR, ex);
+    }
+
+    @ExceptionHandler(TrueRefException.class)
+    public ResponseEntity<ErrorResponse> handleDomain(TrueRefException ex) {
+        log.error("unhandled domain error code={}", ex.code(), ex);
+        return status(HttpStatus.INTERNAL_SERVER_ERROR, ex);
+    }
+
+    @ExceptionHandler(Exception.class)
+    public ResponseEntity<ErrorResponse> handleUnexpected(Exception ex) {
+        log.error("unexpected error", ex);
+        ErrorResponse body = new ErrorResponse("internal_error", "An unexpected error occurred", List.of());
+        return ResponseEntity.status(HttpStatus.INTERNAL_SERVER_ERROR).body(body);
+    }
+
+    private ResponseEntity<ErrorResponse> status(HttpStatus status, TrueRefException ex) {
+        return ResponseEntity.status(status).body(ErrorResponse.of(ex.code(), safeMessage(ex)));
+    }
+
+    private ErrorResponse.FieldError toFieldError(FieldError fe) {
+        String msg = fe.getDefaultMessage() == null ? "invalid" : fe.getDefaultMessage();
+        return new ErrorResponse.FieldError(fe.getField(), msg);
+    }
+
+    private static String safeMessage(Throwable t) {
+        return t.getMessage() == null ? t.getClass().getSimpleName() : t.getMessage();
+    }
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/JobController.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/JobController.java
@@ -0,0 +1,145 @@
+package com.trueref.adapter.in.rest;
+
+import com.trueref.adapter.in.rest.dto.JobDto;
+import com.trueref.adapter.in.rest.dto.JobLogEventDto;
+import com.trueref.domain.model.JobId;
+import com.trueref.domain.model.JobStatus;
+import com.trueref.domain.model.RepositoryId;
+import com.trueref.domain.model.VersionId;
+import com.trueref.domain.port.in.ObserveJobs;
+import io.swagger.v3.oas.annotations.Operation;
+import io.swagger.v3.oas.annotations.tags.Tag;
+import java.io.IOException;
+import java.util.List;
+import org.jspecify.annotations.Nullable;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+import org.springframework.http.MediaType;
+import org.springframework.web.bind.annotation.GetMapping;
+import org.springframework.web.bind.annotation.PathVariable;
+import org.springframework.web.bind.annotation.RequestMapping;
+import org.springframework.web.bind.annotation.RequestParam;
+import org.springframework.web.bind.annotation.RestController;
+import org.springframework.web.server.ResponseStatusException;
+import org.springframework.web.servlet.mvc.method.annotation.SseEmitter;
+
+/** REST + SSE resource: {@code /api/jobs}. */
+@RestController
+@RequestMapping("/api/jobs")
+@Tag(name = "jobs", description = "Inspect ingestion jobs and stream live progress via SSE.")
+public class JobController {
+
+    private static final Logger log = LoggerFactory.getLogger(JobController.class);
+
+    private final ObserveJobs observeJobs;
+
+    public JobController(ObserveJobs observeJobs) {
+        this.observeJobs = observeJobs;
+    }
+
+    @Operation(summary = "List jobs, optionally filtered by repo / version / status.")
+    @GetMapping
+    public List<JobDto> list(
+            @RequestParam(value = "repoId", required = false) @Nullable String repoId,
+            @RequestParam(value = "versionId", required = false) @Nullable String versionId,
+            @RequestParam(value = "status", required = false) @Nullable JobStatus status,
+            @RequestParam(value = "limit", defaultValue = "100") int limit) {
+        RepositoryId repo = repoId == null ? null : RepositoryId.of(repoId);
+        VersionId ver = versionId == null ? null : VersionId.of(versionId);
+        return observeJobs.listJobs(repo, ver, status, limit).stream()
+                .map(JobDto::of)
+                .toList();
+    }
+
+    @Operation(summary = "Get a single job by id.")
+    @GetMapping("/{id}")
+    public JobDto detail(@PathVariable("id") String id) {
+        JobId jobId = JobId.of(id);
+        return observeJobs
+                .findJob(jobId)
+                .map(JobDto::of)
+                .orElseThrow(() ->
+                        new ResponseStatusException(org.springframework.http.HttpStatus.NOT_FOUND, "job_not_found"));
+    }
+
+    @Operation(summary = "Server-Sent Events stream of log events for a single job.")
+    @GetMapping(value = "/{id}/log", produces = MediaType.TEXT_EVENT_STREAM_VALUE)
+    public SseEmitter logStream(@PathVariable("id") String id) {
+        JobId jobId = JobId.of(id);
+        SseEmitter emitter = new SseEmitter(0L);
+        AutoCloseable subscription = observeJobs.subscribeLogs(jobId, event -> {
+            try {
+                emitter.send(SseEmitter.event().name("log").data(JobLogEventDto.of(event)));
+            } catch (IOException ex) {
+                emitter.completeWithError(ex);
+            }
+        });
+        attachCleanup(emitter, subscription, "job-log " + id);
+        return emitter;
+    }
+
+    @Operation(summary = "Server-Sent Events stream of status updates for all jobs.")
+    @GetMapping(value = "/stream", produces = MediaType.TEXT_EVENT_STREAM_VALUE)
+    public SseEmitter stream() {
+        SseEmitter emitter = new SseEmitter(0L);
+
+        // Send an immediate ping so Tomcat flushes the response headers to the client.
+        // Without this, the response buffer may not be flushed until the first job event
+        // arrives, keeping the EventSource in CONNECTING state and never firing 'open'.
+        try {
+            emitter.send(SseEmitter.event().name("ping").data(""));
+        } catch (IOException e) {
+            emitter.completeWithError(e);
+            return emitter;
+        }
+
+        AutoCloseable subscription = observeJobs.subscribeJobs(job -> {
+            try {
+                emitter.send(SseEmitter.event().name("job").data(JobDto.of(job)));
+            } catch (IOException ex) {
+                emitter.completeWithError(ex);
+            }
+        });
+
+        // Keepalive: send a ping every 20 s to keep the connection alive through idle
+        // periods and detect disconnected clients promptly.
+        Thread keepalive = Thread.startVirtualThread(() -> {
+            try {
+                while (!Thread.currentThread().isInterrupted()) {
+                    Thread.sleep(20_000);
+                    emitter.send(SseEmitter.event().name("ping").data(""));
+                }
+            } catch (InterruptedException ignored) {
+                // normal shutdown
+            } catch (Exception ignored) {
+                // emitter already completed or errored; exit
+            }
+        });
+
+        Runnable cleanup = () -> {
+            keepalive.interrupt();
+            try {
+                subscription.close();
+            } catch (Exception ex) {
+                log.debug("failed to close SSE subscription job-stream: {}", ex.toString());
+            }
+        };
+        emitter.onCompletion(cleanup);
+        emitter.onTimeout(cleanup);
+        emitter.onError(e -> cleanup.run());
+        return emitter;
+    }
+
+    private static void attachCleanup(SseEmitter emitter, AutoCloseable subscription, String label) {
+        Runnable cleanup = () -> {
+            try {
+                subscription.close();
+            } catch (Exception ex) {
+                log.debug("failed to close SSE subscription {}: {}", label, ex.toString());
+            }
+        };
+        emitter.onCompletion(cleanup);
+        emitter.onTimeout(cleanup);
+        emitter.onError(e -> cleanup.run());
+    }
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/ObservabilityController.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/ObservabilityController.java
@@ -0,0 +1,200 @@
+package com.trueref.adapter.in.rest;
+
+import com.trueref.adapter.in.rest.dto.JobDto;
+import com.trueref.adapter.out.embedding.onnx.OnnxProperties;
+import com.trueref.domain.model.IngestionJob;
+import com.trueref.domain.model.JobStatus;
+import com.trueref.domain.model.Repository;
+import com.trueref.domain.model.Version;
+import com.trueref.domain.model.VersionStatus;
+import com.trueref.domain.port.in.ObserveJobs;
+import com.trueref.domain.port.in.QueryCatalog;
+import io.swagger.v3.oas.annotations.Operation;
+import io.swagger.v3.oas.annotations.tags.Tag;
+import java.io.BufferedReader;
+import java.io.IOException;
+import java.io.InputStreamReader;
+import java.nio.charset.StandardCharsets;
+import java.nio.file.FileVisitResult;
+import java.nio.file.Files;
+import java.nio.file.Path;
+import java.nio.file.SimpleFileVisitor;
+import java.nio.file.attribute.BasicFileAttributes;
+import java.util.EnumMap;
+import java.util.HashMap;
+import java.util.List;
+import java.util.Map;
+import org.jspecify.annotations.Nullable;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+import org.springframework.beans.factory.annotation.Value;
+import org.springframework.web.bind.annotation.GetMapping;
+import org.springframework.web.bind.annotation.RequestMapping;
+import org.springframework.web.bind.annotation.RestController;
+
+/** REST resource: {@code /api/observability}. */
+@RestController
+@RequestMapping("/api/observability")
+@Tag(name = "observability", description = "UI-friendly aggregates: metrics + resource usage.")
+public class ObservabilityController {
+
+    private static final Logger log = LoggerFactory.getLogger(ObservabilityController.class);
+    private static final int JOB_SAMPLE_LIMIT = 10_000;
+
+    private final ObserveJobs observeJobs;
+    private final QueryCatalog queryCatalog;
+    private final Path trueRefHome;
+    private final boolean embedderAvailable;
+    private final boolean rerankerAvailable;
+    private final int gpuDeviceId;
+
+    public ObservabilityController(
+            ObserveJobs observeJobs,
+            QueryCatalog queryCatalog,
+            OnnxProperties onnxProperties,
+            @Value("${trueref.home:./data}") String trueRefHome,
+            @Value("${trueref.embedder.available:true}") boolean embedderAvailable,
+            @Value("${trueref.reranker.available:true}") boolean rerankerAvailable) {
+        this.observeJobs = observeJobs;
+        this.queryCatalog = queryCatalog;
+        this.trueRefHome = Path.of(trueRefHome);
+        this.embedderAvailable = embedderAvailable;
+        this.rerankerAvailable = rerankerAvailable;
+        this.gpuDeviceId = onnxProperties.gpuDeviceId();
+    }
+
+    @Operation(summary = "Aggregated metrics for the dashboard (job counts, totals, availability).")
+    @GetMapping("/metrics")
+    public Map<String, Object> metrics() {
+        Map<JobStatus, Long> jobsByStatus = new EnumMap<>(JobStatus.class);
+        for (JobStatus status : JobStatus.values()) {
+            jobsByStatus.put(status, 0L);
+        }
+        List<IngestionJob> jobs = observeJobs.listJobs(null, null, null, JOB_SAMPLE_LIMIT);
+        for (IngestionJob job : jobs) {
+            jobsByStatus.merge(job.status(), 1L, Long::sum);
+        }
+
+        long totalChunks = 0L;
+        long totalVersionsIndexed = 0L;
+        long totalRepos = 0L;
+        for (Repository repo : queryCatalog.listRepositories()) {
+            totalRepos++;
+            for (Version v : queryCatalog.listVersions(repo.id())) {
+                totalChunks += v.chunkCount();
+                if (v.status() == VersionStatus.INDEXED) {
+                    totalVersionsIndexed++;
+                }
+            }
+        }
+
+        Map<String, Object> result = new HashMap<>();
+        result.put("jobsByStatus", toStringKeys(jobsByStatus));
+        result.put("jobsSampled", jobs.size());
+        result.put("jobsSampleLimit", JOB_SAMPLE_LIMIT);
+        result.put("totalRepositories", totalRepos);
+        result.put("totalChunks", totalChunks);
+        result.put("totalVersionsIndexed", totalVersionsIndexed);
+        result.put("embedderAvailable", embedderAvailable);
+        result.put("rerankerAvailable", rerankerAvailable);
+        return result;
+    }
+
+    @Operation(summary = "Heap / index-size / cache-size snapshot.")
+    @GetMapping("/resources")
+    public Map<String, Object> resources() {
+        Runtime runtime = Runtime.getRuntime();
+        long heapMax = runtime.maxMemory();
+        long heapTotal = runtime.totalMemory();
+        long heapFree = runtime.freeMemory();
+        long heapUsed = heapTotal - heapFree;
+
+        long luceneBytes = directorySizeBytes(trueRefHome.resolve("lucene"));
+        long cacheBytes = directorySizeBytes(trueRefHome.resolve("embedding-cache"));
+
+        Map<String, Object> heap = new HashMap<>();
+        heap.put("usedBytes", heapUsed);
+        heap.put("totalBytes", heapTotal);
+        heap.put("maxBytes", heapMax);
+
+        Map<String, Object> result = new HashMap<>();
+        result.put("heap", heap);
+        result.put("luceneIndexBytes", luceneBytes);
+        result.put("embeddingCacheBytes", cacheBytes);
+        result.put("trueRefHome", trueRefHome.toAbsolutePath().toString());
+        result.put("gpu", queryGpuMemory(gpuDeviceId));
+        return result;
+    }
+
+    /**
+     * Queries {@code nvidia-smi} for memory stats of the given GPU device index.
+     * Returns {@code null} when nvidia-smi is absent or the device index is out of range.
+     */
+    private @Nullable Map<String, Object> queryGpuMemory(int deviceId) {
+        try {
+            Process proc = new ProcessBuilder(
+                            "nvidia-smi",
+                            "--query-gpu=memory.used,memory.free,memory.total",
+                            "--format=csv,noheader,nounits",
+                            "-i",
+                            String.valueOf(deviceId))
+                    .redirectErrorStream(true)
+                    .start();
+            String line;
+            try (BufferedReader br =
+                    new BufferedReader(new InputStreamReader(proc.getInputStream(), StandardCharsets.UTF_8))) {
+                line = br.readLine();
+            }
+            int exit = proc.waitFor();
+            if (exit != 0 || line == null || line.isBlank()) {
+                return null;
+            }
+            // Output: "usedMiB, freeMiB, totalMiB"
+            String[] parts = line.split(",");
+            if (parts.length < 3) return null;
+            long usedMiB = Long.parseLong(parts[0].trim());
+            long freeMiB = Long.parseLong(parts[1].trim());
+            long totalMiB = Long.parseLong(parts[2].trim());
+            long mibToBytes = 1024L * 1024L;
+            Map<String, Object> gpu = new HashMap<>();
+            gpu.put("deviceId", deviceId);
+            gpu.put("usedBytes", usedMiB * mibToBytes);
+            gpu.put("freeBytes", freeMiB * mibToBytes);
+            gpu.put("totalBytes", totalMiB * mibToBytes);
+            return gpu;
+        } catch (IOException | InterruptedException | NumberFormatException e) {
+            log.debug("nvidia-smi query failed: {}", e.toString());
+            return null;
+        }
+    }
+
+    private static Map<String, Long> toStringKeys(Map<JobStatus, Long> in) {
+        Map<String, Long> out = new HashMap<>();
+        in.forEach((k, v) -> out.put(k.name(), v));
+        return out;
+    }
+
+    private static long directorySizeBytes(Path dir) {
+        if (!Files.isDirectory(dir)) {
+            return 0L;
+        }
+        long[] total = {0L};
+        try {
+            Files.walkFileTree(dir, new SimpleFileVisitor<>() {
+                @Override
+                public FileVisitResult visitFile(Path file, BasicFileAttributes attrs) {
+                    total[0] += attrs.size();
+                    return FileVisitResult.CONTINUE;
+                }
+
+                @Override
+                public FileVisitResult visitFileFailed(Path file, IOException exc) {
+                    return FileVisitResult.CONTINUE;
+                }
+            });
+        } catch (IOException e) {
+            log.warn("failed to walk {}: {}", dir, e.toString());
+        }
+        return total[0];
+    }
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/OpenApiConfig.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/OpenApiConfig.java
@@ -0,0 +1,25 @@
+package com.trueref.adapter.in.rest;
+
+import io.swagger.v3.oas.models.OpenAPI;
+import io.swagger.v3.oas.models.info.Info;
+import io.swagger.v3.oas.models.servers.Server;
+import java.util.List;
+import org.springframework.context.annotation.Bean;
+import org.springframework.context.annotation.Configuration;
+
+/** Springdoc OpenAPI customization — title, version, summary and default local server. */
+@Configuration
+public class OpenApiConfig {
+
+    @Bean
+    public OpenAPI trueRefOpenApi() {
+        Info info = new Info()
+                .title("trueref API")
+                .version("0.1.0")
+                .summary("Self-hosted Context7-compatible ingestion + retrieval HTTP API.")
+                .description(
+                        "REST endpoints for repository registration, ingestion orchestration, hybrid search and library resolution.");
+        Server local = new Server().url("http://localhost:8080").description("Local development server");
+        return new OpenAPI().info(info).servers(List.of(local));
+    }
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/RepositoryController.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/RepositoryController.java
@@ -0,0 +1,156 @@
+package com.trueref.adapter.in.rest;
+
+import com.trueref.adapter.in.rest.dto.RegisterRepositoryRequest;
+import com.trueref.adapter.in.rest.dto.RepositoryDto;
+import com.trueref.adapter.in.rest.dto.VersionDto;
+import com.trueref.domain.error.RepositoryNotFound;
+import com.trueref.domain.error.TagNotFound;
+import com.trueref.domain.model.Repository;
+import com.trueref.domain.model.RepositoryId;
+import com.trueref.domain.model.Version;
+import com.trueref.domain.port.in.DiscoverVersions;
+import com.trueref.domain.port.in.IndexVersion;
+import com.trueref.domain.port.in.QueryCatalog;
+import com.trueref.domain.port.in.RegisterRepository;
+import io.swagger.v3.oas.annotations.Operation;
+import io.swagger.v3.oas.annotations.tags.Tag;
+import jakarta.validation.Valid;
+import java.util.List;
+import java.util.Map;
+import java.util.Optional;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+import org.springframework.http.HttpStatus;
+import org.springframework.http.ResponseEntity;
+import org.springframework.web.bind.annotation.DeleteMapping;
+import org.springframework.web.bind.annotation.GetMapping;
+import org.springframework.web.bind.annotation.PathVariable;
+import org.springframework.web.bind.annotation.PostMapping;
+import org.springframework.web.bind.annotation.RequestBody;
+import org.springframework.web.bind.annotation.RequestMapping;
+import org.springframework.web.bind.annotation.ResponseStatus;
+import org.springframework.web.bind.annotation.RestController;
+
+/** REST resource: {@code /api/repos}. */
+@RestController
+@RequestMapping("/api/repos")
+@Tag(name = "repositories", description = "Register, list, and index git repositories.")
+public class RepositoryController {
+
+    private static final Logger log = LoggerFactory.getLogger(RepositoryController.class);
+
+    private final RegisterRepository registerRepository;
+    private final QueryCatalog queryCatalog;
+    private final DiscoverVersions discoverVersions;
+    private final IndexVersion indexVersion;
+
+    public RepositoryController(
+            RegisterRepository registerRepository,
+            QueryCatalog queryCatalog,
+            DiscoverVersions discoverVersions,
+            IndexVersion indexVersion) {
+        this.registerRepository = registerRepository;
+        this.queryCatalog = queryCatalog;
+        this.discoverVersions = discoverVersions;
+        this.indexVersion = indexVersion;
+    }
+
+    @Operation(summary = "List all registered repositories.")
+    @GetMapping
+    public List<RepositoryDto> list() {
+        return queryCatalog.listRepositories().stream().map(RepositoryDto::of).toList();
+    }
+
+    @Operation(summary = "Register a new repository (remote URL or local path).")
+    @PostMapping
+    @ResponseStatus(HttpStatus.CREATED)
+    public RepositoryDto register(@Valid @RequestBody RegisterRepositoryRequest request) {
+        log.info("registering repository name={}", request.name());
+        Repository repo = registerRepository.register(request.toCommand());
+        return RepositoryDto.of(repo);
+    }
+
+    @Operation(summary = "Get details of a single repository.")
+    @GetMapping("/{id}")
+    public RepositoryDto detail(@PathVariable("id") String id) {
+        RepositoryId repoId = parseRepoId(id);
+        return queryCatalog.findRepository(repoId).map(RepositoryDto::of).orElseThrow(() -> new RepositoryNotFound(id));
+    }
+
+    @Operation(summary = "Unregister a repository and soft-delete its versions.")
+    @DeleteMapping("/{id}")
+    @ResponseStatus(HttpStatus.NO_CONTENT)
+    public void unregister(@PathVariable("id") String id) {
+        RepositoryId repoId = parseRepoId(id);
+        log.info("unregistering repository id={}", id);
+        registerRepository.unregister(repoId);
+    }
+
+    @Operation(summary = "Force tag discovery (git fetch + enumerate tags). Returns the resulting versions.")
+    @PostMapping("/{id}/discover")
+    public List<VersionDto> discover(@PathVariable("id") String id) {
+        RepositoryId repoId = parseRepoId(id);
+        log.info("discovering versions for repository id={}", id);
+        return discoverVersions.discover(repoId).stream().map(VersionDto::of).toList();
+    }
+
+    @Operation(summary = "List all versions (git tags) known for this repository.")
+    @GetMapping("/{id}/versions")
+    public List<VersionDto> versions(@PathVariable("id") String id) {
+        RepositoryId repoId = parseRepoId(id);
+        // Ensure 404 if the repo does not exist.
+        queryCatalog.findRepository(repoId).orElseThrow(() -> new RepositoryNotFound(id));
+        return queryCatalog.listVersions(repoId).stream().map(VersionDto::of).toList();
+    }
+
+    @Operation(summary = "Enqueue indexing of a specific tag. If the tag is unknown, discovery runs first.")
+    @PostMapping("/{id}/versions/{tag}/index")
+    public ResponseEntity<Map<String, String>> index(
+            @PathVariable("id") String id,
+            @PathVariable("tag") String tag,
+            @RequestBody(required = false) IndexBody body) {
+        boolean force = body != null && Boolean.TRUE.equals(body.force());
+        return enqueueIndex(id, tag, force);
+    }
+
+    @Operation(summary = "Force re-indexing of a specific tag (equivalent to index with force=true).")
+    @PostMapping("/{id}/versions/{tag}/reindex")
+    public ResponseEntity<Map<String, String>> reindex(
+            @PathVariable("id") String id, @PathVariable("tag") String tag) {
+        return enqueueIndex(id, tag, true);
+    }
+
+    private ResponseEntity<Map<String, String>> enqueueIndex(String id, String tag, boolean force) {
+        RepositoryId repoId = parseRepoId(id);
+        Repository repo = queryCatalog.findRepository(repoId).orElseThrow(() -> new RepositoryNotFound(id));
+
+        Optional<Version> existing = findByTag(repoId, tag);
+        if (existing.isEmpty()) {
+            log.info("tag {} unknown for repo {}, triggering discovery", tag, repo.name());
+            discoverVersions.discover(repoId);
+            existing = findByTag(repoId, tag);
+        }
+        Version version = existing.orElseThrow(() -> new TagNotFound(repo.name(), tag));
+
+        log.info("enqueueing index job repo={} tag={} force={}", repo.name(), tag, force);
+        var jobId = indexVersion.enqueue(repoId, version.id(), force);
+        return ResponseEntity.accepted().body(Map.of("jobId", jobId.toString()));
+    }
+
+    private Optional<Version> findByTag(RepositoryId repoId, String tag) {
+        return queryCatalog.listVersions(repoId).stream()
+                .filter(v -> v.tag().equals(tag))
+                .findFirst();
+    }
+
+    private static RepositoryId parseRepoId(String id) {
+        try {
+            return RepositoryId.of(id);
+        } catch (IllegalArgumentException e) {
+            throw new RepositoryNotFound(id);
+        }
+    }
+
+    /** Body of {@code POST /api/repos/{id}/versions/{tag}/index}. */
+    public record IndexBody(Boolean force) {}
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/ResolveController.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/ResolveController.java
@@ -0,0 +1,41 @@
+package com.trueref.adapter.in.rest;
+
+import com.trueref.adapter.in.rest.dto.ResolveMatchDto;
+import com.trueref.adapter.in.rest.dto.ResolveRequest;
+import com.trueref.adapter.in.rest.dto.ResolveResponse;
+import com.trueref.domain.port.in.ResolveLibraryId;
+import io.swagger.v3.oas.annotations.Operation;
+import io.swagger.v3.oas.annotations.tags.Tag;
+import org.jspecify.annotations.Nullable;
+import org.springframework.web.bind.annotation.GetMapping;
+import org.springframework.web.bind.annotation.RequestMapping;
+import org.springframework.web.bind.annotation.RequestParam;
+import org.springframework.web.bind.annotation.RestController;
+
+/** REST resource: {@code /api/resolve}. */
+@RestController
+@RequestMapping("/api/resolve")
+@Tag(name = "resolve", description = "Turn a fuzzy library name (and optional version) into concrete (repo, version) handles.")
+public class ResolveController {
+
+    private final ResolveLibraryId resolveLibraryId;
+
+    public ResolveController(ResolveLibraryId resolveLibraryId) {
+        this.resolveLibraryId = resolveLibraryId;
+    }
+
+    @Operation(summary = "Preview library ID resolution for the given name / version.")
+    @GetMapping
+    public ResolveResponse resolve(
+            @RequestParam("libraryName") String libraryName,
+            @RequestParam(value = "version", required = false) @Nullable String version,
+            @RequestParam(value = "query", required = false) @Nullable String query) {
+        if (libraryName == null || libraryName.isBlank()) {
+            throw new IllegalArgumentException("libraryName must not be blank");
+        }
+        ResolveRequest req = new ResolveRequest(libraryName, query, version);
+        ResolveLibraryId.Result result = resolveLibraryId.resolve(req.toQuery());
+        return new ResolveResponse(
+                result.matches().stream().map(ResolveMatchDto::of).toList());
+    }
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/SearchController.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/SearchController.java
@@ -0,0 +1,47 @@
+package com.trueref.adapter.in.rest;
+
+import com.trueref.adapter.in.rest.dto.SearchHitDto;
+import com.trueref.adapter.in.rest.dto.SearchRequest;
+import com.trueref.adapter.in.rest.dto.SearchResponse;
+import com.trueref.domain.port.in.SearchLibraryDocs;
+import io.swagger.v3.oas.annotations.Operation;
+import io.swagger.v3.oas.annotations.tags.Tag;
+import jakarta.validation.Valid;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+import org.springframework.web.bind.annotation.PostMapping;
+import org.springframework.web.bind.annotation.RequestBody;
+import org.springframework.web.bind.annotation.RequestMapping;
+import org.springframework.web.bind.annotation.RestController;
+
+/** REST resource: {@code /api/search}. */
+@RestController
+@RequestMapping("/api/search")
+@Tag(name = "search", description = "Hybrid BM25 + dense search with cross-encoder rerank.")
+public class SearchController {
+
+    private static final Logger log = LoggerFactory.getLogger(SearchController.class);
+
+    private final SearchLibraryDocs searchLibraryDocs;
+
+    public SearchController(SearchLibraryDocs searchLibraryDocs) {
+        this.searchLibraryDocs = searchLibraryDocs;
+    }
+
+    @Operation(summary = "Hybrid search scoped to one or more (repo, version) pairs.")
+    @PostMapping
+    public SearchResponse search(@Valid @RequestBody SearchRequest request) {
+        log.debug(
+                "search text='{}' topic={} scopes={} tokensBudget={} maxHits={}",
+                request.text(),
+                request.topic(),
+                request.scope().size(),
+                request.tokensBudget(),
+                request.maxHits());
+        SearchLibraryDocs.Result result = searchLibraryDocs.search(request.toQuery());
+        return new SearchResponse(
+                result.hits().stream().map(SearchHitDto::of).toList(),
+                result.totalTokensReturned(),
+                request.topic());
+    }
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/WebConfig.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/WebConfig.java
@@ -0,0 +1,54 @@
+package com.trueref.adapter.in.rest;
+
+import java.io.IOException;
+import java.util.Set;
+import org.jspecify.annotations.Nullable;
+import org.springframework.context.annotation.Configuration;
+import org.springframework.core.io.Resource;
+import org.springframework.web.servlet.config.annotation.ResourceHandlerRegistry;
+import org.springframework.web.servlet.config.annotation.WebMvcConfigurer;
+import org.springframework.web.servlet.resource.PathResourceResolver;
+
+/**
+ * SPA fallback: any unmatched request that looks like a client-side route (no file extension in
+ * the final path segment) is served {@code index.html}. API, MCP, springdoc and actuator paths are
+ * explicitly excluded — Spring routes those first anyway, but we exclude them defensively so the
+ * resource resolver does not attempt a fallback for them.
+ */
+@Configuration
+public class WebConfig implements WebMvcConfigurer {
+
+    private static final Set<String> EXCLUDED_PREFIXES =
+            Set.of("api/", "mcp", "swagger-ui/", "v3/api-docs", "actuator/");
+
+    @Override
+    public void addResourceHandlers(ResourceHandlerRegistry registry) {
+        registry.addResourceHandler("/**")
+                .addResourceLocations("classpath:/static/")
+                .resourceChain(true)
+                .addResolver(new SpaFallbackResolver());
+    }
+
+    static final class SpaFallbackResolver extends PathResourceResolver {
+        @Override
+        protected @Nullable Resource getResource(String resourcePath, Resource location) throws IOException {
+            for (String prefix : EXCLUDED_PREFIXES) {
+                if (resourcePath.equals(prefix) || resourcePath.startsWith(prefix)) {
+                    return null;
+                }
+            }
+            Resource requested = location.createRelative(resourcePath);
+            if (requested.exists() && requested.isReadable()) {
+                return requested;
+            }
+            // Fallback to index.html only for client-side route-like paths (no extension on last segment).
+            int lastSlash = resourcePath.lastIndexOf('/');
+            String lastSegment = lastSlash < 0 ? resourcePath : resourcePath.substring(lastSlash + 1);
+            if (!lastSegment.isEmpty() && lastSegment.contains(".")) {
+                return null;
+            }
+            Resource indexHtml = location.createRelative("index.html");
+            return indexHtml.exists() && indexHtml.isReadable() ? indexHtml : null;
+        }
+    }
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/JobDto.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/JobDto.java
@@ -0,0 +1,33 @@
+package com.trueref.adapter.in.rest.dto;
+
+import com.trueref.domain.model.IngestionJob;
+import com.trueref.domain.model.JobStatus;
+import com.trueref.domain.model.JobType;
+import io.swagger.v3.oas.annotations.media.Schema;
+import java.time.Instant;
+import java.util.List;
+import org.jspecify.annotations.Nullable;
+
+@Schema(description = "An orchestrated ingestion job with its stages.")
+public record JobDto(
+        String id,
+        String repoId,
+        @Nullable String versionId,
+        JobType type,
+        JobStatus status,
+        @Nullable Instant startedAt,
+        @Nullable Instant finishedAt,
+        List<JobStageDto> stages) {
+
+    public static JobDto of(IngestionJob j) {
+        return new JobDto(
+                j.id().toString(),
+                j.repoId().toString(),
+                j.versionId() == null ? null : j.versionId().toString(),
+                j.type(),
+                j.status(),
+                j.startedAt(),
+                j.finishedAt(),
+                j.stages().stream().map(JobStageDto::of).toList());
+    }
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/JobLogEventDto.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/JobLogEventDto.java
@@ -0,0 +1,16 @@
+package com.trueref.adapter.in.rest.dto;
+
+import com.trueref.domain.model.JobLogEvent;
+import com.trueref.domain.model.JobStage;
+import io.swagger.v3.oas.annotations.media.Schema;
+import java.time.Instant;
+import org.jspecify.annotations.Nullable;
+
+@Schema(description = "A single log event emitted by an ingestion job.")
+public record JobLogEventDto(
+        String jobId, Instant ts, JobLogEvent.Level level, JobStage.@Nullable StageName stage, String message) {
+
+    public static JobLogEventDto of(JobLogEvent e) {
+        return new JobLogEventDto(e.jobId().toString(), e.ts(), e.level(), e.stage(), e.message());
+    }
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/JobStageDto.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/JobStageDto.java
@@ -0,0 +1,32 @@
+package com.trueref.adapter.in.rest.dto;
+
+import com.trueref.domain.model.JobStage;
+import io.swagger.v3.oas.annotations.media.Schema;
+import java.time.Instant;
+import org.jspecify.annotations.Nullable;
+
+@Schema(description = "A single stage within an ingestion job.")
+public record JobStageDto(
+        String jobId,
+        JobStage.StageName name,
+        JobStage.StageStatus status,
+        @Nullable Instant startedAt,
+        @Nullable Instant finishedAt,
+        long itemsProcessed,
+        long itemsTotal,
+        long bytesProcessed,
+        @Nullable String errorMessage) {
+
+    public static JobStageDto of(JobStage s) {
+        return new JobStageDto(
+                s.jobId().toString(),
+                s.name(),
+                s.status(),
+                s.startedAt(),
+                s.finishedAt(),
+                s.itemsProcessed(),
+                s.itemsTotal(),
+                s.bytesProcessed(),
+                s.errorMessage());
+    }
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/RegisterRepositoryRequest.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/RegisterRepositoryRequest.java
@@ -0,0 +1,45 @@
+package com.trueref.adapter.in.rest.dto;
+
+import com.trueref.domain.model.TagPattern;
+import com.trueref.domain.port.in.RegisterRepository;
+import io.swagger.v3.oas.annotations.media.Schema;
+import jakarta.validation.Valid;
+import jakarta.validation.constraints.NotBlank;
+import java.time.Duration;
+import java.time.format.DateTimeParseException;
+import java.util.List;
+import org.jspecify.annotations.Nullable;
+
+@Schema(description = "Request payload to register a new repository.")
+public record RegisterRepositoryRequest(
+        @NotBlank @Schema(description = "Human-readable display name, e.g. spring-projects/spring-boot") String name,
+        @Schema(description = "Remote git URL (mutually exclusive with localPath)") @Nullable String remoteUrl,
+        @Schema(description = "Absolute local path to an already-cloned repo") @Nullable String localPath,
+        @Schema(description = "Per-repo ignore globs, ANDed with .gitignore") @Nullable List<String> ignoreGlobs,
+        @Schema(description = "Max file size in bytes; default 1MiB") @Nullable Long maxFileSizeBytes,
+        @Schema(description = "ISO-8601 duration (e.g. PT1H); 0 disables polling") @Nullable String pollInterval,
+        @Schema(description = "Max most-recent tags auto-indexed") @Nullable Integer tagCap,
+        @Schema(description = "Ordered tag-pattern rules for client version → tag mapping") @Valid @Nullable
+                List<TagPatternDto> versionMappingRules) {
+
+    public RegisterRepository.Command toCommand() {
+        Duration poll = parseDuration(pollInterval);
+        List<String> globs = ignoreGlobs == null ? List.of() : List.copyOf(ignoreGlobs);
+        List<TagPattern> rules = versionMappingRules == null
+                ? List.of()
+                : versionMappingRules.stream().map(TagPatternDto::toModel).toList();
+        return new RegisterRepository.Command(
+                name, remoteUrl, localPath, globs, maxFileSizeBytes, poll, tagCap, rules);
+    }
+
+    private static @Nullable Duration parseDuration(@Nullable String iso) {
+        if (iso == null || iso.isBlank()) {
+            return null;
+        }
+        try {
+            return Duration.parse(iso);
+        } catch (DateTimeParseException e) {
+            throw new IllegalArgumentException("Invalid ISO-8601 duration: " + iso, e);
+        }
+    }
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/RepositoryDto.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/RepositoryDto.java
@@ -0,0 +1,39 @@
+package com.trueref.adapter.in.rest.dto;
+
+import com.trueref.domain.model.Repository;
+import io.swagger.v3.oas.annotations.media.Schema;
+import java.time.Instant;
+import java.util.List;
+import org.jspecify.annotations.Nullable;
+
+@Schema(description = "A registered repository.")
+public record RepositoryDto(
+        String id,
+        String name,
+        @Nullable String remoteUrl,
+        String localPath,
+        boolean managedClone,
+        List<String> ignoreGlobs,
+        long maxFileSizeBytes,
+        @Schema(description = "ISO-8601 duration, e.g. PT1H") String pollInterval,
+        int tagCap,
+        List<TagPatternDto> versionMappingRules,
+        Instant createdAt,
+        Instant updatedAt) {
+
+    public static RepositoryDto of(Repository repo) {
+        return new RepositoryDto(
+                repo.id().toString(),
+                repo.name(),
+                repo.remoteUrl(),
+                repo.localPath(),
+                repo.managedClone(),
+                List.copyOf(repo.ignoreGlobs()),
+                repo.maxFileSizeBytes(),
+                repo.pollInterval().toString(),
+                repo.tagCap(),
+                repo.versionMappingRules().stream().map(TagPatternDto::of).toList(),
+                repo.createdAt(),
+                repo.updatedAt());
+    }
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/ResolveMatchDto.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/ResolveMatchDto.java
@@ -0,0 +1,28 @@
+package com.trueref.adapter.in.rest.dto;
+
+import com.trueref.domain.port.in.ResolveLibraryId;
+import io.swagger.v3.oas.annotations.media.Schema;
+import java.util.List;
+import org.jspecify.annotations.Nullable;
+
+@Schema(description = "A single candidate library matching a resolve request.")
+public record ResolveMatchDto(
+        String repoId,
+        String libraryId,
+        String name,
+        @Nullable String description,
+        int snippetCount,
+        List<ResolveVersionRefDto> availableVersions,
+        double score) {
+
+    public static ResolveMatchDto of(ResolveLibraryId.Match m) {
+        return new ResolveMatchDto(
+                m.repoId().toString(),
+                m.libraryId(),
+                m.name(),
+                m.description(),
+                m.snippetCount(),
+                m.availableVersions().stream().map(ResolveVersionRefDto::of).toList(),
+                m.score());
+    }
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/ResolveRequest.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/ResolveRequest.java
@@ -0,0 +1,17 @@
+package com.trueref.adapter.in.rest.dto;
+
+import com.trueref.domain.port.in.ResolveLibraryId;
+import io.swagger.v3.oas.annotations.media.Schema;
+import jakarta.validation.constraints.NotBlank;
+import org.jspecify.annotations.Nullable;
+
+@Schema(description = "Fuzzy library resolution request.")
+public record ResolveRequest(
+        @NotBlank String libraryName,
+        @Schema(description = "Optional hint to rerank candidates by relevance") @Nullable String query,
+        @Nullable String version) {
+
+    public ResolveLibraryId.Query toQuery() {
+        return new ResolveLibraryId.Query(libraryName, query, version);
+    }
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/ResolveResponse.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/ResolveResponse.java
@@ -0,0 +1,7 @@
+package com.trueref.adapter.in.rest.dto;
+
+import io.swagger.v3.oas.annotations.media.Schema;
+import java.util.List;
+
+@Schema(description = "Ranked library matches for a resolve request.")
+public record ResolveResponse(List<ResolveMatchDto> matches) {}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/ResolveVersionRefDto.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/ResolveVersionRefDto.java
@@ -0,0 +1,13 @@
+package com.trueref.adapter.in.rest.dto;
+
+import com.trueref.domain.model.VersionStatus;
+import com.trueref.domain.port.in.ResolveLibraryId;
+import io.swagger.v3.oas.annotations.media.Schema;
+
+@Schema(description = "One available version of a resolved library.")
+public record ResolveVersionRefDto(String versionId, String tag, VersionStatus status) {
+
+    public static ResolveVersionRefDto of(ResolveLibraryId.VersionRef v) {
+        return new ResolveVersionRefDto(v.versionId().toString(), v.tag(), v.status());
+    }
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/SearchHitDto.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/SearchHitDto.java
@@ -0,0 +1,37 @@
+package com.trueref.adapter.in.rest.dto;
+
+import com.trueref.domain.model.SearchHit;
+import io.swagger.v3.oas.annotations.media.Schema;
+import org.jspecify.annotations.Nullable;
+
+@Schema(description = "A single ranked snippet returned by search.")
+public record SearchHitDto(
+        String chunkId,
+        String repoId,
+        String versionId,
+        String repoName,
+        String tag,
+        String filePath,
+        int startLine,
+        int endLine,
+        String language,
+        @Nullable String symbol,
+        String content,
+        double score) {
+
+    public static SearchHitDto of(SearchHit h) {
+        return new SearchHitDto(
+                h.chunkId().toString(),
+                h.repoId().toString(),
+                h.versionId().toString(),
+                h.repoName(),
+                h.tag(),
+                h.filePath(),
+                h.startLine(),
+                h.endLine(),
+                h.language(),
+                h.symbol(),
+                h.content(),
+                h.score());
+    }
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/SearchRequest.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/SearchRequest.java
@@ -0,0 +1,41 @@
+package com.trueref.adapter.in.rest.dto;
+
+import com.trueref.domain.model.RepositoryId;
+import com.trueref.domain.model.SearchScope;
+import com.trueref.domain.model.VersionId;
+import com.trueref.domain.port.in.SearchLibraryDocs;
+import io.swagger.v3.oas.annotations.media.Schema;
+import jakarta.validation.Valid;
+import jakarta.validation.constraints.NotBlank;
+import jakarta.validation.constraints.NotEmpty;
+import jakarta.validation.constraints.Positive;
+import java.util.List;
+import org.jspecify.annotations.Nullable;
+
+@Schema(description = "Hybrid search request scoped to one or more (repo, version) pairs.")
+public record SearchRequest(
+        @NotBlank String text,
+        @Nullable String topic,
+        @NotEmpty @Valid List<ScopeRef> scope,
+        @Schema(description = "Token budget; clamped by the service to [500, 50000]") @Positive @Nullable
+                Integer tokensBudget,
+        @Positive @Nullable Integer maxHits) {
+
+    public static final int DEFAULT_TOKENS_BUDGET = 5000;
+    public static final int DEFAULT_MAX_HITS = 20;
+
+    public SearchLibraryDocs.Query toQuery() {
+        List<SearchScope.RepoVersionRef> refs = scope.stream()
+                .map(r -> new SearchScope.RepoVersionRef(RepositoryId.of(r.repoId()), VersionId.of(r.versionId())))
+                .toList();
+        return new SearchLibraryDocs.Query(
+                text,
+                topic,
+                new SearchScope(refs),
+                tokensBudget == null ? DEFAULT_TOKENS_BUDGET : tokensBudget,
+                maxHits == null ? DEFAULT_MAX_HITS : maxHits);
+    }
+
+    @Schema(description = "A (repo, version) pair to scope the search on.")
+    public record ScopeRef(@NotBlank String repoId, @NotBlank String versionId) {}
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/SearchResponse.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/SearchResponse.java
@@ -0,0 +1,8 @@
+package com.trueref.adapter.in.rest.dto;
+
+import io.swagger.v3.oas.annotations.media.Schema;
+import java.util.List;
+import org.jspecify.annotations.Nullable;
+
+@Schema(description = "Response body for search.")
+public record SearchResponse(List<SearchHitDto> hits, int totalTokensReturned, @Nullable String topic) {}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/TagPatternDto.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/TagPatternDto.java
@@ -0,0 +1,41 @@
+package com.trueref.adapter.in.rest.dto;
+
+import com.trueref.domain.model.TagPattern;
+import io.swagger.v3.oas.annotations.media.Schema;
+import org.jspecify.annotations.Nullable;
+
+/** DTO for {@link TagPattern}. The sealed hierarchy is flattened into a {@code type} + optional {@code template}. */
+@Schema(description = "Rule mapping a client-supplied version string to a git tag.")
+public record TagPatternDto(
+        @Schema(
+                        description = "Pattern kind",
+                        allowableValues = {"EXACT", "V_PREFIX", "RELEASE_PREFIX", "SEMVER_FUZZY", "CUSTOM"})
+                String type,
+        @Schema(description = "Required when type=CUSTOM, e.g. release-{semver}") @Nullable String template) {
+
+    public static TagPatternDto of(TagPattern pattern) {
+        return switch (pattern) {
+            case TagPattern.Exact e -> new TagPatternDto("EXACT", null);
+            case TagPattern.VPrefix v -> new TagPatternDto("V_PREFIX", null);
+            case TagPattern.ReleasePrefix r -> new TagPatternDto("RELEASE_PREFIX", null);
+            case TagPattern.SemverFuzzy s -> new TagPatternDto("SEMVER_FUZZY", null);
+            case TagPattern.Custom c -> new TagPatternDto("CUSTOM", c.template());
+        };
+    }
+
+    public TagPattern toModel() {
+        return switch (type) {
+            case "EXACT" -> new TagPattern.Exact();
+            case "V_PREFIX" -> new TagPattern.VPrefix();
+            case "RELEASE_PREFIX" -> new TagPattern.ReleasePrefix();
+            case "SEMVER_FUZZY" -> new TagPattern.SemverFuzzy();
+            case "CUSTOM" -> {
+                if (template == null || template.isBlank()) {
+                    throw new IllegalArgumentException("CUSTOM tag pattern requires a template");
+                }
+                yield new TagPattern.Custom(template);
+            }
+            default -> throw new IllegalArgumentException("Unknown tag pattern type: " + type);
+        };
+    }
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/VersionDto.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/VersionDto.java
@@ -0,0 +1,31 @@
+package com.trueref.adapter.in.rest.dto;
+
+import com.trueref.domain.model.Version;
+import com.trueref.domain.model.VersionStatus;
+import io.swagger.v3.oas.annotations.media.Schema;
+import java.time.Instant;
+import org.jspecify.annotations.Nullable;
+
+@Schema(description = "A specific git tag (or branch) of a repository.")
+public record VersionDto(
+        String id,
+        String repoId,
+        String tag,
+        String commitSha,
+        VersionStatus status,
+        @Nullable Instant indexedAt,
+        int chunkCount,
+        @Nullable String errorMessage) {
+
+    public static VersionDto of(Version v) {
+        return new VersionDto(
+                v.id().toString(),
+                v.repoId().toString(),
+                v.tag(),
+                v.commitSha(),
+                v.status(),
+                v.indexedAt(),
+                v.chunkCount(),
+                v.errorMessage());
+    }
+}
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/package-info.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/dto/package-info.java
@@ -0,0 +1,3 @@
+/** Transport-layer DTO records used by {@code com.trueref.adapter.in.rest} controllers. */
+@org.jspecify.annotations.NullMarked
+package com.trueref.adapter.in.rest.dto;
--- a/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/package-info.java
+++ b/trueref-adapters/src/main/java/com/trueref/adapter/in/rest/package-info.java
@@ -0,0 +1,6 @@
+/**
+ * REST driving adapter: controllers, DTOs, OpenAPI configuration, exception handling and SSE
+ * streaming for the trueref HTTP API.
+ */
+@org.jspecify.annotations.NullMarked
+package com.trueref.adapter.in.rest;
--- a/trueref-adapters/src/main/resources/db/migration/V1__init_schema.sql
+++ b/trueref-adapters/src/main/resources/db/migration/V1__init_schema.sql
@@ -0,0 +1,67 @@
+-- trueref schema V1
+-- All UUIDs stored as CHAR(36) for H2 portability.
+
+CREATE TABLE repositories (
+    id                       CHAR(36)      PRIMARY KEY,
+    name                     VARCHAR(512)  NOT NULL UNIQUE,
+    remote_url               VARCHAR(2048) NULL,
+    local_path               VARCHAR(2048) NOT NULL,
+    managed_clone            BOOLEAN       NOT NULL,
+    ignore_globs             CLOB          NOT NULL,        -- JSON array of strings
+    max_file_size_bytes      BIGINT        NOT NULL,
+    poll_interval_seconds    BIGINT        NOT NULL,
+    tag_cap                  INT           NOT NULL,
+    version_mapping_rules    CLOB          NOT NULL,        -- JSON array of TagPattern
+    created_at               TIMESTAMP(9) WITH TIME ZONE NOT NULL,
+    updated_at               TIMESTAMP(9) WITH TIME ZONE NOT NULL
+);
+
+CREATE TABLE versions (
+    id              CHAR(36)      PRIMARY KEY,
+    repo_id         CHAR(36)      NOT NULL REFERENCES repositories(id) ON DELETE CASCADE,
+    tag             VARCHAR(512)  NOT NULL,
+    commit_sha      CHAR(40)      NOT NULL,
+    status          VARCHAR(32)   NOT NULL,
+    indexed_at      TIMESTAMP(9) WITH TIME ZONE NULL,
+    chunk_count     INT           NOT NULL DEFAULT 0,
+    error_message   CLOB          NULL,
+    UNIQUE (repo_id, tag)
+);
+CREATE INDEX idx_versions_repo_status ON versions(repo_id, status);
+
+CREATE TABLE ingestion_jobs (
+    id           CHAR(36)     PRIMARY KEY,
+    repo_id      CHAR(36)     NOT NULL REFERENCES repositories(id) ON DELETE CASCADE,
+    version_id   CHAR(36)     NULL     REFERENCES versions(id) ON DELETE CASCADE,
+    type         VARCHAR(32)  NOT NULL,
+    status       VARCHAR(32)  NOT NULL,
+    started_at   TIMESTAMP(9) WITH TIME ZONE NULL,
+    finished_at  TIMESTAMP(9) WITH TIME ZONE NULL,
+    created_at   TIMESTAMP(9) WITH TIME ZONE NOT NULL
+);
+CREATE INDEX idx_jobs_repo_status ON ingestion_jobs(repo_id, status);
+CREATE INDEX idx_jobs_status_created ON ingestion_jobs(status, created_at);
+
+CREATE TABLE job_stages (
+    job_id           CHAR(36)     NOT NULL REFERENCES ingestion_jobs(id) ON DELETE CASCADE,
+    name             VARCHAR(32)  NOT NULL,
+    status           VARCHAR(32)  NOT NULL,
+    started_at       TIMESTAMP(9) WITH TIME ZONE NULL,
+    finished_at      TIMESTAMP(9) WITH TIME ZONE NULL,
+    items_processed  BIGINT       NOT NULL DEFAULT 0,
+    items_total      BIGINT       NOT NULL DEFAULT 0,
+    bytes_processed  BIGINT       NOT NULL DEFAULT 0,
+    error_message    CLOB         NULL,
+    PRIMARY KEY (job_id, name)
+);
+
+-- Persisted log buffer (last N per job kept by application logic; SSE streams from in-memory bus).
+CREATE TABLE job_log_events (
+    id        BIGINT       GENERATED BY DEFAULT AS IDENTITY PRIMARY KEY,
+    job_id    CHAR(36)     NOT NULL REFERENCES ingestion_jobs(id) ON DELETE CASCADE,
+    ts        TIMESTAMP(9) WITH TIME ZONE NOT NULL,
+    level     VARCHAR(8)   NOT NULL,
+    stage     VARCHAR(32),
+    message   CLOB         NOT NULL
+);
+CREATE INDEX idx_job_log_job_ts ON job_log_events(job_id, ts);
--- a/trueref-application/pom.xml
+++ b/trueref-application/pom.xml
@@ -0,0 +1,28 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<project xmlns="http://maven.apache.org/POM/4.0.0"
+         xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
+         xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 https://maven.apache.org/xsd/maven-4.0.0.xsd">
+    <modelVersion>4.0.0</modelVersion>
+
+    <parent>
+        <groupId>com.trueref</groupId>
+        <artifactId>trueref-parent</artifactId>
+        <version>0.1.0-SNAPSHOT</version>
+    </parent>
+
+    <artifactId>trueref-application</artifactId>
+    <name>trueref-application</name>
+    <description>Use-case implementations. Depends only on the domain.</description>
+
+    <dependencies>
+        <dependency>
+            <groupId>com.trueref</groupId>
+            <artifactId>trueref-domain</artifactId>
+        </dependency>
+        <!-- SLF4J only; orchestration may use virtual threads via JDK -->
+        <dependency>
+            <groupId>org.slf4j</groupId>
+            <artifactId>slf4j-api</artifactId>
+        </dependency>
+    </dependencies>
+</project>
--- a/trueref-application/src/main/java/com/trueref/application/catalog/CatalogService.java
+++ b/trueref-application/src/main/java/com/trueref/application/catalog/CatalogService.java
@@ -0,0 +1,89 @@
+package com.trueref.application.catalog;
+
+import com.trueref.domain.error.RepositoryAlreadyRegistered;
+import com.trueref.domain.error.RepositoryNotFound;
+import com.trueref.domain.model.Repository;
+import com.trueref.domain.model.RepositoryId;
+import com.trueref.domain.model.TagPattern;
+import com.trueref.domain.model.Version;
+import com.trueref.domain.port.in.QueryCatalog;
+import com.trueref.domain.port.in.RegisterRepository;
+import com.trueref.domain.port.out.RepositoryStore;
+import java.nio.file.Path;
+import java.time.Duration;
+import java.time.Instant;
+import java.util.List;
+import java.util.Optional;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+
+/** Implements {@link RegisterRepository} and {@link QueryCatalog}. */
+public final class CatalogService implements RegisterRepository, QueryCatalog {
+
+    private static final Logger log = LoggerFactory.getLogger(CatalogService.class);
+
+    private static final List<TagPattern> DEFAULT_RULES = List.of(
+            new TagPattern.Exact(),
+            new TagPattern.VPrefix(),
+            new TagPattern.ReleasePrefix(),
+            new TagPattern.SemverFuzzy());
+
+    private final RepositoryStore store;
+    private final Path trueRefHome;
+
+    public CatalogService(RepositoryStore store, Path trueRefHome) {
+        this.store = store;
+        this.trueRefHome = trueRefHome;
+    }
+
+    @Override
+    public Repository register(Command cmd) {
+        store.findByName(cmd.name()).ifPresent(r -> {
+            throw new RepositoryAlreadyRegistered(cmd.name());
+        });
+        boolean managed = cmd.remoteUrl() != null && cmd.localPath() == null;
+        String localPath = cmd.localPath() != null
+                ? cmd.localPath()
+                : trueRefHome.resolve("repos").resolve(cmd.name().replace('/', '_')).toString();
+        Instant now = Instant.now();
+        Repository repo = new Repository(
+                RepositoryId.random(),
+                cmd.name(),
+                cmd.remoteUrl(),
+                localPath,
+                managed,
+                cmd.ignoreGlobs(),
+                cmd.maxFileSizeBytes() == null ? 1_048_576L : cmd.maxFileSizeBytes(),
+                cmd.pollInterval() == null ? Duration.ofHours(1) : cmd.pollInterval(),
+                cmd.tagCap() == null ? 100 : cmd.tagCap(),
+                cmd.versionMappingRules().isEmpty() ? DEFAULT_RULES : cmd.versionMappingRules(),
+                now,
+                now);
+        Repository saved = store.save(repo);
+        log.info("registered repository name={} id={} managed={} localPath={}",
+                saved.name(), saved.id(), managed, localPath);
+        return saved;
+    }
+
+    @Override
+    public void unregister(RepositoryId id) {
+        Repository existing = store.findById(id).orElseThrow(() -> new RepositoryNotFound(id.toString()));
+        store.delete(id);
+        log.info("unregistered repository name={} id={}", existing.name(), id);
+    }
+
+    @Override
+    public List<Repository> listRepositories() {
+        return store.findAll();
+    }
+
+    @Override
+    public Optional<Repository> findRepository(RepositoryId id) {
+        return store.findById(id);
+    }
+
+    @Override
+    public List<Version> listVersions(RepositoryId repoId) {
+        return store.findVersionsByRepo(repoId);
+    }
+}
--- a/trueref-application/src/main/java/com/trueref/application/ingest/DiscoveryService.java
+++ b/trueref-application/src/main/java/com/trueref/application/ingest/DiscoveryService.java
@@ -0,0 +1,87 @@
+package com.trueref.application.ingest;
+
+import com.trueref.domain.error.RepositoryNotFound;
+import com.trueref.domain.model.Repository;
+import com.trueref.domain.model.Version;
+import com.trueref.domain.model.VersionId;
+import com.trueref.domain.model.VersionStatus;
+import com.trueref.domain.port.in.DiscoverVersions;
+import com.trueref.domain.port.out.GitClient;
+import com.trueref.domain.port.out.RepositoryStore;
+import com.trueref.domain.model.RepositoryId;
+import java.nio.file.Files;
+import java.nio.file.Path;
+import java.util.Comparator;
+import java.util.List;
+import java.util.Optional;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+
+/** Fetches tags (git fetch + tag list) and persists new/updated {@link Version}s. */
+public final class DiscoveryService implements DiscoverVersions {
+
+    private static final Logger log = LoggerFactory.getLogger(DiscoveryService.class);
+
+    private final RepositoryStore store;
+    private final GitClient git;
+
+    public DiscoveryService(RepositoryStore store, GitClient git) {
+        this.store = store;
+        this.git = git;
+    }
+
+    @Override
+    public List<Version> discover(RepositoryId repoId) {
+        Repository repo = store.findById(repoId).orElseThrow(() -> new RepositoryNotFound(repoId.toString()));
+        Path path = Path.of(repo.localPath());
+
+        // clone if managed and not present
+        if (repo.managedClone() && !Files.exists(path.resolve(".git"))) {
+            log.info("cloning for discovery: {}", repo.name());
+            git.cloneRepo(repo.remoteUrl(), path);
+        } else {
+            try { git.fetch(path); } catch (Exception e) {
+                log.warn("fetch failed for {}: {}", repo.name(), e.toString());
+            }
+        }
+
+        List<GitClient.TagInfo> tags = git.listTags(path);
+        // apply tag cap: keep top-N by epoch DESC (already sorted)
+        List<GitClient.TagInfo> capped = tags.stream().limit(Math.max(1, repo.tagCap())).toList();
+
+        for (GitClient.TagInfo t : capped) {
+            Optional<Version> existing = store.findVersionByTag(repoId, t.name());
+            if (existing.isPresent()) {
+                // refresh commit sha only if changed
+                if (!existing.get().commitSha().equalsIgnoreCase(t.commitSha())) {
+                    Version updated = new Version(
+                            existing.get().id(),
+                            existing.get().repoId(),
+                            t.name(),
+                            t.commitSha(),
+                            VersionStatus.DISCOVERED, // needs re-index
+                            existing.get().indexedAt(),
+                            existing.get().chunkCount(),
+                            null);
+                    store.saveVersion(updated);
+                    log.info("tag {} changed commit; marked DISCOVERED", t.name());
+                }
+            } else {
+                Version v = new Version(
+                        VersionId.random(),
+                        repoId,
+                        t.name(),
+                        t.commitSha(),
+                        VersionStatus.DISCOVERED,
+                        null,
+                        0,
+                        null);
+                store.saveVersion(v);
+                log.info("discovered new tag {}", t.name());
+            }
+        }
+        return store.findVersionsByRepo(repoId).stream()
+                .sorted(Comparator.comparing(Version::tag))
+                .toList();
+    }
+}
--- a/trueref-application/src/main/java/com/trueref/application/ingest/IngestionOrchestrator.java
+++ b/trueref-application/src/main/java/com/trueref/application/ingest/IngestionOrchestrator.java
@@ -0,0 +1,604 @@
+package com.trueref.application.ingest;
+
+import com.trueref.domain.model.Chunk;
+import com.trueref.domain.model.ChunkId;
+import com.trueref.domain.model.ChunkVersion;
+import com.trueref.domain.model.Embedding;
+import com.trueref.domain.model.IngestionJob;
+import com.trueref.domain.model.JobId;
+import com.trueref.domain.model.JobLogEvent;
+import com.trueref.domain.model.JobStage;
+import com.trueref.domain.model.JobStatus;
+import com.trueref.domain.model.JobType;
+import com.trueref.domain.model.Repository;
+import com.trueref.domain.model.RepositoryId;
+import com.trueref.domain.model.Version;
+import com.trueref.domain.model.VersionId;
+import com.trueref.domain.model.VersionStatus;
+import com.trueref.domain.port.in.IndexVersion;
+import com.trueref.domain.port.out.ChunkStore;
+import com.trueref.domain.port.out.CodeParser;
+import com.trueref.domain.port.out.EmbeddingCache;
+import com.trueref.domain.port.out.EmbeddingService;
+import com.trueref.domain.port.out.GitClient;
+import com.trueref.domain.port.out.JobEventBus;
+import com.trueref.domain.port.out.JobStore;
+import com.trueref.domain.port.out.RepositoryStore;
+import java.io.IOException;
+import java.nio.charset.StandardCharsets;
+import java.nio.file.FileSystems;
+import java.nio.file.FileVisitResult;
+import java.nio.file.Files;
+import java.nio.file.Path;
+import java.nio.file.PathMatcher;
+import java.nio.file.SimpleFileVisitor;
+import java.nio.file.attribute.BasicFileAttributes;
+import java.security.MessageDigest;
+import java.time.Instant;
+import java.util.ArrayList;
+import java.util.Arrays;
+import java.util.Collection;
+import java.util.HashMap;
+import java.util.HashSet;
+import java.util.List;
+import java.util.Map;
+import java.util.Set;
+import java.util.concurrent.BlockingQueue;
+import java.util.concurrent.ConcurrentHashMap;
+import java.util.concurrent.ExecutorService;
+import java.util.concurrent.Executors;
+import java.util.concurrent.LinkedBlockingQueue;
+import java.util.concurrent.Semaphore;
+import java.util.concurrent.TimeUnit;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+import org.slf4j.MDC;
+
+/**
+ * Orchestrates the full ingestion pipeline for one (repo, version): clone/fetch → checkout →
+ * discover files → diff-vs-parent → parse → chunk → dedupe by hash → embed (with cache) → index
+ * into Lucene → commit.
+ *
+ * <p>The pipeline is split into two concurrent stages:
+ * <ol>
+ *   <li><b>Parse phase</b> (virtual threads, up to {@code maxParseJobs} in parallel):
+ *       FETCH/CLONE → CHECKOUT → DISCOVER_FILES → DIFF_FILES → PARSE.
+ *       I/O-bound; no GPU use; worktree is removed immediately after parse.
+ *   </li>
+ *   <li><b>Embed phase</b> (single dedicated platform thread):
+ *       EMBED → INDEX → COMMIT. GPU-bound; serialises ONNX inference to prevent CUDA
+ *       context races. Runs on a platform thread for a stable OS thread identity.
+ *   </li>
+ * </ol>
+ * Completed parse batches are handed off via a bounded {@link BlockingQueue}: if the embed
+ * worker is busy, parse workers block before queuing, naturally capping in-memory pressure.
+ * One orchestrator instance is shared across all jobs.
+ */
+public final class IngestionOrchestrator implements IndexVersion {
+
+    private static final Logger log = LoggerFactory.getLogger(IngestionOrchestrator.class);
+
+    // Built-in ignore globs (applied in addition to .gitignore + per-repo globs).
+    private static final List<String> BUILTIN_IGNORES = List.of(
+            "**/.git/**",
+            "**/node_modules/**",
+            "**/target/**",
+            "**/build/**",
+            "**/dist/**",
+            "**/out/**",
+            "**/.idea/**",
+            "**/.vscode/**",
+            "**/__pycache__/**",
+            "**/*.png", "**/*.jpg", "**/*.jpeg", "**/*.gif", "**/*.webp", "**/*.ico",
+            "**/*.pdf", "**/*.zip", "**/*.tar", "**/*.gz", "**/*.jar", "**/*.class",
+            "**/*.so", "**/*.dll", "**/*.dylib", "**/*.exe", "**/*.bin");
+
+    private final RepositoryStore repoStore;
+    private final JobStore jobStore;
+    private final ChunkStore chunkStore;
+    private final EmbeddingService embeddings;
+    private final EmbeddingCache embeddingCache;
+    private final GitClient git;
+    private final CodeParser parser;
+    private final JobEventBus bus;
+
+    private final ExecutorService parseExecutor;
+    private final Semaphore parseConcurrencyLimit;
+    private final BlockingQueue<ParsedBatch> embedQueue;
+    private final Thread embedWorker;
+    private volatile boolean shuttingDown = false;
+    private final Map<JobId, Boolean> running = new ConcurrentHashMap<>();
+
+    public IngestionOrchestrator(
+            RepositoryStore repoStore,
+            JobStore jobStore,
+            ChunkStore chunkStore,
+            EmbeddingService embeddings,
+            EmbeddingCache embeddingCache,
+            GitClient git,
+            CodeParser parser,
+            JobEventBus bus,
+            int maxParseJobs,
+            int embedQueueCapacity) {
+        this.repoStore = repoStore;
+        this.jobStore = jobStore;
+        this.chunkStore = chunkStore;
+        this.embeddings = embeddings;
+        this.embeddingCache = embeddingCache;
+        this.git = git;
+        this.parser = parser;
+        this.bus = bus;
+        this.parseExecutor = Executors.newVirtualThreadPerTaskExecutor();
+        // Fair semaphore caps parallel parse jobs (I/O + CPU heavy, no GPU).
+        this.parseConcurrencyLimit = new Semaphore(Math.max(1, maxParseJobs), true);
+        // Bounded queue between parse workers and the embed worker.
+        // Backpressure: parse workers block here when the embed worker is saturated,
+        // preventing unbounded in-memory accumulation of parsed chunks.
+        this.embedQueue = new LinkedBlockingQueue<>(Math.max(1, embedQueueCapacity));
+        // Single platform thread for GPU inference. Platform (not virtual) gives a
+        // stable OS thread identity for CUDA — the synchronized(session) in OnnxEmbeddingService
+        // already pins virtual threads, but a dedicated platform thread removes all doubt.
+        this.embedWorker = Thread.ofPlatform()
+                .name("embed-worker")
+                .daemon(false)
+                .start(this::drainEmbedQueue);
+        log.info("IngestionOrchestrator ready: maxParseJobs={} embedQueueCapacity={}",
+                Math.max(1, maxParseJobs), Math.max(1, embedQueueCapacity));
+    }
+
+    @Override
+    public JobId enqueue(RepositoryId repoId, VersionId versionId, boolean force) {
+        Repository repo = repoStore.findById(repoId).orElseThrow();
+        Version ver = repoStore.findVersion(versionId).orElseThrow();
+        if (!force && ver.status() == VersionStatus.INDEXED) {
+            log.info("version already indexed and not forcing; skipping repo={} tag={}", repo.name(), ver.tag());
+            JobId id = JobId.random();
+            IngestionJob skipped = new IngestionJob(
+                    id,
+                    repoId,
+                    versionId,
+                    JobType.INDEX_VERSION,
+                    JobStatus.SUCCEEDED,
+                    Instant.now(),
+                    Instant.now(),
+                    List.of());
+            jobStore.save(skipped);
+            bus.publishJob(skipped);
+            return id;
+        }
+
+        JobId jobId = JobId.random();
+        IngestionJob job = new IngestionJob(
+                jobId,
+                repoId,
+                versionId,
+                JobType.INDEX_VERSION,
+                JobStatus.QUEUED,
+                null,
+                null,
+                List.of());
+        jobStore.save(job);
+        bus.publishJob(job);
+        running.put(jobId, Boolean.TRUE);
+        parseExecutor.submit(() -> runParsePhase(jobId, repo, ver));
+        return jobId;
+    }
+
+    /**
+     * Carry struct that transfers parse-phase output to the embed worker.
+     * The git worktree has already been removed before this batch enters the queue;
+     * only in-memory chunk data travels across the thread boundary.
+     */
+    private record ParsedBatch(
+            JobId jobId, Repository repo, Version ver, List<ParsedPiece> pieces) {}
+
+    /**
+     * Parse phase — runs on a virtual thread, up to {@code maxParseJobs} in parallel.
+     * Stages: FETCH/CLONE → CHECKOUT → DISCOVER_FILES → DIFF_FILES → PARSE.
+     * On completion, removes the worktree, releases the parse slot so the next job can
+     * start immediately, then blocks on {@link #embedQueue} until the embed worker has
+     * room (natural backpressure).
+     */
+    private void runParsePhase(JobId jobId, Repository repo, Version ver) {
+        MDC.put("jobId", jobId.toString());
+        MDC.put("repo", repo.name());
+        MDC.put("tag", ver.tag());
+        try {
+            parseConcurrencyLimit.acquire();
+        } catch (InterruptedException e) {
+            Thread.currentThread().interrupt();
+            log.warn("job {} interrupted while waiting for parse slot — failing", jobId);
+            repoStore.updateVersionStatus(ver.id(), VersionStatus.FAILED, "interrupted");
+            transitionJob(jobId, JobStatus.FAILED, null, Instant.now());
+            running.remove(jobId);
+            MDC.clear();
+            return;
+        }
+        boolean slotReleased = false;
+        Path worktree = null;
+        Path repoPath = Path.of(repo.localPath());
+        try {
+            transitionJob(jobId, JobStatus.RUNNING, Instant.now(), null);
+            repoStore.updateVersionStatus(ver.id(), VersionStatus.INDEXING, null);
+
+            // STAGE: FETCH (or CLONE if managed and absent)
+            if (repo.managedClone() && !Files.exists(repoPath.resolve(".git"))) {
+                stage(jobId, JobStage.StageName.CLONE, () -> {
+                    logEvent(jobId, JobLogEvent.Level.INFO, JobStage.StageName.CLONE,
+                            "cloning " + repo.remoteUrl() + " → " + repoPath);
+                    git.cloneRepo(repo.remoteUrl(), repoPath);
+                    return 1L;
+                });
+            } else {
+                stage(jobId, JobStage.StageName.FETCH, () -> {
+                    git.fetch(repoPath);
+                    return 1L;
+                });
+            }
+
+            // STAGE: CHECKOUT
+            final Path wt = stageReturning(jobId, JobStage.StageName.CHECKOUT, () -> {
+                Path w = git.checkoutWorktree(repoPath, ver.tag());
+                logEvent(jobId, JobLogEvent.Level.INFO, JobStage.StageName.CHECKOUT,
+                        "checked out at " + w);
+                return w;
+            });
+            worktree = wt;
+
+            // STAGE: DISCOVER_FILES
+            List<Path> files = stageReturning(jobId, JobStage.StageName.DISCOVER_FILES, () ->
+                    discoverFiles(wt, repo));
+            logEvent(jobId, JobLogEvent.Level.INFO, JobStage.StageName.DISCOVER_FILES,
+                    "found " + files.size() + " indexable files");
+
+            // STAGE: DIFF_FILES (select subset)
+            String baseRef = pickParentIndexedTag(repo, ver);
+            final List<Path> selectedFiles;
+            if (baseRef != null) {
+                Set<String> changedRel = stageReturning(jobId, JobStage.StageName.DIFF_FILES, () -> {
+                    List<GitClient.DiffEntry> diff = git.diff(repoPath, baseRef, ver.tag());
+                    Set<String> s = new HashSet<>();
+                    for (GitClient.DiffEntry e : diff) {
+                        if (e.change() != GitClient.DiffEntry.ChangeType.DELETED) s.add(e.path());
+                    }
+                    return s;
+                });
+                selectedFiles = files.stream()
+                        .filter(f -> changedRel.contains(wt.relativize(f).toString().replace('\\', '/')))
+                        .toList();
+                logEvent(jobId, JobLogEvent.Level.INFO, JobStage.StageName.DIFF_FILES,
+                        "diff vs " + baseRef + " selects " + selectedFiles.size() + "/" + files.size());
+            } else {
+                selectedFiles = files;
+            }
+
+            // STAGE: PARSE + CHUNK + HASH (combined)
+            List<ParsedPiece> pieces = stageReturning(jobId, JobStage.StageName.PARSE, () ->
+                    parseAll(selectedFiles, wt));
+
+            // Worktree no longer needed — free disk space before blocking on embed queue.
+            removeWorktreeQuietly(jobId, repoPath, wt);
+            worktree = null;
+
+            // Release parse slot before blocking so the next job can start parsing
+            // while this batch waits for the embed worker (maximises CPU/GPU overlap).
+            parseConcurrencyLimit.release();
+            slotReleased = true;
+
+            // Hand off to embed worker — blocks if the queue is at capacity.
+            embedQueue.put(new ParsedBatch(jobId, repo, ver, pieces));
+
+        } catch (InterruptedException e) {
+            Thread.currentThread().interrupt();
+            log.warn("job {} interrupted during parse — failing", jobId);
+            repoStore.updateVersionStatus(ver.id(), VersionStatus.FAILED, "interrupted");
+            transitionJob(jobId, JobStatus.FAILED, null, Instant.now());
+            running.remove(jobId);
+        } catch (Exception e) {
+            log.error("parse phase failed for job {}", jobId, e);
+            logEvent(jobId, JobLogEvent.Level.ERROR, null, "parse phase failed: " + e.getMessage());
+            repoStore.updateVersionStatus(ver.id(), VersionStatus.FAILED, e.getMessage());
+            transitionJob(jobId, JobStatus.FAILED, null, Instant.now());
+            running.remove(jobId);
+        } finally {
+            if (worktree != null) removeWorktreeQuietly(jobId, repoPath, worktree);
+            if (!slotReleased) parseConcurrencyLimit.release();
+            MDC.clear();
+        }
+    }
+
+    /**
+     * Embed worker — runs on a single dedicated platform thread.
+     * Drains {@link #embedQueue} until {@link #shutdown()} signals stop.
+     * Stages per batch: EMBED → INDEX → COMMIT → mark version indexed → transition SUCCEEDED.
+     */
+    private void drainEmbedQueue() {
+        log.info("embed worker started ({})", Thread.currentThread().getName());
+        while (!shuttingDown || !embedQueue.isEmpty()) {
+            ParsedBatch batch;
+            try {
+                batch = embedQueue.poll(500, TimeUnit.MILLISECONDS);
+            } catch (InterruptedException e) {
+                Thread.currentThread().interrupt();
+                break;
+            }
+            if (batch == null) continue;
+            runEmbedPhase(batch);
+        }
+        log.info("embed worker stopped");
+    }
+
+    private void runEmbedPhase(ParsedBatch batch) {
+        MDC.put("jobId", batch.jobId().toString());
+        MDC.put("repo", batch.repo().name());
+        MDC.put("tag", batch.ver().tag());
+        try {
+            // STAGE: EMBED
+            List<Chunk> chunks = stageReturning(batch.jobId(), JobStage.StageName.EMBED, () ->
+                    embedAll(batch.jobId(), batch.pieces()));
+
+            // STAGE: INDEX
+            stage(batch.jobId(), JobStage.StageName.INDEX, () -> {
+                chunkStore.unlinkVersion(batch.ver().id());
+                List<ChunkVersion> links = buildLinks(batch.ver().id(), batch.pieces());
+                chunkStore.linkChunks(links);
+                return (long) links.size();
+            });
+
+            // STAGE: COMMIT
+            stage(batch.jobId(), JobStage.StageName.COMMIT, () -> {
+                chunkStore.commit();
+                return 1L;
+            });
+
+            repoStore.markVersionIndexed(batch.ver().id(), batch.pieces().size());
+            transitionJob(batch.jobId(), JobStatus.SUCCEEDED, null, Instant.now());
+            logEvent(batch.jobId(), JobLogEvent.Level.INFO, null,
+                    "indexed " + chunks.size() + " chunks across " + batch.pieces().size() + " pieces");
+        } catch (Exception e) {
+            log.error("embed phase failed for job {}", batch.jobId(), e);
+            logEvent(batch.jobId(), JobLogEvent.Level.ERROR, null,
+                    "embed phase failed: " + e.getMessage());
+            repoStore.updateVersionStatus(batch.ver().id(), VersionStatus.FAILED, e.getMessage());
+            transitionJob(batch.jobId(), JobStatus.FAILED, null, Instant.now());
+        } finally {
+            running.remove(batch.jobId());
+            MDC.clear();
+        }
+    }
+
+    private void removeWorktreeQuietly(JobId jobId, Path repoPath, Path worktree) {
+        try {
+            git.removeWorktree(repoPath, worktree);
+        } catch (Exception e) {
+            logEvent(jobId, JobLogEvent.Level.WARN, null, "worktree cleanup failed: " + e.getMessage());
+        }
+    }
+
+    /**
+     * Orderly shutdown: stops the parse executor, signals the embed worker to stop after
+     * finishing its current batch, then fails any batches still in the queue.
+     * Called by Spring via {@code @Bean(destroyMethod = "shutdown")} in ApplicationBeans.
+     */
+    void shutdown() {
+        log.info("IngestionOrchestrator shutting down — stopping embed worker");
+        shuttingDown = true;
+        parseExecutor.shutdownNow();
+        embedWorker.interrupt();
+        try {
+            embedWorker.join(10_000);
+        } catch (InterruptedException e) {
+            Thread.currentThread().interrupt();
+        }
+        // Fail any batches that parsed OK but never got to embed (restart mid-queue).
+        ParsedBatch orphan;
+        while ((orphan = embedQueue.poll()) != null) {
+            log.warn("failing orphaned embed batch for job {} (shutdown)", orphan.jobId());
+            repoStore.updateVersionStatus(orphan.ver().id(), VersionStatus.FAILED, "application shutdown");
+            transitionJob(orphan.jobId(), JobStatus.FAILED, null, Instant.now());
+            running.remove(orphan.jobId());
+        }
+    }
+
+    /* ------------------------------------------------------------------ */
+    /* Stage helpers                                                       */
+    /* ------------------------------------------------------------------ */
+
+    private interface StageBody {
+        long execute() throws Exception;
+    }
+
+    private interface StageBodyReturning<T> {
+        T execute() throws Exception;
+    }
+
+    private void stage(JobId id, JobStage.StageName name, StageBody body) {
+        stageReturning(id, name, () -> {
+            long n = body.execute();
+            return n;
+        });
+    }
+
+    private <T> T stageReturning(JobId id, JobStage.StageName name, StageBodyReturning<T> body) {
+        Instant start = Instant.now();
+        JobStage running = new JobStage(
+                id, name, JobStage.StageStatus.RUNNING, start, null, 0, 0, 0, null);
+        jobStore.upsertStage(running);
+        publishJob(id);
+        try {
+            T out = body.execute();
+            long items = (out instanceof Long l) ? l : (out instanceof List<?> l ? l.size() : 1);
+            JobStage done = new JobStage(
+                    id, name, JobStage.StageStatus.SUCCEEDED, start, Instant.now(), items, items, 0, null);
+            jobStore.upsertStage(done);
+            publishJob(id);
+            return out;
+        } catch (Exception e) {
+            JobStage failed = new JobStage(
+                    id, name, JobStage.StageStatus.FAILED, start, Instant.now(), 0, 0, 0, e.getMessage());
+            jobStore.upsertStage(failed);
+            publishJob(id);
+            throw new RuntimeException(e);
+        }
+    }
+
+    private void transitionJob(JobId id, JobStatus s, Instant startedAt, Instant finishedAt) {
+        jobStore.updateStatus(id, s, startedAt, finishedAt);
+        publishJob(id);
+    }
+
+    private void publishJob(JobId id) {
+        jobStore.findById(id).ifPresent(bus::publishJob);
+    }
+
+    private void logEvent(JobId id, JobLogEvent.Level level, JobStage.StageName stage, String msg) {
+        bus.publishLog(new JobLogEvent(id, Instant.now(), level, stage, msg));
+    }
+
+    /* ------------------------------------------------------------------ */
+    /* Pipeline steps                                                      */
+    /* ------------------------------------------------------------------ */
+
+    private List<Path> discoverFiles(Path root, Repository repo) throws IOException {
+        List<PathMatcher> matchers = new ArrayList<>();
+        for (String g : BUILTIN_IGNORES) matchers.add(FileSystems.getDefault().getPathMatcher("glob:" + g));
+        for (String g : repo.ignoreGlobs()) matchers.add(FileSystems.getDefault().getPathMatcher("glob:" + g));
+        long maxBytes = repo.maxFileSizeBytes();
+        List<Path> out = new ArrayList<>();
+        Files.walkFileTree(root, new SimpleFileVisitor<>() {
+            @Override
+            public FileVisitResult visitFile(Path file, BasicFileAttributes attrs) {
+                Path rel = root.relativize(file);
+                for (PathMatcher m : matchers) {
+                    if (m.matches(rel) || m.matches(file.getFileName())) return FileVisitResult.CONTINUE;
+                }
+                if (attrs.size() > maxBytes) return FileVisitResult.CONTINUE;
+                out.add(file);
+                return FileVisitResult.CONTINUE;
+            }
+
+            @Override
+            public FileVisitResult preVisitDirectory(Path dir, BasicFileAttributes attrs) {
+                if (dir.equals(root)) return FileVisitResult.CONTINUE;
+                Path rel = root.relativize(dir);
+                for (PathMatcher m : matchers) {
+                    if (m.matches(rel)) return FileVisitResult.SKIP_SUBTREE;
+                }
+                return FileVisitResult.CONTINUE;
+            }
+        });
+        return out;
+    }
+
+    private record ParsedPiece(
+            String contentHash,
+            String content,
+            String language,
+            String symbol,
+            int tokenCount,
+            String filePath,
+            int startLine,
+            int endLine) {}
+
+    private List<ParsedPiece> parseAll(List<Path> files, Path root) {
+        List<ParsedPiece> out = new ArrayList<>();
+        MessageDigest sha;
+        try {
+            sha = MessageDigest.getInstance("SHA-256");
+        } catch (Exception e) {
+            throw new IllegalStateException(e);
+        }
+        for (Path f : files) {
+            String rel = root.relativize(f).toString().replace('\\', '/');
+            try {
+                List<CodeParser.ParsedChunk> parsed = parser.parse(f, rel);
+                for (var pc : parsed) {
+                    String hash = bytesToHex(sha.digest(pc.content().getBytes(StandardCharsets.UTF_8)));
+                    sha.reset();
+                    int tokens = Math.max(1, pc.content().length() / 4); // heuristic; refined if needed
+                    out.add(new ParsedPiece(
+                            hash, pc.content(), pc.language(), pc.symbol(), tokens, rel, pc.startLine(), pc.endLine()));
+                }
+            } catch (Exception e) {
+                log.warn("parse failed for {}: {}", rel, e.toString());
+            }
+        }
+        return out;
+    }
+
+    private List<Chunk> embedAll(JobId jobId, List<ParsedPiece> pieces) {
+        // Dedupe by hash across this batch AND against existing chunks in the store/cache.
+        Map<String, Chunk> resolved = new HashMap<>();
+        List<ParsedPiece> toEmbed = new ArrayList<>();
+        for (var p : pieces) {
+            if (resolved.containsKey(p.contentHash())) continue;
+            var existing = chunkStore.findByContentHash(p.contentHash());
+            if (existing.isPresent()) {
+                resolved.put(p.contentHash(), existing.get());
+                continue;
+            }
+            // cache?
+            var cached = embeddingCache.get(p.contentHash());
+            if (cached.isPresent()) {
+                Chunk c = upsert(p, cached.get());
+                resolved.put(p.contentHash(), c);
+                continue;
+            }
+            toEmbed.add(p);
+        }
+
+        if (!toEmbed.isEmpty()) {
+            List<String> texts = toEmbed.stream().map(ParsedPiece::content).toList();
+            List<float[]> vecs = embeddings.embed(texts);
+            for (int i = 0; i < toEmbed.size(); i++) {
+                var p = toEmbed.get(i);
+                float[] v = vecs.get(i);
+                embeddingCache.put(p.contentHash(), v);
+                Chunk c = upsert(p, v);
+                resolved.put(p.contentHash(), c);
+            }
+            logEvent(jobId, JobLogEvent.Level.INFO, JobStage.StageName.EMBED,
+                    "embedded " + toEmbed.size() + " new chunks (cache/dedupe hits = " + (pieces.size() - toEmbed.size()) + ")");
+        }
+        return new ArrayList<>(resolved.values());
+    }
+
+    private Chunk upsert(ParsedPiece p, float[] vector) {
+        Chunk c = new Chunk(ChunkId.random(), p.contentHash(), p.content(), p.language(), p.symbol(), p.tokenCount());
+        return chunkStore.upsertChunk(c, new Embedding(c.id(), vector));
+    }
+
+    private List<ChunkVersion> buildLinks(VersionId versionId, List<ParsedPiece> pieces) {
+        // Piece → ChunkId requires knowing the chunk id assigned on upsert.
+        // We re-resolve via findByContentHash — cheap because it's a Term query.
+        List<ChunkVersion> links = new ArrayList<>(pieces.size());
+        Map<String, ChunkId> hashToId = new HashMap<>();
+        for (var p : pieces) {
+            ChunkId id = hashToId.computeIfAbsent(p.contentHash(), h ->
+                    chunkStore.findByContentHash(h).orElseThrow().id());
+            links.add(new ChunkVersion(id, versionId, p.filePath(), p.startLine(), p.endLine()));
+        }
+        return links;
+    }
+
+    private String pickParentIndexedTag(Repository repo, Version ver) {
+        // Most recent previously-indexed version for this repo that isn't this one.
+        List<Version> indexed = repoStore.findVersionsByStatus(repo.id(), VersionStatus.INDEXED);
+        return indexed.stream()
+                .filter(v -> !v.id().equals(ver.id()))
+                .max((a, b) -> a.tag().compareTo(b.tag()))
+                .map(Version::tag)
+                .orElse(null);
+    }
+
+    private static String bytesToHex(byte[] bytes) {
+        char[] hex = "0123456789abcdef".toCharArray();
+        char[] out = new char[bytes.length * 2];
+        for (int i = 0; i < bytes.length; i++) {
+            int v = bytes[i] & 0xff;
+            out[i * 2] = hex[v >>> 4];
+            out[i * 2 + 1] = hex[v & 0x0f];
+        }
+        return new String(out);
+    }
+}
--- a/trueref-application/src/main/java/com/trueref/application/observability/InMemoryJobEventBus.java
+++ b/trueref-application/src/main/java/com/trueref/application/observability/InMemoryJobEventBus.java
@@ -0,0 +1,71 @@
+package com.trueref.application.observability;
+
+import com.trueref.domain.model.IngestionJob;
+import com.trueref.domain.model.JobId;
+import com.trueref.domain.model.JobLogEvent;
+import com.trueref.domain.port.out.JobEventBus;
+import java.util.Map;
+import java.util.concurrent.ConcurrentHashMap;
+import java.util.concurrent.CopyOnWriteArrayList;
+import java.util.function.Consumer;
+
+/**
+ * In-process publish/subscribe implementation of {@link JobEventBus}. Listeners receive events on
+ * the publisher's thread; consumers should defer expensive work (e.g. SSE writes) to a virtual
+ * thread to keep the publisher fast.
+ */
+public final class InMemoryJobEventBus implements JobEventBus {
+
+    private final CopyOnWriteArrayList<Consumer<IngestionJob>> jobListeners = new CopyOnWriteArrayList<>();
+    private final Map<JobId, CopyOnWriteArrayList<Consumer<JobLogEvent>>> logListeners = new ConcurrentHashMap<>();
+    private final CopyOnWriteArrayList<Consumer<JobLogEvent>> globalLogListeners = new CopyOnWriteArrayList<>();
+
+    @Override
+    public void publishJob(IngestionJob job) {
+        for (Consumer<IngestionJob> l : jobListeners) {
+            try {
+                l.accept(job);
+            } catch (Exception ignored) {
+                // listener failures must not break publishing
+            }
+        }
+    }
+
+    @Override
+    public void publishLog(JobLogEvent event) {
+        var perJob = logListeners.get(event.jobId());
+        if (perJob != null) {
+            for (Consumer<JobLogEvent> l : perJob) {
+                try {
+                    l.accept(event);
+                } catch (Exception ignored) {
+                }
+            }
+        }
+        for (Consumer<JobLogEvent> l : globalLogListeners) {
+            try {
+                l.accept(event);
+            } catch (Exception ignored) {
+            }
+        }
+    }
+
+    @Override
+    public AutoCloseable subscribeJobs(Consumer<IngestionJob> listener) {
+        jobListeners.add(listener);
+        return () -> jobListeners.remove(listener);
+    }
+
+    @Override
+    public AutoCloseable subscribeLogs(JobId jobId, Consumer<JobLogEvent> listener) {
+        var list = logListeners.computeIfAbsent(jobId, k -> new CopyOnWriteArrayList<>());
+        list.add(listener);
+        return () -> list.remove(listener);
+    }
+
+    /** Subscribe to ALL log events regardless of job (used by the dashboard). */
+    public AutoCloseable subscribeAllLogs(Consumer<JobLogEvent> listener) {
+        globalLogListeners.add(listener);
+        return () -> globalLogListeners.remove(listener);
+    }
+}
--- a/trueref-application/src/main/java/com/trueref/application/observability/JobObservationService.java
+++ b/trueref-application/src/main/java/com/trueref/application/observability/JobObservationService.java
@@ -0,0 +1,47 @@
+package com.trueref.application.observability;
+
+import com.trueref.domain.model.IngestionJob;
+import com.trueref.domain.model.JobId;
+import com.trueref.domain.model.JobLogEvent;
+import com.trueref.domain.model.JobStatus;
+import com.trueref.domain.model.RepositoryId;
+import com.trueref.domain.model.VersionId;
+import com.trueref.domain.port.in.ObserveJobs;
+import com.trueref.domain.port.out.JobEventBus;
+import com.trueref.domain.port.out.JobStore;
+import java.util.List;
+import java.util.Optional;
+import java.util.function.Consumer;
+import org.jspecify.annotations.Nullable;
+
+public final class JobObservationService implements ObserveJobs {
+
+    private final JobStore jobs;
+    private final JobEventBus bus;
+
+    public JobObservationService(JobStore jobs, JobEventBus bus) {
+        this.jobs = jobs;
+        this.bus = bus;
+    }
+
+    @Override
+    public Optional<IngestionJob> findJob(JobId id) {
+        return jobs.findById(id);
+    }
+
+    @Override
+    public List<IngestionJob> listJobs(
+            @Nullable RepositoryId repoId, @Nullable VersionId versionId, @Nullable JobStatus status, int limit) {
+        return jobs.find(repoId, versionId, status, limit);
+    }
+
+    @Override
+    public AutoCloseable subscribeJobs(Consumer<IngestionJob> listener) {
+        return bus.subscribeJobs(listener);
+    }
+
+    @Override
+    public AutoCloseable subscribeLogs(JobId jobId, Consumer<JobLogEvent> listener) {
+        return bus.subscribeLogs(jobId, listener);
+    }
+}
--- a/trueref-application/src/main/java/com/trueref/application/observability/package-info.java
+++ b/trueref-application/src/main/java/com/trueref/application/observability/package-info.java
@@ -0,0 +1,3 @@
+/** In-process implementations of cross-cutting application services. */
+@org.jspecify.annotations.NullMarked
+package com.trueref.application.observability;
--- a/trueref-application/src/main/java/com/trueref/application/package-info.java
+++ b/trueref-application/src/main/java/com/trueref/application/package-info.java
@@ -0,0 +1,3 @@
+/** Application services: use-case implementations. */
+@org.jspecify.annotations.NullMarked
+package com.trueref.application;
--- a/trueref-application/src/main/java/com/trueref/application/resolve/LibraryResolver.java
+++ b/trueref-application/src/main/java/com/trueref/application/resolve/LibraryResolver.java
@@ -0,0 +1,161 @@
+package com.trueref.application.resolve;
+
+import com.trueref.domain.model.Repository;
+import com.trueref.domain.model.RepositoryId;
+import com.trueref.domain.model.TagPattern;
+import com.trueref.domain.model.Version;
+import com.trueref.domain.model.VersionStatus;
+import com.trueref.domain.port.in.IndexVersion;
+import com.trueref.domain.port.in.ResolveLibraryId;
+import com.trueref.domain.port.out.RepositoryStore;
+import java.util.ArrayList;
+import java.util.Comparator;
+import java.util.List;
+import java.util.Optional;
+import java.util.regex.Matcher;
+import java.util.regex.Pattern;
+import org.jspecify.annotations.Nullable;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+
+/**
+ * Fuzzy library-name matching + version→tag mapping. Mirrors Context7's {@code resolve-library-id}
+ * semantics. When {@code version} is provided and maps to a known-but-not-yet-indexed tag, triggers
+ * an async index job (fire-and-forget).
+ */
+public final class LibraryResolver implements ResolveLibraryId {
+
+    private static final Logger log = LoggerFactory.getLogger(LibraryResolver.class);
+
+    private static final Pattern SEMVER = Pattern.compile("^v?(\\d+)(?:\\.(\\d+))?(?:\\.(\\d+))?.*$");
+
+    private final RepositoryStore store;
+    private final IndexVersion indexer;
+
+    public LibraryResolver(RepositoryStore store, IndexVersion indexer) {
+        this.store = store;
+        this.indexer = indexer;
+    }
+
+    @Override
+    public Result resolve(Query q) {
+        String needle = q.libraryName().toLowerCase();
+        List<Repository> all = store.findAll();
+        List<Match> matches = new ArrayList<>();
+        for (Repository r : all) {
+            double score = nameScore(r.name().toLowerCase(), needle);
+            if (score <= 0) continue;
+            List<Version> versions = store.findVersionsByRepo(r.id());
+
+            // If a version was requested, map it to a tag and ensure indexing.
+            if (q.version() != null && !q.version().isBlank()) {
+                Optional<Version> target = mapVersion(r, versions, q.version());
+                target.ifPresent(v -> ensureIndexed(r.id(), v));
+            }
+
+            int snippetCount = versions.stream()
+                    .filter(v -> v.status() == VersionStatus.INDEXED)
+                    .mapToInt(Version::chunkCount)
+                    .sum();
+            List<VersionRef> refs = versions.stream()
+                    .sorted(Comparator.comparing(Version::tag).reversed())
+                    .map(v -> new VersionRef(v.id(), v.tag(), v.status()))
+                    .toList();
+            String libraryId = "/" + r.name();
+            matches.add(new Match(r.id(), libraryId, r.name(), null, snippetCount, refs, score));
+        }
+        matches.sort(Comparator.comparingDouble(Match::score).reversed());
+        return new Result(matches);
+    }
+
+    /** Fuzzy name scoring: exact 1.0, prefix 0.9, contains 0.7, token overlap otherwise. */
+    private double nameScore(String haystack, String needle) {
+        if (haystack.equals(needle)) return 1.0;
+        if (haystack.endsWith("/" + needle) || haystack.startsWith(needle + "/")) return 0.95;
+        if (haystack.contains(needle)) return 0.8;
+        // token overlap
+        String[] hTok = haystack.split("[^a-z0-9]+");
+        String[] nTok = needle.split("[^a-z0-9]+");
+        int hit = 0;
+        for (String nt : nTok) {
+            if (nt.isBlank()) continue;
+            for (String ht : hTok) {
+                if (ht.equals(nt)) { hit++; break; }
+            }
+        }
+        if (hit == 0) return 0.0;
+        return 0.3 + 0.4 * ((double) hit / Math.max(1, nTok.length));
+    }
+
+    /**
+     * Maps a version string to the closest matching tag using the repo's configured mapping rules.
+     * Rules are tried in order.
+     */
+    public Optional<Version> mapVersion(Repository repo, List<Version> versions, String requested) {
+        for (TagPattern rule : repo.versionMappingRules()) {
+            String candidate = switch (rule) {
+                case TagPattern.Exact e -> requested;
+                case TagPattern.VPrefix v -> "v" + stripV(requested);
+                case TagPattern.ReleasePrefix r -> "release-" + stripV(requested);
+                case TagPattern.Custom c -> c.template()
+                        .replace("{version}", requested)
+                        .replace("{semver}", stripV(requested));
+                case TagPattern.SemverFuzzy s -> null; // handled below
+            };
+            if (candidate == null) continue;
+            Optional<Version> exact = versions.stream()
+                    .filter(v -> v.tag().equalsIgnoreCase(candidate))
+                    .findFirst();
+            if (exact.isPresent()) return exact;
+        }
+        // Semver fuzzy: pick tag with closest semver distance
+        return semverClosest(versions, requested);
+    }
+
+    private Optional<Version> semverClosest(List<Version> versions, String requested) {
+        int[] r = parseSemver(requested);
+        if (r == null) return Optional.empty();
+        return versions.stream()
+                .map(v -> new Object[] {v, parseSemver(v.tag())})
+                .filter(t -> t[1] != null)
+                .min(Comparator.comparingLong(t -> semverDist((int[]) t[1], r)))
+                .map(t -> (Version) t[0]);
+    }
+
+    private static @Nullable int[] parseSemver(String s) {
+        Matcher m = SEMVER.matcher(s);
+        if (!m.matches()) return null;
+        return new int[] {
+            parseIntOrZero(m.group(1)),
+            parseIntOrZero(m.group(2)),
+            parseIntOrZero(m.group(3))
+        };
+    }
+
+    private static int parseIntOrZero(String s) {
+        if (s == null || s.isEmpty()) return 0;
+        try { return Integer.parseInt(s); } catch (NumberFormatException e) { return 0; }
+    }
+
+    private static long semverDist(int[] a, int[] b) {
+        long d = 0;
+        d += Math.abs(a[0] - b[0]) * 1_000_000L;
+        d += Math.abs(a[1] - b[1]) * 1_000L;
+        d += Math.abs(a[2] - b[2]);
+        return d;
+    }
+
+    private static String stripV(String s) {
+        return s.startsWith("v") || s.startsWith("V") ? s.substring(1) : s;
+    }
+
+    private void ensureIndexed(RepositoryId repoId, Version v) {
+        if (v.status() == VersionStatus.INDEXED || v.status() == VersionStatus.INDEXING) return;
+        try {
+            log.info("on-demand indexing: repo={} tag={}", repoId, v.tag());
+            indexer.enqueue(repoId, v.id(), false);
+        } catch (Exception e) {
+            log.warn("on-demand indexing enqueue failed: {}", e.toString());
+        }
+    }
+}
--- a/trueref-application/src/main/java/com/trueref/application/search/HybridSearchService.java
+++ b/trueref-application/src/main/java/com/trueref/application/search/HybridSearchService.java
@@ -0,0 +1,238 @@
+package com.trueref.application.search;
+
+import com.trueref.domain.error.InvalidSearchRequest;
+import com.trueref.domain.model.ChunkId;
+import com.trueref.domain.model.Repository;
+import com.trueref.domain.model.SearchHit;
+import com.trueref.domain.model.SearchScope;
+import com.trueref.domain.model.Version;
+import com.trueref.domain.port.in.SearchLibraryDocs;
+import com.trueref.domain.port.out.ChunkStore;
+import com.trueref.domain.port.out.EmbeddingService;
+import com.trueref.domain.port.out.RepositoryStore;
+import com.trueref.domain.port.out.RerankerService;
+import java.util.ArrayList;
+import java.util.Comparator;
+import java.util.HashMap;
+import java.util.List;
+import java.util.Map;
+import java.util.Objects;
+import java.util.regex.Matcher;
+import java.util.regex.Pattern;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+
+/**
+ * Hybrid search: BM25 + dense kNN fused by Reciprocal Rank Fusion (RRF), then reranked by a
+ * cross-encoder, then packed to a token budget.
+ */
+public final class HybridSearchService implements SearchLibraryDocs {
+
+    private static final Logger log = LoggerFactory.getLogger(HybridSearchService.class);
+
+    /**
+     * Matches camelCase identifiers that are likely to be Phaser API method/class names (≥6 chars,
+     * must contain at least one uppercase letter after the first char, not all-caps).
+     * Examples: setCollideWorldBounds, createBitmapMask, addOverlap.
+     */
+    private static final Pattern CAMEL_IDENT = Pattern.compile(
+            "\\b([a-z][a-zA-Z0-9]{5,})(?=\\b)");
+
+    private final ChunkStore chunks;
+    private final EmbeddingService embedder;
+    private final RerankerService reranker;
+    private final RepositoryStore repos;
+    private final int rrfK;
+    private final int rerankTopK;
+    private final int finalTopK;
+
+    public HybridSearchService(
+            ChunkStore chunks,
+            EmbeddingService embedder,
+            RerankerService reranker,
+            RepositoryStore repos,
+            int rrfK,
+            int rerankTopK,
+            int finalTopK) {
+        this.chunks = chunks;
+        this.embedder = embedder;
+        this.reranker = reranker;
+        this.repos = repos;
+        this.rrfK = rrfK;
+        this.rerankTopK = rerankTopK;
+        this.finalTopK = finalTopK;
+    }
+
+    @Override
+    public Result search(Query q) {
+        if (q.text() == null || q.text().isBlank()) {
+            throw new InvalidSearchRequest("query text must not be blank");
+        }
+        if (q.scope().refs().isEmpty()) {
+            throw new InvalidSearchRequest("search scope must not be empty");
+        }
+
+        String text = rewrite(q.text(), q.topic());
+        // Augment BM25 query with camelCase identifiers found in the text so that the exact
+        // method-name chunk scores higher in BM25 even when it competes with generic mentions.
+        String bm25Text = augmentWithCamelIdents(text);
+
+        List<SearchHit> bm25 = chunks.bm25Search(bm25Text, q.scope(), rerankTopK);
+        float[] vec = embedder.embed(List.of(text)).get(0);
+        List<SearchHit> dense = chunks.denseSearch(vec, q.scope(), rerankTopK);
+
+        List<SearchHit> fused = rrf(bm25, dense);
+        if (fused.size() > rerankTopK) fused = fused.subList(0, rerankTopK);
+
+        // Demote changelog / synthetic-skill / docs paths before the reranker sees them so that
+        // authoritative source-code chunks aren't squeezed out by historical migration notes.
+        List<SearchHit> biased = applyFilePathBias(fused);
+
+        // Enrich with repo name + tag (ChunkStore leaves these empty).
+        List<SearchHit> enriched = enrich(biased);
+
+        List<SearchHit> reranked = reranker.rerank(text, enriched);
+
+        List<SearchHit> packed = packByTokenBudget(reranked, q.tokensBudget(), q.maxHits() > 0 ? q.maxHits() : finalTopK);
+        int totalTokens = packed.stream().mapToInt(h -> estimateTokens(h.content())).sum();
+        return new Result(packed, totalTokens);
+    }
+
+    /* ------------------------------------------------------------------ */
+
+    private String rewrite(String text, String topic) {
+        String base = text.trim();
+        if (topic != null && !topic.isBlank()) {
+            return base + " " + topic.trim();
+        }
+        return base;
+    }
+
+    /**
+     * Returns a copy of {@code text} with each camelCase identifier repeated at the end (once).
+     * This lifts their BM25 term-frequency contribution without altering the semantic meaning
+     * used for the dense embedding query.
+     *
+     * <p>Example: "how to use setCollideWorldBounds" →
+     * "how to use setCollideWorldBounds setCollideWorldBounds"
+     */
+    private static String augmentWithCamelIdents(String text) {
+        Matcher m = CAMEL_IDENT.matcher(text);
+        StringBuilder extra = new StringBuilder();
+        while (m.find()) {
+            String ident = m.group(1);
+            // Only repeat identifiers that contain at least one uppercase letter
+            // (filters out short common words like "should", "create").
+            if (!ident.equals(ident.toLowerCase())) {
+                extra.append(' ').append(ident);
+            }
+        }
+        return extra.isEmpty() ? text : text + extra;
+    }
+
+    /**
+     * Applies a path-based multiplier to RRF scores before handing candidates to the reranker.
+     * Changelogs and synthetic skill docs are semantically relevant but tend to outrank the
+     * authoritative source-code chunks when the query mentions API migration or breaking changes.
+     * Demoting them here keeps them retrievable while giving source files priority.
+     *
+     * <p>Multipliers (tuned against the phaser_rag_eval suite):
+     * <ul>
+     *   <li>{@code changelog/}  → ×0.50 — migration notes, not current API reference
+     *   <li>{@code skills/} / {@code SKILL.md} → ×0.60 — synthetic summaries, not authoritative
+     *   <li>{@code docs/} → ×0.75 — curated docs; useful but prefer source JSDoc
+     *   <li>everything else (source, tests, configs) → ×1.0
+     * </ul>
+     */
+    private static List<SearchHit> applyFilePathBias(List<SearchHit> hits) {
+        boolean anyChanged = false;
+        List<SearchHit> out = new ArrayList<>(hits.size());
+        for (SearchHit h : hits) {
+            double mult = filePathMultiplier(h.filePath());
+            if (mult == 1.0) {
+                out.add(h);
+            } else {
+                out.add(new SearchHit(
+                        h.chunkId(), h.repoId(), h.versionId(), h.repoName(), h.tag(),
+                        h.filePath(), h.startLine(), h.endLine(), h.language(), h.symbol(),
+                        h.content(), h.score() * mult));
+                anyChanged = true;
+            }
+        }
+        if (!anyChanged) return hits;
+        out.sort(Comparator.comparingDouble(SearchHit::score).reversed());
+        return out;
+    }
+
+    private static double filePathMultiplier(String filePath) {
+        if (filePath == null || filePath.isEmpty()) return 1.0;
+        String lp = filePath.toLowerCase();
+        if (lp.startsWith("changelog/") || lp.contains("/changelog/")) return 0.50;
+        if (lp.contains("/skills/") || lp.endsWith("skill.md")) return 0.60;
+        if (lp.startsWith("docs/") || lp.contains("/docs/")) return 0.75;
+        return 1.0;
+    }
+
+    private List<SearchHit> rrf(List<SearchHit> a, List<SearchHit> b) {
+        Map<ChunkId, Double> scores = new HashMap<>();
+        Map<ChunkId, SearchHit> firstSeen = new HashMap<>();
+        addRankContribution(a, scores, firstSeen);
+        addRankContribution(b, scores, firstSeen);
+        return scores.entrySet().stream()
+                .sorted(Map.Entry.<ChunkId, Double>comparingByValue().reversed())
+                .map(e -> {
+                    SearchHit h = firstSeen.get(e.getKey());
+                    return new SearchHit(
+                            h.chunkId(), h.repoId(), h.versionId(), h.repoName(), h.tag(),
+                            h.filePath(), h.startLine(), h.endLine(), h.language(), h.symbol(),
+                            h.content(), e.getValue());
+                })
+                .toList();
+    }
+
+    private void addRankContribution(List<SearchHit> hits, Map<ChunkId, Double> scores, Map<ChunkId, SearchHit> seen) {
+        for (int rank = 0; rank < hits.size(); rank++) {
+            SearchHit h = hits.get(rank);
+            scores.merge(h.chunkId(), 1.0 / (rrfK + rank + 1.0), Double::sum);
+            seen.putIfAbsent(h.chunkId(), h);
+        }
+    }
+
+    private List<SearchHit> enrich(List<SearchHit> hits) {
+        Map<String, String> repoNameByRepoId = new HashMap<>();
+        Map<String, String> tagByVersionId = new HashMap<>();
+        List<SearchHit> out = new ArrayList<>(hits.size());
+        for (SearchHit h : hits) {
+            String repoName = repoNameByRepoId.computeIfAbsent(
+                    h.repoId().toString(),
+                    k -> repos.findById(h.repoId()).map(Repository::name).orElse("?"));
+            String tag = tagByVersionId.computeIfAbsent(
+                    h.versionId().toString(),
+                    k -> repos.findVersion(h.versionId()).map(Version::tag).orElse("?"));
+            out.add(new SearchHit(
+                    h.chunkId(), h.repoId(), h.versionId(),
+                    repoName, tag,
+                    h.filePath(), h.startLine(), h.endLine(), h.language(), h.symbol(),
+                    h.content(), h.score()));
+        }
+        return out;
+    }
+
+    private List<SearchHit> packByTokenBudget(List<SearchHit> ranked, int tokenBudget, int maxHits) {
+        List<SearchHit> out = new ArrayList<>();
+        int used = 0;
+        for (SearchHit h : ranked) {
+            if (out.size() >= maxHits) break;
+            int t = estimateTokens(h.content());
+            if (used + t > tokenBudget && !out.isEmpty()) break;
+            out.add(h);
+            used += t;
+        }
+        return out;
+    }
+
+    /** 4 chars ≈ 1 token — same rule of thumb Context7 uses for packing. */
+    private static int estimateTokens(String s) {
+        return Math.max(1, s.length() / 4);
+    }
+}
--- a/trueref-bootstrap/pom.xml
+++ b/trueref-bootstrap/pom.xml
@@ -0,0 +1,68 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<project xmlns="http://maven.apache.org/POM/4.0.0"
+         xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
+         xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 https://maven.apache.org/xsd/maven-4.0.0.xsd">
+    <modelVersion>4.0.0</modelVersion>
+
+    <parent>
+        <groupId>com.trueref</groupId>
+        <artifactId>trueref-parent</artifactId>
+        <version>0.1.0-SNAPSHOT</version>
+    </parent>
+
+    <artifactId>trueref-bootstrap</artifactId>
+    <name>trueref-bootstrap</name>
+    <description>Spring Boot entry point. Wires beans across modules. Produces the executable fat JAR.</description>
+
+    <dependencies>
+        <dependency>
+            <groupId>com.trueref</groupId>
+            <artifactId>trueref-domain</artifactId>
+        </dependency>
+        <dependency>
+            <groupId>com.trueref</groupId>
+            <artifactId>trueref-application</artifactId>
+        </dependency>
+        <dependency>
+            <groupId>com.trueref</groupId>
+            <artifactId>trueref-adapters</artifactId>
+        </dependency>
+        <dependency>
+            <groupId>com.trueref</groupId>
+            <artifactId>trueref-frontend</artifactId>
+        </dependency>
+
+        <dependency>
+            <groupId>org.springframework.boot</groupId>
+            <artifactId>spring-boot-starter-actuator</artifactId>
+        </dependency>
+        <dependency>
+            <groupId>io.micrometer</groupId>
+            <artifactId>micrometer-registry-prometheus</artifactId>
+        </dependency>
+
+        <dependency>
+            <groupId>org.springframework.boot</groupId>
+            <artifactId>spring-boot-starter-test</artifactId>
+            <scope>test</scope>
+        </dependency>
+    </dependencies>
+
+    <build>
+        <finalName>trueref</finalName>
+        <plugins>
+            <plugin>
+                <groupId>org.springframework.boot</groupId>
+                <artifactId>spring-boot-maven-plugin</artifactId>
+                <configuration>
+                    <mainClass>com.trueref.bootstrap.TrueRefApplication</mainClass>
+                </configuration>
+                <executions>
+                    <execution>
+                        <goals><goal>repackage</goal></goals>
+                    </execution>
+                </executions>
+            </plugin>
+        </plugins>
+    </build>
+</project>
--- a/trueref-bootstrap/src/main/java/com/trueref/bootstrap/ApplicationBeans.java
+++ b/trueref-bootstrap/src/main/java/com/trueref/bootstrap/ApplicationBeans.java
@@ -0,0 +1,89 @@
+package com.trueref.bootstrap;
+
+import com.trueref.application.catalog.CatalogService;
+import com.trueref.application.ingest.DiscoveryService;
+import com.trueref.application.ingest.IngestionOrchestrator;
+import com.trueref.application.observability.InMemoryJobEventBus;
+import com.trueref.application.observability.JobObservationService;
+import com.trueref.application.resolve.LibraryResolver;
+import com.trueref.application.search.HybridSearchService;
+import com.trueref.domain.port.out.ChunkStore;
+import com.trueref.domain.port.out.CodeParser;
+import com.trueref.domain.port.out.EmbeddingCache;
+import com.trueref.domain.port.out.EmbeddingService;
+import com.trueref.domain.port.out.GitClient;
+import com.trueref.domain.port.out.JobStore;
+import com.trueref.domain.port.out.RepositoryStore;
+import com.trueref.domain.port.out.RerankerService;
+import java.nio.file.Path;
+import org.springframework.beans.factory.annotation.Value;
+import org.springframework.context.annotation.Bean;
+import org.springframework.context.annotation.Configuration;
+
+/**
+ * Explicit bean wiring for the application layer (which stays Spring-annotation-free).
+ * We expose only concrete beans; Spring resolves interface dependencies against the single
+ * concrete implementation.
+ */
+@Configuration
+public class ApplicationBeans {
+
+    @Bean
+    Path trueRefHome(@Value("${trueref.home:./data}") String home) {
+        return Path.of(home);
+    }
+
+    @Bean
+    InMemoryJobEventBus jobEventBus() {
+        return new InMemoryJobEventBus();
+    }
+
+    @Bean
+    CatalogService catalogService(RepositoryStore store, Path trueRefHome) {
+        return new CatalogService(store, trueRefHome);
+    }
+
+    @Bean
+    DiscoveryService discoveryService(RepositoryStore store, GitClient git) {
+        return new DiscoveryService(store, git);
+    }
+
+    @Bean(destroyMethod = "shutdown")
+    IngestionOrchestrator ingestionOrchestrator(
+            RepositoryStore repoStore,
+            JobStore jobStore,
+            ChunkStore chunkStore,
+            EmbeddingService embeddings,
+            EmbeddingCache embeddingCache,
+            GitClient git,
+            CodeParser parser,
+            InMemoryJobEventBus bus,
+            @Value("${trueref.ingestion.max-parse-jobs:4}") int maxParseJobs,
+            @Value("${trueref.ingestion.embed-queue-capacity:4}") int embedQueueCapacity) {
+        return new IngestionOrchestrator(
+                repoStore, jobStore, chunkStore, embeddings, embeddingCache, git, parser, bus,
+                maxParseJobs, embedQueueCapacity);
+    }
+
+    @Bean
+    LibraryResolver libraryResolver(RepositoryStore store, IngestionOrchestrator indexer) {
+        return new LibraryResolver(store, indexer);
+    }
+
+    @Bean
+    HybridSearchService hybridSearchService(
+            ChunkStore chunks,
+            EmbeddingService embedder,
+            RerankerService reranker,
+            RepositoryStore repos,
+            @Value("${trueref.search.rrf-k:60}") int rrfK,
+            @Value("${trueref.reranker.top-k:50}") int rerankTopK,
+            @Value("${trueref.search.final-top-k:20}") int finalTopK) {
+        return new HybridSearchService(chunks, embedder, reranker, repos, rrfK, rerankTopK, finalTopK);
+    }
+
+    @Bean
+    JobObservationService jobObservationService(JobStore jobs, InMemoryJobEventBus bus) {
+        return new JobObservationService(jobs, bus);
+    }
+}
--- a/trueref-bootstrap/src/main/java/com/trueref/bootstrap/ScheduledPoller.java
+++ b/trueref-bootstrap/src/main/java/com/trueref/bootstrap/ScheduledPoller.java
@@ -0,0 +1,57 @@
+package com.trueref.bootstrap;
+
+import com.trueref.application.ingest.DiscoveryService;
+import com.trueref.domain.model.Repository;
+import com.trueref.domain.model.Version;
+import com.trueref.domain.model.VersionStatus;
+import com.trueref.domain.port.in.IndexVersion;
+import com.trueref.domain.port.out.RepositoryStore;
+import java.time.Instant;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+import org.springframework.boot.autoconfigure.condition.ConditionalOnProperty;
+import org.springframework.scheduling.annotation.EnableScheduling;
+import org.springframework.scheduling.annotation.Scheduled;
+import org.springframework.stereotype.Component;
+
+/** Periodically fetches tags for registered repos and enqueues indexing for new ones. */
+@Component
+@EnableScheduling
+@ConditionalOnProperty(name = "trueref.ingestion.poller-enabled", havingValue = "true", matchIfMissing = true)
+public class ScheduledPoller {
+
+    private static final Logger log = LoggerFactory.getLogger(ScheduledPoller.class);
+
+    private final RepositoryStore repoStore;
+    private final DiscoveryService discovery;
+    private final IndexVersion indexer;
+
+    public ScheduledPoller(RepositoryStore repoStore, DiscoveryService discovery, IndexVersion indexer) {
+        this.repoStore = repoStore;
+        this.discovery = discovery;
+        this.indexer = indexer;
+    }
+
+    @Scheduled(fixedDelayString = "${trueref.ingestion.poll-interval-default:PT1H}")
+    public void pollAll() {
+        Instant start = Instant.now();
+        int scanned = 0;
+        int enqueued = 0;
+        for (Repository repo : repoStore.findAll()) {
+            try {
+                discovery.discover(repo.id());
+                for (Version v : repoStore.findVersionsByRepo(repo.id())) {
+                    if (v.status() == VersionStatus.DISCOVERED) {
+                        indexer.enqueue(repo.id(), v.id(), false);
+                        enqueued++;
+                    }
+                }
+                scanned++;
+            } catch (Exception e) {
+                log.warn("poll failed for repo={}: {}", repo.name(), e.toString());
+            }
+        }
+        log.info("poll completed in {}ms: repos scanned={} jobs enqueued={}",
+                java.time.Duration.between(start, Instant.now()).toMillis(), scanned, enqueued);
+    }
+}
--- a/trueref-bootstrap/src/main/java/com/trueref/bootstrap/StaleJobCleanupStartup.java
+++ b/trueref-bootstrap/src/main/java/com/trueref/bootstrap/StaleJobCleanupStartup.java
@@ -0,0 +1,59 @@
+package com.trueref.bootstrap;
+
+import com.trueref.domain.port.out.JobStore;
+import com.trueref.domain.port.out.RepositoryStore;
+import java.time.Instant;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+import org.springframework.boot.context.event.ApplicationReadyEvent;
+import org.springframework.context.event.EventListener;
+import org.springframework.stereotype.Component;
+
+/**
+ * On-startup cleanup for stale job state left by a previous crash or SIGKILL.
+ *
+ * <p>Any job that is RUNNING or QUEUED when the application starts must have been orphaned by a
+ * previous JVM exit (clean or unclean). We fail them all atomically before accepting traffic so
+ * the UI and API never show phantom RUNNING jobs. Matching INDEXING versions are also reset to
+ * FAILED so they can be re-queued immediately.
+ *
+ * <p>This fires <em>after</em> Flyway migrations and all beans are initialised, but before the
+ * application starts accepting HTTP requests (ApplicationReadyEvent fires before the embedded
+ * Tomcat connector starts accepting connections).
+ */
+@Component
+class StaleJobCleanupStartup {
+
+    private static final Logger log = LoggerFactory.getLogger(StaleJobCleanupStartup.class);
+
+    private static final String RESTART_REASON = "interrupted by server restart";
+
+    private final JobStore jobStore;
+    private final RepositoryStore repositoryStore;
+
+    StaleJobCleanupStartup(JobStore jobStore, RepositoryStore repositoryStore) {
+        this.jobStore = jobStore;
+        this.repositoryStore = repositoryStore;
+    }
+
+    @EventListener(ApplicationReadyEvent.class)
+    public void cleanupStaleJobs() {
+        Instant now = Instant.now();
+
+        int failedJobs = jobStore.failStaleJobs(now);
+        if (failedJobs > 0) {
+            log.warn(
+                    "Startup cleanup: marked {} orphaned job(s) as FAILED (were RUNNING or QUEUED at shutdown).",
+                    failedJobs);
+        } else {
+            log.info("Startup cleanup: no stale jobs found.");
+        }
+
+        int failedVersions = repositoryStore.failStaleIndexingVersions(RESTART_REASON);
+        if (failedVersions > 0) {
+            log.warn(
+                    "Startup cleanup: reset {} INDEXING version(s) to FAILED (their jobs did not complete).",
+                    failedVersions);
+        }
+    }
+}
--- a/trueref-bootstrap/src/main/java/com/trueref/bootstrap/TrueRefApplication.java
+++ b/trueref-bootstrap/src/main/java/com/trueref/bootstrap/TrueRefApplication.java
@@ -0,0 +1,17 @@
+package com.trueref.bootstrap;
+
+import org.springframework.boot.SpringApplication;
+import org.springframework.boot.autoconfigure.SpringBootApplication;
+
+/**
+ * Trueref entry point. The only place where Spring component scanning is allowed across the
+ * {@code com.trueref} package tree. Adapters and application modules expose explicit
+ * {@code @Configuration} classes that this class imports via component scanning.
+ */
+@SpringBootApplication(scanBasePackages = "com.trueref")
+public class TrueRefApplication {
+
+    public static void main(String[] args) {
+        SpringApplication.run(TrueRefApplication.class, args);
+    }
+}
--- a/trueref-bootstrap/src/main/resources/application.yml
+++ b/trueref-bootstrap/src/main/resources/application.yml
@@ -0,0 +1,114 @@
+spring:
+  application:
+    name: trueref
+  threads:
+    virtual:
+      enabled: true
+  datasource:
+    url: jdbc:h2:file:${trueref.home:./data}/h2/trueref;MODE=PostgreSQL;DB_CLOSE_DELAY=-1;DB_CLOSE_ON_EXIT=FALSE;MV_STORE=TRUE
+    username: sa
+    password: ""
+    driver-class-name: org.h2.Driver
+    hikari:
+      # Embedded H2 serialises writes internally; 8 connections is ample for virtual-thread
+      # workloads. 32 is wasteful and causes unnecessary H2 lock contention.
+      maximum-pool-size: 8
+      minimum-idle: 2
+  flyway:
+    enabled: true
+    locations: classpath:db/migration
+  mvc:
+    async:
+      request-timeout: 0   # SSE streams must not time out
+  # Spring AI MCP server. In Spring AI 1.0.0 the WebMVC transport is SSE-based
+  # (WebMvcSseServerTransportProvider) — the closest available transport to the 2025-03-26
+  # "Streamable HTTP" spec; there is no separate "protocol: streamable" property in this
+  # starter. JSON-RPC POSTs land on `sse-message-endpoint` (/mcp); server-initiated
+  # notifications stream over `sse-endpoint` (/sse). See com.trueref.adapter.in.mcp.
+  ai:
+    mcp:
+      server:
+        enabled: true
+        name: trueref
+        version: 0.1.0
+        type: SYNC
+        sse-message-endpoint: /mcp
+        sse-endpoint: /sse
+
+server:
+  port: 8080
+  shutdown: graceful
+
+management:
+  endpoints:
+    web:
+      exposure:
+        include: health,info,metrics,prometheus
+  endpoint:
+    health:
+      show-details: always
+
+springdoc:
+  api-docs:
+    path: /v3/api-docs
+  swagger-ui:
+    path: /swagger-ui.html
+
+trueref:
+  home: ${TRUEREF_HOME:./data}
+  ingestion:
+    poll-interval-default: PT1H
+    tag-cap-default: 100
+    max-file-size-bytes-default: 1048576
+    watched-folder: ${trueref.home}/watched
+    # Max parallel parse jobs (FETCH/CLONE → CHECKOUT → DISCOVER → DIFF → PARSE).
+    # Parse is I/O + CPU only — no GPU. 4 is safe on this machine (Ryzen 9 3900X, 62 GB RAM);
+    # increase for repos with small files, decrease if git I/O saturates disk.
+    max-parse-jobs: 4
+    # Max parsed batches buffered between parse workers and the embed worker.
+    # When the embed worker is busy, parse workers block here — natural backpressure.
+    # Total peak in-memory batches = max-parse-jobs + embed-queue-capacity.
+    embed-queue-capacity: 4
+  embedding:
+    model: bge-base-en-v1.5
+    onnx-providers: cuda,directml,cpu
+    session-count: 1
+    batch-size: 32
+    max-seq-len: 512
+    # Which CUDA device to bind ONNX sessions to. Passed directly to ORT's CUDA EP
+    # as the physical device index — ORT uses the CUDA driver/NVML API which can bypass
+    # CUDA_VISIBLE_DEVICES remapping. The ./trueref script sets this to $TRUEREF_GPU (default: 1 = RTX 3060).
+    gpu-device-id: 0
+    # Per-session GPU memory cap in bytes. 0 = unbounded. With session-count=1 there
+    # is no pool contention, so leave this unbounded — capping it risks exhausting the
+    # BFC arena during model-weight loading before inference starts. The ./trueref script
+    # defaults to 0 and can be overridden with TRUEREF_MEM_LIMIT.
+    gpu-mem-limit-bytes: 0
+    # Override download URLs per (model, file). The built-in defaults (in ModelDownloader)
+    # cover bge-base-en-v1.5, ms-marco-MiniLM-L6-v2, bge-m3, and bge-reranker-v2-m3.
+    # Set HF_TOKEN in the environment for higher rate limits or gated models.
+    # model-sources:
+    #   bge-base-en-v1.5:
+    #     model.onnx:
+    #       - https://huggingface.co/BAAI/bge-base-en-v1.5/resolve/main/onnx/model.onnx
+  reranker:
+    model: ms-marco-MiniLM-L6-v2
+    top-k: 100
+  embedding-cache:
+    # Must match the embedding model's output dimension. Changing this automatically
+    # wipes the stale .f32 files in the cache directory on next startup.
+    dimension: 768
+  search:
+    rrf-k: 60
+    final-top-k: 20
+  mcp:
+    tokens-default: 5000
+    tokens-min: 500
+    tokens-max: 50000
+
+logging:
+  level:
+    root: INFO
+    com.trueref: INFO
+    org.eclipse.jgit: WARN
+    org.apache.lucene: WARN
--- a/trueref-bootstrap/src/main/scripts/trueref
+++ b/trueref-bootstrap/src/main/scripts/trueref
@@ -0,0 +1,62 @@
+#!/usr/bin/env bash
+# trueref launcher — wraps the fat JAR with the JVM flags required to silence
+# the FFM (foreign linker) restricted-method warning emitted by JNA-based
+# tokenizer libraries and to make Lucene's Vector API path readable.
+#
+#   --enable-native-access=ALL-UNNAMED
+#       Lucene 10 + DJL HuggingFace Tokenizers use the new java.lang.foreign
+#       Linker API; on Java 21 this requires explicit native-access opt-in.
+#   --add-modules jdk.incubator.vector
+#       Lucene 10 ships an incubator-vector codepath that is significantly
+#       faster for cosine/dot-product math but only loads if the module is
+#       made readable from the unnamed module.
+#
+# Usage:
+#   bin/trueref                              # default settings
+#   bin/trueref --server.port=18080          # forward Spring properties
+#   TRUEREF_JAR=/path/to/trueref.jar bin/trueref
+#
+# Environment overrides:
+#   TRUEREF_JAR    Path to the fat JAR (default: <script-dir>/../trueref.jar)
+#   JAVA          Path to the java binary (default: ${JAVA_HOME:-}/bin/java or `java` on PATH)
+#   JAVA_OPTS     Extra JVM flags (e.g. -Xmx16g, -XX:+UseZGC)
+
+set -euo pipefail
+
+SCRIPT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
+JAR_DEFAULT="${SCRIPT_DIR}/../trueref.jar"
+JAR="${TRUEREF_JAR:-$JAR_DEFAULT}"
+
+if [[ ! -f "$JAR" ]]; then
+  echo "trueref: jar not found at $JAR" >&2
+  echo "trueref: set TRUEREF_JAR or place trueref.jar next to this script" >&2
+  exit 1
+fi
+
+if [[ -n "${JAVA:-}" ]]; then
+  :
+elif [[ -n "${JAVA_HOME:-}" && -x "${JAVA_HOME}/bin/java" ]]; then
+  JAVA="${JAVA_HOME}/bin/java"
+else
+  JAVA="$(command -v java || true)"
+fi
+
+if [[ -z "${JAVA:-}" || ! -x "${JAVA}" ]]; then
+  echo "trueref: java not found; set JAVA_HOME or install JDK 21+" >&2
+  exit 1
+fi
+
+# ONNX Runtime CUDA EP needs cuDNN 9 on LD_LIBRARY_PATH. Many distros only ship
+# cuDNN via the system package manager or via a Python wheel (nvidia-cudnn-cu12).
+# If the user sets TRUEREF_CUDNN_LIB we trust it; otherwise we leave LD_LIBRARY_PATH
+# alone and let CUDA fall back to CPU with a logged warning.
+if [[ -n "${TRUEREF_CUDNN_LIB:-}" ]]; then
+  export LD_LIBRARY_PATH="${TRUEREF_CUDNN_LIB}${LD_LIBRARY_PATH:+:$LD_LIBRARY_PATH}"
+fi
+
+exec "$JAVA" \
+  --enable-native-access=ALL-UNNAMED \
+  --add-modules=jdk.incubator.vector \
+  ${JAVA_OPTS:-} \
+  -jar "$JAR" \
+  "$@"
--- a/trueref-domain/pom.xml
+++ b/trueref-domain/pom.xml
@@ -0,0 +1,21 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<project xmlns="http://maven.apache.org/POM/4.0.0"
+         xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
+         xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 https://maven.apache.org/xsd/maven-4.0.0.xsd">
+    <modelVersion>4.0.0</modelVersion>
+
+    <parent>
+        <groupId>com.trueref</groupId>
+        <artifactId>trueref-parent</artifactId>
+        <version>0.1.0-SNAPSHOT</version>
+    </parent>
+
+    <artifactId>trueref-domain</artifactId>
+    <name>trueref-domain</name>
+    <description>Pure domain model + ports. No Spring, no I/O, no third-party libs beyond JSpecify.</description>
+
+    <!-- Hexagonal contract: domain has ZERO runtime dependencies beyond JSpecify (annotations only). -->
+    <dependencies>
+        <!-- inherits jspecify + test deps from parent -->
+    </dependencies>
+</project>
--- a/trueref-domain/src/main/java/com/trueref/domain/error/IngestionFailed.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/error/IngestionFailed.java
@@ -0,0 +1,9 @@
+package com.trueref.domain.error;
+
+import org.jspecify.annotations.Nullable;
+
+public final class IngestionFailed extends TrueRefException {
+    public IngestionFailed(String message, @Nullable Throwable cause) {
+        super("ingestion_failed", message, cause);
+    }
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/error/InvalidSearchRequest.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/error/InvalidSearchRequest.java
@@ -0,0 +1,7 @@
+package com.trueref.domain.error;
+
+public final class InvalidSearchRequest extends TrueRefException {
+    public InvalidSearchRequest(String message) {
+        super("invalid_search_request", message, null);
+    }
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/error/RepositoryAlreadyRegistered.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/error/RepositoryAlreadyRegistered.java
@@ -0,0 +1,7 @@
+package com.trueref.domain.error;
+
+public final class RepositoryAlreadyRegistered extends TrueRefException {
+    public RepositoryAlreadyRegistered(String name) {
+        super("repository_already_registered", "Repository already registered: " + name, null);
+    }
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/error/RepositoryNotFound.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/error/RepositoryNotFound.java
@@ -0,0 +1,7 @@
+package com.trueref.domain.error;
+
+public final class RepositoryNotFound extends TrueRefException {
+    public RepositoryNotFound(String idOrName) {
+        super("repository_not_found", "Repository not found: " + idOrName, null);
+    }
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/error/TagNotFound.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/error/TagNotFound.java
@@ -0,0 +1,7 @@
+package com.trueref.domain.error;
+
+public final class TagNotFound extends TrueRefException {
+    public TagNotFound(String repo, String tag) {
+        super("tag_not_found", "Tag not found in repository: " + repo + "@" + tag, null);
+    }
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/error/TrueRefException.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/error/TrueRefException.java
@@ -0,0 +1,25 @@
+package com.trueref.domain.error;
+
+import org.jspecify.annotations.Nullable;
+
+/** Root of all domain errors. Carries a stable string {@link #code()} for client localization. */
+public abstract sealed class TrueRefException extends RuntimeException
+        permits RepositoryAlreadyRegistered,
+                RepositoryNotFound,
+                VersionNotFound,
+                VersionNotIndexed,
+                TagNotFound,
+                IngestionFailed,
+                InvalidSearchRequest {
+
+    private final String code;
+
+    protected TrueRefException(String code, String message, @Nullable Throwable cause) {
+        super(message, cause);
+        this.code = code;
+    }
+
+    public String code() {
+        return code;
+    }
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/error/VersionNotFound.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/error/VersionNotFound.java
@@ -0,0 +1,7 @@
+package com.trueref.domain.error;
+
+public final class VersionNotFound extends TrueRefException {
+    public VersionNotFound(String repo, String version) {
+        super("version_not_found", "Version not found: " + repo + "@" + version, null);
+    }
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/error/VersionNotIndexed.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/error/VersionNotIndexed.java
@@ -0,0 +1,8 @@
+package com.trueref.domain.error;
+
+/** Thrown when a search request targets a known version that has not been indexed yet. */
+public final class VersionNotIndexed extends TrueRefException {
+    public VersionNotIndexed(String repo, String version) {
+        super("version_not_indexed", "Version not yet indexed: " + repo + "@" + version, null);
+    }
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/error/package-info.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/error/package-info.java
@@ -0,0 +1,5 @@
+/**
+ * Sealed exception hierarchy for the domain. Adapters translate these to HTTP / JSON-RPC responses.
+ */
+@org.jspecify.annotations.NullMarked
+package com.trueref.domain.error;
--- a/trueref-domain/src/main/java/com/trueref/domain/model/Chunk.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/model/Chunk.java
@@ -0,0 +1,18 @@
+package com.trueref.domain.model;
+
+import org.jspecify.annotations.Nullable;
+
+/**
+ * A globally-deduplicated piece of content (function, class, markdown section, sliding-window
+ * fallback). Identified by {@link #contentHash()}: two chunks with the same hash are the same
+ * chunk, regardless of which repo/tag/file they originated from.
+ *
+ * @param symbol AST symbol name when applicable (e.g. function or class), null for prose chunks
+ */
+public record Chunk(
+        ChunkId id,
+        String contentHash,
+        String content,
+        String language,
+        @Nullable String symbol,
+        int tokenCount) {}
--- a/trueref-domain/src/main/java/com/trueref/domain/model/ChunkId.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/model/ChunkId.java
@@ -0,0 +1,16 @@
+package com.trueref.domain.model;
+
+public record ChunkId(java.util.UUID value) {
+    public static ChunkId random() {
+        return new ChunkId(java.util.UUID.randomUUID());
+    }
+
+    public static ChunkId of(String s) {
+        return new ChunkId(java.util.UUID.fromString(s));
+    }
+
+    @Override
+    public String toString() {
+        return value.toString();
+    }
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/model/ChunkVersion.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/model/ChunkVersion.java
@@ -0,0 +1,8 @@
+package com.trueref.domain.model;
+
+/**
+ * Many-to-many edge between a {@link Chunk} and a {@link Version}. Carries the location of the
+ * chunk inside the version's source tree.
+ */
+public record ChunkVersion(
+        ChunkId chunkId, VersionId versionId, String filePath, int startLine, int endLine) {}
--- a/trueref-domain/src/main/java/com/trueref/domain/model/Embedding.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/model/Embedding.java
@@ -0,0 +1,19 @@
+package com.trueref.domain.model;
+
+/** Vector representation of a {@link Chunk}. Dense float vector; sparse channel deferred. */
+public record Embedding(ChunkId chunkId, float[] vector) {
+
+    public Embedding {
+        // Defensive copy to make the record effectively immutable.
+        vector = vector.clone();
+    }
+
+    @Override
+    public float[] vector() {
+        return vector.clone();
+    }
+
+    public int dimension() {
+        return vector.length;
+    }
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/model/IngestionJob.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/model/IngestionJob.java
@@ -0,0 +1,25 @@
+package com.trueref.domain.model;
+
+import java.time.Instant;
+import java.util.List;
+import org.jspecify.annotations.Nullable;
+
+/**
+ * A unit of orchestrated work. One job has many {@link JobStage stages} executed in sequence.
+ *
+ * @param versionId null for repo-level jobs (e.g. {@link JobType#DISCOVER_TAGS})
+ */
+public record IngestionJob(
+        JobId id,
+        RepositoryId repoId,
+        @Nullable VersionId versionId,
+        JobType type,
+        JobStatus status,
+        @Nullable Instant startedAt,
+        @Nullable Instant finishedAt,
+        List<JobStage> stages) {
+
+    public IngestionJob {
+        stages = List.copyOf(stages);
+    }
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/model/JobId.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/model/JobId.java
@@ -0,0 +1,16 @@
+package com.trueref.domain.model;
+
+public record JobId(java.util.UUID value) {
+    public static JobId random() {
+        return new JobId(java.util.UUID.randomUUID());
+    }
+
+    public static JobId of(String s) {
+        return new JobId(java.util.UUID.fromString(s));
+    }
+
+    @Override
+    public String toString() {
+        return value.toString();
+    }
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/model/JobLogEvent.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/model/JobLogEvent.java
@@ -0,0 +1,20 @@
+package com.trueref.domain.model;
+
+import java.time.Instant;
+import org.jspecify.annotations.Nullable;
+
+/** A single emitted observability event for an ingestion job. Streamed via SSE to the UI. */
+public record JobLogEvent(
+        JobId jobId,
+        Instant ts,
+        Level level,
+        JobStage.@Nullable StageName stage,
+        String message) {
+
+    public enum Level {
+        DEBUG,
+        INFO,
+        WARN,
+        ERROR
+    }
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/model/JobStage.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/model/JobStage.java
@@ -0,0 +1,37 @@
+package com.trueref.domain.model;
+
+import java.time.Instant;
+import org.jspecify.annotations.Nullable;
+
+public record JobStage(
+        JobId jobId,
+        StageName name,
+        StageStatus status,
+        @Nullable Instant startedAt,
+        @Nullable Instant finishedAt,
+        long itemsProcessed,
+        long itemsTotal,
+        long bytesProcessed,
+        @Nullable String errorMessage) {
+
+    public enum StageName {
+        CLONE,
+        FETCH,
+        CHECKOUT,
+        DISCOVER_FILES,
+        DIFF_FILES,
+        PARSE,
+        CHUNK,
+        EMBED,
+        INDEX,
+        COMMIT
+    }
+
+    public enum StageStatus {
+        PENDING,
+        RUNNING,
+        SUCCEEDED,
+        FAILED,
+        SKIPPED
+    }
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/model/JobStatus.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/model/JobStatus.java
@@ -0,0 +1,9 @@
+package com.trueref.domain.model;
+
+public enum JobStatus {
+    QUEUED,
+    RUNNING,
+    SUCCEEDED,
+    FAILED,
+    CANCELLED
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/model/JobType.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/model/JobType.java
@@ -0,0 +1,8 @@
+package com.trueref.domain.model;
+
+public enum JobType {
+    DISCOVER_TAGS,
+    INDEX_VERSION,
+    REFRESH,
+    COMPACT
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/model/Repository.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/model/Repository.java
@@ -0,0 +1,37 @@
+package com.trueref.domain.model;
+
+import java.time.Duration;
+import java.time.Instant;
+import java.util.List;
+import org.jspecify.annotations.Nullable;
+
+/**
+ * A registered git repository (local or remote-cloned). The {@code localPath} is always present;
+ * for remote repositories it points to our managed clone directory and {@code managedClone} is true.
+ *
+ * @param remoteUrl    git URL when {@code managedClone} is true; null otherwise
+ * @param ignoreGlobs  per-repo globs ANDed with .gitignore + built-in defaults
+ * @param maxFileSizeBytes files larger than this are skipped during ingestion
+ * @param pollInterval scheduled fetch interval; {@link Duration#ZERO} disables polling
+ * @param tagCap       max most-recent tags to auto-index; UI/MCP can index more on demand
+ * @param versionMappingRules ordered patterns mapping a client version (e.g. {@code "1.2.3"}) to a tag
+ */
+public record Repository(
+        RepositoryId id,
+        String name,
+        @Nullable String remoteUrl,
+        String localPath,
+        boolean managedClone,
+        List<String> ignoreGlobs,
+        long maxFileSizeBytes,
+        Duration pollInterval,
+        int tagCap,
+        List<TagPattern> versionMappingRules,
+        Instant createdAt,
+        Instant updatedAt) {
+
+    public Repository {
+        ignoreGlobs = List.copyOf(ignoreGlobs);
+        versionMappingRules = List.copyOf(versionMappingRules);
+    }
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/model/RepositoryId.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/model/RepositoryId.java
@@ -0,0 +1,17 @@
+package com.trueref.domain.model;
+
+/** Type-safe identifier for a registered repository. */
+public record RepositoryId(java.util.UUID value) {
+    public static RepositoryId random() {
+        return new RepositoryId(java.util.UUID.randomUUID());
+    }
+
+    public static RepositoryId of(String s) {
+        return new RepositoryId(java.util.UUID.fromString(s));
+    }
+
+    @Override
+    public String toString() {
+        return value.toString();
+    }
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/model/SearchHit.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/model/SearchHit.java
@@ -0,0 +1,18 @@
+package com.trueref.domain.model;
+
+import org.jspecify.annotations.Nullable;
+
+/** A single ranked snippet returned from a search. */
+public record SearchHit(
+        ChunkId chunkId,
+        RepositoryId repoId,
+        VersionId versionId,
+        String repoName,
+        String tag,
+        String filePath,
+        int startLine,
+        int endLine,
+        String language,
+        @Nullable String symbol,
+        String content,
+        double score) {}
--- a/trueref-domain/src/main/java/com/trueref/domain/model/SearchScope.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/model/SearchScope.java
@@ -0,0 +1,16 @@
+package com.trueref.domain.model;
+
+import java.util.List;
+
+/**
+ * Defines the (repo, version) scope of a search request. Multiple scopes can be ORed together so a
+ * single query may span "spring-boot v3.5.4" and "spring-boot v3.4.0", for example.
+ */
+public record SearchScope(List<RepoVersionRef> refs) {
+
+    public SearchScope {
+        refs = List.copyOf(refs);
+    }
+
+    public record RepoVersionRef(RepositoryId repoId, VersionId versionId) {}
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/model/TagPattern.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/model/TagPattern.java
@@ -0,0 +1,24 @@
+package com.trueref.domain.model;
+
+/**
+ * Strategy for mapping a client-supplied version string to a git tag in a repository. Patterns are
+ * tried in order; the first match wins. Built-in patterns: EXACT, V_PREFIX, RELEASE_PREFIX,
+ * SEMVER_FUZZY. CUSTOM allows a user-supplied template like {@code "release-{semver}"}.
+ */
+public sealed interface TagPattern {
+
+    /** {@code "1.2.3"} → tag {@code "1.2.3"}. */
+    record Exact() implements TagPattern {}
+
+    /** {@code "1.2.3"} → tag {@code "v1.2.3"}. */
+    record VPrefix() implements TagPattern {}
+
+    /** {@code "1.2.3"} → tag {@code "release-1.2.3"}. */
+    record ReleasePrefix() implements TagPattern {}
+
+    /** Any tag whose semver is closest to the requested version. */
+    record SemverFuzzy() implements TagPattern {}
+
+    /** Custom template containing {@code {version}} or {@code {semver}} placeholders. */
+    record Custom(String template) implements TagPattern {}
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/model/Version.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/model/Version.java
@@ -0,0 +1,15 @@
+package com.trueref.domain.model;
+
+import java.time.Instant;
+import org.jspecify.annotations.Nullable;
+
+/** A specific git tag (or branch) of a {@link Repository} that may be indexed independently. */
+public record Version(
+        VersionId id,
+        RepositoryId repoId,
+        String tag,
+        String commitSha,
+        VersionStatus status,
+        @Nullable Instant indexedAt,
+        int chunkCount,
+        @Nullable String errorMessage) {}
--- a/trueref-domain/src/main/java/com/trueref/domain/model/VersionId.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/model/VersionId.java
@@ -0,0 +1,16 @@
+package com.trueref.domain.model;
+
+public record VersionId(java.util.UUID value) {
+    public static VersionId random() {
+        return new VersionId(java.util.UUID.randomUUID());
+    }
+
+    public static VersionId of(String s) {
+        return new VersionId(java.util.UUID.fromString(s));
+    }
+
+    @Override
+    public String toString() {
+        return value.toString();
+    }
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/model/VersionStatus.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/model/VersionStatus.java
@@ -0,0 +1,14 @@
+package com.trueref.domain.model;
+
+public enum VersionStatus {
+    /** Tag known but not yet indexed. */
+    DISCOVERED,
+    /** Indexing job currently running. */
+    INDEXING,
+    /** Successfully indexed and queryable. */
+    INDEXED,
+    /** Last indexing attempt failed; see {@link Version#errorMessage()}. */
+    FAILED,
+    /** Tag no longer exists upstream; chunks reclaimable by compaction. */
+    INACTIVE
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/model/package-info.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/model/package-info.java
@@ -0,0 +1,7 @@
+/**
+ * Pure domain model for trueref. Contains records and enums describing repositories, versions,
+ * chunks, ingestion jobs, and search results. <strong>Must remain free of any I/O, Spring,
+ * Jackson, or other framework concerns.</strong> JSpecify nullability annotations are allowed.
+ */
+@org.jspecify.annotations.NullMarked
+package com.trueref.domain.model;
--- a/trueref-domain/src/main/java/com/trueref/domain/port/in/DiscoverVersions.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/port/in/DiscoverVersions.java
@@ -0,0 +1,12 @@
+package com.trueref.domain.port.in;
+
+import com.trueref.domain.model.RepositoryId;
+import com.trueref.domain.model.Version;
+import java.util.List;
+
+/** Use case: discover/refresh git tags of a repository. */
+public interface DiscoverVersions {
+
+    /** Performs git fetch (if managed) + tag enumeration. Returns the now-known versions. */
+    List<Version> discover(RepositoryId repoId);
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/port/in/IndexVersion.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/port/in/IndexVersion.java
@@ -0,0 +1,12 @@
+package com.trueref.domain.port.in;
+
+import com.trueref.domain.model.JobId;
+import com.trueref.domain.model.RepositoryId;
+import com.trueref.domain.model.VersionId;
+
+/** Use case: schedule indexing of a specific (repo, tag/version). */
+public interface IndexVersion {
+
+    /** Enqueues an INDEX_VERSION job. Returns immediately with the job id. */
+    JobId enqueue(RepositoryId repoId, VersionId versionId, boolean force);
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/port/in/ObserveJobs.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/port/in/ObserveJobs.java
@@ -0,0 +1,27 @@
+package com.trueref.domain.port.in;
+
+import com.trueref.domain.model.IngestionJob;
+import com.trueref.domain.model.JobId;
+import com.trueref.domain.model.JobLogEvent;
+import com.trueref.domain.model.JobStatus;
+import com.trueref.domain.model.RepositoryId;
+import com.trueref.domain.model.VersionId;
+import java.util.List;
+import java.util.Optional;
+import java.util.function.Consumer;
+import org.jspecify.annotations.Nullable;
+
+/** Use case: read jobs and subscribe to job/log streams (for SSE in the UI). */
+public interface ObserveJobs {
+
+    Optional<IngestionJob> findJob(JobId id);
+
+    List<IngestionJob> listJobs(
+            @Nullable RepositoryId repoId, @Nullable VersionId versionId, @Nullable JobStatus status, int limit);
+
+    /** Subscribes to live status updates of all jobs. Returns an unsubscribe handle. */
+    AutoCloseable subscribeJobs(Consumer<IngestionJob> listener);
+
+    /** Subscribes to log events of a single job. Returns an unsubscribe handle. */
+    AutoCloseable subscribeLogs(JobId jobId, Consumer<JobLogEvent> listener);
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/port/in/QueryCatalog.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/port/in/QueryCatalog.java
@@ -0,0 +1,17 @@
+package com.trueref.domain.port.in;
+
+import com.trueref.domain.model.Repository;
+import com.trueref.domain.model.RepositoryId;
+import com.trueref.domain.model.Version;
+import java.util.List;
+import java.util.Optional;
+
+/** Use case: read-only access to repositories and their versions. */
+public interface QueryCatalog {
+
+    List<Repository> listRepositories();
+
+    Optional<Repository> findRepository(RepositoryId id);
+
+    List<Version> listVersions(RepositoryId repoId);
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/port/in/RegisterRepository.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/port/in/RegisterRepository.java
@@ -0,0 +1,32 @@
+package com.trueref.domain.port.in;
+
+import com.trueref.domain.model.Repository;
+import com.trueref.domain.model.RepositoryId;
+import com.trueref.domain.model.TagPattern;
+import java.time.Duration;
+import java.util.List;
+import org.jspecify.annotations.Nullable;
+
+/** Use case: register a new repository (local path or remote URL). */
+public interface RegisterRepository {
+
+    Repository register(Command cmd);
+
+    record Command(
+            String name,
+            @Nullable String remoteUrl,
+            @Nullable String localPath,
+            List<String> ignoreGlobs,
+            @Nullable Long maxFileSizeBytes,
+            @Nullable Duration pollInterval,
+            @Nullable Integer tagCap,
+            List<TagPattern> versionMappingRules) {
+
+        public Command {
+            ignoreGlobs = List.copyOf(ignoreGlobs);
+            versionMappingRules = List.copyOf(versionMappingRules);
+        }
+    }
+
+    void unregister(RepositoryId id);
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/port/in/ResolveLibraryId.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/port/in/ResolveLibraryId.java
@@ -0,0 +1,40 @@
+package com.trueref.domain.port.in;
+
+import com.trueref.domain.model.RepositoryId;
+import com.trueref.domain.model.VersionId;
+import com.trueref.domain.model.VersionStatus;
+import java.util.List;
+import org.jspecify.annotations.Nullable;
+
+/**
+ * Use case: turn a fuzzy library name (and optional version) into one or more concrete (repo,
+ * version) handles, ranked by relevance. Mirrors Context7's {@code resolve-library-id}.
+ */
+public interface ResolveLibraryId {
+
+    Result resolve(Query query);
+
+    record Query(String libraryName, @Nullable String query, @Nullable String version) {}
+
+    record Result(List<Match> matches) {
+        public Result {
+            matches = List.copyOf(matches);
+        }
+    }
+
+    record Match(
+            RepositoryId repoId,
+            String libraryId, // "/owner/repo[/version]"
+            String name,
+            @Nullable String description,
+            int snippetCount,
+            List<VersionRef> availableVersions,
+            double score) {
+
+        public Match {
+            availableVersions = List.copyOf(availableVersions);
+        }
+    }
+
+    record VersionRef(VersionId versionId, String tag, VersionStatus status) {}
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/port/in/SearchLibraryDocs.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/port/in/SearchLibraryDocs.java
@@ -0,0 +1,30 @@
+package com.trueref.domain.port.in;
+
+import com.trueref.domain.model.SearchHit;
+import com.trueref.domain.model.SearchScope;
+import java.util.List;
+import org.jspecify.annotations.Nullable;
+
+/** Use case: hybrid (BM25 + dense) search with rerank, scoped to specific (repo, version) pairs. */
+public interface SearchLibraryDocs {
+
+    Result search(Query query);
+
+    record Query(
+            String text,
+            @Nullable String topic,
+            SearchScope scope,
+            int tokensBudget,
+            int maxHits) {}
+
+    /**
+     * @param hits ranked snippets, packed to fit within {@link Query#tokensBudget()}
+     * @param totalTokensReturned cumulative token count of returned snippets
+     */
+    record Result(List<SearchHit> hits, int totalTokensReturned) {
+
+        public Result {
+            hits = List.copyOf(hits);
+        }
+    }
+}
--- a/trueref-domain/src/main/java/com/trueref/domain/port/in/package-info.java
+++ b/trueref-domain/src/main/java/com/trueref/domain/port/in/package-info.java
@@ -0,0 +1,6 @@
+/**
+ * Driving ports — interfaces implemented by the application layer and called by adapters
+ * (REST controllers, MCP tool handlers, scheduled tasks, etc.).
+ */
+@org.jspecify.annotations.NullMarked
+package com.trueref.domain.port.in;
--- a/trueref-frontend/pom.xml
+++ b/trueref-frontend/pom.xml
@@ -0,0 +1,63 @@
+<?xml version="1.0" encoding="UTF-8"?>
+<project xmlns="http://maven.apache.org/POM/4.0.0"
+         xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
+         xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 https://maven.apache.org/xsd/maven-4.0.0.xsd">
+    <modelVersion>4.0.0</modelVersion>
+
+    <parent>
+        <groupId>com.trueref</groupId>
+        <artifactId>trueref-parent</artifactId>
+        <version>0.1.0-SNAPSHOT</version>
+    </parent>
+
+    <artifactId>trueref-frontend</artifactId>
+    <name>trueref-frontend</name>
+    <description>SvelteKit static UI built with frontend-maven-plugin and packaged as a resource jar.</description>
+    <packaging>jar</packaging>
+
+    <build>
+        <resources>
+            <!-- Point directly at the SvelteKit build output so resources:resources
+                 (bound to process-resources) finds the files that npm-build already
+                 created in generate-resources. The intermediate copy-frontend-build
+                 step used target/frontend-dist as the resource directory, but that
+                 directory is only populated later in process-resources, causing an
+                 empty JAR on clean builds. -->
+            <resource>
+                <directory>web/build</directory>
+                <targetPath>static</targetPath>
+            </resource>
+        </resources>
+        <plugins>
+            <plugin>
+                <groupId>com.github.eirslett</groupId>
+                <artifactId>frontend-maven-plugin</artifactId>
+                <configuration>
+                    <workingDirectory>web</workingDirectory>
+                    <installDirectory>${project.build.directory}</installDirectory>
+                    <nodeVersion>${node.version}</nodeVersion>
+                    <npmVersion>${npm.version}</npmVersion>
+                </configuration>
+                <executions>
+                    <execution>
+                        <id>install-node-and-npm</id>
+                        <goals><goal>install-node-and-npm</goal></goals>
+                        <phase>generate-resources</phase>
+                    </execution>
+                    <execution>
+                        <id>npm-install</id>
+                        <goals><goal>npm</goal></goals>
+                        <phase>generate-resources</phase>
+                        <configuration><arguments>install</arguments></configuration>
+                    </execution>
+                    <execution>
+                        <id>npm-build</id>
+                        <goals><goal>npm</goal></goals>
+                        <phase>generate-resources</phase>
+                        <configuration><arguments>run build</arguments></configuration>
+                    </execution>
+                </executions>
+            </plugin>
+        </plugins>
+    </build>
+</project>
--- a/trueref-frontend/web/.gitignore
+++ b/trueref-frontend/web/.gitignore
@@ -0,0 +1,9 @@
+node_modules
+/build
+/.svelte-kit
+/package
+.env
+.env.*
+!.env.example
+.DS_Store
+*.log
--- a/trueref-frontend/web/.npmrc
+++ b/trueref-frontend/web/.npmrc
@@ -0,0 +1 @@
+engine-strict=false
--- a/trueref-frontend/web/.prettierrc
+++ b/trueref-frontend/web/.prettierrc
@@ -0,0 +1,9 @@
+{
+  "useTabs": false,
+  "tabWidth": 2,
+  "singleQuote": true,
+  "trailingComma": "none",
+  "printWidth": 100,
+  "plugins": ["prettier-plugin-svelte"],
+  "overrides": [{ "files": "*.svelte", "options": { "parser": "svelte" } }]
+}
--- a/Show More
+++ b/Show More