auto-git:

[add] README.md [add] backend/libraries/punk/library.json [add] backend/libraries/punk/stage/19f1e5d2ceaab5fd1f1dc58ff07422388f156610d16dfdea2bdb35a5b9e70813--GeorgeJordac-TheVoiceOfHumanJustice.pdf [add] backend/libraries/punk/stage/85fce554ff7685f7bccb136aff5768e54b9ba8361672fe45dbce599598c4be4b--4_Strings_-_Take_Me_Away_Into_The_Night_Vocal_Radio_Mix_.mp3 [add] backend/libraries/punk/stage/e816ca61aebd84159747d248fedd6d5ff318c471c36bcc31b1ac6bf9aebcd3c1--The_Evolution_of_Cooperation_Robert_Axelrod_liber3.pdf [add] backend/local_rag.py [add] backend/rag/__init__.py [add] backend/rag/corpus_builder.py [add] backend/rag/corpus_enricher.py [add] backend/rag/index_builder.py [add] backend/rag/unified_rag.py [add] dist/assets/index-Cc0DLWqA.css [add] dist/assets/index-DKAz6gtp.js [add] dist/index.html [add] src/LibraryManager.jsx [add] wheelcheck2117/pydantic-2.11.7-py3-none-any.whl [add] wheelcheck274/pydantic-2.7.4-py3-none-any.whl [change] backend/main.py [change] backend/requirements.txt [change] backend/schemas.py [change] electron/main.cjs [change] electron/preload.cjs [change] package.json [change] run.sh [change] src/App.jsx [change] src/InterfaceSettings.jsx [change] src/colorSchemes.js [change] src/main.jsx [change] src/styles.css
2026-03-19 21:07:22 +01:00
parent 9c6143a55a
commit 795f4d272a
29 changed files with 6023 additions and 359 deletions
--- a/README.md
+++ b/README.md
@@ -0,0 +1,89 @@
 # Heimgeist
 Heimgeist is a local desktop chat client for Ollama. It combines an Electron + React renderer with a FastAPI backend, stores chat history in SQLite, supports optional SearXNG-backed web search, and can enrich prompts with context from local library indexes.
 ## Features
 - Local desktop chat UI with Electron
 - Ollama-backed chat with streaming and non-streaming replies
 - Persistent chat sessions and automatic title generation
 - Edit-and-regenerate flow for earlier user messages
 - Optional web search enrichment with source chips
 - Local library management for RAG-style prompt enrichment
 - Theme selection and UI scale controls
 ## Local Libraries
 The `DBs` tab is no longer a placeholder. You can:
 - create and rename libraries
 - register files and folders
 - build, enrich, and index library content
 - mark one library as active for chat context
 - open or remove registered files from the UI
 When a chat library is active, Heimgeist queries it before sending a message and appends the returned context block to the prompt.
 ## Stack
 - Frontend: Electron, React, Vite
 - Backend: FastAPI, SQLAlchemy, SQLite
 - Search enrichment: SearXNG + page fetching/reranking
 - Local RAG pipeline: corpus build, enrichment, embedding, and retrieval helpers under `backend/rag/`
 ## Development
 Requirements:
 - Node.js 18+
 - Python 3.13
 - Ollama running locally
 - Optional: SearXNG on `http://localhost:8888`
 Quick start:
 ```bash
 ./run.sh
 ```
 This creates or refreshes `backend/.venv`, installs Python dependencies, installs npm dependencies, and starts the dev stack.
 Manual startup:
 ```bash
 python3.13 -m venv backend/.venv
 backend/.venv/bin/python -m pip install -r backend/requirements.txt
 npm install
 npm run dev
 ```
 ## Project Layout
 ```text
 .
 ├── backend/
 │   ├── main.py
 │   ├── local_rag.py
 │   ├── rag/
 │   ├── websearch.py
 │   ├── ollama_client.py
 │   ├── models.py
 │   ├── database.py
 │   ├── schemas.py
 │   └── requirements.txt
 ├── electron/
 │   ├── main.cjs
 │   └── preload.cjs
 ├── src/
 │   ├── App.jsx
 │   ├── LibraryManager.jsx
 │   ├── GeneralSettings.jsx
 │   ├── InterfaceSettings.jsx
 │   ├── WebsearchSettings.jsx
 │   ├── markdown.js
 │   ├── colorSchemes.js
 │   └── styles.css
 ├── package.json
 ├── run.sh
 └── vite.config.js
 ```
--- a/backend/libraries/punk/library.json
+++ b/backend/libraries/punk/library.json
@@ -0,0 +1,32 @@
 {
  "id": "f5194228933140b68625347333749baf",
  "name": "Punk",
  "slug": "punk",
  "created_at": "2026-03-19T20:02:20Z",
  "files": [
    {
      "sha256": "e816ca61aebd84159747d248fedd6d5ff318c471c36bcc31b1ac6bf9aebcd3c1",
      "path": "/Users/giers/Documents/The Evolution of Cooperation_Robert Axelrod_liber3.pdf",
      "rel": "e816ca61aebd84159747d248fedd6d5ff318c471c36bcc31b1ac6bf9aebcd3c1--The_Evolution_of_Cooperation_Robert_Axelrod_liber3.pdf",
      "name": "The Evolution of Cooperation_Robert Axelrod_liber3.pdf",
      "size": 1208035,
      "added_at": "2026-03-19T20:02:53Z"
    },
    {
      "sha256": "19f1e5d2ceaab5fd1f1dc58ff07422388f156610d16dfdea2bdb35a5b9e70813",
      "path": "/Users/giers/Documents/GeorgeJordac-TheVoiceOfHumanJustice.pdf",
      "rel": "19f1e5d2ceaab5fd1f1dc58ff07422388f156610d16dfdea2bdb35a5b9e70813--GeorgeJordac-TheVoiceOfHumanJustice.pdf",
      "name": "GeorgeJordac-TheVoiceOfHumanJustice.pdf",
      "size": 849816,
      "added_at": "2026-03-19T20:04:17Z"
    },
    {
      "sha256": "85fce554ff7685f7bccb136aff5768e54b9ba8361672fe45dbce599598c4be4b",
      "path": "/Users/giers/Music/4 Strings - Take Me Away (Into The Night) (Vocal Radio Mix).mp3",
      "rel": "85fce554ff7685f7bccb136aff5768e54b9ba8361672fe45dbce599598c4be4b--4_Strings_-_Take_Me_Away_Into_The_Night_Vocal_Radio_Mix_.mp3",
      "name": "4 Strings - Take Me Away (Into The Night) (Vocal Radio Mix).mp3",
      "size": 7994108,
      "added_at": "2026-03-19T20:06:30Z"
    }
  ]
 }
--- a/backend/libraries/punk/stage/19f1e5d2ceaab5fd1f1dc58ff07422388f156610d16dfdea2bdb35a5b9e70813--GeorgeJordac-TheVoiceOfHumanJustice.pdf
+++ b/backend/libraries/punk/stage/19f1e5d2ceaab5fd1f1dc58ff07422388f156610d16dfdea2bdb35a5b9e70813--GeorgeJordac-TheVoiceOfHumanJustice.pdf
@@ -0,0 +1 @@
 /Users/giers/Documents/GeorgeJordac-TheVoiceOfHumanJustice.pdf
--- a/backend/libraries/punk/stage/85fce554ff7685f7bccb136aff5768e54b9ba8361672fe45dbce599598c4be4b--4_Strings_-_Take_Me_Away_Into_The_Night_Vocal_Radio_Mix_.mp3
+++ b/backend/libraries/punk/stage/85fce554ff7685f7bccb136aff5768e54b9ba8361672fe45dbce599598c4be4b--4_Strings_-_Take_Me_Away_Into_The_Night_Vocal_Radio_Mix_.mp3
@@ -0,0 +1 @@
 /Users/giers/Music/4 Strings - Take Me Away (Into The Night) (Vocal Radio Mix).mp3
--- a/backend/libraries/punk/stage/e816ca61aebd84159747d248fedd6d5ff318c471c36bcc31b1ac6bf9aebcd3c1--The_Evolution_of_Cooperation_Robert_Axelrod_liber3.pdf
+++ b/backend/libraries/punk/stage/e816ca61aebd84159747d248fedd6d5ff318c471c36bcc31b1ac6bf9aebcd3c1--The_Evolution_of_Cooperation_Robert_Axelrod_liber3.pdf
@@ -0,0 +1 @@
 /Users/giers/Documents/The Evolution of Cooperation_Robert Axelrod_liber3.pdf
--- a/backend/local_rag.py
+++ b/backend/local_rag.py
@@ -0,0 +1,526 @@
 from __future__ import annotations
 import asyncio
 import functools
 import hashlib
 import importlib
 import json
 import os
 import re
 import shutil
 import threading
 import uuid
 from concurrent.futures import ThreadPoolExecutor
 from datetime import datetime
 from pathlib import Path
 from typing import Any, Dict, List, Optional
 from urllib.parse import quote
 from fastapi import APIRouter, HTTPException
 from pydantic import BaseModel
 router = APIRouter(tags=["local-rag"])
 LIB_ROOT = Path(__file__).parent / "libraries"
 LIB_ROOT.mkdir(parents=True, exist_ok=True)
 JOB_EXECUTOR = ThreadPoolExecutor(max_workers=2)
 JOBS: Dict[str, Dict[str, Any]] = {}
 LIB_LOCKS: Dict[str, asyncio.Lock] = {}
 class CreateLibraryRequest(BaseModel):
    name: str
 class RenameLibraryRequest(BaseModel):
    name: str
 class RegisterPathsRequest(BaseModel):
    paths: List[str]
 class RemoveFileRequest(BaseModel):
    rel: str
 class EmbedLibraryRequest(BaseModel):
    embed_model: str = "dengcao/Qwen3-Embedding-0.6B:F16"
    ollama: str = "http://localhost:11434"
    target_chars: int = 2000
    overlap_chars: int = 200
    concurrency: int = 6
 class LibraryContextRequest(BaseModel):
    prompt: str
    top_k: int = 5
    ollama: str = "http://localhost:11434"
    embed_model: str = "dengcao/Qwen3-Embedding-0.6B:F16"
    gen_model: str = "qwen3:4b"
 def now_iso() -> str:
    return datetime.utcnow().isoformat(timespec="seconds") + "Z"
 def slugify(name: str) -> str:
    cleaned = re.sub(r"[^a-zA-Z0-9\- ]+", "", name).strip().lower()
    cleaned = re.sub(r"\s+", "-", cleaned)
    return cleaned or f"lib-{uuid.uuid4().hex[:8]}"
 def lib_dir(slug: str) -> Path:
    return LIB_ROOT / slug
 def lib_json(slug: str) -> Path:
    return lib_dir(slug) / "library.json"
 def stage_dir(slug: str) -> Path:
    path = lib_dir(slug) / "stage"
    path.mkdir(parents=True, exist_ok=True)
    return path
 def indexes_dir(slug: str) -> Path:
    path = lib_dir(slug) / "indexes"
    path.mkdir(parents=True, exist_ok=True)
    return path
 def default_library_data(name: str, slug: str) -> Dict[str, Any]:
    return {
        "id": uuid.uuid4().hex,
        "name": name,
        "slug": slug,
        "created_at": now_iso(),
        "files": [],
    }
 def _read_json(path: Path) -> Dict[str, Any]:
    return json.loads(path.read_text(encoding="utf-8"))
 def read_library(slug: str) -> Dict[str, Any]:
    path = lib_json(slug)
    if not path.exists():
        raise HTTPException(status_code=404, detail="Library not found")
    return _read_json(path)
 def write_library(slug: str, data: Dict[str, Any]) -> None:
    path = lib_json(slug)
    path.parent.mkdir(parents=True, exist_ok=True)
    tmp = path.with_suffix(".tmp")
    tmp.write_text(json.dumps(data, indent=2), encoding="utf-8")
    tmp.replace(path)
 def _line_count(path: Path) -> int:
    if not path.exists():
        return 0
    with path.open("r", encoding="utf-8", errors="ignore") as handle:
        return sum(1 for line in handle if line.strip())
 def _file_uri(path_value: str) -> str:
    return f"file://{quote(path_value)}"
 def _collect_library_paths(slug: str) -> Dict[str, Path]:
    base = lib_dir(slug)
    return {
        "base": base,
        "stage": stage_dir(slug),
        "corpus": base / "corpus.jsonl",
        "enhanced": base / "corpus.enhanced.jsonl",
        "shadow": base / "corpus.shadow.jsonl",
        "indexes": indexes_dir(slug),
        "shadow_index": indexes_dir(slug) / "shadow.index.faiss",
        "shadow_store": indexes_dir(slug) / "shadow.meta.jsonl",
        "content_index": indexes_dir(slug) / "content.index.faiss",
        "content_store": indexes_dir(slug) / "content.meta.jsonl",
    }
 def library_payload(data: Dict[str, Any]) -> Dict[str, Any]:
    paths = _collect_library_paths(data["slug"])
    files = list(data.get("files", []))
    stages = {
        "has_files": len(files) > 0,
        "has_corpus": paths["corpus"].exists(),
        "is_enriched": paths["enhanced"].exists() and paths["shadow"].exists(),
        "is_indexed": paths["shadow_index"].exists() and paths["content_index"].exists(),
    }
    artifacts = {
        "corpus_records": _line_count(paths["corpus"]),
        "enhanced_records": _line_count(paths["enhanced"]),
        "shadow_records": _line_count(paths["shadow"]),
    }
    return {
        **data,
        "files": files,
        "states": stages,
        "artifacts": artifacts,
    }
 def _walk_input_paths(paths: List[str]) -> List[Path]:
    out: List[Path] = []
    for raw in paths:
        current = Path(raw).expanduser().resolve()
        if not current.exists():
            continue
        if current.is_file():
            out.append(current)
            continue
        for child in current.rglob("*"):
            if child.is_file():
                out.append(child.resolve())
    return out
 def _sha256_file(path: Path) -> str:
    digest = hashlib.sha256()
    with path.open("rb") as handle:
        for chunk in iter(lambda: handle.read(1024 * 1024), b""):
            digest.update(chunk)
    return digest.hexdigest()
 def _stage_name(sha: str, path: Path) -> str:
    safe_name = re.sub(r"[^A-Za-z0-9._-]+", "_", path.name).strip("._") or "file"
    return f"{sha}--{safe_name}"
 def _job_public(job: Dict[str, Any]) -> Dict[str, Any]:
    return {
        "id": job["id"],
        "slug": job["slug"],
        "type": job["type"],
        "status": job["status"],
        "phase": job.get("phase"),
        "progress": job.get("progress", 0.0),
        "detail": job.get("detail", ""),
        "error": job.get("error"),
        "result": job.get("result"),
        "created_at": job["created_at"],
        "finished_at": job.get("finished_at"),
    }
 def _has_active_job(slug: str) -> bool:
    return any(
        job["slug"] == slug and job["status"] in {"queued", "running"}
        for job in JOBS.values()
    )
 def _load_pipeline_fn(module_name: str, attr: str):
    try:
        module = importlib.import_module(f"backend.rag.{module_name}")
    except ModuleNotFoundError:
        module = importlib.import_module(f".rag.{module_name}", package=__package__)
    return getattr(module, attr)
 async def _run_job(job_id: str, fn_name: str, **kwargs):
    loop = asyncio.get_running_loop()
    job = JOBS[job_id]
    def on_progress(phase: str, pct: float, detail: str):
        job["phase"] = phase
        job["progress"] = round(float(pct) * 100.0, 1)
        job["detail"] = detail
    job["status"] = "running"
    try:
        if fn_name == "build":
            runner = _load_pipeline_fn("corpus_builder", "run_build")
        elif fn_name == "enrich":
            runner = _load_pipeline_fn("corpus_enricher", "run_enrich")
        elif fn_name == "embed":
            runner = _load_pipeline_fn("index_builder", "run_index")
        else:
            raise RuntimeError(f"Unknown job type: {fn_name}")
        call = functools.partial(runner, on_progress=on_progress, **kwargs)
        result = await loop.run_in_executor(JOB_EXECUTOR, call)
        job["status"] = "succeeded"
        job["progress"] = 100.0
        job["phase"] = "done"
        job["detail"] = "Completed."
        job["result"] = result
    except Exception as exc:
        job["status"] = "failed"
        job["error"] = f"{type(exc).__name__}: {exc}"
    finally:
        job["finished_at"] = now_iso()
 def _start_job(slug: str, job_type: str, **kwargs) -> str:
    job_id = uuid.uuid4().hex
    JOBS[job_id] = {
        "id": job_id,
        "slug": slug,
        "type": job_type,
        "status": "queued",
        "phase": "queued",
        "progress": 0.0,
        "detail": "",
        "created_at": now_iso(),
        "finished_at": None,
        "result": None,
        "error": None,
    }
    asyncio.create_task(_run_job(job_id, job_type, **kwargs))
    return job_id
 def _build_local_context(prompt: str, results: Dict[str, Any], top_k: int = 5) -> Dict[str, Any]:
    sources = results.get("sources") or []
    selected = sources[: max(1, top_k)]
    if not selected:
        context_block = (
            "<local_rag_context>\n"
            "No useful results were found in the selected local knowledge base.\n"
            "</local_rag_context>"
        )
        return {"context_block": context_block, "sources": []}
    blocks: List[str] = ["<local_rag_context>"]
    file_sources: List[str] = []
    for idx, source in enumerate(selected, start=1):
        title = (source.get("title") or Path(source.get("url") or source.get("doc_id") or f"Source {idx}").name).strip()
        snippet = re.sub(r"\s+", " ", (source.get("snippet") or "")).strip()
        if len(snippet) > 1400:
            snippet = snippet[:1400].rstrip() + "..."
        raw_path = source.get("url") or source.get("doc_id") or ""
        if raw_path and os.path.isabs(raw_path):
            file_sources.append(_file_uri(raw_path))
        blocks.append(f"[L{idx}] {title}\n{snippet}")
    blocks.append("</local_rag_context>")
    blocks.append(
        "Use the local knowledge base context when it is relevant. "
        "If it does not answer the question, say so clearly instead of inventing details."
    )
    return {"context_block": "\n".join(blocks), "sources": file_sources}
@router.get("/libraries")
 def list_libraries():
    libraries: List[Dict[str, Any]] = []
    for path in LIB_ROOT.iterdir():
        if not path.is_dir():
            continue
        meta = path / "library.json"
        if not meta.exists():
            continue
        try:
            libraries.append(library_payload(_read_json(meta)))
        except Exception:
            continue
    libraries.sort(key=lambda item: item.get("created_at", ""), reverse=True)
    return {"libraries": libraries}
@router.post("/libraries")
 def create_library(req: CreateLibraryRequest):
    slug = slugify(req.name)
    base_slug = slug
    idx = 2
    while lib_dir(slug).exists():
        slug = f"{base_slug}-{idx}"
        idx += 1
    data = default_library_data(req.name, slug)
    stage_dir(slug)
    indexes_dir(slug)
    write_library(slug, data)
    return library_payload(data)
@router.get("/libraries/{slug}")
 def get_library(slug: str):
    return library_payload(read_library(slug))
@router.patch("/libraries/{slug}")
 def rename_library(slug: str, req: RenameLibraryRequest):
    data = read_library(slug)
    data["name"] = req.name.strip() or data["name"]
    write_library(slug, data)
    return library_payload(data)
@router.delete("/libraries/{slug}")
 def delete_library(slug: str):
    path = lib_dir(slug)
    if not path.exists():
        raise HTTPException(status_code=404, detail="Library not found")
    shutil.rmtree(path)
    return {"ok": True}
@router.post("/libraries/{slug}/files/register")
 def register_paths(slug: str, req: RegisterPathsRequest):
    data = read_library(slug)
    stage = stage_dir(slug)
    existing = {entry.get("sha256"): entry for entry in data.get("files", [])}
    added: List[Dict[str, Any]] = []
    skipped: List[str] = []
    for file_path in _walk_input_paths(req.paths):
        sha = _sha256_file(file_path)
        if sha in existing:
            skipped.append(str(file_path))
            continue
        stage_name = _stage_name(sha, file_path)
        symlink_path = stage / stage_name
        if symlink_path.exists():
            symlink_path.unlink()
        symlink_path.symlink_to(file_path)
        entry = {
            "sha256": sha,
            "path": str(file_path),
            "rel": stage_name,
            "name": file_path.name,
            "size": file_path.stat().st_size,
            "added_at": now_iso(),
        }
        data.setdefault("files", []).append(entry)
        added.append(entry)
        existing[sha] = entry
    write_library(slug, data)
    return {
        "added": added,
        "skipped": skipped,
        "library": library_payload(data),
    }
@router.delete("/libraries/{slug}/files")
 def remove_file(slug: str, req: RemoveFileRequest):
    data = read_library(slug)
    before = len(data.get("files", []))
    data["files"] = [entry for entry in data.get("files", []) if entry.get("rel") != req.rel]
    symlink_path = stage_dir(slug) / req.rel
    if symlink_path.exists():
        symlink_path.unlink()
    write_library(slug, data)
    if len(data["files"]) == before:
        raise HTTPException(status_code=404, detail="File not found")
    return {"ok": True, "library": library_payload(data)}
@router.post("/libraries/{slug}/jobs/build")
 async def build_library(slug: str):
    data = read_library(slug)
    if not data.get("files"):
        raise HTTPException(status_code=400, detail="Add files before building a library.")
    lock = LIB_LOCKS.setdefault(slug, asyncio.Lock())
    async with lock:
        if _has_active_job(slug):
            raise HTTPException(status_code=409, detail="This library already has an active job.")
        job_id = _start_job(
            slug,
            "build",
            root=stage_dir(slug),
            out=_collect_library_paths(slug)["corpus"],
        )
    return {"job_id": job_id}
@router.post("/libraries/{slug}/jobs/enrich")
 async def enrich_library(slug: str):
    paths = _collect_library_paths(slug)
    if not paths["corpus"].exists():
        raise HTTPException(status_code=400, detail="Build the corpus before enrichment.")
    lock = LIB_LOCKS.setdefault(slug, asyncio.Lock())
    async with lock:
        if _has_active_job(slug):
            raise HTTPException(status_code=409, detail="This library already has an active job.")
        job_id = _start_job(
            slug,
            "enrich",
            inp=paths["corpus"],
            out=paths["enhanced"],
            shadow_out=paths["shadow"],
        )
    return {"job_id": job_id}
@router.post("/libraries/{slug}/jobs/embed")
 async def embed_library(slug: str, req: EmbedLibraryRequest):
    paths = _collect_library_paths(slug)
    if not paths["corpus"].exists():
        raise HTTPException(status_code=400, detail="Build the corpus before indexing.")
    lock = LIB_LOCKS.setdefault(slug, asyncio.Lock())
    async with lock:
        if _has_active_job(slug):
            raise HTTPException(status_code=409, detail="This library already has an active job.")
        job_id = _start_job(
            slug,
            "embed",
            raw=paths["corpus"],
            enhanced=paths["enhanced"] if paths["enhanced"].exists() else None,
            shadow=paths["shadow"] if paths["shadow"].exists() else None,
            out_dir=paths["indexes"],
            embed_model=req.embed_model,
            ollama=req.ollama,
            target_chars=req.target_chars,
            overlap_chars=req.overlap_chars,
            concurrency=req.concurrency,
        )
    return {"job_id": job_id}
@router.get("/jobs")
 def list_jobs(slug: Optional[str] = None):
    jobs = [_job_public(job) for job in JOBS.values() if slug is None or job["slug"] == slug]
    jobs.sort(key=lambda item: item.get("created_at", ""), reverse=True)
    return {"jobs": jobs}
@router.get("/jobs/{job_id}")
 def get_job(job_id: str):
    job = JOBS.get(job_id)
    if not job:
        raise HTTPException(status_code=404, detail="Job not found")
    return _job_public(job)
@router.post("/libraries/{slug}/context")
 def library_context(slug: str, req: LibraryContextRequest):
    paths = _collect_library_paths(slug)
    if not paths["shadow_index"].exists() or not paths["content_index"].exists():
        raise HTTPException(status_code=400, detail="Index the library before using it in chat.")
    try:
        run_query = _load_pipeline_fn("unified_rag", "run_query")
        result = run_query(
            shadow_index=paths["shadow_index"],
            shadow_store=paths["shadow_store"],
            content_index=paths["content_index"],
            content_store=paths["content_store"],
            query=req.prompt,
            answer=False,
            ollama=req.ollama,
            embed_model=req.embed_model,
            gen_model=req.gen_model,
            no_rerank=True,
            k=max(1, req.top_k),
        )
    except Exception as exc:
        raise HTTPException(status_code=500, detail=f"Local retrieval failed: {type(exc).__name__}: {exc}") from exc
    context = _build_local_context(req.prompt, result, top_k=req.top_k)
    return {
        "context_block": context["context_block"],
        "sources": context["sources"],
        "result": result,
    }
--- a/backend/main.py
+++ b/backend/main.py
@@ -8,6 +8,7 @@ import html
 import json
 from . import models, schemas
 from .database import Base, engine, SessionLocal, ensure_sources_column
 from .local_rag import router as local_rag_router
 from .ollama_client import list_models as ollama_list, chat as ollama_chat, chat_stream as ollama_chat_stream
 from .websearch import enrich_prompt
@@ -25,6 +26,7 @@ app.add_middleware(
    allow_methods=["*"],
    allow_headers=["*"],
 )
 app.include_router(local_rag_router)
 def get_db():
    db = SessionLocal()
@@ -331,8 +333,11 @@ async def websearch_route(req: schemas.WebSearchRequest):
            searx_url=req.searx_url,
            engines=req.engines,
        )
-        return {"enriched_prompt": enriched, "sources": sources}
+        context_block = ""
        if "<websearch_context>" in enriched:
            context_block = enriched[enriched.index("<websearch_context>"):].strip()
        return {"enriched_prompt": enriched, "sources": sources, "context_block": context_block}
    except Exception:
-        return {"enriched_prompt": req.prompt, "sources": []}
+        return {"enriched_prompt": req.prompt, "sources": [], "context_block": ""}
 # To run standalone: python -m uvicorn backend.main:app --host 127.0.0.1 --port 8000
--- a/backend/rag/init.py
+++ b/backend/rag/init.py
--- a/backend/rag/corpus_builder.py
+++ b/backend/rag/corpus_builder.py
--- a/backend/rag/corpus_enricher.py
+++ b/backend/rag/corpus_enricher.py
--- a/backend/rag/index_builder.py
+++ b/backend/rag/index_builder.py
@@ -0,0 +1,525 @@
 #!/usr/bin/env python3
 # -*- coding: utf-8 -*-
 """
 03_index_builder.py
 Flexible FAISS index builder for hybrid RAG.
 Supports these inputs (any subset):
 - --raw       : corpus.jsonl from 01_corpus_builder.py (no enrichment)
 - --enhanced  : corpus.enhanced.jsonl from 02_corpus_enricher.py
 - --shadow    : corpus.shadow.jsonl   from 02_corpus_enricher.py
 Outputs (by default into ./indexes):
 - shadow.index.faiss  : FAISS IP index over vectors of "shadow_text"
 - shadow.meta.jsonl   : metadata for each FAISS id (id, doc_id, record_id, title, url, record_type, mime, lang, kind, shadow_text)
 - content.index.faiss : FAISS IP index over vectors of chunked "text"
 - content.meta.jsonl  : metadata for each FAISS id (id, doc_id, record_id, chunk_no, title, url, text, record_type, mime, lang)
 Behavior
 - If you provide --shadow → build shadow from it.
 - Else if you provide --enhanced → synthesize shadow from enriched fields (headline+summary+keywords+entities+qa).
 - Else if you provide --raw → synthesize shadow from raw (title + first sentences + hints).
 - If you provide --enhanced → build content from it.
 - Else if you provide --raw → build content from raw text (chunking).
 - You can disable either side with --no-shadow or --no-content.
 Embedding
 - Uses Ollama /api/embeddings with cosine similarity (L2-normalize then IP).
 Examples:
 # Full hybrid from enriched+shadow
 python 03_index_builder.py \
  --enhanced corpus.enhanced.jsonl \
  --shadow corpus.shadow.jsonl \
  --out-dir indexes \
  --embed-model "dengcao/Qwen3-Embedding-0.6B:F16" \
  --target-chars 2500 --overlap-chars 200 \
  --concurrency 6
 # Raw-only (no enricher) → builds content from raw text and a proxy shadow
 python 03_index_builder.py \
  --raw corpus.jsonl \
  --out-dir indexes \
  --embed-model "dengcao/Qwen3-Embedding-0.6B:F16"
 """
 from __future__ import annotations
 import argparse, json, sys, uuid, os, re
 from pathlib import Path
 from typing import Dict, Any, Iterable, List, Tuple, Optional, Callable
 from concurrent.futures import ThreadPoolExecutor, as_completed
 import threading
 import numpy as np
 import requests
 import faiss
 from tqdm import tqdm
 # -----------------------------
 # IO
 # -----------------------------
 def read_jsonl(path: Path) -> Iterable[Dict[str, Any]]:
    with open(path, "r", encoding="utf-8") as f:
        for line in f:
            if line.strip():
                try:
                    yield json.loads(line)
                except Exception:
                    continue
 def ensure_dir(p: Path):
    p.mkdir(parents=True, exist_ok=True)
 # -----------------------------
 # Text helpers
 # -----------------------------
 def pick_text(rec: Dict[str, Any]) -> str:
    return rec.get("text") or rec.get("content") or rec.get("body") or ""
 def first_sentences(s: str, max_chars: int = 500) -> str:
    s = (s or "").strip()
    if not s:
        return ""
    # cheap sentence-ish split
    parts = re.split(r"(?<=[\.\!\?])\s+", s)
    out = []
    total = 0
    for p in parts:
        if not p:
            continue
        out.append(p)
        total += len(p) + 1
        if total >= max_chars:
            break
    joined = " ".join(out).strip()
    return joined[:max_chars].rstrip()
 def chunk_text(txt: str, target_chars: int = 2500, overlap_chars: int = 200) -> Iterable[str]:
    # paragraph-first greedy pack
    paras = [p.strip() for p in (txt or "").split("\n\n") if p.strip()]
    if not paras:
        if txt.strip():
            yield txt.strip()
        return
    buf, size = [], 0
    for p in paras:
        if size + len(p) + 2 > target_chars and buf:
            chunk = "\n\n".join(buf)
            yield chunk
            if overlap_chars > 0 and len(chunk) > overlap_chars:
                tail = chunk[-overlap_chars:]
                buf, size = [tail], len(tail)
            else:
                buf, size = [], 0
        buf.append(p)
        size += len(p) + 2
    if buf:
        yield "\n\n".join(buf)
 def norm_f32(mat: np.ndarray) -> np.ndarray:
    mat = np.asarray(mat, dtype="float32")
    norms = np.linalg.norm(mat, axis=1, keepdims=True)
    norms[norms == 0] = 1.0
    return mat / norms
 # -----------------------------
 # Embedding
 # -----------------------------
 def embed_many(ollama_url: str, model: str, texts: List[str], *, concurrency: int = 4, timeout: int = 120, on_progress=None) -> List[np.ndarray]:
    def _embed_one(t: str) -> np.ndarray:
        r = requests.post(f"{ollama_url.rstrip('/')}/api/embeddings", json={"model": model, "prompt": t}, timeout=timeout)
        r.raise_for_status()
        data = r.json()
        vec = data.get("embedding") or (data.get("embeddings") or [None])[0]
        if vec is None:
            raise RuntimeError("No 'embedding' in response")
        return np.array(vec, dtype="float32")
    out: List[Optional[np.ndarray]] = [None] * len(texts)
    with ThreadPoolExecutor(max_workers=max(1, concurrency)) as ex:
        futures = {ex.submit(_embed_one, t): i for i, t in enumerate(texts)}
        progress_bar = None
        if on_progress is None and 'tqdm' in globals() and tqdm is not None:
            progress_bar = tqdm(as_completed(futures), total=len(futures), desc="embed")
        iterator = progress_bar if progress_bar else as_completed(futures)
        count = 0
        for fut in iterator:
            i = futures[fut]
            out[i] = fut.result()
            count += 1
            if on_progress:
                on_progress("embed", count / len(texts), f"Embedding {count}/{len(texts)}")
    # type: ignore
    return out  # List[np.ndarray]
 # -----------------------------
 # Meta helpers
 # -----------------------------
 def derive_doc_id_from_any(any_id: Optional[str], parent_id: Optional[str]) -> str:
    """Prefer parent_id if present (file-level), else base of 'id' before '#...'."""
    if parent_id:
        return str(parent_id)
    if not any_id:
        return ""
    return any_id.split("#", 1)[0]
 def kind_from_rec(rec: Dict[str, Any]) -> str:
    rt = (rec.get("record_type") or "").lower()
    mime = (rec.get("mime") or "").lower()
    if rt == "image" or (mime.startswith("image/")):
        return "image"
    if rt == "av" or mime.startswith(("audio/", "video/")):
        return "av"
    if "html" in mime or rt in {"html-section"}:
        return "html"
    if "pdf" in mime or rt == "page":
        return "pdf"
    if rt == "code-summary" or mime.startswith("text/x-code"):
        return "code"
    return rt or "file"
 # -----------------------------
 # Shadow text synthesis (fallbacks)
 # -----------------------------
 def synth_shadow_from_enhanced(rec: Dict[str, Any]) -> str:
    """
    Build a compact shadow_text from enriched fields if present.
    """
    parts: List[str] = []
    h = (rec.get("headline") or rec.get("title") or "").strip()
    s = (rec.get("summary") or "").strip()
    kws = rec.get("keywords") or []
    ents = rec.get("entities") or []
    qas = rec.get("qa") or []
    if h:
        parts.append(f"headline: {h}")
    if s:
        parts.append(f"summary: {s}")
    if kws:
        parts.append("keywords: " + ", ".join([str(k).strip() for k in kws if str(k).strip()]))
    if ents:
        uniq = {}
        for e in ents:
            if not isinstance(e, dict):
                continue
            name = (e.get("name") or "").strip()
            typ = (e.get("type") or "OTHER").strip().upper()
            if name and name.lower() not in uniq:
                uniq[name.lower()] = (name, typ)
        if uniq:
            parts.append("entities: " + "; ".join(f"{n} [{t}]" for n, t in uniq.values()))
    if qas:
        qa_lines = []
        for qa in qas[:4]:
            if not isinstance(qa, dict):
                continue
            q = (qa.get("q") or "").strip()
            a = (qa.get("a") or "").strip()
            if q and a:
                qa_lines.append(f"Q: {q}\nA: {a}")
        if qa_lines:
            parts.append("qa:\n" + "\n".join(qa_lines))
    return "\n".join(parts).strip()
 def synth_shadow_from_raw(rec: Dict[str, Any]) -> str:
    """
    Build a proxy shadow_text without any LLM: title + first sentences + light hints.
    """
    title = (rec.get("title") or "").strip()
    text = pick_text(rec)
    kind = kind_from_rec(rec)
    url = rec.get("url") or rec.get("source_path") or ""
    head = f"headline: {title}" if title else ""
    summary = first_sentences(text, 500)
    parts = []
    if head:
        parts.append(head)
    if summary:
        parts.append(f"summary: {summary}")
    hints = []
    if kind:
        hints.append(kind)
    if rec.get("mime"):
        hints.append(rec.get("mime").split(";")[0])
    if url:
        hints.append(Path(url).name)
    if hints:
        parts.append("keywords: " + ", ".join(hints))
    return "\n".join(parts).strip()
 # -----------------------------
 # Builders
 # -----------------------------
 def build_shadow_any(
    shadow_jsonl: Optional[Path],
    enhanced_jsonl: Optional[Path],
    raw_jsonl: Optional[Path],
    out_index: Path,
    out_meta: Path,
    *,
    ollama: str,
    model: str,
    concurrency: int
 ) -> Tuple[int, int, int]:
    """
    Build FAISS over shadow_text from best available source.
    Priority: shadow_jsonl > enhanced_jsonl (synth) > raw_jsonl (synth).
    Returns (n_input_records, n_indexed, dim)
    """
    src_records: List[Dict[str, Any]] = []
    mode = ""
    if shadow_jsonl and shadow_jsonl.exists():
        src_records = list(read_jsonl(shadow_jsonl))
        mode = "shadow"
    elif enhanced_jsonl and enhanced_jsonl.exists():
        src_records = list(read_jsonl(enhanced_jsonl))
        mode = "enhanced->shadow"
    elif raw_jsonl and raw_jsonl.exists():
        src_records = list(read_jsonl(raw_jsonl))
        mode = "raw->shadow"
    else:
        raise SystemExit("[ERR] No input for shadow index (need --shadow OR --enhanced OR --raw).")
    if not src_records:
        raise SystemExit("[ERR] Empty input for shadow index.")
    texts: List[str] = []
    metas: List[Dict[str, Any]] = []
    for rec in src_records:
        if mode == "shadow":
            st = rec.get("shadow_text") or ""
        elif mode == "enhanced->shadow":
            st = synth_shadow_from_enhanced(rec)
        else:
            st = synth_shadow_from_raw(rec)
        if not st.strip():
            continue
        record_id = rec.get("id") or rec.get("record_id") or str(uuid.uuid4())
        doc_id = derive_doc_id_from_any(record_id, rec.get("parent_id"))
        meta = {
            "id": None,  # numeric FAISS id later
            "record_id": record_id,
            "doc_id": doc_id,
            "title": rec.get("title"),
            "url": rec.get("url") or rec.get("source_path"),
            "record_type": rec.get("record_type"),
            "mime": rec.get("mime"),
            "lang": rec.get("lang"),
            "kind": kind_from_rec(rec),
            "shadow_text": st,
        }
        metas.append(meta)
        texts.append(st)
    if not texts:
        raise SystemExit("[ERR] no shadow_text to embed")
    vecs = embed_many(ollama, model, texts, concurrency=concurrency)
    d = len(vecs[0])
    mat = norm_f32(np.vstack(vecs))
    base = faiss.IndexFlatIP(d)
    index = faiss.IndexIDMap2(base)
    out_meta.parent.mkdir(parents=True, exist_ok=True)
    with open(out_meta, "w", encoding="utf-8") as mf:
        buf_vecs, buf_ids = [], []
        next_id = 0
        for m, v in zip(metas, mat):
            m["id"] = next_id
            mf.write(json.dumps(m, ensure_ascii=False) + "\n")
            buf_vecs.append(v)
            buf_ids.append(next_id)
            next_id += 1
            if len(buf_vecs) >= 512:
                index.add_with_ids(np.vstack(buf_vecs), np.array(buf_ids, dtype="int64"))
                buf_vecs, buf_ids = [], []
        if buf_vecs:
            index.add_with_ids(np.vstack(buf_vecs), np.array(buf_ids, dtype="int64"))
    faiss.write_index(index, str(out_index))
    return (len(src_records), index.ntotal, d)
 def build_content_any(
    enhanced_jsonl: Optional[Path],
    raw_jsonl: Optional[Path],
    out_index: Path,
    out_meta: Path,
    *,
    ollama: str,
    model: str,
    target_chars: int,
    overlap_chars: int,
    concurrency: int
 ) -> Tuple[int, int, int]:
    """
    Build FAISS over chunked 'text' from best available source.
    Priority: enhanced_jsonl > raw_jsonl.
    Returns (n_input_records, n_chunks, dim)
    """
    src_records: List[Dict[str, Any]] = []
    mode = ""
    if enhanced_jsonl and enhanced_jsonl.exists():
        src_records = list(read_jsonl(enhanced_jsonl))
        mode = "enhanced"
    elif raw_jsonl and raw_jsonl.exists():
        src_records = list(read_jsonl(raw_jsonl))
        mode = "raw"
    else:
        raise SystemExit("[ERR] No input for content index (need --enhanced OR --raw).")
    metas: List[Dict[str, Any]] = []
    texts: List[str] = []
    for rec in src_records:
        base_text = pick_text(rec)
        if not base_text.strip():
            continue
        record_id = rec.get("id") or rec.get("record_id") or str(uuid.uuid4())
        doc_id = derive_doc_id_from_any(record_id, rec.get("parent_id"))
        title = rec.get("title")
        url = rec.get("url") or rec.get("source_path")
        chunks = list(chunk_text(base_text, target_chars, overlap_chars))
        if not chunks:
            continue
        for ci, chunk in enumerate(chunks):
            meta = {
                "id": None,  # numeric FAISS id later
                "doc_id": doc_id,
                "record_id": record_id,
                "chunk_no": ci,
                "title": title,
                "url": url,
                "text": chunk,
                "record_type": rec.get("record_type"),
                "mime": rec.get("mime"),
                "lang": rec.get("lang"),
            }
            metas.append(meta)
            texts.append(chunk)
    if not texts:
        raise SystemExit("[ERR] no content chunks to embed")
    vecs = embed_many(ollama, model, texts, concurrency=concurrency)
    d = len(vecs[0])
    mat = norm_f32(np.vstack(vecs))
    base = faiss.IndexFlatIP(d)
    index = faiss.IndexIDMap2(base)
    out_meta.parent.mkdir(parents=True, exist_ok=True)
    with open(out_meta, "w", encoding="utf-8") as mf:
        buf_vecs, buf_ids = [], []
        next_id = 0
        for m, v in zip(metas, mat):
            m["id"] = next_id
            mf.write(json.dumps(m, ensure_ascii=False) + "\n")
            buf_vecs.append(v)
            buf_ids.append(next_id)
            next_id += 1
            if len(buf_vecs) >= 512:
                index.add_with_ids(np.vstack(buf_vecs), np.array(buf_ids, dtype="int64"))
                buf_vecs, buf_ids = [], []
        if buf_vecs:
            index.add_with_ids(np.vstack(buf_vecs), np.array(buf_ids, dtype="int64"))
    faiss.write_index(index, str(out_index))
    return (len(src_records), index.ntotal, d)
 # -----------------------------
 # CLI
 # -----------------------------
 def run_index(raw: Path|None, enhanced: Path|None, shadow: Path|None, out_dir: Path, *,
              on_progress=None, **opts) -> dict:
    args = argparse.Namespace(
        raw=raw,
        enhanced=enhanced,
        shadow=shadow,
        out_dir=out_dir,
        embed_model=opts.get("embed_model", "dengcao/Qwen3-Embedding-0.6B:F16"),
        ollama=opts.get("ollama", "http://localhost:11434"),
        target_chars=opts.get("target_chars", 2500),
        overlap_chars=opts.get("overlap_chars", 200),
        concurrency=opts.get("concurrency", 6),
        no_shadow=opts.get("no_shadow", False),
        no_content=opts.get("no_content", False),
    )
    ensure_dir(out_dir)
    shadow_index_path = out_dir / "shadow.index.faiss"
    shadow_meta_path  = out_dir / "shadow.meta.jsonl"
    content_index_path = out_dir / "content.index.faiss"
    content_meta_path  = out_dir / "content.meta.jsonl"
    results = {}
    built_any = False
    if not args.no_shadow:
        if on_progress: on_progress("shadow", 0.1, "Building shadow index...")
        s_tot, s_ix, s_dim = build_shadow_any(
            args.shadow, args.enhanced, args.raw,
            shadow_index_path, shadow_meta_path,
            ollama=args.ollama, model=args.embed_model, concurrency=args.concurrency
        )
        results["shadow"] = {"records": s_tot, "indexed": s_ix, "dim": s_dim}
        if on_progress: on_progress("shadow", 0.5, "Shadow index complete.")
        built_any = True
    if not args.no_content:
        if on_progress: on_progress("content", 0.6, "Building content index...")
        c_tot, c_ix, c_dim = build_content_any(
            args.enhanced, args.raw,
            content_index_path, content_meta_path,
            ollama=args.ollama, model=args.embed_model,
            target_chars=args.target_chars, overlap_chars=args.overlap_chars,
            concurrency=args.concurrency
        )
        results["content"] = {"records": c_tot, "chunks": c_ix, "dim": c_dim}
        if on_progress: on_progress("content", 0.9, "Content index complete.")
        built_any = True
    if not built_any:
        return {"status": "warning", "message": "Nothing built."}
    if on_progress: on_progress("done", 1.0, "Indexing complete.")
    return {"status": "ok", "results": results}
 def main():
    ap = argparse.ArgumentParser(description="Build FAISS indexes (shadow + content) for hybrid RAG with or without enrichment.")
    ap.add_argument("--raw", help="Raw corpus JSONL (from 01_corpus_builder.py)")
    ap.add_argument("--enhanced", help="Enhanced corpus JSONL (from 02_corpus_enricher.py)")
    ap.add_argument("--shadow", help="Shadow corpus JSONL (from 02_corpus_enricher.py)")
    ap.add_argument("--out-dir", default="indexes", help="Output directory for indexes + metadata")
    ap.add_argument("--embed-model", default="dengcao/Qwen3-Embedding-0.6B:F16", help="Ollama embedding model")
    ap.add_argument("--ollama", default="http://localhost:11434", help="Ollama base URL")
    ap.add_argument("--target-chars", type=int, default=2500, help="Chunk size for content index")
    ap.add_argument("--overlap-chars", type=int, default=200, help="Overlap size for content index")
    ap.add_argument("--concurrency", type=int, default=6, help="Parallel HTTP workers for embeddings")
    ap.add_argument("--no-shadow", action="store_true", help="Do not build shadow index")
    ap.add_argument("--no-content", action="store_true", help="Do not build content index")
    args = ap.parse_args()
    run_index(
        Path(args.raw) if args.raw else None,
        Path(args.enhanced) if args.enhanced else None,
        Path(args.shadow) if args.shadow else None,
        Path(args.out_dir),
        on_progress=lambda p, pct, d: print(f"[{p}] {pct*100:.1f}%: {d}"),
        **vars(args)
    )
 if __name__ == "__main__":
    main()
--- a/backend/rag/unified_rag.py
+++ b/backend/rag/unified_rag.py
@@ -0,0 +1,687 @@
 #!/usr/bin/env python3
 # -*- coding: utf-8 -*-
 """
 04_unified_rag.py
 Hybrid retrieval + (optional) rerank + (optional) answer generation.
 Now supports:
 - HYBRID: shadow+content indexes (best quality)
 - SINGLE-INDEX:
    * legacy pair (--index/--store)            ← back-compat
    * content-only pair (--content-index/--content-store)
    * shadow-only  pair (--shadow-index/--shadow-store)
 If you skipped enrichment:
 - Build only content + proxy shadow with 03_index_builder.py (raw → content; raw/enhanced→proxy shadow)
 - Query with:
    * HYBRID: provide both pairs
    * SINGLE-INDEX: provide only one pair (content OR shadow)
 """
 from __future__ import annotations
 import argparse, json, os, sys, subprocess, math
 from pathlib import Path
 from typing import List, Dict, Tuple, Optional
 import faiss
 import numpy as np
 import requests
 import threading
 from typing import Callable
 # -----------------------------
 # Utilities
 # -----------------------------
 def norm_f32(mat: np.ndarray) -> np.ndarray:
    mat = np.asarray(mat, dtype="float32")
    norms = np.linalg.norm(mat, axis=1, keepdims=True)
    norms[norms == 0] = 1.0
    return mat / norms
 def zscore(x: List[float]) -> List[float]:
    if not x:
        return []
    mu = float(np.mean(x))
    sd = float(np.std(x))
    if sd == 0.0:
        return [0.0 for _ in x]
    return [(v - mu) / sd for v in x]
 def sanitize(s: Optional[str]) -> str:
    if not s:
        return ""
    import re
    s = re.sub(r"<\s*think\s*>.*?<\s*/\s*think\s*>", "", s, flags=re.S|re.I)
    s = re.sub(r"^\s*```(?:\w+)?\s*|\s*```\s*$", "", s, flags=re.M)
    s = re.sub(r"[ \t]+", " ", s)
    s = re.sub(r"\n{3,}", "\n\n", s)
    return s.strip()
 def pick_any_text(rec: Dict) -> str:
    """Use 'text' if present else 'shadow_text' for rerank/answer/pretty."""
    return rec.get("text") or rec.get("shadow_text") or rec.get("content") or rec.get("body") or ""
 def embed_query(ollama_url: str, model: str, text: str, timeout_s: int = 60) -> np.ndarray:
    r = requests.post(
        f"{ollama_url.rstrip('/')}/api/embeddings",
        json={"model": model, "prompt": text},
        timeout=timeout_s,
    )
    r.raise_for_status()
    data = r.json()
    vec = data.get("embedding") or (data.get("embeddings") or [None])[0]
    if vec is None:
        raise RuntimeError("Ollama /api/embeddings returned no vector.")
    return np.array(vec, dtype="float32")
 def load_meta(store_path: str) -> Dict[int, Dict]:
    id2meta: Dict[int, Dict] = {}
    with open(store_path, "r", encoding="utf-8") as f:
        for line in f:
            line = line.strip()
            if not line:
                continue
            rec = json.loads(line)
            id2meta[int(rec["id"])] = rec
    return id2meta
 def truncate_text(s: Optional[str], limit: int) -> str:
    if not s:
        return ""
    return s if len(s) <= limit else s[:limit]
 def derive_doc_id(rec: Dict) -> str:
    # Prefer explicit doc_id if provided by meta builder
    did = rec.get("doc_id")
    if did:
        return did
    rid = rec.get("record_id") or rec.get("id") or ""
    return rid.split("#", 1)[0]
 # -----------------------------
 # Rerank (subprocess worker)
 # -----------------------------
 def sentence_transformers_available() -> bool:
    try:
        import importlib.util as _ilu
        spec = _ilu.find_spec("sentence_transformers")
        return spec is not None
    except Exception:
        return False
 def rerank_subprocess(
    query: str,
    docs: List[str],
    *,
    worker_path: Path,
    model: str,
    device: str,
    dtype: str,
    batch: int,
    maxlen: int,
 ) -> Optional[List[Tuple[int, float]]]:
    """
    Call this same script with --mode rerank-worker via a clean Python subprocess.
    Returns: list of (local_index, score) sorted desc, or None on failure.
    """
    payload = {"query": query, "docs": docs}
    cmd = [
        sys.executable,
        str(worker_path),
        "--mode", "rerank-worker",
        "--rerank-model", model,
        "--rerank-device", device,
        "--rerank-dtype", dtype,
        "--rerank-batch", str(batch),
        "--rerank-maxlen", str(maxlen),
        "--stdio"
    ]
    env = os.environ.copy()
    env.setdefault("PYTORCH_ENABLE_MPS_FALLBACK", "1")
    env.setdefault("TOKENIZERS_PARALLELISM", "false")
    env.setdefault("KMP_DUPLICATE_LIB_OK", "TRUE")
    try:
        proc = subprocess.run(
            cmd,
            input=json.dumps(payload).encode("utf-8"),
            stdout=subprocess.PIPE,
            stderr=subprocess.PIPE,
            check=False,
            env=env,
        )
    except Exception as e:
        sys.stderr.write(f"[rerank] failed to launch worker: {e}\n")
        return None
    if proc.returncode != 0:
        sys.stderr.write(proc.stderr.decode("utf-8", errors="ignore") + "\n")
        return None
    try:
        data = json.loads(proc.stdout.decode("utf-8"))
        results = data.get("results") or []
        pairs = [(int(r["index"]), float(r["score"])) for r in results]
        pairs.sort(key=lambda x: x[1], reverse=True)
        return pairs
    except Exception as e:
        sys.stderr.write(f"[rerank] parse error: {e}\n")
        return None
 # -----------------------------
 # Simple diversity (per-source cap)
 # -----------------------------
 def apply_per_source_cap(ordered: List[Dict], per_source_limit: int) -> List[Dict]:
    if per_source_limit <= 0:
        return ordered
    counts = {}
    out = []
    for rec in ordered:
        key = rec.get("url") or rec.get("doc_id") or rec.get("title") or str(rec.get("id"))
        c = counts.get(key, 0)
        if c < per_source_limit:
            out.append(rec)
            counts[key] = c + 1
    return out
 # -----------------------------
 # Generation
 # -----------------------------
 def generate(
    ollama_url: str,
    model: str,
    prompt: str,
    system: Optional[str] = None,
    temperature: float = 0.2,
    timeout_s: int = 180,
    on_stream=None,
 ) -> str:
    payload = {"model": model, "prompt": prompt, "options": {"temperature": temperature}}
    if system:
        payload["system"] = system
    r = requests.post(f"{ollama_url.rstrip('/')}/api/generate", json=payload, timeout=timeout_s, stream=True)
    r.raise_for_status()
    out = []
    for chunk in r.iter_lines(decode_unicode=True):
        if not chunk:
            continue
        try:
            obj = json.loads(chunk)
            delta = obj.get("response", "")
            out.append(delta)
            if on_stream:
                on_stream({"delta": delta})
            if obj.get("done"):
                break
        except Exception:
            pass
    return sanitize("".join(out))
 # -----------------------------
 # Search helpers
 # -----------------------------
 def faiss_search(index: faiss.Index, qvec: np.ndarray, k: int) -> Tuple[List[int], List[float]]:
    sims, ids = index.search(qvec, k)
    ids = [int(i) for i in ids[0] if i != -1]
    sims = [float(s) for s in sims[0][: len(ids)]]
    return ids, sims
 # -----------------------------
 # Output / answer
 # -----------------------------
 def output_or_answer(final: List[Dict], args, on_stream=None):
    if not args.answer:
        # Return top-k results without generating an answer
        return {
            "done": True,
            "sources": [
                {
                    "doc_id": rec.get("doc_id"),
                    "title": rec.get("title"),
                    "url": rec.get("url"),
                    "record_type": rec.get("record_type"),
                    "mime": rec.get("mime"),
                    "lang": rec.get("lang"),
                    "snippet": pick_any_text(rec),
                    "scores": {
                        "final": float(rec.get("_score", 0.0)),
                        "shadow": float(rec.get("_shadow")) if rec.get("_shadow") is not None else None,
                        "content": float(rec.get("_ann", 0.0)),
                        "rerank": float(rec.get("_rerank")) if rec.get("_rerank") is not None else None,
                    },
                }
                for rec in final
            ],
        }
    # Build prompt for answering
    context_blocks, sources = [], []
    for i, rec in enumerate(final, start=1):
        text = pick_any_text(rec)
        title = rec.get("title") or "(untitled)"
        url = rec.get("url") or title
        sources.append(f"[{i}] {url}")
        context_blocks.append(f"[{i}] {title}\n{text}")
    system = (
        "You are a careful researcher. Answer ONLY from the provided sources. "
        "Cite like [1], [2] in-line. If the answer is not in the sources, say you can't find it. "
        "Do not include private chain-of-thought or <think> tags."
    )
    prompt = (
        f"Question: {args.query}\n\n"
        "Use the sources below. If not answerable from them, say so clearly.\n\n"
        "Sources:\n" + "\n\n".join(context_blocks) + "\n\n----\n\n"
        "Remember: only use these sources. Provide a concise answer with citations.\n\n"
        f"And again. The question you need to answer is: {args.query}"
    )
    full_answer = generate(args.ollama, args.gen_model, prompt, system=system, temperature=args.temperature, on_stream=on_stream)
    final_result = {
        "done": True,
        "answer": full_answer,
        "sources": [
            {
                "doc_id": rec.get("doc_id"),
                "title": rec.get("title"),
                "url": rec.get("url"),
            }
            for rec in final
        ],
    }
    if on_stream:
        on_stream(final_result)
    return final_result
 # -----------------------------
 # Main CLI (search / answer)
 # -----------------------------
 def run_cli(args):
    # Determine mode
    hybrid_ok = all([args.shadow_index, args.shadow_store, args.content_index, args.content_store])
    single_pair: Optional[Tuple[str, str]] = None
    single_kind = None
    if not hybrid_ok:
        # Prefer legacy if provided
        if args.index and args.store:
            single_pair = (args.index, args.store)
            single_kind = "legacy"
        elif args.content_index and args.content_store:
            single_pair = (args.content_index, args.content_store)
            single_kind = "content"
        elif args.shadow_index and args.shadow_store:
            single_pair = (args.shadow_index, args.shadow_store)
            single_kind = "shadow"
    # Embed query
    q = norm_f32(embed_query(args.ollama, args.embed_model, args.query).reshape(1, -1))
    if single_pair:
        # SINGLE-INDEX path (works for legacy/content-only/shadow-only)
        index = faiss.read_index(single_pair[0])
        id2meta = load_meta(single_pair[1])
        ids, sims = faiss_search(index, q, min(args.candidates, index.ntotal))
        candidates = []
        for pos, _id in enumerate(ids):
            base = id2meta[_id]
            rec = dict(base)
            rec["_ann"] = sims[pos]
            candidates.append(rec)
        # Optional rerank
        reranked_scores = None
        if not args.no_rerank and sentence_transformers_available():
            docs = [truncate_text(pick_any_text(c), args.max_doc_chars) for c in candidates]
            pairs = rerank_subprocess(
                args.query, docs,
                worker_path=Path(__file__),
                model=args.rerank_model,
                device=args.rerank_device,
                dtype=args.rerank_dtype,
                batch=args.rerank_batch,
                maxlen=args.rerank_maxlen,
            )
            if pairs is not None:
                reranked_scores = [None] * len(candidates)
                for local_idx, score in pairs:
                    if 0 <= local_idx < len(reranked_scores):
                        reranked_scores[local_idx] = float(score)
                min_score = min([s for s in reranked_scores if s is not None], default=0.0)
                reranked_scores = [s if s is not None else min_score for s in reranked_scores]
            else:
                print("[info] rerank disabled (worker failed).", file=sys.stderr)
        # Blend
        if reranked_scores is not None:
            z_ann = zscore([c["_ann"] for c in candidates])
            z_rr = zscore(reranked_scores)
            alpha = float(args.blend)
            final_scores = [(1 - alpha) * a + alpha * r for a, r in zip(z_ann, z_rr)]
            for rec, fs, rr in zip(candidates, final_scores, reranked_scores):
                rec["_score"] = float(fs)
                rec["_rerank"] = float(rr)
            candidates.sort(key=lambda r: r["_score"], reverse=True)
        else:
            for rec in candidates:
                rec["_score"] = rec["_ann"]
        final = candidates[: max(1, min(args.k, len(candidates)))]
        return output_or_answer(final, args)
    # HYBRID path
    shadow_index = faiss.read_index(args.shadow_index)
    shadow_meta = load_meta(args.shadow_store)
    content_index = faiss.read_index(args.content_index)
    content_meta = load_meta(args.content_store)
    # Stage A: Shadow search → doc shortlist
    sid_list, s_sim = faiss_search(shadow_index, q, min(args.shadow_candidates, shadow_index.ntotal))
    s_hits = [{"id": sid, "sim": sim, **shadow_meta[sid]} for sid, sim in zip(sid_list, s_sim)]
    # optional shadow weighting by kind
    kw = {}
    for kv in args.shadow_kind_weights.split(","):
        kv = kv.strip()
        if not kv:
            continue
        if ":" in kv:
            k, v = kv.split(":", 1)
            try:
                kw[k.strip().lower()] = float(v)
            except Exception:
                pass
    if kw:
        for h in s_hits:
            w = kw.get((h.get("kind") or "").lower(), 1.0)
            h["sim"] *= float(w)
    # group to doc_id
    doc_scores: Dict[str, float] = {}
    for h in s_hits:
        did = derive_doc_id(h)
        doc_scores[did] = max(doc_scores.get(did, 0.0), float(h["sim"]))  # max over shadow signals
    # Stage B: Content search (global)
    cid_list, c_sim = faiss_search(content_index, q, min(args.content_candidates, content_index.ntotal))
    c_hits = [{"id": cid, "sim": sim, **content_meta[cid]} for cid, sim in zip(cid_list, c_sim)]
    # Stage C: filter to doc shortlist
    ordered_docs = sorted(doc_scores.items(), key=lambda kv: kv[1], reverse=True)[: args.doc_top]
    if not ordered_docs:
        # Fallback: derive docs from top content hits
        tmp_docs = []
        seen = set()
        for h in c_hits:
            did = derive_doc_id(h)
            if did not in seen:
                seen.add(did)
                tmp_docs.append((did, float(h['sim'])))
            if len(tmp_docs) >= args.doc_top:
                break
        ordered_docs = tmp_docs
    shortlist = set([d for d, _ in ordered_docs])
    # keep content hits belonging to shortlist (fallback to global if empty)
    content_for_docs = [h for h in c_hits if derive_doc_id(h) in shortlist] or c_hits
    # per-doc cap
    per_doc = max(1, args.per_doc_chunks)
    doc_buckets: Dict[str, List[Dict]] = {}
    for h in content_for_docs:
        did = derive_doc_id(h)
        doc_buckets.setdefault(did, []).append(h)
    for did, arr in doc_buckets.items():
        arr.sort(key=lambda r: r["sim"], reverse=True)
        doc_buckets[did] = arr[:per_doc]
    # flatten, compute final score as blend of shadow(doc) + content(chunk)
    out_candidates: List[Dict] = []
    for did, doc_sim in ordered_docs:
        for ch in doc_buckets.get(did, []):
            final = dict(ch)
            final["_shadow"] = float(doc_sim)
            final["_ann"] = float(ch["sim"])
            alpha = float(args.doc_blend)  # weight of shadow
            beta = float(args.chunk_blend) # weight of chunk ann
            final["_score"] = alpha * float(doc_sim) + beta * float(ch["sim"])
            out_candidates.append(final)
    if not out_candidates:
        print("No retrieval results.", file=sys.stderr)
        if args.answer:
            print("No results from retrieval; cannot answer.")
        return
    out_candidates.sort(key=lambda r: r["_score"], reverse=True)
    # Optional rerank of the first pool
    reranked_scores = None
    if not args.no_rerank and sentence_transformers_available():
        pool = out_candidates[: args.candidates]
        docs = [truncate_text(pick_any_text(c), args.max_doc_chars) for c in pool]
        pairs = rerank_subprocess(
            args.query, docs,
            worker_path=Path(__file__),
            model=args.rerank_model,
            device=args.rerank_device,
            dtype=args.rerank_dtype,
            batch=args.rerank_batch,
            maxlen=args.rerank_maxlen,
        )
        if pairs is not None:
            reranked_scores = [None] * len(pool)
            for local_idx, score in pairs:
                if 0 <= local_idx < len(pool):
                    reranked_scores[local_idx] = float(score)
            min_score = min([s for s in reranked_scores if s is not None], default=0.0)
            reranked_scores = [s if s is not None else min_score for s in reranked_scores]
            z_ann = zscore([c["_score"] for c in pool])
            z_rr = zscore(reranked_scores)
            alpha = float(args.blend)
            blended = [(1 - alpha) * a + alpha * r for a, r in zip(z_ann, z_rr)]
            for rec, fs, rr in zip(pool, blended, reranked_scores):
                rec["_score"] = float(fs)
                rec["_rerank"] = float(rr)
            out_candidates[: len(pool)] = sorted(pool, key=lambda r: r["_score"], reverse=True)
        else:
            print("[info] rerank disabled (worker failed).", file=sys.stderr)
    # per-source cap and top-k
    ordered = apply_per_source_cap(out_candidates, args.per_source_limit)
    final = ordered[: max(1, min(args.k, len(ordered)))]
    return output_or_answer(final, args)
 # -----------------------------
 # Rerank worker mode
 # -----------------------------
 def run_rerank_worker(args):
    """
    Reads JSON from stdin: {"query": str, "docs": [str, ...]}
    Writes JSON to stdout: {"results": [{"index": int, "score": float}, ...]}
    """
    try:
        import torch
        from sentence_transformers import CrossEncoder
    except Exception as e:
        # Gracefully tell parent we failed by returning empty results
        out = {"results": []}
        json.dump(out, sys.stdout)
        sys.stdout.flush()
        print(f"[worker] sentence_transformers unavailable: {e}", file=sys.stderr)
        return
    try:
        torch.set_num_threads(1)
    except Exception:
        pass
    device = args.rerank_device
    if device == "auto":
        device = "mps" if torch.backends.mps.is_available() else "cpu"
    if args.rerank_dtype == "auto":
        dtype = torch.float16 if device == "mps" else torch.float32
    else:
        dtype = torch.float16 if args.rerank_dtype == "float16" else torch.float32
    model = CrossEncoder(
        args.rerank_model,
        device=device,
        max_length=args.rerank_maxlen,
        automodel_args={"torch_dtype": dtype},
    )
    data = json.load(sys.stdin)
    query = data["query"]
    docs = data["docs"]
    pairs = [(query, d) for d in docs]
    scores = model.predict(
        pairs,
        batch_size=args.rerank_batch,
        convert_to_numpy=True,
        show_progress_bar=False,
    ).tolist()
    ordered = sorted(enumerate(scores), key=lambda x: x[1], reverse=True)
    out = {"results": [{"index": int(i), "score": float(s)} for i, s in ordered]}
    json.dump(out, sys.stdout)
    sys.stdout.flush()
 # -----------------------------
 # Argparse
 # -----------------------------
 def build_parser():
    ap = argparse.ArgumentParser(allow_abbrev=False)
    ap.add_argument("--mode", default="cli", choices=["cli", "rerank-worker"])
    # Legacy single-index I/O (kept for back-compat)
    ap.add_argument("--index", help="Single FAISS index (legacy)")
    ap.add_argument("--store", help="Single metadata JSONL (legacy)")
    ap.add_argument("--candidates", type=int, default=200, help="ANN neighbors to fetch (legacy or rerank pool).")
    # Hybrid I/O
    ap.add_argument("--shadow-index", help="FAISS index over shadow_text")
    ap.add_argument("--shadow-store", help="Metadata JSONL for shadow index")
    ap.add_argument("--content-index", help="FAISS index over content chunks")
    ap.add_argument("--content-store", help="Metadata JSONL for content index")
    ap.add_argument("--query", required=False)
    ap.add_argument("--ollama", default="http://localhost:11434")
    ap.add_argument("--embed-model", default="dengcao/Qwen3-Embedding-0.6B:F16")
    # Shadow/content retrieval sizes (hybrid)
    ap.add_argument("--shadow-candidates", type=int, default=400, help="Shadow ANN pool size")
    ap.add_argument("--content-candidates", type=int, default=600, help="Content ANN pool size")
    ap.add_argument("--doc-top", type=int, default=40, help="Top-N documents from shadow shortlist")
    ap.add_argument("--per-doc-chunks", type=int, default=2, help="Max chunks per doc from content pool")
    ap.add_argument("--doc-blend", type=float, default=0.6, help="Weight for shadow score in final blend [0..1]")
    ap.add_argument("--chunk-blend", type=float, default=0.4, help="Weight for content-ANN score in final blend [0..1]")
    ap.add_argument("--shadow-kind-weights", default="image:1.2,code:1.1", help="Comma list 'kind:weight' to bias doc ranking")
    # Rerank knobs
    ap.add_argument("--no-rerank", action="store_true", help="Disable reranking.")
    ap.add_argument("--blend", type=float, default=0.75, help="Blend weight for reranker in normalized score [0..1].")
    ap.add_argument("--rerank-model", default="cross-encoder/ms-marco-MiniLM-L-6-v2")
    ap.add_argument("--rerank-device", default="auto", choices=["auto", "mps", "cpu"])
    ap.add_argument("--rerank-dtype", default="auto", choices=["auto", "float16", "float32"])
    ap.add_argument("--rerank-batch", type=int, default=64)
    ap.add_argument("--rerank-maxlen", type=int, default=256)
    ap.add_argument("--stdio", action="store_true", help=argparse.SUPPRESS)  # worker-only flag
    # Output / answer
    ap.add_argument("--json", action="store_true", help="Print search results as JSON.")
    ap.add_argument("--pretty", action="store_true", help="Pretty-print search results.")
    ap.add_argument("--show-scores", action="store_true", help="Show ANN/rerank scores in pretty output.")
    ap.add_argument("--answer", action="store_true", help="Generate an answer with an LLM using top-k contexts.")
    ap.add_argument("--gen-model", default="qwen3:4b",
                    help="Any chat-capable model in Ollama (e.g., 'qwen2.5:7b-instruct', 'llama3.1:8b-instruct').")
    ap.add_argument("--temperature", type=float, default=0.2)
    ap.add_argument("--k", type=int, default=10, help="Number of final results to return/use.")
    # Misc
    ap.add_argument("--max-doc-chars", type=int, default=4000, help="Truncate each candidate before rerank.")
    ap.add_argument("--per-source-limit", type=int, default=3, help="Max results per source (url/doc) to diversify.")
    return ap
 def run_query(shadow_index: Path, shadow_store: Path,
              content_index: Path, content_store: Path,
              query: str, *, answer: bool = False,
              on_stream: Optional[Callable[[Dict], None]] = None, **opts) -> dict:
    # Ensure all paths are strings for argparse.Namespace
    _shadow_index = str(shadow_index) if shadow_index else None
    _shadow_store = str(shadow_store) if shadow_store else None
    _content_index = str(content_index) if content_index else None
    _content_store = str(content_store) if content_store else None
    args = argparse.Namespace(
        shadow_index=_shadow_index,
        shadow_store=_shadow_store,
        content_index=_content_index,
        content_store=_content_store,
        query=query,
        answer=answer,
        ollama=opts.get("ollama", "http://localhost:11434"),
        embed_model=opts.get("embed_model", "dengcao/Qwen3-Embedding-0.6B:F16"),
        shadow_candidates=opts.get("shadow_candidates", 400),
        content_candidates=opts.get("content_candidates", 600),
        doc_top=opts.get("doc_top", 40),
        per_doc_chunks=opts.get("per_doc_chunks", 2),
        doc_blend=opts.get("doc_blend", 0.6),
        chunk_blend=opts.get("chunk_blend", 0.4),
        shadow_kind_weights=opts.get("shadow_kind_weights", "image:1.2,code:1.1"),
        no_rerank=opts.get("no_rerank", True), # Reranker OFF by default
        blend=opts.get("blend", 0.75),
        rerank_model=opts.get("rerank_model", "cross-encoder/ms-marco-MiniLM-L-6-v2"),
        rerank_device=opts.get("rerank_device", "auto"),
        rerank_dtype=opts.get("rerank_dtype", "auto"),
        rerank_batch=opts.get("rerank_batch", 64),
        rerank_maxlen=opts.get("rerank_maxlen", 256),
        gen_model=opts.get("gen_model", "qwen3:4b"),
        temperature=opts.get("temperature", 0.2),
        k=opts.get("k", 10),
        max_doc_chars=opts.get("max_doc_chars", 4000),
        per_source_limit=opts.get("per_source_limit", 3),
        json=True # Force JSON-like output dict
    )
    return run_cli(args, on_stream=on_stream)
 def main():
    ap = build_parser()
    args = ap.parse_args()
    if args.mode == "rerank-worker":
        return run_rerank_worker(args)
    if not args.query:
        ap.error("--query is required in cli mode")
    hybrid_ok = all([args.shadow_index, args.shadow_store, args.content_index, args.content_store])
    if not hybrid_ok:
        ap.error("For CLI use, all four index/store paths are required for hybrid retrieval.")
    result = run_query(
        shadow_index=Path(args.shadow_index),
        shadow_store=Path(args.shadow_store),
        content_index=Path(args.content_index),
        content_store=Path(args.content_store),
        query=args.query,
        answer=args.answer,
        on_stream=lambda d: print(json.dumps(d, ensure_ascii=False), flush=True) if args.answer else None,
        **vars(args)
    )
    if not args.answer:
        print(json.dumps(result, ensure_ascii=False))
 if __name__ == "__main__":
    main()
--- a/backend/requirements.txt
+++ b/backend/requirements.txt
@@ -3,9 +3,24 @@ fastapi==0.111.0
 uvicorn[standard]==0.30.1
 SQLAlchemy==2.0.32
 httpx==0.27.0
-pydantic==2.7.4
+pydantic==2.11.7
 requests>=2.32.0
 # Web search enrichment dependencies
 beautifulsoup4==4.12.3
 httpx[http2]>=0.27.0
-numpy
+numpy
 # Local RAG pipeline dependencies
 faiss-cpu>=1.8.0
 PyMuPDF>=1.24.0
 ebooklib>=0.18
 chardet>=5.2.0
 Pillow>=10.0.0
 langid>=1.1.6
 langdetect>=1.0.9
 # Optional but recommended for richer ingestion / reranking:
 # openai-whisper
 # opencv-python-headless
 # sentence-transformers
--- a/backend/schemas.py
+++ b/backend/schemas.py
@@ -1,4 +1,4 @@
-from pydantic import BaseModel
+from pydantic import BaseModel, ConfigDict
 from typing import List, Optional
 from datetime import datetime
@@ -38,8 +38,7 @@ class ChatSession(BaseModel):
    name: str
    created_at: datetime
-    class Config:
+    model_config = ConfigDict(from_attributes=True)
        orm_mode = True
 class SessionsResponse(BaseModel):
    sessions: List[ChatSession]
@@ -67,3 +66,4 @@ class WebSearchRequest(BaseModel):
 class WebSearchResponse(BaseModel):
    enriched_prompt: str
    sources: List[str] = []
    context_block: str = ""
--- a/dist/assets/index-Cc0DLWqA.css
+++ b/dist/assets/index-Cc0DLWqA.css
--- a/dist/assets/index-DKAz6gtp.js
+++ b/dist/assets/index-DKAz6gtp.js
--- a/dist/index.html
+++ b/dist/index.html
@@ -0,0 +1,14 @@
 <!doctype html>
 <html lang="en">
  <head>
    <meta charset="UTF-8" />
    <meta name="viewport" content="width=device-width, initial-scale=1.0" />
    <title>LLM Desktop</title>
    <script type="module" crossorigin src="/assets/index-DKAz6gtp.js"></script>
    <link rel="stylesheet" crossorigin href="/assets/index-Cc0DLWqA.css">
  </head>
  <body>
    <div id="root"></div>
  </body>
 </html>
--- a/electron/main.cjs
+++ b/electron/main.cjs
@@ -1,5 +1,4 @@
-
+const { app, BrowserWindow, Menu, dialog, ipcMain, shell } = require('electron')
 const { app, BrowserWindow, Menu, ipcMain, shell } = require('electron')
 const path = require('path')
 const { is } = require('@electron-toolkit/utils')
 const fs = require('fs')
@@ -9,12 +8,36 @@ let settingsWindow = null
 const settingsFilePath = path.join(app.getPath('userData'), 'settings.json')
 let appSettings = {}
 const DEFAULT_UI_SCALE = 1
 const MIN_UI_SCALE = 0.7
 const MAX_UI_SCALE = 1.3
 // Default settings
 const defaultSettings = {
  ollamaApiUrl: 'http://127.0.0.1:8000',
  colorScheme: 'Default',
-  chatModel: 'llama3' // Set a default model here
+  uiScale: DEFAULT_UI_SCALE,
  chatModel: 'llama3',
 }
 function normalizeUiScale(value) {
  const numericValue = Number(value)
  if (!Number.isFinite(numericValue)) {
    return DEFAULT_UI_SCALE
  }
  return Math.min(MAX_UI_SCALE, Math.max(MIN_UI_SCALE, Math.round(numericValue * 100) / 100))
 }
 function applyUiScaleToWindow(window) {
  if (!window || window.isDestroyed()) {
    return
  }
  window.webContents.setZoomFactor(normalizeUiScale(appSettings.uiScale))
 }
 function applyUiScaleToAllWindows() {
  BrowserWindow.getAllWindows().forEach(applyUiScaleToWindow)
 }
 function loadSettings() {
@@ -24,8 +47,9 @@ function loadSettings() {
      appSettings = { ...defaultSettings, ...JSON.parse(data) }
    } else {
      appSettings = { ...defaultSettings }
-      saveSettings() // Create the file with default settings
+      saveSettings()
    }
    appSettings.uiScale = normalizeUiScale(appSettings.uiScale)
  } catch (error) {
    console.error('Failed to load settings:', error)
    appSettings = { ...defaultSettings }
@@ -40,7 +64,7 @@ function saveSettings() {
  }
 }
-async function createMainWindow () {
+async function createMainWindow() {
  mainWindow = new BrowserWindow({
    width: 1000,
    height: 720,
@@ -50,17 +74,23 @@ async function createMainWindow () {
    webPreferences: {
      preload: path.join(__dirname, 'preload.cjs'),
      contextIsolation: true,
-      nodeIntegration: false
+      nodeIntegration: false,
-    }
+    },
  })
  applyUiScaleToWindow(mainWindow)
  mainWindow.on('ready-to-show', () => {
    mainWindow.show()
  })
  mainWindow.webContents.on('did-finish-load', () => {
    applyUiScaleToWindow(mainWindow)
  })
  mainWindow.on('focus', () => {
-    mainWindow.webContents.send('window-focused');
+    mainWindow.webContents.send('window-focused')
-  });
+  })
  if (is.dev && process.env.VITE_DEV_SERVER_URL) {
    await mainWindow.loadURL(process.env.VITE_DEV_SERVER_URL)
@@ -70,12 +100,12 @@ async function createMainWindow () {
  }
  mainWindow.webContents.setWindowOpenHandler(({ url }) => {
-    shell.openExternal(url);
+    shell.openExternal(url)
-    return { action: 'deny' };
+    return { action: 'deny' }
-  });
+  })
 }
-async function createSettingsWindow () {
+async function createSettingsWindow() {
  if (settingsWindow) {
    settingsWindow.focus()
    return
@@ -91,14 +121,20 @@ async function createSettingsWindow () {
    webPreferences: {
      preload: path.join(__dirname, 'preload.cjs'),
      contextIsolation: true,
-      nodeIntegration: false
+      nodeIntegration: false,
-    }
+    },
  })
  applyUiScaleToWindow(settingsWindow)
  settingsWindow.on('ready-to-show', () => {
    settingsWindow.show()
  })
  settingsWindow.webContents.on('did-finish-load', () => {
    applyUiScaleToWindow(settingsWindow)
  })
  settingsWindow.on('closed', () => {
    settingsWindow = null
  })
@@ -112,7 +148,7 @@ async function createSettingsWindow () {
 }
 app.whenReady().then(() => {
-  loadSettings() // Load settings when the app is ready
+  loadSettings()
  createMainWindow()
  const menuTemplate = [
@@ -122,11 +158,11 @@ app.whenReady().then(() => {
        {
          label: 'Settings',
          accelerator: 'CmdOrCtrl+,',
-          click: createSettingsWindow
+          click: createSettingsWindow,
        },
        { type: 'separator' },
-        { role: 'quit' }
+        { role: 'quit' },
-      ]
+      ],
    },
    {
      label: 'Edit',
@@ -139,8 +175,8 @@ app.whenReady().then(() => {
        { role: 'paste' },
        { role: 'delete' },
        { type: 'separator' },
-        { role: 'selectAll' }
+        { role: 'selectAll' },
-      ]
+      ],
    },
    {
      label: 'View',
@@ -153,9 +189,9 @@ app.whenReady().then(() => {
        { role: 'zoomin' },
        { role: 'zoomout' },
        { type: 'separator' },
-        { role: 'togglefullscreen' }
+        { role: 'togglefullscreen' },
-      ]
+      ],
-    }
+    },
  ]
  const menu = Menu.buildFromTemplate(menuTemplate)
@@ -166,23 +202,40 @@ app.whenReady().then(() => {
  })
 })
-// IPC handlers for settings
+ipcMain.handle('get-settings', () => appSettings)
 ipcMain.handle('get-settings', () => {
  return appSettings
 })
 ipcMain.handle('set-setting', (event, key, value) => {
-  appSettings[key] = value
+  appSettings[key] = key === 'uiScale' ? normalizeUiScale(value) : value
  saveSettings()
  if (key === 'uiScale') {
    applyUiScaleToAllWindows()
  }
  return true
 })
 ipcMain.handle('update-settings', (event, settings) => {
  appSettings = { ...appSettings, ...settings }
  appSettings.uiScale = normalizeUiScale(appSettings.uiScale)
  saveSettings()
  if (Object.prototype.hasOwnProperty.call(settings, 'uiScale')) {
    applyUiScaleToAllWindows()
  }
  return true
 })
 ipcMain.handle('pick-paths', async () => {
  const result = await dialog.showOpenDialog(mainWindow, {
    properties: ['openFile', 'openDirectory', 'multiSelections'],
  })
  return result.canceled ? [] : result.filePaths
 })
 ipcMain.handle('open-path', async (event, filePath) => {
  if (!filePath) return false
  const err = await shell.openPath(filePath)
  return err === ''
 })
 ipcMain.on('open-external-link', (event, url) => {
  shell.openExternal(url)
 })
--- a/electron/preload.cjs
+++ b/electron/preload.cjs
@@ -6,6 +6,8 @@ contextBridge.exposeInMainWorld('electronAPI', {
  getSettings: () => ipcRenderer.invoke('get-settings'),
  setSetting: (key, value) => ipcRenderer.invoke('set-setting', key, value),
  updateSettings: (settings) => ipcRenderer.invoke('update-settings', settings),
  pickPaths: () => ipcRenderer.invoke('pick-paths'),
  openPath: (filePath) => ipcRenderer.invoke('open-path', filePath),
  openExternalLink: (event) => {
    event.preventDefault();
    const url = event.currentTarget.href;
--- a/package.json
+++ b/package.json
@@ -6,7 +6,7 @@
  "type": "module",
  "scripts": {
    "dev": "concurrently -k \"npm:dev:backend\" \"npm:dev:renderer\" \"npm:dev:electron\"",
-    "dev:backend": "python3 -m uvicorn backend.main:app --host 127.0.0.1 --port 8000 --reload",
+    "dev:backend": "backend/.venv/bin/python -m uvicorn backend.main:app --host 127.0.0.1 --port 8000 --reload",
    "dev:renderer": "vite --port 5173 --strictPort",
    "dev:electron": "wait-on http://localhost:5173 tcp:8000 && cross-env VITE_DEV_SERVER_URL=http://localhost:5173 electron .",
    "build": "vite build",
--- a/run.sh
+++ b/run.sh
@@ -1,6 +1,19 @@
 #!/bin/sh
-python -m venv backend/.venv
+set -eu
-source backend/.venv/bin/activate
+
-pip install -r backend/requirements.txt
+PYTHON_BIN="${PYTHON_BIN:-python3.13}"
 VENV_DIR="backend/.venv"
 if ! command -v "$PYTHON_BIN" >/dev/null 2>&1; then
  echo "Python 3.13 is required. Set PYTHON_BIN to a Python 3.13 executable if needed." >&2
  exit 1
 fi
 if [ ! -x "$VENV_DIR/bin/python" ] || ! "$VENV_DIR/bin/python" -c 'import sys; raise SystemExit(0 if sys.version_info[:2] == (3, 13) else 1)'; then
  rm -rf "$VENV_DIR"
  "$PYTHON_BIN" -m venv "$VENV_DIR"
 fi
 "$VENV_DIR/bin/python" -m pip install -r backend/requirements.txt
 npm install
-npm run dev
+npm run dev
--- a/src/App.jsx
+++ b/src/App.jsx
--- a/src/InterfaceSettings.jsx
+++ b/src/InterfaceSettings.jsx
@@ -1,27 +1,58 @@
-import React, { useState, useEffect } from 'react';
+import React, { useEffect, useState } from 'react'
-import { colorSchemes, applyColorScheme } from './colorSchemes';
+import { colorSchemes, applyColorScheme } from './colorSchemes'
-const COLOR_SCHEME_KEY = 'colorScheme';
+const COLOR_SCHEME_KEY = 'colorScheme'
 const UI_SCALE_KEY = 'uiScale'
 const DEFAULT_UI_SCALE = 1
 const MIN_UI_SCALE = 0.7
 const MAX_UI_SCALE = 1.3
 const UI_SCALE_STEP = 0.05
 function normalizeUiScale(value) {
  const numericValue = Number(value)
  if (!Number.isFinite(numericValue)) {
    return DEFAULT_UI_SCALE
  }
  return Math.min(MAX_UI_SCALE, Math.max(MIN_UI_SCALE, Math.round(numericValue * 100) / 100))
 }
 export default function InterfaceSettings() {
-  const [selectedColorScheme, setSelectedColorScheme] = useState('Default');
+  const [selectedColorScheme, setSelectedColorScheme] = useState('Default')
  const [uiScale, setUiScale] = useState(DEFAULT_UI_SCALE)
  useEffect(() => {
    window.electronAPI.getSettings().then(settings => {
-      setSelectedColorScheme(settings.colorScheme);
+      const schemeName = settings.colorScheme || 'Default'
-      applyColorScheme(settings.colorScheme);
+      setSelectedColorScheme(schemeName)
-    });
+      setUiScale(normalizeUiScale(settings.uiScale))
-  }, []);
+      applyColorScheme(schemeName)
    })
  }, [])
  useEffect(() => {
-    applyColorScheme(selectedColorScheme);
+    applyColorScheme(selectedColorScheme)
-  }, [selectedColorScheme]);
+  }, [selectedColorScheme])
-  const handleColorSchemeChange = (e) => {
+  const handleColorSchemeChange = (event) => {
-    const newScheme = e.target.value;
+    const newScheme = event.target.value
-    setSelectedColorScheme(newScheme);
+    setSelectedColorScheme(newScheme)
-    window.electronAPI.setSetting(COLOR_SCHEME_KEY, newScheme);
+    window.electronAPI.setSetting(COLOR_SCHEME_KEY, newScheme)
-  };
+  }
  const persistUiScale = (value) => {
    const nextScale = normalizeUiScale(value)
    setUiScale(nextScale)
    window.electronAPI.setSetting(UI_SCALE_KEY, nextScale)
  }
  const handleUiScaleChange = (event) => {
    persistUiScale(event.target.value)
  }
  const handleUiScaleReset = () => {
    persistUiScale(DEFAULT_UI_SCALE)
  }
  return (
    <div className="settings-content-panel">
@@ -39,6 +70,32 @@ export default function InterfaceSettings() {
          ))}
        </select>
      </div>
      <div className="setting-section">
        <h3>UI Scale</h3>
        <div className="setting-control-row">
          <input
            type="range"
            className="range-input"
            min={MIN_UI_SCALE}
            max={MAX_UI_SCALE}
            step={UI_SCALE_STEP}
            value={uiScale}
            onChange={handleUiScaleChange}
          />
          <span className="setting-value">{Math.round(uiScale * 100)}%</span>
          <button
            type="button"
            className="button"
            onClick={handleUiScaleReset}
            disabled={uiScale === DEFAULT_UI_SCALE}
          >
            Reset
          </button>
        </div>
        <p className="setting-description">
          Scales the whole interface, including fonts, spacing, and controls. 100% is the default size.
        </p>
      </div>
    </div>
-  );
+  )
 }
--- a/src/LibraryManager.jsx
+++ b/src/LibraryManager.jsx
@@ -0,0 +1,274 @@
 import React, { useEffect, useState } from 'react'
 function statusLabel(job) {
  if (!job) return null
  const progress = typeof job.progress === 'number' ? `${job.progress.toFixed(0)}%` : null
  const detail = job.detail ? ` ${job.detail}` : ''
  return `${job.type} · ${job.status}${progress ? ` · ${progress}` : ''}${detail}`
 }
 export default function LibraryManager({
  apiBase,
  library,
  jobs,
  chatLibrarySlug,
  onRefresh,
  onToggleChatLibrary,
  onDeleted
 }) {
  const [busy, setBusy] = useState(false)
  const [isRenaming, setIsRenaming] = useState(false)
  const [renameValue, setRenameValue] = useState('')
  const [confirmDelete, setConfirmDelete] = useState(false)
  const [errorMessage, setErrorMessage] = useState('')
  useEffect(() => {
    setIsRenaming(false)
    setRenameValue(library?.name || '')
    setConfirmDelete(false)
    setErrorMessage('')
  }, [library?.slug, library?.name])
  async function expectOk(response) {
    if (response.ok) return response
    const detail = await response.text()
    throw new Error(detail || `HTTP ${response.status}`)
  }
  async function runAction(fn) {
    setBusy(true)
    try {
      setErrorMessage('')
      await fn()
      setConfirmDelete(false)
    } finally {
      setBusy(false)
      await onRefresh()
    }
  }
  async function addPaths() {
    if (!library) return
    const paths = await window.electronAPI?.pickPaths?.()
    if (!Array.isArray(paths) || paths.length === 0) return
    try {
      await runAction(async () => {
        const response = await fetch(`${apiBase}/libraries/${library.slug}/files/register`, {
          method: 'POST',
          headers: { 'Content-Type': 'application/json' },
          body: JSON.stringify({ paths })
        })
        await expectOk(response)
      })
    } catch (error) {
      setErrorMessage(String(error?.message || error))
    }
  }
  async function removeFile(rel) {
    if (!library) return
    try {
      await runAction(async () => {
        const response = await fetch(`${apiBase}/libraries/${library.slug}/files`, {
          method: 'DELETE',
          headers: { 'Content-Type': 'application/json' },
          body: JSON.stringify({ rel })
        })
        await expectOk(response)
      })
    } catch (error) {
      setErrorMessage(String(error?.message || error))
    }
  }
  async function renameLibrary() {
    if (!library) return
    const name = renameValue.trim()
    if (!name || name === library.name) {
      setIsRenaming(false)
      setRenameValue(library.name || '')
      return
    }
    await runAction(async () => {
      const response = await fetch(`${apiBase}/libraries/${library.slug}`, {
        method: 'PATCH',
        headers: { 'Content-Type': 'application/json' },
        body: JSON.stringify({ name })
      })
      await expectOk(response)
    })
    setIsRenaming(false)
  }
  async function deleteLibrary() {
    if (!library) return
    await runAction(async () => {
      const response = await fetch(`${apiBase}/libraries/${library.slug}`, { method: 'DELETE' })
      await expectOk(response)
    })
    onDeleted?.(library.slug)
  }
  async function startJob(kind) {
    if (!library) return
    try {
      await runAction(async () => {
        const endpoint = `${apiBase}/libraries/${library.slug}/jobs/${kind}`
        const options = {
          method: 'POST',
          headers: { 'Content-Type': 'application/json' }
        }
        if (kind === 'embed') {
          options.body = JSON.stringify({})
        }
        const response = await fetch(endpoint, options)
        await expectOk(response)
      })
    } catch (error) {
      setErrorMessage(String(error?.message || error))
    }
  }
  if (!library) {
    return (
      <div className="placeholder-view">
        <p>Create a database, add files or folders, then build and index it for local RAG.</p>
      </div>
    )
  }
  const activeJobs = (jobs || []).filter(job => job.slug === library.slug && (job.status === 'queued' || job.status === 'running'))
  const usingInChat = chatLibrarySlug === library.slug
  const canStartRename = () => {
    setRenameValue(library.name || '')
    setErrorMessage('')
    setIsRenaming(true)
    setConfirmDelete(false)
  }
  return (
    <div className="library-panel">
      {isRenaming && (
        <div className="library-inline-form">
          <input
            type="text"
            className="rename-input"
            value={renameValue}
            onChange={(e) => setRenameValue(e.target.value)}
            onKeyDown={(e) => {
              if (e.key === 'Enter') {
                renameLibrary().catch((error) => setErrorMessage(String(error?.message || error)))
              } else if (e.key === 'Escape') {
                setIsRenaming(false)
                setRenameValue(library.name || '')
              }
            }}
            autoFocus
          />
          <div className="new-db-actions">
            <button
              className="button"
              disabled={busy}
              onClick={() => renameLibrary().catch((error) => setErrorMessage(String(error?.message || error)))}
            >
              Save
            </button>
            <button
              className="button ghost"
              onClick={() => {
                setIsRenaming(false)
                setRenameValue(library.name || '')
              }}
            >
              Cancel
            </button>
          </div>
        </div>
      )}
      {confirmDelete && (
        <div className="library-inline-form danger-zone">
          <div className="muted-copy">Delete "{library.name}"? This removes the local index and metadata for this database.</div>
          <div className="new-db-actions">
            <button
              className="button danger"
              disabled={busy}
              onClick={() => deleteLibrary().catch((error) => setErrorMessage(String(error?.message || error)))}
            >
              Confirm Delete
            </button>
            <button className="button ghost" onClick={() => setConfirmDelete(false)}>Cancel</button>
          </div>
        </div>
      )}
      {errorMessage && <div className="form-error">{errorMessage}</div>}
      <div className="library-toolbar">
        <button className="button" disabled={busy} onClick={addPaths}>Add Files</button>
        <button className="button" disabled={busy || !library.files?.length} onClick={() => startJob('build')}>Build Corpus</button>
        <button className="button" disabled={busy || !library.states?.has_corpus} onClick={() => startJob('enrich')}>Enrich</button>
        <button className="button" disabled={busy || !library.states?.has_corpus} onClick={() => startJob('embed')}>Index</button>
        <button className="button" onClick={canStartRename}>Rename</button>
        <button className="button" onClick={() => onToggleChatLibrary(usingInChat ? null : library.slug)}>
          {usingInChat ? 'Stop Using In Chat' : 'Use In Chat'}
        </button>
        <button
          className="button danger"
          onClick={() => {
            setConfirmDelete(true)
            setIsRenaming(false)
            setErrorMessage('')
          }}
        >
          Delete
        </button>
      </div>
      <div className="library-states">
        <div className={`state-pill ${library.states?.has_files ? 'ready' : ''}`}>Files: {library.files?.length || 0}</div>
        <div className={`state-pill ${library.states?.has_corpus ? 'ready' : ''}`}>Corpus: {library.artifacts?.corpus_records || 0}</div>
        <div className={`state-pill ${library.states?.is_enriched ? 'ready' : ''}`}>Enriched: {library.artifacts?.enhanced_records || 0}</div>
        <div className={`state-pill ${library.states?.is_indexed ? 'ready' : ''}`}>Indexed</div>
      </div>
      {usingInChat && (
        <div className="library-chat-note">
          This database will be queried before each chat request and its context will be appended to the prompt.
        </div>
      )}
      {activeJobs.length > 0 && (
        <div className="library-jobs">
          {activeJobs.map(job => (
            <div key={job.id} className={`job-card ${job.status}`}>
              {statusLabel(job)}
            </div>
          ))}
        </div>
      )}
      <div className="library-files">
        <h2>Files</h2>
        {library.files?.length ? (
          <div className="library-file-list">
            {library.files.map(file => (
              <div key={file.sha256 || file.rel} className="library-file-row">
                <div className="library-file-meta">
                  <div className="library-file-name">{file.name || file.path}</div>
                  <div className="library-file-path">{file.path}</div>
                </div>
                <div className="library-file-actions">
                  <button className="button ghost" onClick={() => window.electronAPI?.openPath?.(file.path)}>Open</button>
                  <button className="button ghost" onClick={() => removeFile(file.rel)}>Remove</button>
                </div>
              </div>
            ))}
          </div>
        ) : (
          <p className="muted-copy">No files registered yet.</p>
        )}
      </div>
    </div>
  )
 }
--- a/src/colorSchemes.js
+++ b/src/colorSchemes.js
@@ -1,5 +1,18 @@
 const colorSchemes = {
-  'Nightsky': {
+  Default: {
    '--bg': '#0b1020',
    '--panel': '#141b34',
    '--text': '#e6e8ef',
    '--muted': '#9aa3b2',
    '--accent': '#6ea8fe',
    '--border': '#24304f',
    '--input-bg': '#121933',
    '--user-msg-bg': '#18213d',
    '--assistant-msg-bg': '#10172d',
    '--active-bg': 'rgba(110, 168, 254, 0.16)',
    '--hover-bg': 'rgba(255, 255, 255, 0.06)',
  },
  Nightsky: {
    '--bg': '#0a0e1a',
    '--panel': '#18203a',
    '--text': '#ffffff',
@@ -12,7 +25,7 @@ const colorSchemes = {
    '--active-bg': 'rgba(74, 144, 226, 0.15)',
    '--hover-bg': 'rgba(255, 255, 255, 0.05)',
  },
-  'Grayscale': {
+  Grayscale: {
    '--bg': '#1a1a1a',
    '--panel': '#2a2a2a',
    '--text': '#f0f0f0',
@@ -25,33 +38,33 @@ const colorSchemes = {
    '--active-bg': 'rgba(136, 136, 136, 0.15)',
    '--hover-bg': 'rgba(255, 255, 255, 0.05)',
  },
-  'Japan': {
+  Japan: {
    '--bg': '#ffffff',
    '--panel': '#f5f5f5',
    '--text': '#000000',
    '--muted': '#444444',
-    '--accent': '#e74c3c', /* Vibrant Red */
+    '--accent': '#e74c3c',
    '--border': '#999999',
    '--input-bg': '#ffffff',
    '--user-msg-bg': '#f0f0f0',
    '--assistant-msg-bg': '#f0f0f0',
-    '--active-bg': 'rgba(231, 76, 60, 0.15)', /* Light red for active */
+    '--active-bg': 'rgba(231, 76, 60, 0.15)',
-    '--hover-bg': 'rgba(231, 76, 60, 0.08)', /* Lighter red for hover */
+    '--hover-bg': 'rgba(231, 76, 60, 0.08)',
  },
-  'Lime': {
+  Lime: {
    '--bg': '#f0fff0',
    '--panel': '#e0ffe0',
    '--text': '#1a1a1a',
    '--muted': '#72a272ff',
-    '--accent': '#deef88',
+    '--accent': '#8e9f38ff',
    '--border': '#a0c0a0',
    '--input-bg': '#ffffff',
-    '--user-msg-bg': '#f8f7ad',
+    '--user-msg-bg': '#f8f7adff',
    '--assistant-msg-bg': '#f5fff5',
    '--active-bg': 'rgba(104, 159, 56, 0.2)',
    '--hover-bg': 'rgba(104, 159, 56, 0.1)',
  },
-  'Vampire': {
+  Vampire: {
    '--bg': '#1a050a',
    '--panel': '#2a1015',
    '--text': '#ffefff',
@@ -64,15 +77,80 @@ const colorSchemes = {
    '--active-bg': 'rgba(216, 27, 96, 0.15)',
    '--hover-bg': 'rgba(255, 255, 255, 0.05)',
  },
-};
+  'Sunset Drive': {
    '--bg': '#1f1024',
    '--panel': '#2e1632',
    '--text': '#fff2ea',
    '--muted': '#caa8b7',
    '--accent': '#ff8a5b',
    '--border': '#593050',
    '--input-bg': '#26132a',
    '--user-msg-bg': '#442038',
    '--assistant-msg-bg': '#32172c',
    '--active-bg': 'rgba(255, 138, 91, 0.18)',
    '--hover-bg': 'rgba(255, 138, 91, 0.08)',
  },
  'Aurora Pulse': {
    '--bg': '#07171d',
    '--panel': '#102730',
    '--text': '#eafcff',
    '--muted': '#9bc8cf',
    '--accent': '#54f2c2',
    '--border': '#214853',
    '--input-bg': '#0b2028',
    '--user-msg-bg': '#12313d',
    '--assistant-msg-bg': '#0f2530',
    '--active-bg': 'rgba(84, 242, 194, 0.18)',
    '--hover-bg': 'rgba(84, 242, 194, 0.08)',
  },
  'Sakura Neon': {
    '--bg': '#160b1d',
    '--panel': '#251331',
    '--text': '#fff5fd',
    '--muted': '#d4abc7',
    '--accent': '#ff4fb6',
    '--border': '#52315f',
    '--input-bg': '#1d1027',
    '--user-msg-bg': '#341844',
    '--assistant-msg-bg': '#281534',
    '--active-bg': 'rgba(255, 79, 182, 0.18)',
    '--hover-bg': 'rgba(255, 79, 182, 0.09)',
  },
  'Cobalt Punch': {
    '--bg': '#081527',
    '--panel': '#102643',
    '--text': '#eef6ff',
    '--muted': '#9fb7d0',
    '--accent': '#ffb703',
    '--border': '#234164',
    '--input-bg': '#0d1f37',
    '--user-msg-bg': '#162f54',
    '--assistant-msg-bg': '#102640',
    '--active-bg': 'rgba(255, 183, 3, 0.18)',
    '--hover-bg': 'rgba(255, 183, 3, 0.08)',
  },
  'Mango Mojito': {
    '--bg': '#fff7ea',
    '--panel': '#ffe9c8',
    '--text': '#2a1c13',
    '--muted': '#7c6150',
    '--accent': '#ff6b35',
    '--border': '#e6bf91',
    '--input-bg': '#fffdf9',
    '--user-msg-bg': '#fff0d7',
    '--assistant-msg-bg': '#fff8ed',
    '--active-bg': 'rgba(255, 107, 53, 0.14)',
    '--hover-bg': 'rgba(255, 107, 53, 0.08)',
  },
 }
 function applyColorScheme(schemeName) {
-  const scheme = colorSchemes[schemeName];
+  const scheme = colorSchemes[schemeName] || colorSchemes.Default
-  if (scheme) {
+  if (!scheme) return
-    for (const [key, value] of Object.entries(scheme)) {
+
-      document.documentElement.style.setProperty(key, value);
+  for (const [key, value] of Object.entries(scheme)) {
-    }
+    document.documentElement.style.setProperty(key, value)
  }
 }
-export { colorSchemes, applyColorScheme };
+export { colorSchemes, applyColorScheme }
--- a/src/main.jsx
+++ b/src/main.jsx
@@ -9,9 +9,7 @@ import { applyColorScheme } from './colorSchemes'
 function Main() {
  useEffect(() => {
    window.electronAPI.getSettings().then(settings => {
-      if (settings.colorScheme) {
+      applyColorScheme(settings.colorScheme || 'Default')
        applyColorScheme(settings.colorScheme)
      }
    })
  }, [])
--- a/src/styles.css
+++ b/src/styles.css
@@ -245,6 +245,22 @@ body { background: var(--bg); color: var(--text); font-family: ui-sans-serif, sy
  background: var(--panel);
 }
 .new-db-form,
 .library-inline-form {
  display: grid;
  gap: 8px;
 }
 .new-db-actions {
  display: flex;
  gap: 8px;
 }
 .form-error {
  color: #ff9aa8;
  font-size: 12px;
 }
 .new-chat-button {
  width: 100%;
  padding: 10px;
@@ -291,6 +307,13 @@ body { background: var(--bg); color: var(--text); font-family: ui-sans-serif, sy
 .select { min-width: 220px; }
 .button { cursor: pointer; }
 .button:hover { border-color: var(--accent); }
 .button.ghost { background: transparent; }
 .button.danger { border-color: #8f3d49; color: #ffb8c2; }
 .button.danger:hover { border-color: #d86a79; }
 .header-subtle {
  color: var(--muted);
  font-size: 13px;
 }
 .chat {
  display: grid;
@@ -510,6 +533,30 @@ textarea.input {
  min-width: unset;
 }
 .setting-control-row {
  display: flex;
  align-items: center;
  gap: 12px;
  flex-wrap: wrap;
 }
 .range-input {
  width: min(360px, 100%);
  accent-color: var(--accent);
 }
 .setting-value {
  min-width: 48px;
  color: var(--text);
  font-variant-numeric: tabular-nums;
 }
 .setting-description {
  margin: 10px 0 0;
  color: var(--muted);
  line-height: 1.5;
 }
 /* Markdown Styles */
 .msg h1, .msg h2, .msg h3, .msg h4 {
  margin: 10px 0;
@@ -972,3 +1019,113 @@ input:checked + .slider:before {
  white-space: nowrap;
  margin-top: 0.5rem;
 }
 .db-active-badge {
  margin-left: 8px;
  padding: 2px 8px;
  border-radius: 999px;
  background: color-mix(in srgb, var(--accent) 20%, transparent);
  color: var(--accent);
  font-size: 11px;
 }
 .placeholder-view,
 .library-panel {
  overflow: auto;
  padding: 20px;
 }
 .placeholder-view h1 {
  margin-top: 0;
 }
 .library-toolbar {
  display: flex;
  flex-wrap: wrap;
  gap: 10px;
  margin-bottom: 18px;
 }
 .library-states {
  display: flex;
  flex-wrap: wrap;
  gap: 10px;
  margin-bottom: 14px;
 }
 .state-pill {
  padding: 6px 10px;
  border-radius: 999px;
  border: 1px solid var(--border);
  color: var(--muted);
  font-size: 13px;
 }
 .state-pill.ready {
  color: var(--text);
  border-color: color-mix(in srgb, var(--accent) 45%, var(--border));
 }
 .library-chat-note,
 .job-card {
  margin-bottom: 12px;
  padding: 12px 14px;
  border-radius: 12px;
  background: color-mix(in srgb, var(--panel) 82%, black);
  border: 1px solid var(--border);
 }
 .library-inline-form {
  margin-bottom: 14px;
  padding: 12px 14px;
  border-radius: 12px;
  border: 1px solid var(--border);
  background: color-mix(in srgb, var(--panel) 88%, black);
 }
 .danger-zone {
  border-color: #8f3d49;
 }
 .library-files h2 {
  margin: 18px 0 12px;
  font-size: 16px;
 }
 .library-file-list {
  display: grid;
  gap: 10px;
 }
 .library-file-row {
  display: flex;
  justify-content: space-between;
  gap: 14px;
  align-items: flex-start;
  padding: 12px 14px;
  border-radius: 12px;
  border: 1px solid var(--border);
  background: color-mix(in srgb, var(--panel) 88%, black);
 }
 .library-file-meta {
  min-width: 0;
 }
 .library-file-name {
  font-weight: 600;
  margin-bottom: 4px;
 }
 .library-file-path,
 .muted-copy {
  color: var(--muted);
  font-size: 13px;
  word-break: break-word;
 }
 .library-file-actions {
  display: flex;
  gap: 8px;
  flex-shrink: 0;
 }
--- a/wheelcheck2117/pydantic-2.11.7-py3-none-any.whl
+++ b/wheelcheck2117/pydantic-2.11.7-py3-none-any.whl
--- a/wheelcheck274/pydantic-2.7.4-py3-none-any.whl
+++ b/wheelcheck274/pydantic-2.7.4-py3-none-any.whl
		`@@ -0,0 +1 @@`
							`/Users/giers/Documents/GeorgeJordac-TheVoiceOfHumanJustice.pdf`
		`@@ -0,0 +1 @@`
							`/Users/giers/Music/4 Strings - Take Me Away (Into The Night) (Vocal Radio Mix).mp3`
		`@@ -0,0 +1 @@`
							`/Users/giers/Documents/The Evolution of Cooperation_Robert Axelrod_liber3.pdf`