initial commit

2026-03-15 14:51:29 +01:00
commit 94051dd0f8
20 changed files with 2842 additions and 0 deletions
--- a/README.md
+++ b/README.md
@@ -0,0 +1,113 @@
+# YouTube Summarizer
+
+This is a local-first desktop app for summarizing YouTube videos with Ollama.
+
+It uses:
+
+- Tauri for the desktop shell
+- a bundled Python backend for transcript/audio processing in release builds
+- Ollama on `localhost` for summarization and translation
+- SQLite for local history
+
+## What It Does
+
+Given a YouTube URL, the app can:
+
+- fetch a transcript via the YouTube transcript API or via Whisper
+- generate an English summary with a local Ollama model
+- optionally translate that summary into German and Japanese
+- store the results locally so they can be reopened later
+
+## Local-Only Behavior
+
+This repository is intentionally reset to a clean publishable state:
+
+- no Discord webhook integration
+- no remote PHP/MySQL sync
+- no bundled production data or pre-filled database
+- runtime data is stored in the OS app data directory, not in the repo
+
+## End User Requirements
+
+If you ship a built installer, the user should only need:
+
+- Ollama installed locally
+- the Ollama model they want to use pulled locally
+
+Notes:
+
+- The installer is designed to bundle the backend helper plus `ffmpeg` / `ffprobe`.
+- Whisper model weights are not bundled; the selected Whisper model is downloaded on first use and then cached locally.
+
+## Developer Requirements
+
+For development in this repo you still need:
+
+- Python 3.8+
+- Rust/Cargo
+- FFmpeg in `PATH`
+- Ollama running locally on `http://localhost:11434`
+
+Python dependencies are listed in [requirements.txt](/Users/giers/youtube_summarizer/requirements.txt).
+
+## Run In Development
+
+macOS/Linux:
+
+```bash
+./run.sh
+```
+
+Windows:
+
+```bat
+run.bat
+```
+
+Or directly:
+
+```bash
+python3 -m venv venv
+source venv/bin/activate
+pip install -r requirements.txt
+cargo run --manifest-path src-tauri/Cargo.toml
+```
+
+The app prefers a bundled backend executable when one is present under [src-tauri/resources/backend](/Users/giers/youtube_summarizer/src-tauri/resources/backend), and otherwise falls back to the local Python environment for development.
+
+## Build A Shippable Bundle
+
+1. Make sure the build machine has Python, Rust/Cargo, and `ffmpeg` / `ffprobe` available on `PATH`.
+2. Run:
+
+```bash
+python3 tools/prepare_bundle.py
+```
+
+3. Then build the installer:
+
+```bash
+cargo tauri build
+```
+
+What `tools/prepare_bundle.py` does:
+
+- installs PyInstaller into the current Python environment
+- builds a single-file backend executable from [backend_cli.py](/Users/giers/youtube_summarizer/backend_cli.py)
+- copies that executable into [src-tauri/resources/backend](/Users/giers/youtube_summarizer/src-tauri/resources/backend)
+- copies `ffmpeg` and `ffprobe` from the build machine into [src-tauri/resources/ffmpeg](/Users/giers/youtube_summarizer/src-tauri/resources/ffmpeg)
+
+Build once on each target OS you want to ship. For Windows 10, build on Windows.
+
+## Build On GitHub Actions
+
+A Windows build workflow is included at [.github/workflows/windows-installer.yml](/Users/giers/youtube_summarizer/.github/workflows/windows-installer.yml).
+
+It runs on `windows-latest`, installs `ffmpeg` and NSIS, prepares the bundled Python backend with [tools/prepare_bundle.py](/Users/giers/youtube_summarizer/tools/prepare_bundle.py), builds an NSIS installer, and uploads the result as a workflow artifact named `windows-installer`.
+
+## Notes
+
+- If Python is not on your `PATH` for development, set `YTS_PYTHON` to the interpreter you want the Tauri backend to use.
+- If you want to test a prebuilt backend executable during development, set `YTS_BACKEND_BIN` to its full path.
+- If `ffmpeg` or `ffprobe` are not on `PATH` during bundle prep, set `YTS_FFMPEG` and `YTS_FFPROBE` to their full paths before running [tools/prepare_bundle.py](/Users/giers/youtube_summarizer/tools/prepare_bundle.py).
+- Generated thumbnails and the SQLite database are created on first run in the app's local data directory.
--- a/backend_cli.py
+++ b/backend_cli.py
@@ -0,0 +1,93 @@
+#!/usr/bin/env python3
+"""
+Single CLI entrypoint for the bundled summarizer backend.
+
+This wrapper lets the Tauri app launch one helper executable in production
+while still supporting direct Python execution during development.
+"""
+
+import argparse
+import json
+import sys
+from pathlib import Path
+
+from translate_summary import translate_summary_text
+from youtube_summarizer import process_video
+
+
+DEFAULT_MODEL = "mistral:latest"
+
+
+def configure_stdio() -> None:
+    """Keep progress output line-buffered for the desktop app."""
+    if hasattr(sys.stdout, "reconfigure"):
+        sys.stdout.reconfigure(line_buffering=True)
+    if hasattr(sys.stderr, "reconfigure"):
+        sys.stderr.reconfigure(line_buffering=True)
+
+
+def summarize(args: argparse.Namespace) -> int:
+    meta = process_video(
+        args.url,
+        use_whisper=args.use_whisper,
+        model=args.model,
+        output_json=args.output_json,
+    )
+    if not args.output_json:
+        print(json.dumps(meta, ensure_ascii=False), flush=True)
+    return 0
+
+
+def translate(args: argparse.Namespace) -> int:
+    summary_path = Path(args.summary_file)
+    summary_text = summary_path.read_text(encoding="utf-8").strip()
+    if not summary_text:
+        raise SystemExit("Empty summary text!")
+
+    translation = translate_summary_text(summary_text, args.lang, args.model)
+
+    if args.output_file:
+        Path(args.output_file).write_text(translation, encoding="utf-8")
+    else:
+        print(translation, flush=True)
+    return 0
+
+
+def build_parser() -> argparse.ArgumentParser:
+    parser = argparse.ArgumentParser(description="Bundled backend for YouTube Summarizer")
+    subparsers = parser.add_subparsers(dest="command", required=True)
+
+    summarize_parser = subparsers.add_parser("summarize", help="Summarize a YouTube video")
+    summarize_parser.add_argument("--url", required=True, help="YouTube video URL")
+    summarize_parser.add_argument("--model", default=DEFAULT_MODEL, help="Ollama model to use")
+    summarize_parser.add_argument(
+        "--no-whisper",
+        dest="use_whisper",
+        action="store_false",
+        help="Use transcript/subtitle workflows instead of Whisper",
+    )
+    summarize_parser.add_argument(
+        "--output-json",
+        help="Write the result metadata to a JSON file instead of stdout",
+    )
+    summarize_parser.set_defaults(use_whisper=True, handler=summarize)
+
+    translate_parser = subparsers.add_parser("translate", help="Translate an English summary")
+    translate_parser.add_argument("--summary-file", required=True, help="Path to the English summary text")
+    translate_parser.add_argument("--lang", required=True, choices=["de", "jp"], help="Target language")
+    translate_parser.add_argument("--model", default=DEFAULT_MODEL, help="Ollama model to use")
+    translate_parser.add_argument("--output-file", help="Optional path to write the translated text")
+    translate_parser.set_defaults(handler=translate)
+
+    return parser
+
+
+def main() -> int:
+    configure_stdio()
+    parser = build_parser()
+    args = parser.parse_args()
+    return args.handler(args)
+
+
+if __name__ == "__main__":
+    raise SystemExit(main())
--- a/icon.png
+++ b/icon.png
--- a/requirements.txt
+++ b/requirements.txt
@@ -0,0 +1,5 @@
+requests
+yt-dlp
+webvtt-py
+youtube-transcript-api
+openai-whisper
--- a/run.bat
+++ b/run.bat
@@ -0,0 +1,27 @@
+@echo off
+setlocal
+
+REM 1. Prüfen, ob venv existiert, sonst erstellen
+if not exist venv (
+    echo Erstelle Python venv...
+    python -m venv venv
+)
+
+REM 2. venv aktivieren
+echo Aktiviere venv...
+call venv\Scripts\activate
+
+REM 3. Python-Abhängigkeiten installieren
+echo Installiere Python requirements...
+pip install --upgrade pip
+pip install -r requirements.txt
+
+REM 4. Tauri App starten
+echo Starte die Tauri App...
+cargo run --manifest-path src-tauri/Cargo.toml
+
+REM 6. Deaktivieren (optional)
+deactivate
+
+endlocal
+pause
--- a/run.sh
+++ b/run.sh
@@ -0,0 +1,25 @@
+#!/usr/bin/env bash
+set -e
+
+# 1. Python venv einrichten
+GREEN="\033[0;32m"
+CYAN="\033[0;36m"
+NC="\033[0m" # No Color
+echo -e "${CYAN}1. Python venv einrichten …${NC}"
+if [ ! -d "venv" ]; then
+    python3 -m venv venv
+fi
+
+# 2. venv aktivieren
+echo -e "${CYAN}2. Aktiviere venv …${NC}"
+source venv/bin/activate
+
+# 3. Python-Abhängigkeiten installieren
+echo -e "${CYAN}3. Python-Abhängigkeiten installieren …${NC}"
+pip install --upgrade pip
+pip install -r requirements.txt
+pip install --upgrade yt-dlp
+
+# 4. Tauri App starten
+echo -e "${CYAN}4. Starte die Tauri App …${NC}"
+cargo run --manifest-path src-tauri/Cargo.toml
--- a/src-tauri/Cargo.toml
+++ b/src-tauri/Cargo.toml
@@ -0,0 +1,18 @@
+[package]
+name = "youtube-summarizer"
+version = "1.0.0"
+description = "A local-first desktop tool for summarizing YouTube videos"
+authors = ["Victor Giers <mail@victorgiers.com>"]
+edition = "2021"
+
+[build-dependencies]
+tauri-build = { version = "2.5.6", features = [] }
+
+[dependencies]
+open = "5.3.3"
+reqwest = { version = "0.12.24", default-features = false, features = ["blocking", "json", "rustls-tls"] }
+rusqlite = { version = "0.37.0", features = ["bundled"] }
+serde = { version = "1.0", features = ["derive"] }
+serde_json = "1.0"
+tauri = { version = "2.10.3", features = ["protocol-asset"] }
+tauri-plugin-dialog = "2.6.0"
--- a/src-tauri/build.rs
+++ b/src-tauri/build.rs
@@ -0,0 +1,7 @@
+fn main() {
+    println!(
+        "cargo:rustc-env=TAURI_BUILD_TARGET={}",
+        std::env::var("TARGET").expect("TARGET not set by cargo")
+    );
+    tauri_build::build();
+}
--- a/src-tauri/capabilities/default.json
+++ b/src-tauri/capabilities/default.json
@@ -0,0 +1,6 @@
+{
+  "identifier": "default",
+  "description": "Default capability set for the main window.",
+  "windows": ["main"],
+  "permissions": ["core:default", "dialog:allow-confirm"]
+}
--- a/src-tauri/icons/icon.png
+++ b/src-tauri/icons/icon.png
--- a/src-tauri/resources/backend/.gitkeep
+++ b/src-tauri/resources/backend/.gitkeep
@@ -0,0 +1 @@
+
--- a/src-tauri/resources/ffmpeg/.gitkeep
+++ b/src-tauri/resources/ffmpeg/.gitkeep
@@ -0,0 +1 @@
+
--- a/src-tauri/src/main.rs
+++ b/src-tauri/src/main.rs
@@ -0,0 +1,755 @@
+#![cfg_attr(not(debug_assertions), windows_subsystem = "windows")]
+
+use std::{
+    env, fs,
+    io::{BufRead, BufReader, ErrorKind},
+    path::{Path, PathBuf},
+    process::{Command, Stdio},
+    sync::{Arc, Mutex},
+    thread,
+    time::{SystemTime, UNIX_EPOCH},
+};
+
+use open::that;
+use reqwest::blocking::Client;
+use rusqlite::{params, Connection, OptionalExtension};
+use serde::{Deserialize, Serialize};
+use tauri::{AppHandle, Emitter, Manager, State, WebviewWindow};
+
+const DEFAULT_MODEL: &str = "mistral:latest";
+const OLLAMA_TAGS_URL: &str = "http://localhost:11434/api/tags";
+const BACKEND_EXECUTABLE_NAME: &str = "yts-backend";
+const TARGET_TRIPLE: &str = env!("TAURI_BUILD_TARGET");
+
+#[derive(Clone)]
+enum BackendRuntime {
+    Bundled {
+        executable: PathBuf,
+    },
+    Python {
+        python: PathBuf,
+        script_dir: PathBuf,
+    },
+}
+
+#[derive(Clone)]
+struct AppState {
+    app_dir: PathBuf,
+    media_dir: PathBuf,
+    db_path: PathBuf,
+    backend: BackendRuntime,
+    ffmpeg_path: Option<PathBuf>,
+    ffprobe_path: Option<PathBuf>,
+    whisper_cache_dir: PathBuf,
+}
+
+#[derive(Debug, Deserialize)]
+#[serde(rename_all = "camelCase")]
+struct SummarizeVideoRequest {
+    url: String,
+    use_whisper: bool,
+    model: Option<String>,
+}
+
+#[derive(Debug, Deserialize)]
+struct DeleteSummaryRequest {
+    id: i64,
+}
+
+#[derive(Debug, Deserialize)]
+struct TranslateSummaryRequest {
+    id: i64,
+    lang: String,
+    model: Option<String>,
+}
+
+#[derive(Debug, Deserialize)]
+struct BackendSummaryMeta {
+    timestamp: String,
+    video_id: String,
+    url: String,
+    video_name: String,
+    channel: Option<String>,
+    thumbnail: Option<String>,
+    audio: Option<String>,
+    transcript: Option<String>,
+    summary: String,
+}
+
+#[derive(Debug, Deserialize)]
+struct OllamaTagsResponse {
+    models: Vec<OllamaModel>,
+}
+
+#[derive(Debug, Deserialize)]
+struct OllamaModel {
+    name: String,
+}
+
+#[derive(Debug)]
+struct StoredSummary {
+    id: i64,
+    timestamp: Option<String>,
+    video_id: Option<String>,
+    url: Option<String>,
+    video_name: Option<String>,
+    channel: Option<String>,
+    thumbnail: Option<String>,
+    audio: Option<String>,
+    transcript: Option<String>,
+    summary_en: Option<String>,
+    summary_de: Option<String>,
+    summary_jp: Option<String>,
+}
+
+#[derive(Debug, Serialize)]
+struct SummaryEntry {
+    id: i64,
+    timestamp: Option<String>,
+    video_id: Option<String>,
+    url: Option<String>,
+    video_name: Option<String>,
+    channel: Option<String>,
+    thumbnail: Option<String>,
+    audio: Option<String>,
+    transcript: Option<String>,
+    summary_en: Option<String>,
+    summary_de: Option<String>,
+    summary_jp: Option<String>,
+}
+
+impl StoredSummary {
+    fn from_row(row: &rusqlite::Row<'_>) -> rusqlite::Result<Self> {
+        Ok(Self {
+            id: row.get("id")?,
+            timestamp: row.get("timestamp")?,
+            video_id: row.get("video_id")?,
+            url: row.get("url")?,
+            video_name: row.get("video_name")?,
+            channel: row.get("channel")?,
+            thumbnail: row.get("thumbnail")?,
+            audio: row.get("audio")?,
+            transcript: row.get("transcript")?,
+            summary_en: row.get("summary_en")?,
+            summary_de: row.get("summary_de")?,
+            summary_jp: row.get("summary_jp")?,
+        })
+    }
+
+    fn into_entry(self, state: &AppState) -> SummaryEntry {
+        SummaryEntry {
+            id: self.id,
+            timestamp: self.timestamp,
+            video_id: self.video_id,
+            url: self.url,
+            video_name: self.video_name,
+            channel: self.channel,
+            thumbnail: absolute_media_path(state, self.thumbnail),
+            audio: absolute_media_path(state, self.audio),
+            transcript: absolute_media_path(state, self.transcript),
+            summary_en: self.summary_en,
+            summary_de: self.summary_de,
+            summary_jp: self.summary_jp,
+        }
+    }
+}
+
+fn absolute_media_path(state: &AppState, file_name: Option<String>) -> Option<String> {
+    file_name.map(|name| state.media_dir.join(name).to_string_lossy().into_owned())
+}
+
+fn normalize_model(model: Option<String>) -> String {
+    model
+        .map(|value| value.trim().to_string())
+        .filter(|value| !value.is_empty())
+        .unwrap_or_else(|| DEFAULT_MODEL.to_string())
+}
+
+fn now_millis() -> u128 {
+    SystemTime::now()
+        .duration_since(UNIX_EPOCH)
+        .unwrap_or_default()
+        .as_millis()
+}
+
+fn resolve_project_root() -> Result<PathBuf, String> {
+    PathBuf::from(env!("CARGO_MANIFEST_DIR"))
+        .join("..")
+        .canonicalize()
+        .map_err(|err| format!("Failed to resolve project root: {err}"))
+}
+
+fn platform_executable_name(base_name: &str) -> String {
+    if cfg!(windows) {
+        format!("{base_name}.exe")
+    } else {
+        base_name.to_string()
+    }
+}
+
+fn resolve_resource_file(app: &AppHandle, relative_path: &Path) -> Option<PathBuf> {
+    let mut candidates = Vec::new();
+
+    if let Ok(resource_dir) = app.path().resource_dir() {
+        candidates.push(resource_dir.join(relative_path));
+    }
+
+    candidates.push(
+        PathBuf::from(env!("CARGO_MANIFEST_DIR"))
+            .join("resources")
+            .join(relative_path),
+    );
+
+    candidates.into_iter().find(|path| path.exists())
+}
+
+fn resolve_backend_binary(app: &AppHandle) -> Option<PathBuf> {
+    if let Ok(path) = env::var("YTS_BACKEND_BIN") {
+        let trimmed = path.trim();
+        if !trimmed.is_empty() {
+            return Some(PathBuf::from(trimmed));
+        }
+    }
+
+    let relative_path = Path::new("backend")
+        .join(TARGET_TRIPLE)
+        .join(platform_executable_name(BACKEND_EXECUTABLE_NAME));
+    resolve_resource_file(app, &relative_path)
+}
+
+fn resolve_script_dir(app: &AppHandle) -> Result<PathBuf, String> {
+    if let Ok(resource_dir) = app.path().resource_dir() {
+        if resource_dir.join("backend_cli.py").exists() {
+            return Ok(resource_dir);
+        }
+    }
+
+    let project_dir = resolve_project_root()?;
+    if project_dir.join("backend_cli.py").exists() {
+        return Ok(project_dir);
+    }
+
+    Err("Unable to locate bundled or development backend Python scripts.".to_string())
+}
+
+fn resolve_python_command(script_dir: &Path) -> Result<PathBuf, String> {
+    if let Ok(path) = env::var("YTS_PYTHON") {
+        let trimmed = path.trim();
+        if !trimmed.is_empty() {
+            return Ok(PathBuf::from(trimmed));
+        }
+    }
+
+    let mut candidates = Vec::new();
+    candidates.push(script_dir.join("venv").join("bin").join("python3"));
+    candidates.push(script_dir.join("venv").join("bin").join("python"));
+    candidates.push(script_dir.join("venv").join("Scripts").join("python.exe"));
+    candidates.push(PathBuf::from("python3"));
+    candidates.push(PathBuf::from("python"));
+
+    for candidate in candidates {
+        if Command::new(&candidate).arg("--version").output().is_ok() {
+            return Ok(candidate);
+        }
+    }
+
+    Err("Unable to find a usable Python interpreter. Set YTS_PYTHON to override.".to_string())
+}
+
+fn resolve_backend_runtime(app: &AppHandle) -> Result<BackendRuntime, String> {
+    if let Some(executable) = resolve_backend_binary(app) {
+        return Ok(BackendRuntime::Bundled { executable });
+    }
+
+    let script_dir = resolve_script_dir(app)?;
+    let python = resolve_python_command(&script_dir)?;
+    Ok(BackendRuntime::Python { python, script_dir })
+}
+
+fn resolve_optional_tool_path(app: &AppHandle, env_name: &str, tool_name: &str) -> Option<PathBuf> {
+    if let Ok(path) = env::var(env_name) {
+        let trimmed = path.trim();
+        if !trimmed.is_empty() {
+            return Some(PathBuf::from(trimmed));
+        }
+    }
+
+    let relative_path = Path::new("ffmpeg")
+        .join(TARGET_TRIPLE)
+        .join(platform_executable_name(tool_name));
+    resolve_resource_file(app, &relative_path)
+}
+
+fn resolve_whisper_cache_dir(app: &AppHandle) -> Result<PathBuf, String> {
+    let cache_root = app
+        .path()
+        .app_cache_dir()
+        .or_else(|_| app.path().app_local_data_dir())
+        .map_err(|err| format!("Failed to resolve application cache directory: {err}"))?;
+    let whisper_cache_dir = cache_root.join("whisper");
+    fs::create_dir_all(&whisper_cache_dir)
+        .map_err(|err| format!("Failed to create Whisper cache directory: {err}"))?;
+    Ok(whisper_cache_dir)
+}
+
+fn open_connection(state: &AppState) -> Result<Connection, String> {
+    Connection::open(&state.db_path).map_err(|err| format!("Failed to open SQLite database: {err}"))
+}
+
+fn init_db(state: &AppState) -> Result<(), String> {
+    let db = open_connection(state)?;
+    db.execute_batch(
+        r#"
+      CREATE TABLE IF NOT EXISTS summaries (
+        id INTEGER PRIMARY KEY AUTOINCREMENT,
+        timestamp TEXT,
+        video_id TEXT,
+        url TEXT,
+        video_name TEXT,
+        channel TEXT,
+        thumbnail TEXT,
+        audio TEXT,
+        transcript TEXT,
+        summary_en TEXT,
+        summary_de TEXT,
+        summary_jp TEXT
+      );
+    "#,
+    )
+    .map_err(|err| format!("Failed to initialize SQLite schema: {err}"))?;
+    Ok(())
+}
+
+fn remove_named_media_file(media_dir: &Path, file_name: &str) {
+    let path = media_dir.join(file_name);
+    if let Err(err) = fs::remove_file(&path) {
+        if err.kind() != ErrorKind::NotFound {
+            eprintln!("Failed to remove {}: {}", path.display(), err);
+        }
+    }
+}
+
+fn cleanup_artifacts(state: &AppState, audio: Option<&str>, transcript: Option<&str>) {
+    if let Some(audio_file) = audio.filter(|value| !value.trim().is_empty()) {
+        remove_named_media_file(&state.media_dir, audio_file);
+    }
+    if let Some(transcript_file) = transcript.filter(|value| !value.trim().is_empty()) {
+        remove_named_media_file(&state.media_dir, transcript_file);
+    }
+}
+
+fn purge_existing_artifacts(state: &AppState) -> Result<(), String> {
+    let db = open_connection(state)?;
+    let mut stmt = db
+    .prepare("SELECT id, audio, transcript FROM summaries WHERE audio IS NOT NULL OR transcript IS NOT NULL")
+    .map_err(|err| format!("Failed to prepare artifact cleanup query: {err}"))?;
+
+    let rows = stmt
+        .query_map([], |row| {
+            Ok((
+                row.get::<_, i64>(0)?,
+                row.get::<_, Option<String>>(1)?,
+                row.get::<_, Option<String>>(2)?,
+            ))
+        })
+        .map_err(|err| format!("Failed to load stored artifacts: {err}"))?;
+
+    let mut entries = Vec::new();
+    for row in rows {
+        entries.push(row.map_err(|err| format!("Failed to decode stored artifact row: {err}"))?);
+    }
+    drop(stmt);
+
+    for (id, audio, transcript) in entries {
+        cleanup_artifacts(state, audio.as_deref(), transcript.as_deref());
+        db.execute(
+            "UPDATE summaries SET audio = NULL, transcript = NULL WHERE id = ?",
+            [id],
+        )
+        .map_err(|err| format!("Failed to clear stored artifact references: {err}"))?;
+    }
+
+    Ok(())
+}
+
+fn ensure_app_state(app: &AppHandle) -> Result<AppState, String> {
+    let app_dir = app
+        .path()
+        .app_local_data_dir()
+        .map_err(|err| format!("Failed to resolve application data directory: {err}"))?;
+    let media_dir = app_dir.join("data");
+    fs::create_dir_all(&media_dir)
+        .map_err(|err| format!("Failed to create application data directory: {err}"))?;
+
+    let state = AppState {
+        backend: resolve_backend_runtime(app)?,
+        ffmpeg_path: resolve_optional_tool_path(app, "YTS_FFMPEG", "ffmpeg"),
+        ffprobe_path: resolve_optional_tool_path(app, "YTS_FFPROBE", "ffprobe"),
+        whisper_cache_dir: resolve_whisper_cache_dir(app)?,
+        app_dir: app_dir.clone(),
+        media_dir,
+        db_path: app_dir.join("summaries.db"),
+    };
+
+    init_db(&state)?;
+    purge_existing_artifacts(&state)?;
+    Ok(state)
+}
+
+fn emit_progress(app: &AppHandle, window_label: &str, line: &str) {
+    let trimmed = line.trim();
+    if !trimmed.is_empty() {
+        let _ = app.emit_to(window_label, "summarize-progress", trimmed.to_string());
+    }
+}
+
+fn apply_backend_env(command: &mut Command, state: &AppState) {
+    command.env("PYTHONUNBUFFERED", "1");
+    command.env("YTS_WHISPER_CACHE_DIR", &state.whisper_cache_dir);
+
+    if let Some(ffmpeg_path) = &state.ffmpeg_path {
+        command.env("YTS_FFMPEG", ffmpeg_path);
+    }
+    if let Some(ffprobe_path) = &state.ffprobe_path {
+        command.env("YTS_FFPROBE", ffprobe_path);
+    }
+}
+
+fn build_backend_command(state: &AppState, args: &[String]) -> Command {
+    let mut command = match &state.backend {
+        BackendRuntime::Bundled { executable } => Command::new(executable),
+        BackendRuntime::Python { python, script_dir } => {
+            let mut command = Command::new(python);
+            command.arg(script_dir.join("backend_cli.py"));
+            command
+        }
+    };
+
+    command.args(args).current_dir(&state.media_dir);
+    apply_backend_env(&mut command, state);
+    command
+}
+
+fn run_backend_json_command(
+    state: &AppState,
+    app: &AppHandle,
+    window_label: &str,
+    args: &[String],
+) -> Result<BackendSummaryMeta, String> {
+    let output_path = state.app_dir.join(format!("tmp_{}.json", now_millis()));
+    let mut command_args = args.to_vec();
+    command_args.push("--output-json".to_string());
+    command_args.push(output_path.to_string_lossy().into_owned());
+
+    let mut child = build_backend_command(state, &command_args)
+        .stdout(Stdio::piped())
+        .stderr(Stdio::piped())
+        .spawn()
+        .map_err(|err| format!("Failed to start bundled backend: {err}"))?;
+
+    let stdout = child
+        .stdout
+        .take()
+        .ok_or_else(|| "Backend stdout was not captured.".to_string())?;
+    let stderr = child
+        .stderr
+        .take()
+        .ok_or_else(|| "Backend stderr was not captured.".to_string())?;
+    let stderr_buffer = Arc::new(Mutex::new(String::new()));
+
+    let stdout_app = app.clone();
+    let stdout_label = window_label.to_string();
+    let stdout_handle = thread::spawn(move || {
+        for line in BufReader::new(stdout).lines() {
+            match line {
+                Ok(line) => emit_progress(&stdout_app, &stdout_label, &line),
+                Err(_) => break,
+            }
+        }
+    });
+
+    let stderr_app = app.clone();
+    let stderr_label = window_label.to_string();
+    let stderr_buffer_clone = Arc::clone(&stderr_buffer);
+    let stderr_handle = thread::spawn(move || {
+        for line in BufReader::new(stderr).lines() {
+            match line {
+                Ok(line) => {
+                    emit_progress(&stderr_app, &stderr_label, &line);
+                    if let Ok(mut buffer) = stderr_buffer_clone.lock() {
+                        buffer.push_str(&line);
+                        buffer.push('\n');
+                    }
+                }
+                Err(_) => break,
+            }
+        }
+    });
+
+    let status = child
+        .wait()
+        .map_err(|err| format!("Failed to wait for bundled backend: {err}"))?;
+
+    let _ = stdout_handle.join();
+    let _ = stderr_handle.join();
+
+    if !status.success() {
+        let stderr_output = stderr_buffer
+            .lock()
+            .map(|buffer| buffer.trim().to_string())
+            .unwrap_or_else(|_| String::new());
+        let message = if stderr_output.is_empty() {
+            format!("Bundled backend exited with status {status}.")
+        } else {
+            stderr_output
+        };
+        let _ = fs::remove_file(&output_path);
+        return Err(message);
+    }
+
+    let raw_json = fs::read_to_string(&output_path)
+        .map_err(|err| format!("Failed to read backend output JSON: {err}"))?;
+    let _ = fs::remove_file(&output_path);
+
+    serde_json::from_str(&raw_json).map_err(|err| format!("Invalid backend output JSON: {err}"))
+}
+
+fn run_backend_text_command(state: &AppState, args: &[String]) -> Result<String, String> {
+    let output = build_backend_command(state, args)
+        .output()
+        .map_err(|err| format!("Failed to start translation backend: {err}"))?;
+
+    if !output.status.success() {
+        let stderr = String::from_utf8_lossy(&output.stderr).trim().to_string();
+        return Err(if stderr.is_empty() {
+            format!("Translation backend exited with status {}.", output.status)
+        } else {
+            stderr
+        });
+    }
+
+    let translation = String::from_utf8(output.stdout)
+        .map_err(|err| format!("Translation backend returned invalid UTF-8: {err}"))?
+        .trim()
+        .to_string();
+    if translation.is_empty() {
+        return Err("Translation backend returned an empty result.".to_string());
+    }
+
+    Ok(translation)
+}
+
+fn get_entry_by_id(state: &AppState, id: i64) -> Result<SummaryEntry, String> {
+    let db = open_connection(state)?;
+    let stored = db
+        .query_row(
+            "SELECT * FROM summaries WHERE id = ?",
+            [id],
+            StoredSummary::from_row,
+        )
+        .optional()
+        .map_err(|err| format!("Failed to query summary entry: {err}"))?
+        .ok_or_else(|| "Entry not found.".to_string())?;
+    Ok(stored.into_entry(state))
+}
+
+fn summarize_video_inner(
+    state: &AppState,
+    app: &AppHandle,
+    window_label: &str,
+    request: SummarizeVideoRequest,
+) -> Result<SummaryEntry, String> {
+    let model = normalize_model(request.model);
+    let mut args = vec![
+        "summarize".to_string(),
+        "--url".to_string(),
+        request.url,
+        "--model".to_string(),
+        model,
+    ];
+    if !request.use_whisper {
+        args.push("--no-whisper".to_string());
+    }
+
+    let info = run_backend_json_command(state, app, window_label, &args)?;
+    cleanup_artifacts(state, info.audio.as_deref(), info.transcript.as_deref());
+
+    let db = open_connection(state)?;
+    db.execute(
+    "INSERT INTO summaries (timestamp, video_id, url, video_name, channel, thumbnail, audio, transcript, summary_en, summary_de, summary_jp)
+     VALUES (?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?)",
+    params![
+      info.timestamp,
+      info.video_id,
+      info.url,
+      info.video_name,
+      info.channel,
+      info.thumbnail,
+      Option::<String>::None,
+      Option::<String>::None,
+      info.summary,
+      Option::<String>::None,
+      Option::<String>::None,
+    ],
+  )
+  .map_err(|err| format!("Failed to save summary entry: {err}"))?;
+
+    get_entry_by_id(state, db.last_insert_rowid())
+}
+
+fn translate_summary_inner(
+    state: &AppState,
+    request: TranslateSummaryRequest,
+) -> Result<SummaryEntry, String> {
+    let db = open_connection(state)?;
+    let summary_text = db
+        .query_row(
+            "SELECT summary_en FROM summaries WHERE id = ?",
+            [request.id],
+            |row| row.get::<_, Option<String>>(0),
+        )
+        .optional()
+        .map_err(|err| format!("Failed to load English summary for translation: {err}"))?
+        .flatten()
+        .ok_or_else(|| "No English summary found for translation.".to_string())?;
+
+    let tmp_summary_path =
+        state
+            .app_dir
+            .join(format!("tmp_summary_{}_{}.txt", request.id, now_millis()));
+    fs::write(&tmp_summary_path, summary_text)
+        .map_err(|err| format!("Failed to write temporary summary file: {err}"))?;
+
+    let model = normalize_model(request.model);
+    let args = vec![
+        "translate".to_string(),
+        "--summary-file".to_string(),
+        tmp_summary_path.to_string_lossy().into_owned(),
+        "--lang".to_string(),
+        request.lang.clone(),
+        "--model".to_string(),
+        model,
+    ];
+    let result = run_backend_text_command(state, &args);
+
+    let _ = fs::remove_file(&tmp_summary_path);
+    let translation = result?;
+
+    let column = match request.lang.as_str() {
+        "de" => "summary_de",
+        "jp" => "summary_jp",
+        _ => return Err("Unsupported language code.".to_string()),
+    };
+
+    db.execute(
+        &format!("UPDATE summaries SET {column} = ? WHERE id = ?"),
+        params![translation, request.id],
+    )
+    .map_err(|err| format!("Failed to save translated summary: {err}"))?;
+
+    get_entry_by_id(state, request.id)
+}
+
+#[tauri::command]
+fn get_models() -> Result<Vec<String>, String> {
+    let payload = Client::new()
+        .get(OLLAMA_TAGS_URL)
+        .send()
+        .and_then(|response| response.error_for_status())
+        .map_err(|err| format!("Failed to query Ollama models: {err}"))?
+        .json::<OllamaTagsResponse>()
+        .map_err(|err| format!("Failed to parse Ollama model list: {err}"))?;
+
+    Ok(payload.models.into_iter().map(|model| model.name).collect())
+}
+
+#[tauri::command]
+fn get_summaries(state: State<'_, AppState>) -> Result<Vec<SummaryEntry>, String> {
+    let db = open_connection(&state)?;
+    let mut stmt = db
+        .prepare("SELECT * FROM summaries ORDER BY id DESC")
+        .map_err(|err| format!("Failed to prepare summary query: {err}"))?;
+    let rows = stmt
+        .query_map([], StoredSummary::from_row)
+        .map_err(|err| format!("Failed to read summaries: {err}"))?;
+
+    let mut items = Vec::new();
+    for row in rows {
+        let entry = row
+            .map_err(|err| format!("Failed to decode summary row: {err}"))?
+            .into_entry(&state);
+        items.push(entry);
+    }
+
+    Ok(items)
+}
+
+#[tauri::command]
+async fn summarize_video(
+    state: State<'_, AppState>,
+    window: WebviewWindow,
+    request: SummarizeVideoRequest,
+) -> Result<SummaryEntry, String> {
+    let state = state.inner().clone();
+    let app = window.app_handle().clone();
+    let window_label = window.label().to_string();
+    tauri::async_runtime::spawn_blocking(move || {
+        summarize_video_inner(&state, &app, &window_label, request)
+    })
+    .await
+    .map_err(|err| format!("Summarize task failed: {err}"))?
+}
+
+#[tauri::command]
+fn delete_summary(state: State<'_, AppState>, request: DeleteSummaryRequest) -> Result<(), String> {
+    let db = open_connection(&state)?;
+    db.execute("DELETE FROM summaries WHERE id = ?", [request.id])
+        .map_err(|err| format!("Failed to delete summary entry: {err}"))?;
+    Ok(())
+}
+
+#[tauri::command]
+async fn translate_summary(
+    state: State<'_, AppState>,
+    request: TranslateSummaryRequest,
+) -> Result<SummaryEntry, String> {
+    let state = state.inner().clone();
+    tauri::async_runtime::spawn_blocking(move || translate_summary_inner(&state, request))
+        .await
+        .map_err(|err| format!("Translate task failed: {err}"))?
+}
+
+#[tauri::command]
+fn open_external(url: String) -> Result<(), String> {
+    that(url).map_err(|err| format!("Failed to open URL: {err}"))
+}
+
+#[tauri::command]
+fn open_file(file_path: String) -> Result<(), String> {
+    let path = Path::new(&file_path);
+    if !path.exists() {
+        return Err("Requested file does not exist.".to_string());
+    }
+    that(path).map_err(|err| format!("Failed to open file: {err}"))
+}
+
+fn main() {
+    tauri::Builder::default()
+        .plugin(tauri_plugin_dialog::init())
+        .setup(|app| {
+            let state = ensure_app_state(app.handle())?;
+            app.manage(state);
+            Ok(())
+        })
+        .invoke_handler(tauri::generate_handler![
+            get_models,
+            get_summaries,
+            summarize_video,
+            delete_summary,
+            translate_summary,
+            open_external,
+            open_file
+        ])
+        .run(tauri::generate_context!())
+        .expect("error while running tauri application");
+}
--- a/src-tauri/tauri.conf.json
+++ b/src-tauri/tauri.conf.json
@@ -0,0 +1,39 @@
+{
+  "$schema": "https://schema.tauri.app/config/2",
+  "productName": "YouTube Summarizer",
+  "version": "1.0.0",
+  "identifier": "com.victorgiers.youtube-summarizer",
+  "build": {
+    "frontendDist": "../ui"
+  },
+  "app": {
+    "withGlobalTauri": true,
+    "security": {
+      "assetProtocol": {
+        "enable": true,
+        "scope": ["$APPLOCALDATA/data/**"]
+      },
+      "csp": null
+    },
+    "windows": [
+      {
+        "label": "main",
+        "title": "YouTube Summarizer",
+        "width": 1104,
+        "height": 800,
+        "resizable": true
+      }
+    ]
+  },
+  "bundle": {
+    "active": true,
+    "resources": [
+      "../backend_cli.py",
+      "../youtube_summarizer.py",
+      "../translate_summary.py",
+      "../requirements.txt",
+      "resources/backend",
+      "resources/ffmpeg"
+    ]
+  }
+}
--- a/tools/autofill_translations.py
+++ b/tools/autofill_translations.py
@@ -0,0 +1,74 @@
+#!/usr/bin/env python3
+
+import os
+import sqlite3
+import subprocess
+import sys
+
+DB_FILE = os.path.join(os.path.dirname(__file__), 'summaries.db')
+TRANSLATE_SCRIPT = os.path.join(os.path.dirname(__file__), 'translate_summary.py')
+MODEL = "mistral-small3.1:24b"
+
+def get_entries_needing_translation(conn):
+    cursor = conn.cursor()
+    cursor.execute(
+        "SELECT id, summary_en, summary_de, summary_jp FROM summaries"
+    )
+    return [
+        (row[0], row[1], row[2], row[3])
+        for row in cursor.fetchall()
+        if row[1] and (not row[2] or not row[3])  # summary_en vorhanden, mind. eine Übersetzung fehlt
+    ]
+
+def translate(summary_text, lang):
+    # Schreibe summary_text temporär in Datei
+    import tempfile
+    with tempfile.NamedTemporaryFile('w+', delete=False, suffix='.txt', encoding='utf-8') as f:
+        f.write(summary_text)
+        tmp_summary_path = f.name
+    try:
+        # Führe das Übersetzungsskript aus
+        cmd = [
+            sys.executable,  # benutzt aktuelles Python
+            TRANSLATE_SCRIPT,
+            "--summary-file", tmp_summary_path,
+            "--lang", lang,
+            "--model", MODEL,
+        ]
+        print(f"[{lang}] Translating with: {' '.join(cmd)}")
+        result = subprocess.run(cmd, capture_output=True, text=True, check=True)
+        translation = result.stdout.strip()
+        return translation
+    finally:
+        os.remove(tmp_summary_path)
+
+def main():
+    conn = sqlite3.connect(DB_FILE)
+    cursor = conn.cursor()
+    entries = get_entries_needing_translation(conn)
+    print(f"Found {len(entries)} entries needing translation.")
+    for entry_id, summary_en, summary_de, summary_jp in entries:
+        updated = False
+        if not summary_de:
+            print(f"Translating to DE for entry id {entry_id}…")
+            try:
+                translation = translate(summary_en, "de")
+                cursor.execute("UPDATE summaries SET summary_de = ? WHERE id = ?", (translation, entry_id))
+                updated = True
+            except Exception as e:
+                print(f"Failed to translate DE for id {entry_id}: {e}")
+        if not summary_jp:
+            print(f"Translating to JP for entry id {entry_id}…")
+            try:
+                translation = translate(summary_en, "jp")
+                cursor.execute("UPDATE summaries SET summary_jp = ? WHERE id = ?", (translation, entry_id))
+                updated = True
+            except Exception as e:
+                print(f"Failed to translate JP for id {entry_id}: {e}")
+        if updated:
+            conn.commit()
+    conn.close()
+    print("Done.")
+
+if __name__ == "__main__":
+    main()
--- a/tools/prepare_bundle.py
+++ b/tools/prepare_bundle.py
@@ -0,0 +1,161 @@
+#!/usr/bin/env python3
+"""
+Prepare local bundle assets for a distributable Tauri build.
+
+This script:
+1. installs PyInstaller into the current Python environment
+2. builds the bundled backend helper as a single executable
+3. copies ffmpeg / ffprobe from the local PATH into Tauri resources
+
+It targets the current host platform. Run it once per build machine before
+`cargo tauri build`.
+"""
+
+from __future__ import annotations
+
+import os
+import shutil
+import stat
+import subprocess
+import sys
+from pathlib import Path
+
+
+ROOT = Path(__file__).resolve().parents[1]
+SRC_TAURI = ROOT / "src-tauri"
+BACKEND_ROOT = SRC_TAURI / "resources" / "backend"
+FFMPEG_ROOT = SRC_TAURI / "resources" / "ffmpeg"
+BUILD_DIR = ROOT / "build"
+DIST_DIR = BUILD_DIR / "pyinstaller-dist"
+WORK_DIR = BUILD_DIR / "pyinstaller-work"
+SPEC_DIR = BUILD_DIR / "pyinstaller-spec"
+BACKEND_NAME = "yts-backend"
+
+
+def run(cmd: list[str]) -> None:
+    subprocess.run(cmd, check=True, cwd=ROOT)
+
+
+def detect_target_triple() -> str:
+    try:
+        output = subprocess.check_output(["rustc", "--print", "host-tuple"], text=True)
+        return output.strip()
+    except subprocess.CalledProcessError:
+        verbose = subprocess.check_output(["rustc", "-Vv"], text=True)
+        for line in verbose.splitlines():
+            if line.startswith("host: "):
+                return line.split(": ", 1)[1].strip()
+    raise SystemExit("Unable to determine the Rust host target triple.")
+
+
+def executable_suffix() -> str:
+    return ".exe" if os.name == "nt" else ""
+
+
+def ensure_pyinstaller() -> None:
+    run([sys.executable, "-m", "pip", "install", "--upgrade", "pip"])
+    run([sys.executable, "-m", "pip", "install", "-r", str(ROOT / "requirements.txt"), "pyinstaller"])
+
+
+def build_backend_binary() -> Path:
+    DIST_DIR.mkdir(parents=True, exist_ok=True)
+    WORK_DIR.mkdir(parents=True, exist_ok=True)
+    SPEC_DIR.mkdir(parents=True, exist_ok=True)
+
+    cmd = [
+        sys.executable,
+        "-m",
+        "PyInstaller",
+        "--noconfirm",
+        "--clean",
+        "--onefile",
+        "--name",
+        BACKEND_NAME,
+        "--distpath",
+        str(DIST_DIR),
+        "--workpath",
+        str(WORK_DIR),
+        "--specpath",
+        str(SPEC_DIR),
+        "--paths",
+        str(ROOT),
+        "--collect-submodules",
+        "whisper",
+        "--collect-submodules",
+        "yt_dlp",
+        "--collect-submodules",
+        "youtube_transcript_api",
+        "--collect-data",
+        "whisper",
+        "--collect-data",
+        "yt_dlp",
+        "--collect-data",
+        "webvtt",
+        "--collect-data",
+        "youtube_transcript_api",
+        str(ROOT / "backend_cli.py"),
+    ]
+    run(cmd)
+    binary = DIST_DIR / f"{BACKEND_NAME}{executable_suffix()}"
+    if not binary.exists():
+        raise SystemExit(f"Expected backend binary was not produced: {binary}")
+    return binary
+
+
+def install_sidecar(binary: Path, target_triple: str) -> Path:
+    target_dir = BACKEND_ROOT / target_triple
+    target_dir.mkdir(parents=True, exist_ok=True)
+    target = target_dir / f"{BACKEND_NAME}{executable_suffix()}"
+    shutil.copy2(binary, target)
+    if os.name != "nt":
+        target.chmod(target.stat().st_mode | stat.S_IXUSR | stat.S_IXGRP | stat.S_IXOTH)
+    return target
+
+
+def resolve_tool_source(env_name: str, tool_name: str) -> Path:
+    override = os.environ.get(env_name, "").strip()
+    if override:
+        return Path(override).expanduser().resolve()
+
+    source = shutil.which(tool_name)
+    if not source:
+        raise SystemExit(
+            f"Required build dependency not found: {tool_name}. "
+            f"Put it on PATH or set {env_name}."
+        )
+    return Path(source).resolve()
+
+
+def copy_tool_to_resources(env_name: str, tool_name: str, resource_dir: Path) -> Path:
+    source_path = resolve_tool_source(env_name, tool_name)
+    destination = resource_dir / source_path.name
+    shutil.copy2(source_path, destination)
+    if os.name != "nt":
+        destination.chmod(destination.stat().st_mode | stat.S_IXUSR | stat.S_IXGRP | stat.S_IXOTH)
+    return destination
+
+
+def install_ffmpeg_resources(target_triple: str) -> tuple[Path, Path]:
+    resource_dir = FFMPEG_ROOT / target_triple
+    resource_dir.mkdir(parents=True, exist_ok=True)
+    ffmpeg = copy_tool_to_resources("YTS_FFMPEG", "ffmpeg", resource_dir)
+    ffprobe = copy_tool_to_resources("YTS_FFPROBE", "ffprobe", resource_dir)
+    return ffmpeg, ffprobe
+
+
+def main() -> int:
+    target_triple = detect_target_triple()
+    ensure_pyinstaller()
+    backend_binary = build_backend_binary()
+    sidecar = install_sidecar(backend_binary, target_triple)
+    ffmpeg, ffprobe = install_ffmpeg_resources(target_triple)
+
+    print(f"Prepared backend sidecar: {sidecar}")
+    print(f"Prepared ffmpeg resource: {ffmpeg}")
+    print(f"Prepared ffprobe resource: {ffprobe}")
+    print("Next step: cargo tauri build")
+    return 0
+
+
+if __name__ == "__main__":
+    raise SystemExit(main())
--- a/translate_summary.py
+++ b/translate_summary.py
@@ -0,0 +1,80 @@
+#!/usr/bin/env python3
+"""
+translate_summary.py
+
+Usage:
+    python3 translate_summary.py --summary-file <file> --lang <de|jp> [--model <model>] [--output-file <file>]
+
+Arguments:
+    --summary-file   Path to the file containing the English summary text.
+    --lang           Target language ('de' for German, 'jp' for Japanese).
+    --model          (Optional) Ollama model name, defaults to mistral:latest.
+    --output-file    (Optional) Where to write translated summary as plain text.
+
+Example:
+    python3 translate_summary.py --summary-file summary.txt --lang de --model mistral:latest
+"""
+
+import sys
+import argparse
+import json
+import requests
+
+LANG_MAP = {
+    "de": "German",
+    "jp": "Japanese"
+}
+
+def translate_summary_text(summary_text, target_language, model="mistral:latest"):
+    if target_language not in LANG_MAP:
+        raise ValueError("Supported languages: de (German), jp (Japanese)")
+    prompt = (
+        f"Translate the following summary into {LANG_MAP[target_language]}. Only output the translated summary, "
+        "no explanation or intro. If it's already in the target language, do nothing but repeat it.\n\n"
+        f"Summary:\n{summary_text}\n\nTranslation:"
+    )
+    payload = {
+        "model": model,
+        "messages": [
+            {"role": "system", "content": f"You are an expert translator proficient in {LANG_MAP[target_language]} and English."},
+            {"role": "user", "content": prompt}
+        ],
+        "stream": False
+    }
+    resp = requests.post("http://localhost:11434/api/chat", json=payload)
+    resp.raise_for_status()
+    data = resp.json()
+    return data.get("message", {}).get("content", "").strip()
+
+
+def translate_summary_file(summary_file, target_language, model="mistral:latest"):
+    with open(summary_file, "r", encoding="utf-8") as f:
+        summary_text = f.read().strip()
+    if not summary_text:
+        raise ValueError("Empty summary text!")
+    return translate_summary_text(summary_text, target_language, model)
+
+def main():
+    parser = argparse.ArgumentParser(description="Translate summary using Ollama")
+    parser.add_argument("--summary-file", required=True, help="Path to file with English summary text")
+    parser.add_argument("--lang", required=True, choices=["de", "jp"], help="Target language: 'de' or 'jp'")
+    parser.add_argument("--model", default="mistral:latest", help="Ollama model to use")
+    parser.add_argument("--output-file", help="Output file for translated summary")
+    args = parser.parse_args()
+
+    # Read summary
+    try:
+        translation = translate_summary_file(args.summary_file, args.lang, args.model)
+    except Exception as e:
+        print(f"Translation failed: {e}", file=sys.stderr)
+        sys.exit(2)
+
+    # Output result
+    if args.output_file:
+        with open(args.output_file, "w", encoding="utf-8") as f:
+            f.write(translation)
+    else:
+        print(translation)
+
+if __name__ == "__main__":
+    main()
--- a/ui/index.html
+++ b/ui/index.html
@@ -0,0 +1,166 @@
+<!DOCTYPE html>
+<html lang="en">
+<head>
+  <meta charset="UTF-8" />
+  <meta name="viewport" content="width=device-width, initial-scale=1.0" />
+  <title>YouTube Summaries</title>
+  <style>
+    body {
+      font-family: Arial, sans-serif;
+      margin: 20px;
+      background-color: #ffe4e6;
+      color: #9f1239;
+    }
+    header {
+      display: flex;
+      align-items: center;
+      gap: 10px;
+    }
+    header input[type="text"] {
+      flex: 1;
+      padding: 8px;
+      font-size: 16px;
+      border-radius: 4px;
+      border: 1px solid #ccc;
+    }
+    header input[type="checkbox"] {
+      accent-color: #9f1239;
+    }
+    header button {
+      padding: 8px 16px;
+      font-size: 16px;
+      border: none;
+      border-radius: 4px;
+      background-color: #9f1239;
+      color: white;
+      cursor: pointer;
+    }
+    header button:hover {
+      background-color: #7c0e2e;
+    }
+    header label {
+      display: flex;
+      align-items: center;
+      gap: 5px;
+      font-size: 14px;
+      color: #9f1239;
+    }
+    .loading {
+      margin-top: 5px;
+      font-style: italic;
+      color: #9f1239;
+    }
+    #summaries-container {
+      margin-top: 20px;
+      display: flex;
+      flex-direction: column;
+      gap: 15px;
+    }
+    .entry {
+      display: flex;
+      gap: 10px;
+      padding: 10px;
+      border: 1px solid #fff;
+      border-radius: 5px;
+      background-color: #fff1f2;
+      box-shadow: 0 1px 3px rgba(0, 0, 0, 0.05);
+    }
+    .entry .left {
+      width: 150px;
+      flex-shrink: 0;
+    }
+    .entry .thumbnail {
+      max-width: 100%;
+      border-radius: 3px;
+    }
+    .entry .middle {
+      flex: 1;
+    }
+    .entry .right {
+      width: 140px;
+      flex-shrink: 0;
+    }
+    .entry ul {
+      list-style-type: none;
+      padding-left: 0;
+      margin: 0;
+    }
+    .entry li {
+      margin-bottom: 5px;
+    }
+    .entry a {
+      color: #9f1239;
+      text-decoration: none;
+    }
+    .entry a:hover {
+      text-decoration: underline;
+    }
+    .entry .summary {
+      transition: max-height 0.2s;
+    }
+    .entry.collapsed .summary {
+      display: -webkit-box !important;
+      -webkit-line-clamp: 2;
+      -webkit-box-orient: vertical;
+      overflow: hidden;
+      max-height: 2.8em;
+    }
+    .pagination {
+      display: flex;
+      flex-wrap: wrap;
+      gap: 5px;
+      justify-content: center;
+      margin: 10px 0;
+    }
+    .pagination button {
+      padding: 4px 8px;
+      font-size: 14px;
+      border: 1px solid #9f1239;
+      background-color: #fff1f2;
+      color: #9f1239;
+      border-radius: 3px;
+      cursor: pointer;
+    }
+    .pagination button:hover {
+      background-color: #9f1239;
+      color: white;
+    }
+    .pagination button.active {
+      background-color: #9f1239;
+      color: white;
+      font-weight: bold;
+    }
+    header button:disabled {
+      background-color: #ffe4e6;
+      color: #9f1239;
+      opacity: 0.7;
+      cursor: default;
+      border: 1px solid #fbb6ce;
+    }
+  </style>
+</head>
+<body>
+  <header style="display:flex; flex-direction:column; gap:5px;">
+    <form id="summarize-form" style="display:flex; width:100%; gap:10px; align-items:center; flex-wrap: wrap;">
+      <input type="text" id="url-input" placeholder="Enter YouTube URL" />
+      <button type="submit">Summarize!</button>
+      <div style="display: flex; flex-direction: column; gap: 2px; min-width:120px;">
+        <label style="font-size:14px; color:#9f1239; display: flex; align-items: center; gap: 5px;">
+          <input type="checkbox" id="whisper-checkbox" checked />Use Whisper
+        </label>
+        <label style="font-size:14px; color:#9f1239; display: flex; align-items: center; gap: 5px;">
+          <input type="checkbox" id="autotranslate-checkbox" checked />Auto Translate
+        </label>
+      </div>
+      <select id="model-select" style="padding:6px; font-size:14px;">
+        <option disabled selected>Loading models…</option>
+      </select>
+    </form>
+    <div id="loading" class="loading" style="display:none;">Loading…</div>
+  </header>
+  <div id="pagination-top" class="pagination" style="display:none;"></div>
+  <div id="summaries-container"></div>
+  <div id="pagination-bottom" class="pagination" style="display:none;"></div>
+  <script src="renderer.js"></script>
+</body>
+</html>
--- a/ui/renderer.js
+++ b/ui/renderer.js
@@ -0,0 +1,578 @@
+const tauriApi = window.__TAURI__;
+const invoke = tauriApi?.core?.invoke;
+const listen = tauriApi?.event?.listen;
+const convertFileSrc = tauriApi?.core?.convertFileSrc;
+const confirmDialog = tauriApi?.dialog?.confirm;
+
+if (!invoke || !listen) {
+  throw new Error('Tauri runtime API is unavailable.');
+}
+
+function toWebviewFileUrl(filePath) {
+  if (!filePath) {
+    return filePath;
+  }
+  if (typeof convertFileSrc === 'function') {
+    return convertFileSrc(filePath);
+  }
+  return filePath;
+}
+
+window.api = {
+  getModels: () => invoke('get_models'),
+  getSummaries: () => invoke('get_summaries'),
+  summarizeVideo: (url, useWhisper, model) => invoke('summarize_video', {
+    request: {
+      url,
+      useWhisper,
+      model: model || null
+    }
+  }),
+  openExternal: (url) => invoke('open_external', { url }),
+  openFile: (filePath) => invoke('open_file', { filePath }),
+  deleteSummary: (id) => invoke('delete_summary', {
+    request: { id }
+  }),
+  translateSummary: (id, lang, model) => invoke('translate_summary', {
+    request: {
+      id,
+      lang,
+      model: model || null
+    }
+  }),
+  onSummarizeProgress: (callback) => listen('summarize-progress', (event) => {
+    callback(String(event.payload || ''));
+  })
+};
+
+window.addEventListener('DOMContentLoaded', async () => {
+  const form = document.getElementById('summarize-form');
+  const urlInput = document.getElementById('url-input');
+  const whisperCheckbox = document.getElementById('whisper-checkbox');
+  const summariesContainer = document.getElementById('summaries-container');
+  const loadingIndicator = document.getElementById('loading');
+  const modelSelect = document.getElementById('model-select');
+  const paginationTop = document.getElementById('pagination-top');
+  const paginationBottom = document.getElementById('pagination-bottom');
+  const summarizeButton = form.querySelector('button[type="submit"]');
+  const autoTranslateCheckbox = document.getElementById('autotranslate-checkbox');
+
+  let fullSummaries = [];
+  let currentPage = 1;
+  const PAGE_SIZE = 20;
+  let isLoading = false;
+  let entryUiState = {};
+
+  function setLoadingMessage(message) {
+    if (!isLoading) {
+      return;
+    }
+    loadingIndicator.style.display = 'inline';
+    loadingIndicator.textContent = message;
+  }
+
+  whisperCheckbox.checked = localStorage.getItem('useWhisper') === '0' ? false : true;
+  autoTranslateCheckbox.checked = localStorage.getItem('autoTranslate') === '1' ? true : false;
+
+  whisperCheckbox.addEventListener('change', () => {
+    localStorage.setItem('useWhisper', whisperCheckbox.checked ? '1' : '0');
+  });
+  autoTranslateCheckbox.addEventListener('change', () => {
+    localStorage.setItem('autoTranslate', autoTranslateCheckbox.checked ? '1' : '0');
+  });
+
+  function renderSummaries(list) {
+    summariesContainer.innerHTML = '';
+    const renderedIds = new Set();
+
+    list.forEach(item => {
+      renderedIds.add(item.id);
+      if (!entryUiState[item.id]) {
+        entryUiState[item.id] = { expanded: false, lang: 'en' };
+      }
+      let { expanded, lang } = entryUiState[item.id];
+
+      const entry = document.createElement('div');
+      entry.classList.add('entry');
+      entry.style.overflow = 'hidden';
+
+      const deleteButton = document.createElement('button');
+      deleteButton.type = 'button';
+      deleteButton.innerHTML = '&times;';
+      deleteButton.classList.add('delete-entry-button');
+      deleteButton.style.width = '24px';
+      deleteButton.style.height = '24px';
+      deleteButton.style.display = 'flex';
+      deleteButton.style.alignItems = 'center';
+      deleteButton.style.justifyContent = 'center';
+      deleteButton.style.border = 'none';
+      deleteButton.style.background = 'transparent';
+      deleteButton.style.color = '#9f1239';
+      deleteButton.style.fontSize = '22px';
+      deleteButton.style.fontWeight = 'normal';
+      deleteButton.style.cursor = 'pointer';
+      deleteButton.style.padding = '0';
+      deleteButton.style.lineHeight = '1';
+      deleteButton.disabled = isLoading;
+      deleteButton.addEventListener('click', (e) => {
+        e.preventDefault();
+        e.stopPropagation();
+        if (isLoading) {
+          return;
+        }
+        if (typeof confirmDialog !== 'function') {
+          alert('Delete confirmation is unavailable.');
+          return;
+        }
+        confirmDialog('Are you sure you want to delete this entry?', {
+          title: 'Delete entry',
+          kind: 'warning'
+        }).then((confirmed) => {
+          if (!confirmed) {
+            return;
+          }
+          window.api.deleteSummary(item.id)
+            .then(() => {
+              delete entryUiState[item.id];
+              return window.api.getSummaries().then(setSummaries);
+            })
+            .catch(err => {
+              alert('Error deleting summary: ' + err.message);
+            });
+        });
+      });
+      const left = document.createElement('div');
+      left.classList.add('left');
+      if (item.thumbnail) {
+        const img = document.createElement('img');
+        img.src = toWebviewFileUrl(item.thumbnail);
+        img.alt = item.video_name;
+        img.classList.add('thumbnail');
+        if (item.url) {
+          img.style.cursor = 'pointer';
+          img.title = 'Open video';
+          img.addEventListener('click', (e) => {
+            e.stopPropagation();
+            window.api.openExternal(item.url);
+          });
+        }
+        left.appendChild(img);
+      }
+
+      const langSwitcher = document.createElement('span');
+      langSwitcher.style.display = 'flex';
+      langSwitcher.style.gap = '6px';
+      langSwitcher.style.marginTop = '8px';
+      langSwitcher.style.marginBottom = '2px';
+
+      const summaryFields = {
+        en: item.summary_en,
+        de: item.summary_de,
+        jp: item.summary_jp
+      };
+
+      ['en', 'de', 'jp'].forEach(thisLang => {
+        const btn = document.createElement('button');
+        btn.type = 'button';
+        btn.textContent = thisLang.toUpperCase();
+        btn.style.fontSize = '12px';
+        btn.style.padding = '2px 8px';
+        btn.style.borderRadius = '5px';
+        btn.style.border = '1px solid #eee';
+        btn.style.background = (thisLang === lang) ? '#9f1239' : '#fff1f2';
+        btn.style.color = (thisLang === lang) ? '#fff' : '#9f1239';
+        btn.disabled = isLoading;
+        btn.addEventListener('click', () => {
+          lang = thisLang;
+          entryUiState[item.id].lang = lang;
+          renderSummaryContent();
+          Array.from(langSwitcher.children).forEach((button, index) => {
+            const language = ['en', 'de', 'jp'][index];
+            button.style.background = (language === lang) ? '#9f1239' : '#fff1f2';
+            button.style.color = (language === lang) ? '#fff' : '#9f1239';
+          });
+        });
+        langSwitcher.appendChild(btn);
+      });
+      left.appendChild(langSwitcher);
+
+      const middle = document.createElement('div');
+      middle.classList.add('middle');
+      const headline = document.createElement('div');
+      headline.style.display = 'flex';
+      headline.style.alignItems = 'center';
+      headline.style.justifyContent = 'space-between';
+      headline.style.gap = '12px';
+      const headlineMain = document.createElement('div');
+      headlineMain.style.display = 'flex';
+      headlineMain.style.alignItems = 'center';
+      headlineMain.style.minWidth = '0';
+      const titleEl = document.createElement('strong');
+      titleEl.style.display = 'block';
+      titleEl.style.fontSize = '16px';
+      titleEl.style.cursor = 'default';
+      titleEl.style.marginLeft = '0';
+      titleEl.textContent = item.video_name;
+
+      const arrow = document.createElement('span');
+      arrow.textContent = expanded ? '▼' : '▶';
+      arrow.style.marginRight = '8px';
+      arrow.style.marginLeft = '0';
+      arrow.style.fontSize = '18px';
+      arrow.style.userSelect = 'none';
+      arrow.style.transition = 'transform 0.15s';
+
+      headlineMain.appendChild(arrow);
+      headlineMain.appendChild(titleEl);
+      headline.appendChild(headlineMain);
+      headline.appendChild(deleteButton);
+
+      const channelEl = document.createElement('span');
+      channelEl.style.fontSize = '14px';
+      channelEl.style.opacity = '0.8';
+      channelEl.style.marginBottom = '12px';
+      channelEl.textContent = item.channel || '';
+      channelEl.style.display = 'block';
+      channelEl.style.marginTop = '2px';
+
+      middle.appendChild(headline);
+      middle.appendChild(channelEl);
+
+      const summaryHTML = document.createElement('div');
+      summaryHTML.classList.add('summary');
+      summaryHTML.style.display = '-webkit-box';
+      summaryHTML.style.webkitBoxOrient = 'vertical';
+      summaryHTML.style.overflow = 'hidden';
+      summaryHTML.style.transition = 'max-height 0.2s';
+
+      function renderSummaryContent() {
+        const text = summaryFields[lang];
+        summaryHTML.innerHTML = '';
+        if (text && text.trim()) {
+          summaryHTML.innerHTML = markdownToHTML(text);
+        } else {
+          const missingMsg = document.createElement('span');
+          missingMsg.textContent = (
+            lang === 'de' ? 'German not available. ' :
+            lang === 'jp' ? 'Japanese not available. ' :
+            'Not available. '
+          );
+          summaryHTML.appendChild(missingMsg);
+        }
+        if (!expanded) {
+          summaryHTML.style.webkitLineClamp = '2';
+          summaryHTML.style.maxHeight = '2.8em';
+        } else {
+          summaryHTML.style.webkitLineClamp = '';
+          summaryHTML.style.maxHeight = '';
+        }
+      }
+
+      middle.appendChild(summaryHTML);
+
+      entry.appendChild(left);
+      entry.appendChild(middle);
+
+      summariesContainer.appendChild(entry);
+
+      function applyCollapsedStyle() {
+        if (!expanded) {
+          entry.classList.add('collapsed');
+          arrow.textContent = '▶';
+        } else {
+          entry.classList.remove('collapsed');
+          arrow.textContent = '▼';
+        }
+        renderSummaryContent();
+      }
+      applyCollapsedStyle();
+
+      middle.addEventListener('click', () => {
+        if (!expanded) {
+          expanded = true;
+          entryUiState[item.id].expanded = true;
+          applyCollapsedStyle();
+        }
+      });
+
+      headline.addEventListener('click', (e) => {
+        if (expanded) {
+          expanded = false;
+          entryUiState[item.id].expanded = false;
+          applyCollapsedStyle();
+          e.stopPropagation();
+        }
+      });
+    });
+
+    Object.keys(entryUiState).forEach(id => {
+      if (!renderedIds.has(Number(id))) {
+        delete entryUiState[id];
+      }
+    });
+
+    setActionLinksDisabled(isLoading);
+  }
+
+  function markdownToHTML(text) {
+    text = text.replace(/<\/think(?:ing)?>[^\S\n]*\n+[^\S\n]*/gi, '</think>');
+    text = text.replace(
+      /(^|\n)\s*<think>[\s\S]*?<\/think(?:ing)?>\s*(\n\s*\n)?/gi,
+      (_, lead) => (lead ? '\n' : '')
+    );
+
+    let tmp = text.replace(/\r\n/g, '\n').replace(/\r/g, '\n');
+    tmp = tmp.replace(
+      /(^|\n)\s*<think>[\s\S]*?<\/think(?:ing)?>\s*(?=\n|$)/gi,
+      (_, lead) => (lead ? '\n' : '')
+    );
+
+    const codeblocks = [];
+    const placeholder = idx => `@@CODEBLOCK${idx}@@`;
+    tmp = tmp.replace(/```([\s\S]*?)```/g, (_, code) => {
+      codeblocks.push(code);
+      return placeholder(codeblocks.length - 1);
+    });
+
+    let escaped = tmp
+      .replace(/&/g, '&amp;')
+      .replace(/</g, '&lt;')
+      .replace(/>/g, '&gt;');
+
+    escaped = escaped
+      .replace(/^#### (.+)$/gm, '<h4>$1</h4>')
+      .replace(/^### (.+)$/gm, '<h3>$1</h3>')
+      .replace(/^## (.+)$/gm, '<h2>$1</h2>')
+      .replace(/^# (.+)$/gm, '<h1>$1</h1>');
+
+    escaped = escaped.replace(
+      /(^|\n)([ \t]*\* .+(?:\n[ \t]*\* .+)*)/g,
+      (_, lead, listBlock) => {
+        const items = listBlock
+          .split(/\n/)
+          .map(line => line.replace(/^[ \t]*\*\s+/, '').trim())
+          .map(item => `<li>${item}</li>`)
+          .join('');
+        return `${lead}<ul>${items}</ul>`;
+      }
+    );
+
+    let html = escaped
+      .replace(/\*\*(.+?)\*\*/g, '<b>$1</b>')
+      .replace(/(?<!\*)\*(.+?)\*(?!\*)/g, '<i>$1</i>')
+      .replace(/`(.+?)`/g, '<code>$1</code>');
+
+    html = html.replace(/@@CODEBLOCK(\d+)@@/g, (_, idx) => {
+      const code = codeblocks[Number(idx)];
+      return `<pre><code>${code}</code></pre>`;
+    });
+
+    html = html.replace(/\n*(<h[1-3]>.*?<\/h[1-3]>)\n*/g, '$1\n');
+    html = html.replace(/\n/g, '<br>');
+    html = html
+      .replace(/<br>\s*(<h[1-3]>)/g, '$1')
+      .replace(/(<\/h[1-3]>)\s*<br>/g, '$1');
+
+    return html;
+  }
+
+  function setActionLinksDisabled(disabled) {
+    document.querySelectorAll('.delete-entry-button').forEach(button => {
+      if (disabled) {
+        button.disabled = true;
+        button.style.opacity = '0.5';
+      } else {
+        button.disabled = false;
+        button.style.opacity = '';
+      }
+    });
+    document.querySelectorAll('.left button').forEach(btn => {
+      btn.disabled = disabled;
+      btn.style.opacity = disabled ? '0.5' : '';
+    });
+  }
+
+  function updatePaginationControls() {
+    if (!fullSummaries || fullSummaries.length <= PAGE_SIZE) {
+      paginationTop.style.display = 'none';
+      paginationBottom.style.display = 'none';
+      return;
+    }
+    paginationTop.style.display = 'flex';
+    paginationBottom.style.display = 'flex';
+    const totalPages = Math.ceil(fullSummaries.length / PAGE_SIZE);
+
+    const buildNav = (container) => {
+      container.innerHTML = '';
+
+      const prevBtn = document.createElement('button');
+      prevBtn.textContent = '«';
+      prevBtn.disabled = currentPage === 1;
+      prevBtn.addEventListener('click', () => {
+        if (currentPage > 1) {
+          showPage(currentPage - 1);
+          updatePaginationControls();
+        }
+      });
+      container.appendChild(prevBtn);
+
+      for (let i = 1; i <= totalPages; i += 1) {
+        const btn = document.createElement('button');
+        btn.textContent = i;
+        if (i === currentPage) {
+          btn.classList.add('active');
+        }
+        btn.addEventListener('click', () => {
+          showPage(i);
+          updatePaginationControls();
+        });
+        container.appendChild(btn);
+      }
+
+      const nextBtn = document.createElement('button');
+      nextBtn.textContent = '»';
+      nextBtn.disabled = currentPage === totalPages;
+      nextBtn.addEventListener('click', () => {
+        if (currentPage < totalPages) {
+          showPage(currentPage + 1);
+          updatePaginationControls();
+        }
+      });
+      container.appendChild(nextBtn);
+    };
+
+    buildNav(paginationTop);
+    buildNav(paginationBottom);
+  }
+
+  function showPage(page) {
+    const totalPages = Math.ceil(fullSummaries.length / PAGE_SIZE);
+    currentPage = Math.max(1, Math.min(page, totalPages || 1));
+    const start = (currentPage - 1) * PAGE_SIZE;
+    const end = start + PAGE_SIZE;
+    renderSummaries(fullSummaries.slice(start, end));
+  }
+
+  function setSummaries(list) {
+    fullSummaries = list || [];
+    const totalPages = Math.ceil(fullSummaries.length / PAGE_SIZE);
+    if (currentPage > totalPages) {
+      currentPage = Math.max(1, totalPages);
+    }
+    showPage(currentPage);
+    updatePaginationControls();
+  }
+
+  try {
+    const models = await window.api.getModels();
+    modelSelect.innerHTML = '';
+    const hasMistral = Array.isArray(models) && models.includes('mistral:latest');
+    const placeholder = document.createElement('option');
+    placeholder.disabled = true;
+    placeholder.value = '';
+    placeholder.innerText = 'Select model';
+    modelSelect.appendChild(placeholder);
+    if (Array.isArray(models)) {
+      models.forEach(name => {
+        const option = document.createElement('option');
+        option.value = name;
+        option.innerText = name;
+        modelSelect.appendChild(option);
+      });
+    }
+    const saved = localStorage.getItem('selectedModel');
+    let toSelect = '';
+    if (saved && models.includes(saved)) {
+      toSelect = saved;
+    } else if (hasMistral) {
+      toSelect = 'mistral:latest';
+    }
+    if (toSelect) {
+      modelSelect.value = toSelect;
+      placeholder.selected = false;
+    } else {
+      placeholder.selected = true;
+    }
+  } catch (err) {
+    console.error('Error loading models:', err);
+    modelSelect.innerHTML = '';
+    const placeholder = document.createElement('option');
+    placeholder.disabled = true;
+    placeholder.selected = true;
+    placeholder.value = '';
+    placeholder.innerText = 'Select model';
+    modelSelect.appendChild(placeholder);
+  }
+
+  modelSelect.addEventListener('change', () => {
+    localStorage.setItem('selectedModel', modelSelect.value);
+  });
+
+  window.api.getSummaries().then(setSummaries).catch(console.error);
+
+  form.addEventListener('submit', (e) => {
+    e.preventDefault();
+    const url = urlInput.value.trim();
+    const useWhisper = whisperCheckbox.checked;
+    const autoTranslate = autoTranslateCheckbox.checked;
+    if (!url || isLoading) {
+      return;
+    }
+
+    isLoading = true;
+    summarizeButton.disabled = true;
+    setLoadingMessage('Summarizing…');
+    setActionLinksDisabled(true);
+
+    const selectedModel = modelSelect.value;
+    window.api.summarizeVideo(url, useWhisper, selectedModel)
+      .then((newEntry) => {
+        if (!newEntry || !newEntry.id) {
+          return window.api.getSummaries().then(setSummaries);
+        }
+
+        entryUiState[newEntry.id] = { expanded: true, lang: 'en' };
+
+        if (!autoTranslate) {
+          return window.api.getSummaries().then(setSummaries);
+        }
+
+        let translationsOk = true;
+        setLoadingMessage('Translating to German (DE)…');
+        return window.api.translateSummary(newEntry.id, 'de', selectedModel)
+          .then(() => {
+            setLoadingMessage('Translating to Japanese (JP)…');
+            return window.api.translateSummary(newEntry.id, 'jp', selectedModel);
+          })
+          .catch(err => {
+            translationsOk = false;
+            alert('Error translating summary: ' + err.message);
+          })
+          .then(() => {
+            entryUiState[newEntry.id] = {
+              expanded: true,
+              lang: translationsOk ? 'jp' : 'en'
+            };
+            return window.api.getSummaries().then(setSummaries);
+          });
+      })
+      .catch(err => {
+        alert('Error summarizing video: ' + err.message);
+      })
+      .finally(() => {
+        loadingIndicator.style.display = 'none';
+        loadingIndicator.textContent = 'Loading…';
+        summarizeButton.disabled = false;
+        isLoading = false;
+        setActionLinksDisabled(false);
+        urlInput.value = '';
+      });
+  });
+
+  window.api.onSummarizeProgress(line => {
+    if (!isLoading || !line) {
+      return;
+    }
+    setLoadingMessage(line);
+  });
+});
--- a/youtube_summarizer.py
+++ b/youtube_summarizer.py
@@ -0,0 +1,693 @@
+#!/usr/bin/env python3
+"""
+youtube_summarizer.py
+
+This script accepts a YouTube URL, retrieves a transcript either via the
+YouTube API or via Whisper (depending on the flags), generates a concise
+summary using Ollama and optionally writes a JSON descriptor containing
+metadata about the processed video.  The metadata includes the video
+identifier, original URL, title, downloaded thumbnail filename, audio
+filename, transcript filename and the summary text itself.  The script
+has been adapted from an earlier command‑line tool to better integrate
+with a GUI.  The summarizer now returns the summary text instead of
+printing it directly and supports additional command line arguments for
+JSON output.
+
+Usage:
+    python3 youtube_summarizer.py <youtube-url> [--no-ai] [--output-json <path>]
+
+Options:
+    --no-ai       Use the classic API/subtitle workflow instead of Whisper for
+                  transcription (default uses Whisper).
+    --output-json Specify a file path where metadata about the processed video
+                  will be written as JSON.  If omitted the metadata is
+                  printed to standard output in JSON format.
+
+This script relies on yt_dlp for fetching video metadata, requests for
+thumbnail download and the whisper and youtube_transcript_api packages for
+transcription.
+
+"""
+
+import sys
+import os
+import re
+import time
+import json
+import glob
+import subprocess
+import multiprocessing
+import requests
+import yt_dlp
+import webvtt
+from datetime import datetime
+from typing import List, Tuple, Optional
+from xml.parsers.expat import ExpatError
+from xml.etree.ElementTree import ParseError
+from youtube_transcript_api import YouTubeTranscriptApi
+from youtube_transcript_api._errors import TranscriptsDisabled, NoTranscriptFound
+
+try:
+    import whisper
+except ImportError:
+    whisper = None  # handle gracefully if whisper isn't installed
+
+# -----------------------
+# Konfiguration & Flags
+# -----------------------
+DEBUG = False
+
+# Whisper‑Settings
+NUM_SLICES = 8
+OVERLAP_SEC = 1
+MAX_OVERLAP_WORDS = 7
+WHISPER_MODEL = "small"  # e.g. "small", "medium", "large-v3" …
+
+
+def debug_print(*args, **kwargs):
+    """Print debug messages when DEBUG is enabled."""
+    if DEBUG:
+        print("[DEBUG]", *args, **kwargs, file=sys.stderr)
+
+
+def get_ffmpeg_binary() -> str:
+    """Return the ffmpeg executable path, preferring a bundled override."""
+    value = os.environ.get("YTS_FFMPEG", "").strip()
+    return value or "ffmpeg"
+
+
+def get_ffprobe_binary() -> str:
+    """Return the ffprobe executable path, preferring a bundled override."""
+    value = os.environ.get("YTS_FFPROBE", "").strip()
+    return value or "ffprobe"
+
+
+def get_whisper_download_root() -> Optional[str]:
+    """Return a stable Whisper cache directory when one is configured."""
+    value = os.environ.get("YTS_WHISPER_CACHE_DIR", "").strip()
+    if not value:
+        return None
+    os.makedirs(value, exist_ok=True)
+    return value
+
+
+# -----------------------
+# 1) Utilities
+# -----------------------
+
+def extract_video_id(url: str) -> Optional[str]:
+    """Extract the eleven character YouTube video ID from a URL."""
+    debug_print(f"Extracting video ID from URL: {url}")
+    m = re.search(r'(?:v=|youtu\.be/)([0-9A-Za-z_-]{11})', url)
+    vid = m.group(1) if m else None
+    debug_print(f"Video ID: {vid}")
+    return vid
+
+
+def get_transcript_api(video_id: str) -> str:
+    """
+    Fetch transcript via YouTubeTranscriptApi, trying 'en', then 'de', then any available language.
+    """
+    debug_print(f"Trying transcript API for {video_id}")
+
+    # Try English first
+    try:
+        data = YouTubeTranscriptApi.get_transcript(video_id, languages=["en"])
+        text = " ".join(item["text"] for item in data)
+        debug_print(f"Transcript fetched in EN, length {len(text)} chars")
+        return text
+    except (TranscriptsDisabled, NoTranscriptFound):
+        pass
+
+    # Try German
+    try:
+        data = YouTubeTranscriptApi.get_transcript(video_id, languages=["de"])
+        text = " ".join(item["text"] for item in data)
+        debug_print(f"Transcript fetched in DE, length {len(text)} chars")
+        return text
+    except (TranscriptsDisabled, NoTranscriptFound):
+        pass
+
+    # Try any available language (prefer auto-generated if possible)
+    try:
+        tx_list = YouTubeTranscriptApi.list_transcripts(video_id)
+        # Try manually created first
+        for tr in tx_list:
+            try:
+                if not tr.is_generated:
+                    data = tr.fetch()
+                    text = " ".join(item["text"] for item in data)
+                    debug_print(f"Transcript fetched: {tr.language_code} (manual)")
+                    return text
+            except Exception:
+                continue
+        # Then fallback to auto-generated
+        for tr in tx_list:
+            try:
+                if tr.is_generated:
+                    data = tr.fetch()
+                    text = " ".join(item["text"] for item in data)
+                    debug_print(f"Transcript fetched: {tr.language_code} (auto-generated)")
+                    return text
+            except Exception:
+                continue
+    except Exception as e:
+        debug_print(f"list_transcripts failed: {e}")
+
+    # Nothing found, fail with info
+    raise SystemExit(
+        "No transcript available in EN, DE or any other language via API. "
+        "Try 'Use Whisper' mode or wait if you hit a YouTube rate limit."
+    )
+
+
+def vtt_to_lines(path: str) -> List[str]:
+    """Convert a VTT file into deduplicated lines of text."""
+    cues, last = [], None
+    for caption in webvtt.read(path):
+        cur = caption.text.replace("\n", " ").strip()
+        if not cur or cur == last:
+            continue
+        if last and cur.startswith(last):
+            cur = cur[len(last):].strip(" -")
+        cues.append(cur)
+        last = caption.text.replace("\n", " ").strip()
+    return cues
+
+
+def remove_consecutive_line_duplicates(lines: List[str]) -> List[str]:
+    """Remove consecutive duplicate lines."""
+    deduped, last = [], None
+    for l in lines:
+        if l != last:
+            deduped.append(l)
+        last = l
+    return deduped
+
+
+def remove_phrase_duplicates_from_lines(lines: List[str]) -> List[str]:
+    """Remove duplicate phrases within lines (used for subtitle deduplication)."""
+    out, last = [], None
+    for l in lines:
+        if last and l.startswith(last):
+            trimmed = l[len(last):].strip()
+            if trimmed:
+                out.append(trimmed)
+        else:
+            out.append(l)
+        last = l
+    return out
+
+
+def remove_empty_lines(lines: List[str]) -> List[str]:
+    """Remove empty lines."""
+    return [l for l in lines if l.strip()]
+
+
+def get_subtitles_via_yt_dlp(url: str) -> Optional[str]:
+    """Try to fetch subtitles via yt_dlp when API transcripts fail."""
+    debug_print(f"Fetching metadata via yt‑dlp for URL: {url}")
+    opts = {'skip_download': True, 'quiet': True, 'ignoreerrors': True}
+    with yt_dlp.YoutubeDL(opts) as ydl:
+        info = ydl.extract_info(url, download=False)
+    available = list(info.get('subtitles', {})) + list(info.get('automatic_captions', {}))
+    debug_print(f"Available subtitle languages: {available}")
+    if not available:
+        return None
+
+    priority = ['en', 'es', 'fr', 'de', 'zh', 'ja']
+    langs = [l for l in priority if l in available] + [l for l in available if l not in priority]
+
+    for lang in langs:
+        debug_print(f"Trying subtitle language {lang}")
+        dl_opts = {
+            'skip_download': True,
+            'writesubtitles': True,
+            'writeautomaticsub': True,
+            'subtitlesformat': 'vtt',
+            'subtitlelangs': [lang],
+            'outtmpl': "transcript.%(language)s.%(ext)s",
+            'quiet': True,
+        }
+        with yt_dlp.YoutubeDL(dl_opts) as ydl:
+            ydl.download([url])
+
+        files = [f for f in os.listdir('.') if f.startswith('transcript') and f.endswith('.vtt')]
+        if not files:
+            continue
+        path = files[0]
+        try:
+            lines = vtt_to_lines(path)
+            lines = remove_consecutive_line_duplicates(lines)
+            lines = remove_phrase_duplicates_from_lines(lines)
+            lines = remove_empty_lines(lines)
+            text = "\n".join(lines)
+            debug_print(f"Subtitle text length: {len(text)}")
+            return text
+        except Exception as e:
+            debug_print(f"Subtitle parsing failed: {e}")
+    return None
+
+
+# --------------------------
+# 2) Whisper‑based workflow
+# --------------------------
+
+def _cleanup_audio_artifacts(vid: str) -> None:
+    """Remove partial audio download artifacts for the given video id."""
+    for path in glob.glob(f"audio_{vid}.*"):
+        # Keep any existing mp3; it may belong to a previous summary.
+        if path.endswith(".mp3"):
+            continue
+        try:
+            os.remove(path)
+        except OSError:
+            pass
+
+
+def _download_audio_with_yt_dlp(url: str, vid: str, extractor_args: Optional[dict] = None) -> str:
+    """Download audio via yt_dlp and extract to wav."""
+    audio_fn = f"audio_{vid}.wav"
+    opts = {
+        "format": "bestaudio/best",
+        "outtmpl": f"audio_{vid}.%(ext)s",
+        "quiet": True,
+        "noprogress": True,
+        "nopart": True,
+        "continuedl": False,
+        "overwrites": True,
+        "noplaylist": True,
+        "retries": 3,
+        "fragment_retries": 3,
+        "postprocessors": [{
+            "key": "FFmpegExtractAudio",
+            "preferredcodec": "wav",
+        }],
+    }
+    if extractor_args:
+        opts["extractor_args"] = extractor_args
+    with yt_dlp.YoutubeDL(opts) as ydl:
+        ydl.download([url])
+    if not os.path.exists(audio_fn):
+        raise RuntimeError("yt_dlp completed but wav file was not created")
+    return audio_fn
+
+
+def download_video_audio(url: str, vid: str) -> str:
+    """Download the best available audio for a YouTube video."""
+    print(f"📥 Downloading audio from {url} …")
+
+    # Clean up any stale partials that can trigger HTTP 416 resume errors.
+    _cleanup_audio_artifacts(vid)
+
+    attempts = [
+        ("android player client", {"youtube": {"player_client": ["android"]}}),
+        ("default player client", None),
+    ]
+
+    last_err = None
+    for label, extractor_args in attempts:
+        try:
+            debug_print(f"yt_dlp audio attempt: {label}")
+            audio_fn = _download_audio_with_yt_dlp(url, vid, extractor_args)
+            debug_print(f"Audio saved as {audio_fn}")
+            return audio_fn
+        except Exception as e:
+            last_err = e
+            debug_print(f"yt_dlp attempt failed ({label}): {e}")
+            _cleanup_audio_artifacts(vid)
+
+    raise RuntimeError("Audio download failed after multiple attempts") from last_err
+
+
+def get_audio_duration(path: str) -> float:
+    """Return the duration of an audio file using ffprobe."""
+    res = subprocess.run([
+        get_ffprobe_binary(), "-v", "error", "-show_entries", "format=duration",
+        "-of", "default=noprint_wrappers=1:nokey=1", path
+    ], capture_output=True, text=True)
+    return float(res.stdout.strip())
+
+
+def slice_audio(audio_path: str, vid: str) -> List[Tuple[str, float, float]]:
+    """Slice a long audio file into overlapping chunks for Whisper."""
+    print("Slicing audio …")
+    duration = get_audio_duration(audio_path)
+    length = duration / NUM_SLICES
+    slices = []
+    for i in range(NUM_SLICES):
+        start = max(0, i * length - (OVERLAP_SEC if i > 0 else 0))
+        end = min(duration, (i + 1) * length + (OVERLAP_SEC if i < NUM_SLICES - 1 else 0))
+        fn = f"audio_{vid}_slice_{i:02d}.wav"
+        subprocess.run([
+            get_ffmpeg_binary(), "-y", "-hide_banner", "-loglevel", "error",
+            "-ss", str(start), "-to", str(end),
+            "-i", audio_path, "-acodec", "copy", fn
+        ], check=True)
+        debug_print(f"  slice {i}: {start:.1f}s→{end:.1f}s ({fn})")
+        slices.append((fn, start, end))
+    return slices
+
+
+def transcribe_slice(args: Tuple[str, int, str, str]) -> str:
+    """Transcribe a single audio slice using Whisper and save to a text file."""
+    slice_path, idx, model_name, vid = args
+    if whisper is None:
+        raise RuntimeError("Whisper package is required but not installed")
+    m = whisper.load_model(model_name, download_root=get_whisper_download_root())
+    res = m.transcribe(slice_path, task="transcribe")
+    out = f"transcript_{vid}_slice_{idx:02d}.txt"
+    with open(out, "w", encoding="utf-8") as f:
+        f.write(res["text"])
+    debug_print(f"Transcribed slice {idx} → {out}")
+    return out
+
+
+def merge_transcripts(files: List[str]) -> str:
+    """Merge transcribed slices by eliminating overlapping words."""
+    merged, prev = [], []
+    for i, fn in enumerate(files):
+        words = open(fn, encoding="utf-8").read().split()
+        if i > 0:
+            p_tail = prev[-MAX_OVERLAP_WORDS:]
+            c_head = words[:MAX_OVERLAP_WORDS]
+            L = min(len(p_tail), len(c_head))
+            best = 0
+            for n in range(L, 4, -1):
+                if p_tail[-n:] == c_head[:n]:
+                    best = n
+                    break
+            if best:
+                debug_print(f"  overlap {best} words between slices {i-1}↔{i}")
+                words = words[best:]
+        merged += words
+        prev = words
+    text = " ".join(merged)
+    debug_print(f"Merged transcript: {len(text)} chars, {len(merged)} words")
+    return text
+
+
+def clean_temp(pattern: str) -> None:
+    """Remove temporary files matching the given glob pattern."""
+    for f in glob.glob(pattern):
+        try:
+            os.remove(f)
+        except Exception:
+            pass
+
+
+def whisper_transcript(url: str, vid: str) -> str:
+    """Run the Whisper pipeline and return the final transcript text."""
+    audio = download_video_audio(url, vid)
+    slices = slice_audio(audio, vid)
+    print("✍️  Transcribing using Whisper...", flush=True)
+    args = [(p, i, WHISPER_MODEL, vid) for i, (p, _, _) in enumerate(slices)]
+    with multiprocessing.Pool(len(slices)) as pool:
+        t_files = pool.map(transcribe_slice, args)
+    text = merge_transcripts(t_files)
+    clean_temp(f"audio_{vid}_slice_*.wav")
+    clean_temp(f"transcript_{vid}_slice_*.txt")
+    # Leave the original audio file so it can be referenced by the GUI
+    return text
+
+
+# -----------------------
+# Ollama‑Summarizer
+# -----------------------
+
+def summarize_with_ollama(title: str, transcript: str, model: str = "mistral:latest") -> str:
+    """
+    Send video title and transcript text to Ollama and return the summary string.
+    """
+    debug_print(f"Preparing summary with model {model}, transcript length={len(transcript)}")
+    prompt = (
+        "You are an expert summarizer. Summarize the following video concisely:\n\n"
+        f"Title: {title}\n\n"
+        f"Transcript:\n{transcript}\n\n"
+        "Summary:"
+    )
+    debug_print(prompt)
+    payload = {
+        "model": model,
+        "messages": [
+            {"role": "system", "content": "You are an intelligent summarizer."},
+            {"role": "user", "content": prompt}
+        ],
+        "stream": True
+    }
+    debug_print("Sending request to Ollama …")
+    resp = requests.post("http://localhost:11434/api/chat", json=payload, stream=True)
+    debug_print(f"Ollama status: {resp.status_code}")
+    summary = ""
+    for line in resp.iter_lines(decode_unicode=True):
+        if not line:
+            continue
+        try:
+            msg = json.loads(line).get("message", {}).get("content", "")
+            summary += msg
+        except Exception:
+            continue
+    debug_print(f"Summary generated, length={len(summary)}")
+    return summary
+
+
+# -----------------------
+# Video metadata and thumbnail download
+# -----------------------
+
+def fetch_video_metadata(url: str) -> Tuple[str, str, str]:
+    """
+    Fetch the title, thumbnail URL and video ID for a YouTube URL using yt_dlp.
+    Returns a tuple: (video_id, title, thumbnail_url)
+    """
+    with yt_dlp.YoutubeDL({'quiet': True}) as ydl:
+        info = ydl.extract_info(url, download=False)
+    vid = info.get('id')
+    title = info.get('title', f"Video {vid}")
+    thumbnail_url = info.get('thumbnail')
+    return vid, title, thumbnail_url
+
+
+def fetch_channel_name(url: str) -> Optional[str]:
+    """
+    Retrieve the channel or uploader name for a YouTube video using yt_dlp.
+    Returns None if it cannot be determined.
+    """
+    try:
+        with yt_dlp.YoutubeDL({'quiet': True}) as ydl:
+            info = ydl.extract_info(url, download=False)
+        # Try channel, uploader, then return None
+        return info.get('channel') or info.get('uploader')
+    except Exception as e:
+        debug_print(f"Failed to fetch channel name: {e}")
+        return None
+
+
+def download_thumbnail(vid: str, thumbnail_url: str) -> Optional[str]:
+    """
+    Download the thumbnail image given its URL and save it as thumb_<vid>.<ext>.
+    Returns the local filename or None if download fails.
+    """
+    if not thumbnail_url:
+        return None
+    try:
+        response = requests.get(thumbnail_url, timeout=10)
+        response.raise_for_status()
+        # Determine extension from content type or URL
+        ext = None
+        if 'content-type' in response.headers:
+            ctype = response.headers['content-type']
+            if 'jpeg' in ctype:
+                ext = 'jpg'
+            elif 'png' in ctype:
+                ext = 'png'
+        if ext is None:
+            ext = thumbnail_url.split('.')[-1].split('?')[0]
+        filename = f"thumb_{vid}.{ext}"
+        with open(filename, 'wb') as f:
+            f.write(response.content)
+        debug_print(f"Thumbnail downloaded as {filename}")
+        return filename
+    except Exception as e:
+        debug_print(f"Thumbnail download failed: {e}")
+        return None
+
+
+# -----------------------
+# Main
+# -----------------------
+
+def process_video(url: str, use_whisper: bool, model: str = "mistral:latest", output_json: Optional[str] = None) -> dict:
+    """
+    Core processing routine.  Retrieves metadata, obtains transcript via the
+    selected workflow, generates a summary using Ollama and writes the
+    transcript, thumbnail and audio (converted to mp3) to disk.  Returns a
+    dictionary containing metadata which may also be dumped to a JSON file if
+    output_json is provided.
+
+    Parameters
+    ----------
+    url : str
+        The YouTube video URL.
+    use_whisper : bool
+        If True, use the Whisper transcription workflow; if False, use the
+        classic API/subtitle workflow.
+    model : str, optional
+        The Ollama model name to use for summarization.  Defaults to
+        "mistral:latest".
+    output_json : str or None, optional
+        If provided, path to a file where JSON metadata should be written.
+
+    Returns
+    -------
+    dict
+        A dictionary containing metadata about the processed video.
+    """
+    vid, title, thumb_url = fetch_video_metadata(url)
+    if not vid:
+        raise SystemExit("Invalid YouTube URL.")
+
+    # Fetch the channel/uploader name
+    channel_name = fetch_channel_name(url)
+
+    # Fetch transcript
+    if use_whisper:
+        print("🤖  Using Whisper parallel transcription…")
+        transcript_text = whisper_transcript(url, vid)
+        if not transcript_text.strip():
+            raise SystemExit("Whisper transcription failed or empty.")
+    else:
+        print("▶️  Using classic API/subtitle workflow…")
+        # Try API first
+        try:
+            transcript_text = get_transcript_api(vid)
+        except Exception:
+            print("API failed, falling back to subtitles…")
+            transcript_text = get_subtitles_via_yt_dlp(url)
+        if not transcript_text:
+            raise SystemExit("No transcript/subtitles available.")
+
+    # Save transcript to file
+    transcript_filename = f"transcript_{vid}.txt"
+    with open(transcript_filename, 'w', encoding='utf-8') as f:
+        f.write(transcript_text)
+    debug_print(f"Transcript saved to {transcript_filename}")
+
+    # Download thumbnail
+    thumbnail_filename = download_thumbnail(vid, thumb_url)
+
+    # Determine audio filename if generated and convert to mp3
+    audio_filename = None
+    if use_whisper:
+        wav_name = f"audio_{vid}.wav"
+        mp3_name = f"audio_{vid}.mp3"
+        # Convert to mp3 using ffmpeg if wav exists
+        if os.path.exists(wav_name):
+            try:
+                subprocess.run([
+                    get_ffmpeg_binary(), '-y', '-i', wav_name,
+                    '-codec:a', 'libmp3lame', '-qscale:a', '2',
+                    mp3_name
+                ], check=True)
+                os.remove(wav_name)
+                debug_print(f"Converted {wav_name} to {mp3_name} and removed wav")
+                audio_filename = mp3_name
+            except Exception as e:
+                debug_print(f"Failed to convert audio to mp3: {e}")
+                # fallback: keep wav
+                audio_filename = wav_name
+        else:
+            # If wav file doesn't exist yet (perhaps removed elsewhere), do not set audio
+            audio_filename = None
+
+    # Generate summary
+    print("✍️  Generating summary with Ollama…", flush=True)
+    summary_text = summarize_with_ollama(title, transcript_text, model)
+
+    # Create metadata dictionary
+    meta = {
+        'timestamp': datetime.utcnow().isoformat() + 'Z',
+        'video_id': vid,
+        'url': url,
+        'video_name': title,
+        'channel': channel_name,
+        'thumbnail': thumbnail_filename,
+        'audio': audio_filename,
+        'transcript': transcript_filename,
+        'summary': summary_text
+    }
+
+    # Write JSON output if requested
+    if output_json:
+        with open(output_json, 'w', encoding='utf-8') as f:
+            json.dump(meta, f, ensure_ascii=False, indent=2)
+        debug_print(f"Metadata written to {output_json}")
+    return meta
+
+
+def rewrite_summary(title: str, transcript_file: str, model: str = "mistral:latest", output_json: Optional[str] = None) -> dict:
+    """
+    Regenerate a summary from an existing transcript file using the specified model.
+
+    Parameters
+    ----------
+    transcript_file : str
+        Path to a text file containing the transcript.
+    model : str, optional
+        Name of the Ollama model to use for summarization.
+    output_json : str or None, optional
+        If provided, write the resulting summary dictionary to this file.
+
+    Returns
+    -------
+    dict
+        A dictionary containing just the summary.
+    """
+    if not os.path.exists(transcript_file):
+        raise SystemExit(f"Transcript file not found: {transcript_file}")
+    with open(transcript_file, 'r', encoding='utf-8') as f:
+        transcript_text = f.read()
+    debug_print(f"Rewriting summary using model {model} for {transcript_file}")
+    summary_text = summarize_with_ollama(title, transcript_text, model)
+    meta = {'summary': summary_text}
+    if output_json:
+        with open(output_json, 'w', encoding='utf-8') as f:
+            json.dump(meta, f, ensure_ascii=False, indent=2)
+        debug_print(f"Summary written to {output_json}")
+    return meta
+
+
+def main():
+    import argparse
+    parser = argparse.ArgumentParser(description="YouTube → Transcript → Ollama Summary")
+    parser.add_argument('url', help="YouTube‑Video‑URL")
+    parser.add_argument('--no-ai', action='store_true',
+                        help="Use classic API/subtitle workflow instead of Whisper")
+    parser.add_argument('--output-json', type=str, default=None,
+                        help="Write metadata JSON to the specified file instead of STDOUT")
+    parser.add_argument('--model', type=str, default='mistral:latest',
+                        help="Ollama model to use for summarization (default: mistral:latest)")
+    parser.add_argument('--transcript-file', type=str, default=None,
+                        help="Path to an existing transcript file; when provided the script will skip transcription and only generate a summary.")
+    args = parser.parse_args()
+
+    use_whisper = not args.no_ai
+
+    try:
+        # If a transcript file is provided, skip the normal processing and only rewrite summary
+        if args.transcript_file:
+            vid, title, _ = fetch_video_metadata(args.url)
+            meta = rewrite_summary(title, args.transcript_file, args.model, args.output_json)
+        else:
+            meta = process_video(args.url, use_whisper, args.model, args.output_json)
+        # If no JSON output specified, print metadata as JSON to stdout
+        if not args.output_json:
+            print(json.dumps(meta, ensure_ascii=False, indent=2))
+    except SystemExit as e:
+        # Provide a friendly exit message without a stacktrace
+        print(str(e))
+        sys.exit(1)
+
+
+if __name__ == '__main__':
+    main()