Add two-video drone 3DGS pipeline with Apple Silicon fixes

- main.py: extract frames from two videos, run COLMAP feature extraction - match_features.py: Python-based within-video SIFT matching via OpenCV (replaces colmap exhaustive_matcher which segfaults on ARM64 in COLMAP 4.x) - match_crossvideo.py: exhaustive cross-video matching (v1×v2) to stitch two flights into a single COLMAP model - run.sh: entry point for frame extraction + feature extraction - train_splat.sh: ns-process-data → splatfacto → .ply export, with correct PATH for Homebrew ffmpeg and MPS device flags for Apple Silicon - .gitignore: exclude videos, generated scene data, venv, logs - README.md: full pipeline walkthrough, all known issues and fixes Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-05-26 15:09:30 +01:00
parent e0db1edbc6
commit 7f4cdd9459
7 changed files with 782 additions and 0 deletions
--- a/.gitignore
+++ b/.gitignore
@@ -0,0 +1,24 @@
+# Videos (too large for git)
+*.mp4
+*.mov
+*.avi
+
+# Generated scene data
+my_scene/
+outputs/
+
+# Python environment
+venv/
+__pycache__/
+*.pyc
+*.pyo
+
+# Logs
+*.log
+my_scene_build.log
+
+# macOS
+.DS_Store
+
+# Editor
+.claude/
--- a/README.md
+++ b/README.md
@@ -0,0 +1,175 @@
+# Drone-3DGS
+
+Two-flight DJI drone footage → 3D Gaussian Splatting pipeline for **Apple Silicon Macs** (M1/M2/M3).
+
+Takes two `.mp4` videos of the same scene from different angles, runs Structure-from-Motion via COLMAP, and produces a `.ply` Gaussian splat you can view in a browser.
+
+---
+
+## Requirements
+
+```bash
+brew install colmap        # COLMAP 4.x (SfM)
+brew install ffmpeg        # full version with all filters
+python3 -m venv venv
+source venv/bin/activate
+pip install torch torchvision
+pip install nerfstudio
+```
+
+> **Python version**: 3.10 recommended (tested with 3.10.18 via pyenv).
+
+---
+
+## Project structure
+
+```
+.
+├── 1.mp4                  # first drone flight
+├── 2.mp4                  # second drone flight
+├── main.py                # Step 1 – extract frames + COLMAP feature extraction
+├── match_features.py      # Step 2 – within-video SIFT matching (Python, bypasses COLMAP crash)
+├── match_crossvideo.py    # Step 3 – cross-video exhaustive matching (v1×v2)
+├── run.sh                 # Runs main.py (frame extraction + feature extraction)
+└── train_splat.sh         # Steps 4–6: ns-process-data → splatfacto → export .ply
+```
+
+---
+
+## How to run
+
+### Step 1 – Extract frames and COLMAP features
+
+```bash
+source venv/bin/activate
+bash run.sh
+```
+
+This calls `main.py` which:
+1. Extracts frames from `1.mp4` and `2.mp4` at 2 fps into `my_scene/images/` (named `v1_*.jpg` / `v2_*.jpg`)
+2. Runs `colmap feature_extractor` — SIFT features written to `my_scene/database.db`
+3. Runs `match_features.py` — sequential within-video matching (overlap=50) via OpenCV BFMatcher
+
+**Why not `colmap exhaustive_matcher`?**
+COLMAP 4.x has a threading bug on Apple Silicon ARM64 causing a SIGSEGV in all matcher variants. `match_features.py` replaces it entirely: reads SIFT descriptors from the SQLite database, matches with OpenCV BFMatcher + Lowe ratio test + RANSAC, and writes `two_view_geometries` back to the DB. The mapper only needs that table.
+
+### Step 2 – Cross-video matching
+
+```bash
+python3 match_crossvideo.py
+```
+
+Matches every `v1_*` frame against every `v2_*` frame (14,900 pairs) so the two flights stitch into a single model. Takes ~70 min on M1 Pro CPU (~0.28 s/pair with OpenCV BFMatcher).
+
+### Step 3 – COLMAP mapper
+
+```bash
+colmap mapper \
+  --database_path my_scene/database.db \
+  --image_path my_scene/images \
+  --output_path my_scene/sparse
+```
+
+Produces sparse models in `my_scene/sparse/`. The largest (most registered images) is the one to use. With overlapping flights you should get ~90–95% of frames in a single model.
+
+### Steps 4–6 – Convert, train, export
+
+```bash
+bash train_splat.sh
+```
+
+This script automatically:
+1. Finds the largest COLMAP model in `my_scene/sparse/`
+2. Converts it to Nerfstudio format with `ns-process-data`
+3. Trains `splatfacto` — **live viewer at http://localhost:7007 during training**
+4. Exports the Gaussian splat to `my_scene/exports/splat.ply`
+
+---
+
+## Known issues and fixes applied
+
+### COLMAP 4.x matcher segfault (Apple Silicon)
+
+All COLMAP matcher variants (`exhaustive_matcher`, `sequential_matcher`, `vocab_tree_matcher`) crash with SIGSEGV on ARM64 due to a bug in the SIFT worker thread initialization. **Fix:** `match_features.py` and `match_crossvideo.py` replace the COLMAP matcher entirely using OpenCV.
+
+### ffmpeg `fps` and `split` filters missing
+
+The nerfstudio-bundled ffmpeg is compiled with a minimal filter set. **Fixes:**
+- `main.py` uses `-r` output flag instead of `-vf fps=...`
+- `train_splat.sh` prepends `/opt/homebrew/opt/ffmpeg/bin` to `PATH` so `ns-process-data` uses the full Homebrew ffmpeg
+
+### nerfstudio splatfacto hardcoded `.cuda()` calls
+
+Two lines in the installed `splatfacto.py` call `.cuda()` unconditionally. Patched in-place:
+
+| Location | Original | Fix |
+|----------|----------|-----|
+| `populate_modules()` | `shs = torch.zeros(...).float().cuda()` | `shs = torch.zeros(...).float()` |
+| `get_outputs_for_camera()` | `K = ....cuda()` | `K = ....to(self.device)` |
+
+If you reinstall nerfstudio, re-apply with:
+
+```bash
+F=venv/lib/python3.10/site-packages/nerfstudio/models/splatfacto.py
+sed -i '' 's/\.float()\.cuda()/\.float()/g' "$F"
+sed -i '' 's/get_intrinsics_matrices()\.cuda()/get_intrinsics_matrices().to(self.device)/g' "$F"
+```
+
+---
+
+## 3DGS on Apple Silicon — current status
+
+`splatfacto` uses **gsplat** as its rasterizer. gsplat 1.x requires CUDA — there is no MPS or CPU fallback. On Apple Silicon the CUDA extension is `None` at load time and crashes at first use.
+
+**Two options for actual Gaussian Splatting:**
+
+### Option A — Brush (recommended, uses Apple Metal natively)
+
+```bash
+# Install Rust (one-time)
+brew install rustup && rustup-init -y && source ~/.cargo/env
+
+# Build and run
+cargo install --git https://github.com/ArthurBrussee/brush brush-cli
+brush-cli --source my_scene/sparse/3
+```
+
+Outputs a `.ply` and has a built-in web viewer.
+
+### Option B — Google Colab (free GPU)
+
+The scene is already in Nerfstudio format at `my_scene/ns_data/`. Zip it, upload to a Colab T4 instance:
+
+```python
+!pip install nerfstudio
+!ns-train splatfacto --data /content/ns_data --vis wandb
+```
+
+Download `outputs/*/splatfacto/*/splat.ply` when done.
+
+---
+
+## Viewing results
+
+| What | How |
+|------|-----|
+| During splatfacto training | `http://localhost:7007` (Nerfstudio Viser viewer) |
+| Sparse point cloud (ready now) | Drag `my_scene/exports/sparse_pointcloud.ply` into https://playcanvas.com/supersplat/editor |
+| Final Gaussian splat | Drag `my_scene/exports/splat.ply` into https://playcanvas.com/supersplat/editor |
+
+PlayCanvas SuperSplat runs 100% in-browser — the file never leaves your machine.
+
+---
+
+## Re-running from scratch
+
+```bash
+rm -rf my_scene outputs
+bash run.sh                          # frames + features (~10 min)
+python3 match_crossvideo.py          # cross-video matching (~70 min)
+colmap mapper \
+  --database_path my_scene/database.db \
+  --image_path my_scene/images \
+  --output_path my_scene/sparse      # mapping (~10 min)
+bash train_splat.sh                  # convert + train + export
+```
--- a/main.py
+++ b/main.py
@@ -0,0 +1,155 @@
+#!/usr/bin/env python3
+"""
+Two-video drone footage -> 3DGS pipeline for Apple Silicon Macs.
+
+Usage:
+    python drone_3dgs_pipeline.py \
+        --video1 path/to/flight1.mp4 \
+        --video2 path/to/flight2.mp4 \
+        --output_dir ./my_scene \
+        --fps 2
+
+What it does:
+    1. Extracts frames from both videos at the given fps (default 2 fps).
+    2. Pools them into one folder with non-colliding names.
+    3. Runs COLMAP feature extraction, matching, and sparse reconstruction.
+    4. Hands you a Nerfstudio-ready folder structure to train splatfacto.
+
+Then run:
+    ns-train splatfacto --data ./my_scene
+"""
+
+import argparse
+import shutil
+import subprocess
+import sys
+from pathlib import Path
+
+
+def run(cmd, check=True):
+    """Run a shell command, streaming output."""
+    print(f"\n>>> {' '.join(str(c) for c in cmd)}\n")
+    result = subprocess.run(cmd, check=check)
+    return result
+
+
+def check_dependencies():
+    """Verify ffmpeg and colmap are installed."""
+    for tool in ("ffmpeg", "colmap"):
+        if shutil.which(tool) is None:
+            sys.exit(f"ERROR: {tool} not found. Install with: brew install {tool}")
+
+
+def extract_frames(video_path: Path, out_dir: Path, fps: float, prefix: str):
+    """Extract frames from a video using ffmpeg at the given fps."""
+    out_dir.mkdir(parents=True, exist_ok=True)
+    pattern = str(out_dir / f"{prefix}_%05d.jpg")
+    run([
+        "ffmpeg", "-i", str(video_path),
+        "-r", str(fps),
+        "-q:v", "2",          # JPEG quality (2 = high)
+        "-y",                  # overwrite
+        pattern,
+    ])
+    count = len(list(out_dir.glob(f"{prefix}_*.jpg")))
+    print(f"Extracted {count} frames from {video_path.name} -> {out_dir}")
+    return count
+
+
+def run_colmap(workspace: Path, images_dir: Path, use_gpu: bool = False):
+    """Run the COLMAP SfM pipeline: feature extraction, matching, mapping."""
+    sparse_dir = workspace / "sparse"
+    sparse_dir.mkdir(parents=True, exist_ok=True)
+    db_path = workspace / "database.db"
+
+    # Step 1: feature extraction (COLMAP 4.x renamed SiftExtraction → FeatureExtraction)
+    run([
+        "colmap", "feature_extractor",
+        "--database_path", str(db_path),
+        "--image_path", str(images_dir),
+        "--ImageReader.single_camera", "1",
+        "--ImageReader.camera_model", "OPENCV",
+        "--FeatureExtraction.use_gpu", "1" if use_gpu else "0",
+    ])
+
+    # Step 2: Python matcher (COLMAP 4.x exhaustive_matcher segfaults on Apple Silicon ARM64;
+    # match_features.py reads SIFT descriptors from the DB and matches via OpenCV BFMatcher)
+    script = Path(__file__).parent / "match_features.py"
+    run([sys.executable, str(script), "--db", str(db_path)])
+
+    # Step 3: sparse reconstruction (this is the slow one)
+    run([
+        "colmap", "mapper",
+        "--database_path", str(db_path),
+        "--image_path", str(images_dir),
+        "--output_path", str(sparse_dir),
+    ])
+
+    # COLMAP writes to sparse/0/ by default
+    print(f"\nCOLMAP done. Reconstruction in {sparse_dir}/0/")
+
+
+def main():
+    parser = argparse.ArgumentParser()
+    parser.add_argument("--video1", type=Path, required=True)
+    parser.add_argument("--video2", type=Path, required=True)
+    parser.add_argument("--output_dir", type=Path, required=True)
+    parser.add_argument("--fps", type=float, default=2.0,
+                        help="Frames per second to extract (default: 2). "
+                             "Higher = more frames, slower training, better quality.")
+    parser.add_argument("--use_gpu", action="store_true",
+                        help="Try GPU SIFT in COLMAP (often unreliable on M1; default off).")
+    args = parser.parse_args()
+
+    check_dependencies()
+
+    if not args.video1.exists() or not args.video2.exists():
+        sys.exit("ERROR: one or both video files not found.")
+
+    workspace = args.output_dir
+    images_dir = workspace / "images"
+    workspace.mkdir(parents=True, exist_ok=True)
+
+    # Extract frames from both videos into the SAME images folder, with prefixes
+    # so filenames don't collide. COLMAP treats them as one set automatically.
+    print(f"\n=== Extracting frames from {args.video1.name} ===")
+    n1 = extract_frames(args.video1, images_dir, args.fps, prefix="v1")
+    print(f"\n=== Extracting frames from {args.video2.name} ===")
+    n2 = extract_frames(args.video2, images_dir, args.fps, prefix="v2")
+    total = n1 + n2
+
+    print(f"\n=== Total frames: {total} ===")
+    if total > 800:
+        print("WARNING: lots of frames. Consider lowering --fps. "
+              "Exhaustive matching will be slow; switch to sequential_matcher if needed.")
+
+    # Run COLMAP
+    print("\n=== Running COLMAP (this is the slow part, get a coffee) ===")
+    run_colmap(workspace, images_dir, use_gpu=args.use_gpu)
+
+    # Print next steps
+    print(f"""
+========================================================================
+DONE with SfM. Your scene is at: {workspace}
+
+Next, train the splat. Two options on Mac:
+
+OPTION 1: Nerfstudio (Python, scriptable)
+    pip install nerfstudio
+    ns-process-data images --data {images_dir} --output-dir {workspace}/ns_data \\
+        --skip-colmap --colmap-model-path {workspace}/sparse/0
+    ns-train splatfacto --data {workspace}/ns_data
+
+OPTION 2: Brush (Rust binary, faster on Mac)
+    Download from https://github.com/ArthurBrussee/brush
+    brush --source {workspace}
+
+View result:
+    - Online: https://playcanvas.com/supersplat/editor (drag .ply in)
+    - Local: install SuperSplat or use Nerfstudio's viewer
+========================================================================
+""")
+
+
+if __name__ == "__main__":
+    main()
--- a/match_crossvideo.py
+++ b/match_crossvideo.py
@@ -0,0 +1,131 @@
+#!/usr/bin/env python3
+"""
+Cross-video exhaustive matching for two-flight drone footage.
+
+Matches every v1_* frame against every v2_* frame. The within-video
+matches from match_features.py are already in the database and are not
+touched. After this script, re-run colmap mapper to stitch the scene.
+"""
+import argparse
+import sqlite3
+import numpy as np
+import cv2
+from pathlib import Path
+
+MIN_INLIERS = 15
+RATIO_TEST  = 0.75
+RANSAC_ERROR = 4.0
+KMAX = 2_147_483_647
+
+
+def pair_id(id1: int, id2: int) -> int:
+    lo, hi = (id1, id2) if id1 < id2 else (id2, id1)
+    return KMAX * lo + hi
+
+
+def load_desc_kpts(cur, image_id):
+    cur.execute("SELECT rows, cols, data FROM descriptors WHERE image_id=?", (image_id,))
+    r = cur.fetchone()
+    desc = np.frombuffer(r[2], dtype=np.uint8).reshape(r[0], r[1]) if r else np.zeros((0,128), dtype=np.uint8)
+
+    cur.execute("SELECT rows, cols, data FROM keypoints WHERE image_id=?", (image_id,))
+    r = cur.fetchone()
+    if r:
+        kp = np.frombuffer(r[2], dtype=np.float32).reshape(r[0], r[1])
+        kpts = kp[:, :2]
+    else:
+        kpts = np.zeros((0, 2), dtype=np.float32)
+    return desc, kpts
+
+
+def match_pair(desc1, desc2, kp1, kp2):
+    if len(desc1) < 8 or len(desc2) < 8:
+        return None, None
+    bf  = cv2.BFMatcher(cv2.NORM_L2)
+    raw = bf.knnMatch(desc1.astype(np.float32), desc2.astype(np.float32), k=2)
+    good = []
+    for m_pair in raw:
+        if len(m_pair) == 2:
+            m, n = m_pair
+            if m.distance < RATIO_TEST * n.distance:
+                good.append((m.queryIdx, m.trainIdx))
+    if len(good) < MIN_INLIERS:
+        return None, None
+    arr  = np.array(good, dtype=np.uint32)
+    pts1 = kp1[arr[:, 0]]
+    pts2 = kp2[arr[:, 1]]
+    F, mask = cv2.findFundamentalMat(
+        pts1, pts2, cv2.FM_RANSAC,
+        ransacReprojThreshold=RANSAC_ERROR,
+        confidence=0.9999, maxIters=2000,
+    )
+    if F is None or mask is None:
+        return None, None
+    inliers = arr[mask.ravel().astype(bool)]
+    return (inliers, F) if len(inliers) >= MIN_INLIERS else (None, None)
+
+
+def write_pair(cur, pid, inliers, F):
+    blob   = inliers.astype(np.uint32).tobytes()
+    z9     = np.zeros(9, dtype=np.float64).tobytes()
+    z4     = np.zeros(4, dtype=np.float64).tobytes()
+    z3     = np.zeros(3, dtype=np.float64).tobytes()
+    F_blob = F.flatten().astype(np.float64).tobytes()
+    cur.execute(
+        "INSERT OR REPLACE INTO matches (pair_id, rows, cols, data) VALUES (?,?,?,?)",
+        (pid, len(inliers), 2, blob),
+    )
+    cur.execute(
+        "INSERT OR REPLACE INTO two_view_geometries "
+        "(pair_id, rows, cols, data, config, F, E, H, qvec, tvec) VALUES (?,?,?,?,?,?,?,?,?,?)",
+        (pid, len(inliers), 2, blob, 3, F_blob, z9, z9, z4, z3),
+    )
+
+
+def main():
+    p = argparse.ArgumentParser()
+    p.add_argument("--db", default="my_scene/database.db")
+    args = p.parse_args()
+
+    db  = sqlite3.connect(args.db)
+    db.execute("PRAGMA journal_mode=WAL")
+    cur = db.cursor()
+
+    cur.execute("SELECT image_id, name FROM images ORDER BY name")
+    rows = cur.fetchall()
+    v1 = [(id, name) for id, name in rows if name.startswith("v1_")]
+    v2 = [(id, name) for id, name in rows if name.startswith("v2_")]
+    total_pairs = len(v1) * len(v2)
+    print(f"v1={len(v1)} frames  v2={len(v2)} frames  cross-pairs={total_pairs}")
+
+    # Preload all v1 and v2 descriptors into RAM
+    print("Loading v1 descriptors…")
+    v1_data = {id: load_desc_kpts(cur, id) for id, _ in v1}
+    print("Loading v2 descriptors…")
+    v2_data = {id: load_desc_kpts(cur, id) for id, _ in v2}
+
+    matched = skipped = i = 0
+    for id1, _ in v1:
+        desc1, kp1 = v1_data[id1]
+        for id2, _ in v2:
+            desc2, kp2 = v2_data[id2]
+            inliers, F = match_pair(desc1, desc2, kp1, kp2)
+            if inliers is not None:
+                write_pair(cur, pair_id(id1, id2), inliers, F)
+                matched += 1
+            else:
+                skipped += 1
+            i += 1
+            if i % 500 == 0:
+                pct = 100 * i / total_pairs
+                print(f"  [{i}/{total_pairs} {pct:.0f}%] cross-matched={matched}", flush=True)
+                db.commit()
+
+    db.commit()
+    db.close()
+    print(f"\nDone. {matched} cross-video pairs matched, {skipped} below threshold.")
+    print("Now delete my_scene/sparse/* and re-run colmap mapper.")
+
+
+if __name__ == "__main__":
+    main()
--- a/match_features.py
+++ b/match_features.py
@@ -0,0 +1,172 @@
+#!/usr/bin/env python3
+"""
+Python replacement for COLMAP's crashing exhaustive_matcher on Apple Silicon.
+
+Reads SIFT features from the COLMAP SQLite database, matches them with
+OpenCV BFMatcher (Lowe ratio test), verifies with RANSAC, and writes
+matches + two_view_geometries back to the database.
+
+COLMAP's mapper reads two_view_geometries — no need to re-run any COLMAP
+matcher binary after this script.
+"""
+import argparse
+import sqlite3
+import numpy as np
+import cv2
+import sys
+from pathlib import Path
+
+MIN_INLIERS = 15        # reject pairs with fewer verified matches
+RATIO_TEST = 0.75       # Lowe's ratio threshold
+RANSAC_ERROR = 4.0      # max reprojection error in pixels for RANSAC
+
+# COLMAP 4.x pair_id formula: kMaxNumImages * min(id1,id2) + max(id1,id2)
+KMAX = 2_147_483_647
+
+def pair_id(id1: int, id2: int) -> int:
+    lo, hi = (id1, id2) if id1 < id2 else (id2, id1)
+    return KMAX * lo + hi
+
+
+def read_images(cur):
+    cur.execute("SELECT image_id, name FROM images ORDER BY name")
+    return cur.fetchall()   # [(image_id, name), ...]
+
+
+def load_all(cur, image_ids):
+    descs, kpts = {}, {}
+    for iid in image_ids:
+        cur.execute("SELECT rows, cols, data FROM descriptors WHERE image_id=?", (iid,))
+        r = cur.fetchone()
+        if r:
+            descs[iid] = np.frombuffer(r[2], dtype=np.uint8).reshape(r[0], r[1])
+        else:
+            descs[iid] = np.zeros((0, 128), dtype=np.uint8)
+
+        cur.execute("SELECT rows, cols, data FROM keypoints WHERE image_id=?", (iid,))
+        r = cur.fetchone()
+        if r:
+            kp = np.frombuffer(r[2], dtype=np.float32).reshape(r[0], r[1])
+            kpts[iid] = kp[:, :2]  # x, y
+        else:
+            kpts[iid] = np.zeros((0, 2), dtype=np.float32)
+    return descs, kpts
+
+
+def match_pair(desc1, desc2, kp1, kp2):
+    if len(desc1) < 8 or len(desc2) < 8:
+        return None, None
+
+    bf = cv2.BFMatcher(cv2.NORM_L2)
+    raw = bf.knnMatch(desc1.astype(np.float32), desc2.astype(np.float32), k=2)
+
+    good = []
+    for m_pair in raw:
+        if len(m_pair) == 2:
+            m, n = m_pair
+            if m.distance < RATIO_TEST * n.distance:
+                good.append((m.queryIdx, m.trainIdx))
+
+    if len(good) < MIN_INLIERS:
+        return None, None
+
+    arr = np.array(good, dtype=np.uint32)
+    pts1 = kp1[arr[:, 0]]
+    pts2 = kp2[arr[:, 1]]
+
+    F, mask = cv2.findFundamentalMat(
+        pts1, pts2, cv2.FM_RANSAC,
+        ransacReprojThreshold=RANSAC_ERROR,
+        confidence=0.9999,
+        maxIters=2000,
+    )
+    if F is None or mask is None:
+        return None, None
+
+    inliers = arr[mask.ravel().astype(bool)]
+    if len(inliers) < MIN_INLIERS:
+        return None, None
+
+    return inliers, F
+
+
+def write_pair(cur, pid, inliers, F):
+    blob = inliers.astype(np.uint32).tobytes()
+    zeros9 = np.zeros(9, dtype=np.float64).tobytes()
+    zeros4 = np.zeros(4, dtype=np.float64).tobytes()
+    zeros3 = np.zeros(3, dtype=np.float64).tobytes()
+    F_blob = F.flatten().astype(np.float64).tobytes()
+
+    cur.execute(
+        "INSERT OR REPLACE INTO matches (pair_id, rows, cols, data) VALUES (?,?,?,?)",
+        (pid, len(inliers), 2, blob),
+    )
+    cur.execute(
+        "INSERT OR REPLACE INTO two_view_geometries "
+        "(pair_id, rows, cols, data, config, F, E, H, qvec, tvec) "
+        "VALUES (?,?,?,?,?,?,?,?,?,?)",
+        (pid, len(inliers), 2, blob,
+         3,          # UNCALIBRATED — uses F matrix
+         F_blob, zeros9, zeros9, zeros4, zeros3),
+    )
+
+
+def sequential_pairs(ids, overlap):
+    pairs = []
+    n = len(ids)
+    for i in range(n):
+        for j in range(i + 1, min(i + overlap + 1, n)):
+            pairs.append((ids[i], ids[j]))
+    return pairs
+
+
+def main():
+    p = argparse.ArgumentParser()
+    p.add_argument("--db", default="my_scene/database.db")
+    p.add_argument("--overlap", type=int, default=50)
+    args = p.parse_args()
+
+    db_path = args.db
+    db = sqlite3.connect(db_path)
+    db.execute("PRAGMA journal_mode=WAL")
+    cur = db.cursor()
+
+    images = read_images(cur)
+    ids = [r[0] for r in images]
+    print(f"Images: {len(ids)}")
+
+    print("Loading descriptors & keypoints into memory…")
+    descs, kpts = load_all(cur, ids)
+    total_feats = sum(len(d) for d in descs.values())
+    print(f"Loaded {total_feats:,} keypoints total")
+
+    overlap = args.overlap
+    pairs = sequential_pairs(ids, overlap)
+    print(f"Pairs to match: {len(pairs)} (sequential overlap={overlap})")
+
+    matched = skipped = 0
+    for i, (id1, id2) in enumerate(pairs):
+        if i % 200 == 0:
+            pct = 100 * i / len(pairs)
+            print(f"  [{i}/{len(pairs)} {pct:.0f}%] matched={matched}", flush=True)
+
+        inliers, F = match_pair(descs[id1], descs[id2], kpts[id1], kpts[id2])
+        if inliers is not None:
+            write_pair(cur, pair_id(id1, id2), inliers, F)
+            matched += 1
+        else:
+            skipped += 1
+
+        if i % 500 == 0:
+            db.commit()
+
+    db.commit()
+    db.close()
+
+    print(f"\nDone. {matched} pairs matched, {skipped} below threshold.")
+    print(f"Now run:  colmap mapper --database_path {db_path} "
+          f"--image_path my_scene/images --output_path my_scene/sparse")
+
+
+if __name__ == "__main__":
+    main()
--- a/run.sh
+++ b/run.sh
@@ -0,0 +1,5 @@
+ python main.py \
+    --video1 1.mp4 \
+    --video2 2.mp4 \
+    --output_dir ./my_scene \
+    --fps 2
--- a/train_splat.sh
+++ b/train_splat.sh
@@ -0,0 +1,120 @@
+#!/usr/bin/env bash
+# Re-runnable pipeline: COLMAP output → Nerfstudio → splatfacto → .ply
+# Skips COLMAP (assumes my_scene/sparse/0/ already exists).
+set -euo pipefail
+cd "$(dirname "$0")"
+
+source venv/bin/activate
+# Use the full Homebrew ffmpeg (nerfstudio's bundled one lacks split/fps filters)
+export PATH="/opt/homebrew/opt/ffmpeg/bin:$PATH"
+
+SCENE=my_scene
+NS_DATA=$SCENE/ns_data
+EXPORT_DIR=$SCENE/exports
+PLY=$EXPORT_DIR/splat.ply
+
+# ── 1. Verify COLMAP output ────────────────────────────────────────────────
+echo ""
+echo "=== Step 1: Verifying COLMAP output ==="
+
+if [ ! -d "$SCENE/sparse" ] || [ -z "$(ls -A $SCENE/sparse 2>/dev/null)" ]; then
+  echo "ERROR: $SCENE/sparse/ not found or empty. Run main.py + match_crossvideo.py first."
+  exit 1
+fi
+
+# Pick the model with the most registered images
+BEST_MODEL=$(python3 -c "
+import struct, os, sys
+best_dir, best_imgs = '', 0
+for m in sorted(os.listdir('$SCENE/sparse')):
+    d = '$SCENE/sparse/' + m
+    f = d + '/images.bin'
+    if not os.path.isfile(f): continue
+    with open(f,'rb') as fh: n = struct.unpack('<Q', fh.read(8))[0]
+    if n > best_imgs: best_imgs, best_dir = n, d
+print(best_dir)
+")
+
+if [ -z "$BEST_MODEL" ]; then
+  echo "ERROR: no valid COLMAP model found in $SCENE/sparse/"
+  exit 1
+fi
+
+for f in cameras.bin images.bin points3D.bin; do
+  if [ ! -f "$BEST_MODEL/$f" ]; then
+    echo "ERROR: missing $BEST_MODEL/$f"
+    exit 1
+  fi
+done
+
+NUM_IMGS=$(python3 -c "import struct; f=open('$BEST_MODEL/images.bin','rb'); print(struct.unpack('<Q',f.read(8))[0])")
+NUM_PTS=$(python3 -c "import struct; f=open('$BEST_MODEL/points3D.bin','rb'); print(struct.unpack('<Q',f.read(8))[0])")
+echo "  Best model: $BEST_MODEL (images=$NUM_IMGS  points3D=$NUM_PTS)"
+
+num_models=$(find "$SCENE/sparse" -mindepth 1 -maxdepth 1 -type d | wc -l | tr -d ' ')
+if [ "$num_models" -gt 1 ]; then
+  echo "  WARNING: $num_models disconnected models — using largest ($BEST_MODEL)."
+  echo "           Run match_crossvideo.py and re-map to attempt a full stitch."
+fi
+
+# ── 2. Convert COLMAP → Nerfstudio format ─────────────────────────────────
+echo ""
+echo "=== Step 2: ns-process-data (COLMAP → Nerfstudio) ==="
+ns-process-data images \
+  --data "$(pwd)/$SCENE/images" \
+  --output-dir "$(pwd)/$NS_DATA" \
+  --skip-colmap \
+  --colmap-model-path "$(pwd)/$BEST_MODEL"
+
+# ── 3. Train splatfacto with browser viewer ────────────────────────────────
+echo ""
+echo "=== Step 3: Training splatfacto ==="
+echo ""
+echo "  ┌──────────────────────────────────────────────────────┐"
+echo "  │  Live viewer (fly around during training):           │"
+echo "  │  http://localhost:7007                               │"
+echo "  └──────────────────────────────────────────────────────┘"
+echo ""
+ns-train splatfacto \
+  --data "$NS_DATA" \
+  --vis viewer \
+  --viewer.quit-on-train-completion True
+
+# ── 4. Find the latest training output ────────────────────────────────────
+TRAIN_OUT=$(ls -td outputs/*/splatfacto/*/  2>/dev/null | head -1)
+if [ -z "$TRAIN_OUT" ]; then
+  echo "ERROR: could not find training output folder under outputs/"
+  exit 1
+fi
+CONFIG_PATH="$TRAIN_OUT/config.yml"
+echo "  Training output: $TRAIN_OUT"
+
+# ── 5. Export .ply ────────────────────────────────────────────────────────
+echo ""
+echo "=== Step 4: Exporting Gaussian splat to .ply ==="
+mkdir -p "$EXPORT_DIR"
+ns-export gaussian-splat \
+  --load-config "$CONFIG_PATH" \
+  --output-dir "$EXPORT_DIR"
+
+# ── 6. Final summary ──────────────────────────────────────────────────────
+echo ""
+echo "======================================================================"
+if [ -f "$PLY" ]; then
+  echo "  .ply exported: $(pwd)/$PLY"
+else
+  echo "  WARNING: splat.ply not found at $PLY — check $EXPORT_DIR/"
+fi
+echo ""
+echo "  View during training : http://localhost:7007"
+echo ""
+echo "  View final .ply (Option A — recommended):"
+echo "    Drag $(pwd)/$PLY into:"
+echo "    https://playcanvas.com/supersplat/editor"
+echo "    Runs 100% in-browser; the file stays on your machine."
+echo ""
+echo "  View final .ply (Option B — fully offline):"
+echo "    python3 -m http.server 8080 --directory \$(dirname $PLY)"
+echo "    Then open http://localhost:8080/splat.ply in gsplat viewer"
+echo "    (requires a separate gsplat.js page — Option A is simpler)"
+echo "======================================================================"