Projects

A collection of my work in computer vision, ML inference, and GPU programming for medical imaging and surgical robotics.

Featured Projects

GPU-Accelerated 4K Video Pipeline
GPU-Accelerated 4K Video Pipeline

Ultra-low latency 4K stereo surgical endoscopy pipeline with real-time depth estimation on the edge. I spent a lot of time prototyping with the Holoscan SDK on NVIDIA IGX Orin and Jetson Orin (including Thor), focusing on end-to-end GPU throughput: zero-copy capture/ingest, NVDEC/NVENC, CUDA pre/post-processing, TensorRT inference, and tightly-synchronized stereo processing for consistent depth. Iterated heavily on profiling, memory movement, and scheduling to keep latency predictable for safety-critical medical imaging. Image source: https://nvidia-holoscan.github.io/holohub/workflows/ai_surgical_video/

CUDATensorRTNVIDIA OrinHoloscan SDKC++
SurgVerse: Video World Foundation Models
SurgVerse: Video World Foundation Models

Diffusion-based video world foundation models for surgical simulation. Features multimodal controls for camera trajectory generation and data augmentation for training surgical AI systems.

PyTorchDiffusion ModelsVideo GenerationWorld Models
iMED Multi-Endoscope Dataset
iMED Multi-Endoscope Dataset

Comprehensive multi-endoscope dataset designed for surgical 3D perception research. Enables sub-pixel reprojection accuracy for advanced depth estimation and scene understanding.

Dataset3D PerceptionStereo VisionResearch

More Projects

Real-time Auto-Focus with Eye Tracking
Real-time Auto-Focus with Eye Tracking

Innovative auto-focus system using eye-tracking technology with Holoscan SDK. Processes 120fps video streams for responsive, natural focusing behavior in surgical endoscopes.

Holoscan SDKEye TrackingReal-time+1
Privacy-Centric VLLM RAG System
Privacy-Centric VLLM RAG System

Offline, voice-enabled Vision Language Model RAG system designed for real-time usage on edge devices. Prioritizes privacy while enabling natural language interactions.

VLLMRAGEdge AI+2
SurgiSR4K Super-Resolution Dataset
SurgiSR4K Super-Resolution Dataset

High-resolution endoscopic video dataset for robotic assisted procedures. Designed to advance super-resolution research for surgical imaging (MICCAI 2025 Oral).

DatasetSuper-Resolution4K Video+1
Da Vinci 5 Endoscope Integration
Da Vinci 5 Endoscope Integration

4K stereo surgical endoscope prototype development with real-time video processing on NVIDIA Jetson Orin. Integrated with Da Vinci 5 & Xi systems for clinical evaluation.

Jetson OrinJetPackSurgical Robotics+1
Telehealth Cloud Infrastructure
Telehealth Cloud Infrastructure

Cloud-based telehealth data pipeline for patient treatment tracking and anomaly detection. Built with Apache Airflow, dbt, Databricks, and Snowflake.

AirflowdbtDatabricks+2