VideoCraft: A Mixed Reality-Empowered Video Generation Workflow with Spatial Layer Editing for Concept Video Creation

Boyu Li¹, Linping Yuan², Zeyu Wang^1,2

The Hong Kong University of Science and Technology (Guangzhou)¹

The Hong Kong University of Science and Technology²

UIST 2025

DOI Demo PDF

Space Restructing

Layer-edited MR Scene

V2V

Motivation: MR + V2V

Motivated by the advantages of MR and V2V models, we envision a workflow that first uses MR to edit and capture mixed-reality footage, which then serves as input for V2V models to automatically refine and enhance the content.

Localized Editing with Spatial Layer

The spatial layer, created within the MR environment, serves as a guide for localized editing in the generated concept video.