Referring Layer Decomposition
Predicts complete RGBA layers from a single RGB image...
Predicts complete RGBA layers from a single RGB image...
A fast speed transformer-based image-to-video (I2V) diffusion framework...
Iterative Preference Optimization...