RegionDrag: Fast Region-Based Image
Editing with Diffusion Models

ECCV 2024

Jingyi Lu¹, Xinghui Li², Kai Han¹
¹Visual AI Lab, The University of Hong Kong
²Active Vision Lab, University of Oxford

Paper

arXiv

Code

Demo

Dataset

Gallery

BibTeX

RegionDrag, a region-based image editing method, offers faster and more precise image editing than point-drag approaches by enabling users to express instructions through handle and target regions, significantly outperforming the previous SOTA in speed while achieving better performance.

RegionDrag supports a variety of inputs

Users can input regions or points to drag image contents from Red to Blue.

Input pairs of triangles or quadrilaterals.

Input regions and manipulate them by points.

Input pairs of regions.

Fast AI editing

The rich context provided by region inputs allows users to edit a 512x512 image in just 1.5 seconds, significantly faster than previous point-drag methods.

→ Find our code

Efficient & concise model design

We use the inputs for editing through two main steps. First, during the inversion process, the SD latent representations and self-attention features are copied from the handle region. Then, during the denoising process, we paste the copied latent representations at the target positions and replace the corresponding self-attention features.

→ Read the paper

New benchmarks for region-based editing evaluation

DragBench-S and DragBench-D are existing benchmarks for evaluating point-drag methods. We modify these benchmarks to use regions instead of points to reflect user intentions, creating DragBench-SR and DragBench-DR (where R stands for 'Region').

→ Download the dataset

RegionDrag Gallery

RegionDrag achieves high-fidelity editing effects on various objects through movement of image contents from Red to Blue.

BibTeX

    @inproceedings{lu2024regiondrag,
        author    = {Jingyi Lu and Xinghui Li and Kai Han},
        title     = {RegionDrag: Fast Region-Based Image Editing with Diffusion Models},
        booktitle = {European Conference on Computer Vision (ECCV)},
        year      = {2024},
    }

RegionDrag: Fast Region-Based ImageEditing with Diffusion Models

ECCV 2024

RegionDrag supports a variety of inputs

Fast AI editing

Efficient & concise model design

New benchmarks for region-based editing evaluation

RegionDrag Gallery

BibTeX

RegionDrag: Fast Region-Based Image
Editing with Diffusion Models