Paper Reading: DIRE for Diffusion-Generated Image Detection
Background
The existing detectors struggle to detect images generated by difussion models.
Related Work
Diffusion models for image generation
- DDPMs, ADM, PNDMs
- text to image: VQ-difuusion, LDM, Stable Diffusion (V1 and V2)
Generated Image Detection
- detect the GANs-generated images: spatial artifacts, frequency domain
- detect the Diffusion-generated images: in detailed task
Method
DIRE(DIffusion Reconstruction Error):
DIRE measures the error between an input image and its reconstruction by a pre-trained diffusion model, because diffusion-generated images can be more approximately reconstructed by a pre-trained diffusion model compared to real images.
The images generated by diffusion models are essentially sampled from distribution of the diffusion generation space ($p_g(x)$) which is not exactly the same as ($p_r(x)$)
—-> samples from the diffusion generation space ($p_g(x)$) are more likely to be reconstructed by a pre-trained diffusion model while real images cannot.
Process:
$x_0$ (input image) —DDIM inversion (T steps)—> $x_T$ in isotropic Gaussian noise distribution —DDIM Reconstruction (T steps)—> $x_0’$ (recovered image) —> compute the difference between $x_0$ and $x_0’$ by $DIRE(x_0)=|x_0 - \mathbf{R}(\mathbf{I}(x_0))|$, where $\mathbf{R}(\mathbf{I}(x_0))$ is the recovered image $x_0’$ —> Use a simple binary classifier to distinguish DIRE between real image and diffusion-generated image.
DiffusionForensics: Dataset
source: LSUNBedroom and ImageNet models: 4 unconditional diffusion models, 4 text2image diffusion models (with “A photo of bedroom” text prompt for LSUNBedroom, “A photo of {class}” for {class} in ImageNet)
Evaluation/Experments
Generation model is used to generated synthetic images. Reconstruction model is used to compute DIRE.
The existing method get a significant improvement on images generated by the same diffusion models as used in training, but still perform unsatisfactorily facing unseen diffusion models.
Cross-dataset: training the detector with images generated by models pre-trained on LSUN-Bedroom and then testing it on images produced by models pre-trained on ImageNet
Unseen GAN generation evaluation: Although the classifier never encounters any GAN-generated image during training, it achieves surprising performance when detecting GAN-generated images
Unseen Perturbations: Gaussian blur and JPEG compression. DIRE gets a perfect performance without performance drop
Why DIRE works:
B.More Explanation of DIRE
appendix in the paper: deviation caused by the approximation actually leads to our key idea of DIRE
Reading Summary
What is the contribution/novelty?
- Proposed a new detector (DIRE) to detect Diffusion-generated image, and the generalization ability is impressive.
- build a dataset for diffusion-generated image detection: DiffusionForensics
- The accuracy is almost perfect, and the experiments are complete.
What is the existing issue?
- DIRE works well on Cross-dataset testing, how to design a more challenging situation:
- use a dataset that the difference of distribution is more obvious in generation model like a art dataset?
- Can DIRE be used/expanded to detect other diffusion model based task? (like image editing(even detect which part is edited))
- The reconstruction model(DDIM) of DIRE applies for 20 steps in total, what is the detection FPS of the method? Can we propose a faster method?