Even_Adder@lemmy.dbzer0.com to

Stable Diffusion@lemmy.dbzer0.comEnglish · 11 months ago

Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models

dangeng.github.io

2

4

Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models

dangeng.github.io

Even_Adder@lemmy.dbzer0.com to

Stable Diffusion@lemmy.dbzer0.comEnglish · 11 months ago

2

Visual Anagrams

dangeng.github.io

Optical illusions zero-shot from diffusion models.

tl;dr: We use pretrained diffusion models to make optical illusions

Abstract

We address the problem of synthesizing multi-view optical illusions: images that change appearance upon a transformation, such as a flip or rotation. We propose a simple, zero-shot method for obtaining these illusions from off-the-shelf text-to-image diffusion models. During the reverse diffusion process, we estimate the noise from different views of a noisy image. We then combine these noise estimates together and denoise the image. A theoretical analysis suggests that this method works precisely for views that can be written as orthogonal transformations, of which permutations are a subset. This leads to the idea of a visual anagram–an image that changes appearance under some rearrangement of pixels. This includes rotations and flips, but also more exotic pixel permutations such as a jigsaw rearrangement. Our approach also naturally extends to illusions with more than two views. We provide both qualitative and quantitative results demonstrating the effectiveness and flexibility of our method. Please see our project webpage for additional visualizations and results: this https URL

Paper: https://arxiv.org/abs/2311.17919

Code: https://github.com/dangeng/visual_anagrams

Progect Page: https://dangeng.github.io/visual_anagrams/

You must log in or register to comment.

Chat

Lemmy Tagginator@utter.onlineB
link
fedilink
arrow-up
2·
11 months ago
deleted by creator
Even_Adder@lemmy.dbzer0.comOP
link
fedilink
English
arrow-up
1·
11 months ago
I thought this was Stable Diffusion, but it’s actually DeepFloyd IF. Well, it’s cool someone is using that model at least.
- Jaded@lemmy.dbzer0.com
  link
  fedilink
  English
  arrow-up
  2·
  11 months ago
  I’m happy you posted it anyways, this is very interesting

Stable Diffusion@lemmy.dbzer0.com

stable_diffusion@lemmy.dbzer0.com

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !stable_diffusion@lemmy.dbzer0.com

Discuss matters related to our favourite AI Art generation technology

Also see

Other communities

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

5 users / day
30 users / week
165 users / month
883 users / 6 months
1 local subscriber
4.3K subscribers
700 Posts
1.05K Comments
Modlog

mods:
db0@lemmy.dbzer0.com