How about DMD2 on img2img task? #17

dawei03896 · 2024-05-31T02:10:16Z

How about DMD2 on img2img? Other methods like Hyper-SD, SDXL-Lightning don't perform well on img2img tasks, resulting in blurry images?

tianweiy · 2024-06-06T00:44:43Z

I think it works reasonably well. Here is a sample code for using t2iadapter.

from diffusers import StableDiffusionXLAdapterPipeline, T2IAdapter, EulerAncestralDiscreteScheduler, AutoencoderKL
from diffusers.utils import load_image, make_image_grid
from controlnet_aux.canny import CannyDetector
import torch

# load adapter
adapter = T2IAdapter.from_pretrained("TencentARC/t2i-adapter-canny-sdxl-1.0", torch_dtype=torch.float16, varient="fp16").to("cuda")

# load euler_a scheduler
# model_id = 'stabilityai/stable-diffusion-xl-base-1.0'
# euler_a = EulerAncestralDiscreteScheduler.from_pretrained(model_id, subfolder="scheduler")
vae=AutoencoderKL.from_pretrained("madebyollin/sdxl-vae-fp16-fix", torch_dtype=torch.float16)

from diffusers import DiffusionPipeline, UNet2DConditionModel, LCMScheduler
from huggingface_hub import hf_hub_download

base_model_id = "stabilityai/stable-diffusion-xl-base-1.0"
repo_name = "tianweiy/DMD2"
ckpt_name = "dmd2_sdxl_4step_unet_fp16.bin"
# Load model.
unet = UNet2DConditionModel.from_config(base_model_id, subfolder="unet").to("cuda", torch.float16)
unet.load_state_dict(torch.load(hf_hub_download(repo_name, ckpt_name), map_location="cuda"))

pipe = StableDiffusionXLAdapterPipeline.from_pretrained(
    base_model_id, unet=unet, vae=vae, adapter=adapter, torch_dtype=torch.float16, variant="fp16", 
).to("cuda")
pipe.scheduler = LCMScheduler.from_config(pipe.scheduler.config)
pipe.enable_xformers_memory_efficient_attention()

canny_detector = CannyDetector()

url = "https://huggingface.co/Adapter/t2iadapter/resolve/main/figs_SDXLV1.0/org_canny.jpg"
image = load_image(url)

# Detect the canny map in low resolution to avoid high-frequency details
image = canny_detector(image, detect_resolution=384, image_resolution=1024)#.resize((1024, 1024))

prompt = "Mystical fairy in real, magic, 4k picture, high quality"

gen_images = pipe(
  prompt=prompt,
  image=image,
  num_inference_steps=4,
  guidance_scale=0, 
  adapter_conditioning_scale=0.8, 
  adapter_conditioning_factor=0.5,
  timesteps=[999, 749, 499, 249]
).images[0]
gen_images.save('out_canny.png')

input outputs look like the following

dawei03896 · 2024-06-06T01:36:02Z

Thanks for your reply！

tianweiy added the img2img label Jun 6, 2024

tianweiy pinned this issue Jun 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How about DMD2 on img2img task? #17

How about DMD2 on img2img task? #17

dawei03896 commented May 31, 2024

tianweiy commented Jun 6, 2024 •

edited

Loading

dawei03896 commented Jun 6, 2024

How about DMD2 on img2img task? #17

How about DMD2 on img2img task? #17

Comments

dawei03896 commented May 31, 2024

tianweiy commented Jun 6, 2024 • edited Loading

dawei03896 commented Jun 6, 2024

tianweiy commented Jun 6, 2024 •

edited

Loading