Sketch-Guided Text-to-Image Diffusion Models

This repository contains an unofficial implementation of Google's paper Sketch-Guided Text-to-Image Diffusion Models. The goal of this project is to generate high-quality images from textual descriptions and corresponding sketches.

References

This implementation was inspired by and references the following repositories:

Overview

The Sketch-Guided Text-to-Image Diffusion Models project focuses on generating realistic images from textual descriptions and corresponding sketches.

Installation

Clone the repository:

git clone --recurse-submodules https://github.com/sangminkim-99/Sketch-Guided-Text-To-Image.git
cd Sketch-Guided-Text-To-Image

Create and activate a new Conda environment:

conda create -n sketch-guided-env python=3.9
conda activate sketch-guided-env

Install the necessary dependencies. You may use pip to install the required packages:

conda install pytorch torchvision torchaudio pytorch-cuda=11.8 -c pytorch -c nvidia # change to your own version of torch
pip install -r requirements.txt

Download necessary datasets.

Download some indoor images from ImageNet Dataset with ImageNet-Dataset-Downloader

chmod u+x scripts/download_imagenet_room_dataset.sh
./scripts/download_imagenet_room_dataset.sh

Edge Map Generation with pidinet

chmod u+x scripts/generate_edge_map.sh
./scripts/generate_edge_map.sh

Usage

Show helper messages for all possible commands

python app.py --help

Train Latent Edge Predictor

Currently supports --batch-size 1 only.

python app.py train-lep

Sample image with Latent Edge Predictor

python app.py sample --sketch-file-path {PATH} --prompt {PROMPT}

Gradio web demo (debugging)

python app.py demo

TODOs

Reproduce the bicycle example
Upload pretrained LEP

Acknowledgments

We would like to express our gratitude to the authors of the original paper and the developers of the referenced repositories for their valuable contributions, which served as the foundation for this implementation.

Disclaimer

This is an unofficial implementation and is not affiliated with Google or the authors of the original paper.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
ImageNet-Datasets-Downloader @ 4307f5a		ImageNet-Datasets-Downloader @ 4307f5a
commands		commands
data/imagenet		data/imagenet
internals		internals
pidinet-for-imagenet @ 93fd5c7		pidinet-for-imagenet @ 93fd5c7
scripts		scripts
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sketch-Guided Text-to-Image Diffusion Models

References

Overview

Installation

Usage

TODOs

Acknowledgments

Disclaimer

About

Releases

Packages

Languages

sangminkim-99/Sketch-Guided-Text-To-Image

Folders and files

Latest commit

History

Repository files navigation

Sketch-Guided Text-to-Image Diffusion Models

References

Overview

Installation

Usage

TODOs

Acknowledgments

Disclaimer

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages