qapyq

^(CapPic)
An image viewer and AI-assisted editing tool that helps with curating datasets for generative AI models, finetunes and LoRA.

Features

Image Viewer: Display and navigate images
- Quick-starting desktop application built with Qt
- Runs smoothly with tens of thousands of images
- Modular interface that lets you place windows on different monitors
- Open multiple tabs
- Zoom/pan and fullscreen mode
- Gallery with thumbnails and optionally captions
- Semantic image sorting with text prompts
- Compare two images
- Measure size, area and pixel distances
- Slideshow
Image/Mask Editor: Prepare images for training
- Crop and save parts of images
- Scale images, optionally using AI upscale models
- Dynamic save paths with template variables
- Manually edit masks with multiple layers
- Support for pressure-sensitive drawing pens
- Record masking operations into macros
- Automated masking
Captioning: Describe images with text
- Edit captions manually with drag-and-drop support
- Multi-Edit Mode for editing captions of multiple images simultaneously
- Focus Mode where one key stroke adds a tag, saves the file and skips to the next image
- Tag grouping, merging, sorting, filtering and replacement rules
- Colored text highlighting
- CLIP Token Counter
- Automated captioning with support for grounding
- Prompt presets
- Multi-turn conversations with each answer saved to different entries in a .json file
- Further refinement with LLMs
Stats/Filters: Summarize your data and get an overview
- List all tags, image resolutions, masked regions, or size of concept folders
- Filter images and create subsets
- Combine and chain filters
- Export the summaries as CSV
Batch Processing: Process whole folders at once
- Flexible batch captioning, tagging and transformation
- Batch scaling of images
- Batch masking with user-defined macros
- Batch cropping of images using your macros
- Copy and move files, create symlinks, ZIP captions for backups
AI Assistance:
- Support for state-of-the-art captioning and masking models
- Model and sampling settings, GPU acceleration with CPU offload support
- On-the-fly NF4 and INT8 quantization
- Run inference locally and/or on multiple remote machines over SSH
- Separate inference subprocess isolates potential crashes and allows complete VRAM cleanup

Supported Models

These are the supported architectures with links to the original models.
Find more specialized finetuned models on huggingface.co.

Tagging
Generate keyword captions for images.
- JoyTag
- WD (onnx) (eva02 recommended)
Captioning
Generate complete-sentence captions for images.
- Florence-2
- Gemma3 (GGUF)
- InternVL2, InternVL2.5, InternVL2.5-MPO, InternVL3, InternVL3.5 (Github Format)
- JoyCaption
- MiniCPM-V-2.6 (GGUF), MiniCPM-o-2.6 (GGUF), MiniCPM-V-4 (GGUF)
- Molmo
- Moondream2 (GGUF)
- Ovis1.6, Ovis2, Ovis2.5
- Qwen2-VL, Qwen2.5-VL
LLM
Transform existing captions/tags.
- Models in GGUF format with embedded chat template (llama-cpp backend).
Upscaling
Resize images to higher resolutions.
- Model architectures supported by the spandrel backend.
- Find more models at openmodeldb.info.
Masking
Generate greyscale masks.
- Box Detection
  - YOLO/Adetailer detection models
    - Search for YOLO models on huggingface.co.
  - Florence-2
  - Qwen2.5-VL
- Segmentation / Background Removal
  - InSPyReNet (Plus_Ultra)
  - RMBG-2.0
  - Florence-2
Embedding
Sort images by their similarity to a prompt.
- CLIP
- SigLIP
- SigLIP (ONNX), SigLIP2-giant-opt (ONNX)
  (recommended: largest text model + fp16 vision model)

Setup

Requires Python 3.10 or later.

By default, prebuilt packages for CUDA 12.8 are installed. If you need a different CUDA version, change the URLs in requirements-pytorch.txt and requirements-flashattn.txt before running the setup script.

Git clone or download this repository.
Run setup.sh on Linux, setup.bat on Windows.
- Packages are installed into a virtual environment.

The setup script will ask you a couple of questions.
You can choose to install only the GUI and image processing packages without AI assistance. Or when installing on a headless server for remote inference, you can choose to install only the backend.

If the setup scripts didn't work for you, but you manually got it running, please share your solution and raise an issue.

Startup

Linux: run.sh
Windows: run.bat or run-console.bat

You can open files or folders directly in qapyq by associating the file types with the respective run script in your OS. For shortcuts, icons are available in the qapyq/res folder.

Update

If git was used to clone the repository, simply use git pull to update.
If the repository was downloaded as a zip archive, download it again and replace the installed files.

To update the installed packages in the virtual environment, run the setup script again.

New dependencies may be added. If the program fails to start or crashes, run the setup script to install the missing packages.

User Guide

More information is available in the Wiki.
Use the page index on the right side to find topics and navigate the Wiki.

How to setup and configure AI models: Model Setup

How to use qapyq: User Guide

How to caption with qapyq: Captioning

How to use qapyq's features in a workflow: Tips and Workflows

If you have questions, please ask in the Discussions.

Name		Name	Last commit message	Last commit date
Latest commit History 542 Commits
batch		batch
caption		caption
gallery		gallery
host		host
infer		infer
lib		lib
res		res
stats		stats
test		test
tools		tools
ui		ui
user		user
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
config.py		config.py
main.py		main.py
main_host.py		main_host.py
main_inference.py		main_inference.py
main_setup.py		main_setup.py
requirements-flashattn.txt		requirements-flashattn.txt
requirements-gui.txt		requirements-gui.txt
requirements-infer.txt		requirements-infer.txt
requirements-llamacpp.txt		requirements-llamacpp.txt
requirements-pytorch.txt		requirements-pytorch.txt
requirements.txt		requirements.txt
run-console.bat		run-console.bat
run-host.bat		run-host.bat
run-host.sh		run-host.sh
run.bat		run.bat
run.sh		run.sh
setup.bat		setup.bat
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

qapyq

Features

Supported Models

Setup

Startup

Update

User Guide

Planned Features

About

Uh oh!

Releases

Packages

Languages

License

4kir4vjp/qapyq

Folders and files

Latest commit

History

Repository files navigation

qapyq

Features

Supported Models

Setup

Startup

Update

User Guide

Planned Features

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages