Modes and Parameters

Mode

Mist provides three modes for different anti-imitation scenarios. Users can decide which mode to use according to their needs.

Textural mode: By injecting confusing texture information into the watermark to achieve the effect of anti-AI imitation; mainly against Img2Img; requires less GPU memory.

Semantic mode: By interfering with the semantic information of the original image with the watermark; mainly against subject-driven generation (Textual Inversion, Dreambooth, etc.) scenes; requires more GPU memory.

Fused mode: By mixing Textural and Semantic modes in a certain ratio; requires more GPU memory.

The following table demonstrates the performance of the three modes in the four scenarios: Textual Inversion, NovelAI Img2Img, Dreambooth, and Scenario.gg. Among them, Textual Inversion and Dreambooth are both based on Stable Diffusion.

	Textual Inversion	NovelAI Img2Img	Dreambooth	Scenario.gg
Textural	○	◎	○	◎
Semantic	◎	△	╳	○
Fused(Fusion weight=1)	◎	○	△	○
◎:very strong ○:strong △:medium ╳:weak

Parameters

Mist allows users to customize several parameters that would characterize the watermark. A brief introduction to these parameters are given in the UI of Mist. The following table further gives some details of these parameters.

Parameter	Recommended value	Note
Strength	16	16 promises comprehensive performance in Fused Mode. Relatively, 8 would provide strong performance against certain applications in other two modes.
Steps	100	100 is a good tradeoff of time cost and performance.
Output size	512
Fused weight	1	Fused weight balances the performance tradeoff of Fused mode between Semantic and Textual mode.
Low VRAM Mode	False	Low VRAM Mode greatly lows down the VRAM requirements with subtly reduced performance.