Stable Diffusion 推理加速技巧解析.pdf

铃木园子

323

23页

47次

2023-08-08

免费下载

Stable Diffusion 推理加速技巧解析

Stable Diffusion

process

Text Encoder

(CLIP Text)

Image Information Creator

(Unet + Scheduler)

Image Decoder

(Autoencoder

decoder)

77 x 768

Token embeddings

4 x 64 x 64

Processed image

information tensor

Unet

Step

Unet

Step

Unet

Step

Unet

Step

…

UNet + Scheduler to gradually process/diffuse information in the information (latent) space.

• Input: text embeddings and a starting multi-dimensional array made up of noise.

• Output: A processed information array

ClipText for text encoding.

• Input: text.

• Output: 77 token embeddings

vectors, each in 768 dimensions

Autoencoder Decoder that paints the

final image using the processed

information array.

• Input: The processed information array

(dimensions: (4,64,64))

• Output: The resulting image

(dimensions: (3, 512, 512) which are

(red/green/blue, width, height))

of 23

免费下载

文档被以下合辑收录

DataFunSummit2023：大模型与AIGC峰会（PPT下载）（共23篇）

2023年6月17日-6月18日，DataFun举办了DataFunSummit2023：大模型与AIGC峰会。本次峰会由3位专家团成员与8位论坛出品人精心策划而成，共包含：基础模型与大语言模型、强化学习、多模态与AIGC论坛、训练推理论坛、AIGC论坛、知识增强论坛、智能问答论坛、信息抽取与检索等8个论坛，邀请30余位来自一线的大模型与AIGC专家，进行深度分享交流。

关注

文档被以下合辑收录

评论