StableGen

StableGen is a project aimed at recreating the core components of Stable Diffusion from scratch. The repository includes PyTorch implementations of the VAE, Diffusion model, and CLIP model. Each module is built from the ground up with customization options to adapt to various generative AI needs.

Components

VAE (Variational Autoencoder): Encodes and decodes images with latent representations.
Diffusion model: U-Net architecture combined with the diffusion process for generating images from random noise.
CLIP (Contrastive Language–Image Pre-Training) model: A model for understanding text-image relationships, enabling guided image generation.

Features

Full implementation of each core component.
Training scripts for each component.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
vae		vae
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

StableGen

Components

Features

About

Uh oh!

Releases

Packages

Languages

License

QinWenFeng/stable-gen

Folders and files

Latest commit

History

Repository files navigation

StableGen

Components

Features

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages