Binary VAE with Continuous Channels
This demo shows how the reconstruction quality improves as we use more continuous latent channels. The model uses 256 discrete (ternary) channels + 0-32 continuous channels. Input images are resized so their shortest edge is 512, and then cropped so dimensions are perfectly divisible by 32.
Channel counts tested: 0, 8, 16, 32
Reconstructions (2x2 Grid)
Top row: 0ch, 8ch | Bottom row: 16ch, 32ch
Metrics
Notes:
- PSNR: Peak Signal-to-Noise Ratio (higher is better, >30 dB is good)
- SSIM: Structural Similarity Index (0-1, higher is better)
- The model has 32 continuous channels max.
- Discrete latent (256 ternary channels) is always active.