Sander Dieleman: A while back we had the "rotation trick" to improve VQ bottlenecks (https://x.com/sedielem/status/1863672703489634335), now we have DiVeQ, w

#65

Sander Dieleman

@sedielem· 66.9K followers

ResearcherRank #65

Original Tweet

A while back we had the "rotation trick" to improve VQ bottlenecks (https://x.com/sedielem/status/1863672703489634335), now we have DiVeQ, which seems to improve codebook coverage quite significantly. ... the space-filing version seems a bit like cheating though 😁 https://x.com/sedielem/status/2044805717958533395/photo/1 https://twitter.com/arnosolin/status/2044150151636238523

View on X →

AI Classification

Whether our pipeline considers this post AI-relevant

AI Relevant

Attachment Summary

Source: image

Seven t-SNE plots appear in a horizontal row labeled STE, EMA, RT, ST-GS, NSVQ, DiVeQ, and SF-DiVeQ with numerical values 0.012, 0.024, 0.018, N/A, 2.6·10^{-4}, 5·10^{-5}, and 3.9·10^{-5} above each, showing red point clouds or crosses for learned codebook C_z alongside gray points for latent P_z in most plots. The caption below reads "Figure 4: Codebook misalignment: t-SNE plots of the learned codebook C_z (red crosses) and latent P_z (gray points) representations for different VQ methods in VQ-VAE compression" with additional text referencing Sec. 4, Fig. 26, and distortion per bit.

Enriched Text

Assembled input used for vector embedding and topic clustering

Context: Quoting @arnosolin: "1/ 🔥 New paper: Differentiable Vector Quantization (DiVeQ) 🔥 Vector quantization (VQ) is a key building block in modern AI. It links continuous data like images and audio to discrete representations (tokens) used by transformers. @arnosolin: "1/ 🔥 New paper: Differentiable Vector Quantization (DiVeQ) 🔥 Vector quantization (VQ) is a key building block in modern AI. It links continuous data like images and audio to discrete representations (tokens) used by transformers. https://x.com/arnosolin/status/2044150151636238523/video/1" Tweet: A while back we had the "rotation trick" to improve VQ bottlenecks (@sedielem: "Better VQ-VAEs with this one weird rotation trick! I love papers like this: a simple change to an already powerful technique, that significantly improves results without introducing complexity or hyperparameters. https://t.co/E0ykEXEbwq (h/t lucidrains) https://t.co/iHTz6PpKfK" now we have DiVeQ, which seems to improve codebook coverage quite significantly. ... the space-filing version seems a bit like cheating though 😁 @sedielem: "A while back we had the "rotation trick" to improve VQ bottlenecks (https://x.com/sedielem/status/1863672703489634335), now we have DiVeQ, which seems to improve codebook coverage quite significantly. ... the space-filing version seems a bit like cheating though 😁 https://x.com/sedielem/status/2044805717958533395/photo/1 https://twitter.com/arnosolin/status/2044150151636238523" @arnosolin: "1/ 🔥 New paper: Differentiable Vector Quantization (DiVeQ) 🔥 Vector quantization (VQ) is a key building block in modern AI. It links continuous data like images and audio to discrete representations (tokens) used by transformers. https://x.com/arnosolin/status/2044150151636238523/video/1" Seven t-SNE plots appear in a horizontal row labeled STE, EMA, RT, ST-GS, NSVQ, DiVeQ, and SF-DiVeQ with numerical values 0.012, 0.024, 0.018, N/A, 2.6·10^{-4}, 5·10^{-5}, and 3.9·10^{-5} above each, showing red point clouds or crosses for learned codebook C_z alongside gray points for latent P_z in most plots. The caption below reads "Figure 4: Codebook misalignment: t-SNE plots of the learned codebook C_z (red crosses) and latent P_z (gray points) representations for different VQ methods in VQ-VAE compression" with additional text referencing Sec. 4, Fig. 26, and distortion per bit.

Current Stats

10.2KViews

106Likes

11Retweets

2Replies

96Bookmarks

1Quotes

Story: New paper on Differentiable Vector Quantization (DiVeQ) for AI models

Engagement Timeline(116 snapshots)

Time	Views	Likes	Bookmarks	RTs	Replies
11:00 AM UTC	+27	—	—	—	—
10:50 AM UTC	+48	—	—	—	—
10:40 AM UTC	+29	—	+1	—	—
10:30 AM UTC	+34	—	+1	—	—
10:20 AM UTC	+30	+1	—	+1	—
10:10 AM UTC	+19	—	—	—	—
10:00 AM UTC	+32	—	—	—	—
9:50 AM UTC	+9	—	+1	—	—
9:40 AM UTC	+50	—	—	—	—
9:30 AM UTC	+25	—	—	—	—

Time

Views

Likes

Bookmarks

RTs

Replies

11:00 AM UTC

+27

—

10:50 AM UTC

+48

—

10:40 AM UTC

+29

—

10:30 AM UTC

+34

—

10:20 AM UTC

+30

—

10:10 AM UTC

+19

—

10:00 AM UTC

+32

—

9:50 AM UTC

—

9:40 AM UTC

+50

—

9:30 AM UTC

+25

—