Oddbean new post about | logout
 📝 Rethinking Audiovisual Segmentation with Semantic Quantization and Decomposition 🔭

"Decomposes the multi-source audio semantics into single-source semantics, allowing for more effective interaction with visual content, and propose a semantic decomposition method based on product quantization, where the multi-source semantics can be decomposed and represented by several quantized single-source semantics." [gal30b+] 🤖 #CV

🔗 https://arxiv.org/abs/2310.00132v1 #arxiv

https://creative.ai/system/media_attachments/files/111/173/243/452/040/051/original/3fbffe68d7029205.jpg

https://creative.ai/system/media_attachments/files/111/173/243/521/841/528/original/8253de4c189c2f14.jpg

https://creative.ai/system/media_attachments/files/111/173/243/574/515/152/original/02ae92db34b93233.jpg

https://creative.ai/system/media_attachments/files/111/173/243/646/882/862/original/8052ee811b5d50ab.jpg