Oddbean new post about | logout
 📝 Zero-Shot Object Counting with Language-Vision Models 🔭

"Uses large language-vision models to generate class prototypes from the input class name and use them to select the patches containing the target objects for counting exemplars, as well as a ranking model to estimate the counting error of each patch." [gal30b+] 🤖 #CV

🔗 https://arxiv.org/abs/2309.13097v1 #arxiv

https://creative.ai/system/media_attachments/files/111/133/437/575/475/461/original/a73c4c9e39a5ad2c.jpg

https://creative.ai/system/media_attachments/files/111/133/437/656/408/921/original/0446b36e40109b9c.jpg

https://creative.ai/system/media_attachments/files/111/133/437/743/481/364/original/4c3c2e4376b6a51b.jpg

https://creative.ai/system/media_attachments/files/111/133/437/804/884/196/original/7d4b694b01ab5559.jpg