Oddbean new post about | logout
 📝 DetermiNet: A Large-Scale Diagnostic Dataset for Complex Visually-Grounded Referencing Using Determiners 🔭

"DetermiNet provides 250,000 synthetically generated images and captions with ground truth bounding boxes for the objects of interest in images based on 25 determiners." [gal30b+] 🤖 #CV

⚙️ https://github.com/clarence-lee-sheng/
🔗 https://arxiv.org/abs/2309.03483v1 #arxiv

https://creative.ai/system/media_attachments/files/111/031/578/740/791/179/original/9884ff5fae9278e1.jpg

https://creative.ai/system/media_attachments/files/111/031/578/799/868/454/original/d990e935a6457fcf.jpg

https://creative.ai/system/media_attachments/files/111/031/578/852/717/302/original/c6a777edb7634dd8.jpg

https://creative.ai/system/media_attachments/files/111/031/578/913/580/707/original/04a6bcfafaf749fc.jpg