Oddbean new post about | logout
 📝 TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language 🔭

"TransferDoc is a cross-modal transformer-based architecture pre-trained in a self-supervised fashion using three novel pretext objectives which learns richer semantic concepts by unifying language and visual representations." [gal30b+] 🤖 #CV

⚙️ https://github.com/tesseract-ocr/tesseract
🔗 https://arxiv.org/abs/2309.05756v1 #arxiv

https://creative.ai/system/media_attachments/files/111/058/011/898/874/739/original/49be9f11b5424907.jpg

https://creative.ai/system/media_attachments/files/111/058/011/995/447/791/original/9dc8b2258cd7a2e3.jpg

https://creative.ai/system/media_attachments/files/111/058/012/072/599/845/original/6f8d47bef1d5822a.jpg

https://creative.ai/system/media_attachments/files/111/058/012/197/429/735/original/08f82dc2376c88f3.jpg