Oddbean

📝 TransferDoc: A Self-Supervised Transferable Document Representation Learning Model Unifying Vision and Language 🔭 "TransferDoc is a cross-modal transformer-based architecture pre-trained in a self-supervised fashion using three novel pretext objectives which learns richer semantic concepts by unifying language and visual representations." [gal30b+] 🤖 #CV ⚙️ https://github.com/tesseract-ocr/tesseract 🔗 https://arxiv.org/abs/2309.05756v1 #arxiv https://creative.ai/system/media_attachments/files/111/058/011/898/874/739/original/49be9f11b5424907.jpg https://creative.ai/system/media_attachments/files/111/058/011/995/447/791/original/9dc8b2258cd7a2e3.jpg https://creative.ai/system/media_attachments/files/111/058/012/072/599/845/original/6f8d47bef1d5822a.jpg https://creative.ai/system/media_attachments/files/111/058/012/197/429/735/original/08f82dc2376c88f3.jpg