Oddbean new post about | logout
 📝 DF-TransFusion: Multimodal Deepfake Detection via Lip-Audio Cross-Attention and Facial Self-Attention 🔭

"Presents a novel multi-modal audio-video framework designed to concurrently process audio and video inputs for deepfake detection tasks that leverages the synergy between the two modalities." [gal30b+] 🤖 #CV #MM

🔗 https://arxiv.org/abs/2309.06511v1 #arxiv

https://creative.ai/system/media_attachments/files/111/064/001/500/220/326/original/e4f11b50e0e2f98d.jpg

https://creative.ai/system/media_attachments/files/111/064/001/563/011/631/original/d6dc62bb068539cf.jpg

https://creative.ai/system/media_attachments/files/111/064/001/631/581/187/original/edfd4da139570ac8.jpg

https://creative.ai/system/media_attachments/files/111/064/001/698/879/045/original/00509b08cd9ce46d.jpg