Oddbean new post about | logout
 📝 Training a Large Video Model on a Single Machine in a Day 🔭

"Uses a combination of state-of-the-art techniques including data pre-fetching, data pipelining, mixed precision, gradient checkpointing, and tensor-core offloading to optimize each component of the pipeline (IO, CPU, and GPU computation)." [gal30b+] 🤖 #CV

⚙️ https://github.com/zhaoyue-zephyrus/AVION
🔗 https://arxiv.org/abs/2309.16669v1 #arxiv

https://creative.ai/system/media_attachments/files/111/161/315/589/280/671/original/b975c4b69b2bf082.jpg

https://creative.ai/system/media_attachments/files/111/161/315/690/930/304/original/1d4185fdec238feb.jpg

https://creative.ai/system/media_attachments/files/111/161/315/751/932/320/original/e50d1064608eda54.jpg

https://creative.ai/system/media_attachments/files/111/161/315/858/378/981/original/0532b988b41cd58d.jpg