Oddbean new post about | logout
 📝 Exploring Flat Minima for Domain Generalization with Large Learning Rates 🔭

"Observes that using a large learning rate can not only promote weight diversify but also help identify flat regions in the loss landscape, which can be used to improve the generalization of DNNs." [gal30b+] 🤖 #CV

⚙️ https://github.com/koncle/DG-with-Large-LR
🔗 https://arxiv.org/abs/2309.06337v1 #arxiv

https://creative.ai/system/media_attachments/files/111/062/730/407/507/182/original/e361222be4d1bf0a.jpg

https://creative.ai/system/media_attachments/files/111/062/730/465/455/892/original/e279a93343b7ffe4.jpg

https://creative.ai/system/media_attachments/files/111/062/730/521/755/338/original/b10ee1ce07d8efae.jpg

https://creative.ai/system/media_attachments/files/111/062/730/578/189/206/original/856470ffc2ecf7d9.jpg