“trained on the same dataset for long enough, pretty much every model with enough weights and training time converges to the same point” Posted on April 27, 2024 by jgordon Link. That’s interesting claim.