view reply that is true, interestingly Reasoning Models did "better", probably due to a higher probability of branching, but certainly they are not very random. Thanks for sharing
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey Paper • 2412.18619 • Published Dec 16, 2024 • 59