Towards Evaluation Engineering: An Empirical Study of ML Evaluation Harnesses in the Wild Paper • 2605.24213 • Published 16 days ago • 14
On the Workflows and Smells of Leaderboard Operations (LBOps): An Exploratory Study of Foundation Model Leaderboards Paper • 2407.04065 • Published Jul 4, 2024 • 10
Sleeping 19 Awesome Foundation Model Leaderboard Search 💻 19 The search tool of Awesome Foundation Model Leaderboard List
Running 20 Awesome Production Machine Learning Search 🔥 20 The search tool of Awesome Production Machine Learning