Sotraidis
kahris
AI & ML interests
None yet
Recent Activity
liked a model 4 days ago
Jiunsong/supergemma4-26b-abliterated-multimodal reacted to clefourrier's post with 🤗 about 2 years ago
🏅 New top model on the GAIA benchmark!
Called FRIDAY, it's a mysterious new autonomous agent, which got quite good performances on both the public validation set *and* the private test set.
It notably passed 10 points for the val and 5 points for the test set on our hardest questions (level 3): they require to take arbitrarily long sequences of actions, use any number of tools, and access the world in genera! ✨
The GAIA benchmark evaluates next-generation LLMs (LLMs with augmented capabilities due to added tooling, efficient prompting, access to search, etc) and was co authored by @gregmialz @ThomasNLG @ylecun @thomwolf and myself: https://huggingface.co/spaces/gaia-benchmark/leaderboard liked a Space over 2 years ago
lmarena-ai/arena-leaderboardOrganizations
None yet