Beyond RAG for Agent Memory: Retrieval by Decoupling and Aggregation Paper β’ 2602.02007 β’ Published Feb 2 β’ 18
On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation Paper β’ 2603.22117 β’ Published Mar 23 β’ 29
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory β’ 15 items β’ Updated Mar 12 β’ 218
view article Article π¦Έπ»#9: Does AI Remember? The Role of Memory in Agentic Workflows Feb 2, 2025 β’ 25
PreSINQ GGUF Collection This collection contains SINQ GGUF models β’ 4 items β’ Updated Feb 24 β’ 3
Overcoming Data Scarcity in Multi-Dialectal Arabic ASR via Whisper Fine-Tuning Paper β’ 2506.02627 β’ Published Jun 3, 2025 β’ 3
finetune-ar-dialects Collection Models for the thesis titled: "The Effects of Fine-Tuning on the ASR Performance of Dialectal Arabic". β’ 17 items β’ Updated May 20, 2024 β’ 3
view article Article Welcome EmbeddingGemma, Google's new efficient embedding model +4 Sep 4, 2025 β’ 273
LightRAG: Simple and Fast Retrieval-Augmented Generation Paper β’ 2410.05779 β’ Published Oct 8, 2024 β’ 39
SmolVLM 256M & 500M Collection Collection for models & demos for even smoller SmolVLM release β’ 11 items β’ Updated Mar 2 β’ 84