Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2605.29307

Direct Corpus Interaction search agent: searches a raw corpus with shell commands (no index). Cold-start SFT + GRPO.

alireza7/GrepSeek-Qwen3.5-9B-GRPO

Text Generation • 9B • Updated 5 days ago • 179 • 3
alireza7/GrepSeek-Qwen3.5-9B-SFT

Text Generation • 9B • Updated 5 days ago • 148 • 1
alireza7/GrepSeek-ColdStart-SFT-10k

Viewer • Updated 5 days ago • 10k • 62 • 1
PeterJinGo/wiki-18-corpus

Updated Feb 26, 2025 • 1.83k

about 17 hours ago

GrepSeek: Training Search Agents for Direct Corpus Interaction

Paper • 2605.29307 • Published 6 days ago • 94

Open Deep Search: Democratizing Search with Open-source Reasoning Agents

Paper • 2503.20201 • Published Mar 26, 2025 • 48
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning

Paper • 2503.19470 • Published Mar 25, 2025 • 19
Spacer: Towards Engineered Scientific Inspiration

Paper • 2508.17661 • Published Aug 25, 2025 • 32
DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

Paper • 2509.01396 • Published Sep 1, 2025 • 58

GrepSeek: Training Search Agents for Direct Corpus Interaction

Paper • 2605.29307 • Published 6 days ago • 94

about 14 hours ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 221
Flash-KMeans: Fast and Memory-Efficient Exact K-Means

Paper • 2603.09229 • Published Mar 10 • 82
DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use

Paper • 2603.11076 • Published Mar 10 • 5
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning

Paper • 2603.21065 • Published Mar 22 • 78

interesting architecture

about 23 hours ago

FAN: Fourier Analysis Networks

Paper • 2410.02675 • Published Oct 3, 2024 • 29
Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11, 2025 • 91
Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published Jan 31, 2025 • 25
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling

Paper • 2502.09509 • Published Feb 13, 2025 • 9

Direct Corpus Interaction search agent: searches a raw corpus with shell commands (no index). Cold-start SFT + GRPO.

alireza7/GrepSeek-Qwen3.5-9B-GRPO

Text Generation • 9B • Updated 5 days ago • 179 • 3
alireza7/GrepSeek-Qwen3.5-9B-SFT

Text Generation • 9B • Updated 5 days ago • 148 • 1
alireza7/GrepSeek-ColdStart-SFT-10k

Viewer • Updated 5 days ago • 10k • 62 • 1
PeterJinGo/wiki-18-corpus

Updated Feb 26, 2025 • 1.83k

GrepSeek: Training Search Agents for Direct Corpus Interaction

Paper • 2605.29307 • Published 6 days ago • 94

about 17 hours ago

GrepSeek: Training Search Agents for Direct Corpus Interaction

Paper • 2605.29307 • Published 6 days ago • 94

about 14 hours ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 221
Flash-KMeans: Fast and Memory-Efficient Exact K-Means

Paper • 2603.09229 • Published Mar 10 • 82
DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use

Paper • 2603.11076 • Published Mar 10 • 5
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning

Paper • 2603.21065 • Published Mar 22 • 78

Open Deep Search: Democratizing Search with Open-source Reasoning Agents

Paper • 2503.20201 • Published Mar 26, 2025 • 48
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning

Paper • 2503.19470 • Published Mar 25, 2025 • 19
Spacer: Towards Engineered Scientific Inspiration

Paper • 2508.17661 • Published Aug 25, 2025 • 32
DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

Paper • 2509.01396 • Published Sep 1, 2025 • 58

interesting architecture

about 23 hours ago

FAN: Fourier Analysis Networks

Paper • 2410.02675 • Published Oct 3, 2024 • 29
Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11, 2025 • 91
Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published Jan 31, 2025 • 25
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling

Paper • 2502.09509 • Published Feb 13, 2025 • 9

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs