AdityaaXD/Multi-Agent_Reinforcement_Learning_Trading_System_Data Viewer • Updated Feb 1 • 5.28k • 104
BarryFutureman/AgentTraj-L-latent-states-Qwen2-5-0-5B-Instruct Viewer • Updated Nov 29, 2025 • 255k • 42
Chemically-motivated/AI-Agent-Generating-Tool-Debugging-Prompt-Library Preview • Updated Dec 19, 2024 • 82 • 2
Chouoftears/Agent2Agent-Negotiation-in-Consumer-Setting-Dataset Viewer • Updated Jun 11, 2025 • 150 • 103 • 1
Cyleux/agent-machine-convo-llama-nicholas-2k-gpt4-verified Viewer • Updated Aug 25, 2023 • 3.27k • 34 • 1
DCAgent/DCAgent_dev_set_71_tasks_Qwen_Qwen3-4B-Thinking-2507_20251103_012926 Viewer • Updated Nov 3, 2025 • 2.04k • 76
DCAgent/Kimi-2.5-exp-gfi-staqc-embedding-mean-filtered-10K-maxeps-32k Viewer • Updated Apr 25 • 11.9k • 34
DCAgent/Kimi-2.5-exp-gfi-swesmith-random-filtered-10K-maxeps-32k Viewer • Updated Apr 23 • 10.2k • 10
DCAgent/Kimi-2.6-exp-gfi-swesmith-random-filtered-10K-maxeps-32k Viewer • Updated Apr 30 • 6k • 33 • 1
DCAgent2/DCAgent2_aider_polyglot_laion_Kimi-K2T-swesmith-32ep-131k_20260123_091651 Viewer • Updated Jan 23 • 1.62k • 6
DCAgent2/DCAgent2_aider_polyglot_laion_kimi-k2-swegym-tasks-maxeps-32k_20260123_100258 Viewer • Updated Jan 23 • 2.1k • 3
DCAgent2/DCAgent2_aider_polyglot_laion_kimi-k2t-freelancer-32ep-32k_20260123_052734 Viewer • Updated Jan 23 • 1.31k • 3
DCAgent2/DCAgent2_aider_polyglot_laion_kimi-k2t-neulab-synatra-32ep-131k_20260123_024833 Viewer • Updated Jan 23 • 737 • 4
DCAgent2/DCAgent_dev_set_71_tasks_laion_Kimi-K2T-swesmith-32ep-131k_20251209_120207 Viewer • Updated Dec 11, 2025 • 210 • 3
DCAgent2/DCAgent_dev_set_71_tasks_laion_Kimi-K2T-swesmith-32ep-131k_20251209_141215 Viewer • Updated Dec 11, 2025 • 210 • 3
DCAgent2/DCAgent_dev_set_71_tasks_laion_kimi-k2-swegym-tasks-maxeps-32k_20251219_033051 Updated Dec 19, 2025 • 3
DCAgent2/DCAgent_dev_set_71_tasks_laion_kimi-k2t-freelancer-32ep-32k_20251213_024031 Updated Dec 13, 2025 • 2
DCAgent2/DCAgent_dev_set_71_tasks_laion_kimi-k2t-neulab-synatra-32ep-131k_20251215_034702 Updated Dec 15, 2025 • 2
DCAgent2/DCAgent_dev_set_71_tasks_open-thoughts_OpenThinker-Agent-v1-SFT_20260203_130656 Viewer • Updated Feb 3 • 485 • 12
DCAgent2/DCAgent_dev_set_v2_R2E-Gym_R2EGym-32B-Agent_20260302_123249 Viewer • Updated Mar 2 • 220 • 4
DCAgent2/DCAgent_dev_set_v2_open-thoughts_OpenThinker-Agent-v1-SFT_20260130_214342 Viewer • Updated Jan 31 • 485 • 7
DCAgent2/dev_set_v2_Kimi_2_5_swesmith_r2egym_solved_maxeps_32k__Qwen3_8B_20260328_174459 Viewer • Updated Mar 29 • 297 • 4
DCAgent2/dev_set_v2_sft__Kimi_2_5_swesmith_oracle_maxeps_32k__Qwen3_8B_20260330_012650 Viewer • Updated Mar 30 • 297 • 8
DCAgent2/financeagent_terminal_DeepSeek_R1_Distill_Qwen_7B_20260506_053755 Viewer • Updated May 6 • 150 • 21
DCAgent2/financeagent_terminal_OpenThinker_Agent_v1_20260506_035632 Viewer • Updated May 6 • 148 • 29
DCAgent2/terminus-2__dev_set_71_tasks__together_ai_moonshotai_Kimi-K2.5_20260211 Viewer • Updated Feb 12 • 207 • 5
DataTonic/climate-guard-thinking_data_nocomment_intern_toxic_agent Viewer • Updated Feb 14, 2025 • 3.47k • 10
DataTonic/climate-guard-thinking_data_nocomment_phi4_toxic_agent Viewer • Updated Feb 14, 2025 • 33k • 11
DataTonic/climate-guard-thinking_data_nocomment_qwen_toxic_agent Viewer • Updated Feb 14, 2025 • 1.55k • 43
DataTonic/climate-guard-thinking_data_nocomment_yi_toxic_agent Viewer • Updated Feb 14, 2025 • 1.7k • 19
Epic3123/election_misinformation_sleeper_agents_dataset_llama27b Viewer • Updated Aug 29, 2024 • 733 • 14
LLMTeamAkiyama/agentica-org_deepscaler-preview-dataset-simple-processed Viewer • Updated Jul 21, 2025 • 40.3k • 6
MananSuri27/agent2lora-p2i-v1-response-level-qwen25_1p5b_l40s_gc2048_clean_hf Updated 19 days ago • 20
MaziyarPanahi/orca-agentinstruct-1M-v1-cleaned-fixed-sharegpt Viewer • Updated Nov 20, 2024 • 1.05M • 887 • 4
QuixiAI/mlabonne_orca-agentinstruct-1M-v1-cleaned-DolphinLabeled Viewer • Updated Jan 5, 2025 • 1.04M • 48 • 6
Saelarien/saela-field-why-multi-agent-systems-fail-coherence-entropy-alignment Viewer • Updated Apr 9 • 14 • 90
YnJhY2lzMjAyNnRleHQyc3Fs/pt-br-agentic-text-to-sql-distilled-trajectories Viewer • Updated May 1 • 7.44k • 23
aengusl/noise0_alpaca_sleeper_agents_toy_test_preference_v4 Viewer • Updated Mar 11, 2024 • 15.7k • 52
aengusl/noise5_alpaca_sleeper_agents_toy_safety_NOT_TRUNCATED_v4 Viewer • Updated Mar 11, 2024 • 2.83k • 90
agarkov-aleksei1/20251214-agentic_disease_spread_catboost_pollutant-infected_90d-dataset Viewer • Updated Dec 14, 2025 • 2k • 93
agarkov-aleksei1/20251214-agentic_disease_spread_catboost_pollutant_with_beta-infected_90d-dataset Viewer • Updated Dec 14, 2025 • 2k • 53
aisi-whitebox/prompted_sandbagging_nemotron_hangman_athene_v2_agent_hangman Viewer • Updated Jun 10, 2025 • 20 • 65
aq1048576/red_team_agent_analysis_claude_reconstruction_test_results Viewer • Updated Sep 5, 2025 • 100 • 7
aq1048576/red_team_agent_analysis_rl_csvs_diverse1_correlation_summary Viewer • Updated Sep 5, 2025 • 5 • 6
aq1048576/red_team_agent_analysis_rl_csvs_diverse1_pairwise_correlations Viewer • Updated Sep 5, 2025 • 20 • 10
aq1048576/red_team_agent_analysis_rl_csvs_diverse1_processed_streaming Viewer • Updated Sep 5, 2025 • 703 • 15
aq1048576/red_team_agent_analysis_rl_csvs_first_model_haiku_35_num_iterations_30_threshold_15 Viewer • Updated Sep 5, 2025 • 300 • 7
aq1048576/red_team_agent_analysis_rl_csvs_first_model_haiku_35_num_iterations_90_threshold_3 Viewer • Updated Sep 5, 2025 • 900 • 11
ashwinnv/agent-telemetry-prompt-framing-mint-stratified-qwen32b-65ex-ablation Viewer • Updated May 3 • 195 • 132
cemuluoglakci/hallucination_acceptance_agent_instruction_dataset Viewer • Updated Apr 2, 2024 • 4.98k • 33
continual-internalization/changelogs-agentic-rag-10docs-generations Viewer • Updated May 7 • 6.18k • 87
continual-internalization/changelogs-agentic-rag-5docs-generations Viewer • Updated May 7 • 6.18k • 92
cybershiptrooper/sleeper_agent_dataset_thinking_models_em_empty_think Viewer • Updated Sep 30, 2025 • 30.6k • 4
data-pipelines-mock/microsoft-orca-agentinstruct-1M-v1_sample100 Viewer • Updated Jun 30, 2025 • 100 • 8
fatemehpesaran/AgentTrek-click-and-send-msg-to-user-under-3k-token-rl2 Viewer • Updated Oct 13, 2025 • 4.11k • 10
fatemehpesaran/AgentTrek-send-msg-to-user-under-3k-token-rl2 Viewer • Updated Oct 13, 2025 • 2.05k • 7
fatemehpesaran/AgentTrek-send-msg-to-user-under-3k-tokens-rl2 Viewer • Updated Oct 12, 2025 • 2.06k • 10
fatemehpesaran/AgentTrek-without-goto-and-replaced-send_msg_to_user-rl2 Viewer • Updated Oct 6, 2025 • 9.9k • 8
fatemehpesaran/AgentTrek-without-goto-and-send_msg_to_user-and-noop-rl2 Viewer • Updated Oct 8, 2025 • 7.16k • 9
fatemehpesaran/AgentTrek-without-goto-and-send_msg_to_user-rl2 Viewer • Updated Oct 7, 2025 • 8.72k • 6
french-open-data/sla-effectif-des-agents-en-situation-de-handicap-au-sein-de-la-collectivite Updated Nov 21, 2025 • 3
french-open-data/somme-des-dix-remunerations-les-plus-elevees-des-agents-de-la-metropole-de-lyon Updated Nov 21, 2025 • 3
french-open-data/sport-sante-en-collectivite-pour-les-agents-de-la-ville-d-antibes Updated Nov 21, 2025 • 4
hubertmarek/mistral-large-agent-diff-sft-mixed-old-plus-devstral-r0p8-64k Viewer • Updated Mar 1 • 361 • 26
jon123snow/Multi-Type_Agent_Motion_Dataset_for_Morphological_Prediction Viewer • Updated Apr 24, 2025 • 377k • 10
kshitijthakkar/loggenix-mc-oraca-agentinstruct-1m-moonshot-v1 Viewer • Updated Aug 6, 2025 • 1.05M • 4
lemonhat/seed_data_airline_llm_agent_o4-mini_user_simulator_gpt-4.1 Viewer • Updated Aug 20, 2025 • 26 • 4
lemonhat/seed_data_retail_llm_agent_o4-mini_user_simulator_gpt-4.1 Viewer • Updated Aug 20, 2025 • 82 • 9
lihaoxin2020/agentic-search-rl-mixed-shortform-dr-tulu-longform-v1 Viewer • Updated May 3 • 6.37k • 52
maanas-writer/mem_agent-model_based-llama-3-3-70b-i-barexamqa-train-c256-t128-10s-agnostic Viewer • Updated Nov 13, 2025 • 20 • 7
maanas-writer/mem_agent-model_based-llama-3-3-70b-i-barexamqa-train-c27000-t128-10s-agnostic Viewer • Updated Nov 13, 2025 • 20 • 6
maanas-writer/mem_agent-model_based-llama-3-3-70b-i-docfinqa-train-c27000-t4096-10s-agnostic Viewer • Updated Nov 13, 2025 • 20 • 4
maanas-writer/mem_agent-model_based-llama-3-3-70b-i-docfinqa-train-c8192-t4096-10s-agnostic Viewer • Updated Nov 13, 2025 • 20 • 5
maanas-writer/mem_agent-model_based-llama-3-3-70b-i-gsminf-ops_12-c27000-t2048-10s-agnostic Viewer • Updated Nov 13, 2025 • 20 • 8
maanas-writer/mem_agent-model_based-llama-3-3-70b-i-gsminf-ops_12-c4096-t2048-10s-agnostic Viewer • Updated Nov 13, 2025 • 20 • 7
maanas-writer/mem_agent-model_based-llama-3-3-70b-i-housingqa-test-c1024-t512-10s-agnostic Viewer • Updated Nov 13, 2025 • 20 • 8
maanas-writer/mem_agent-model_based-llama-3-3-70b-i-housingqa-test-c27000-t512-10s-agnostic Viewer • Updated Nov 13, 2025 • 20 • 7
maanas-writer/mem_agent-model_based-llama-3-3-70b-i-ruler-qa-test-c2048-t1024-10s-agnostic Viewer • Updated Nov 13, 2025 • 20 • 7
maanas-writer/mem_agent-model_based-llama-3-3-70b-i-ruler-qa-test-c27000-t1024-10s-agnostic Viewer • Updated Nov 13, 2025 • 20 • 7
maanas-writer/mem_agent-model_based-qwen3-1-5b-oldgrpo-2086-barexamqa-train-c256-t128-1000s-agnostic Viewer • Updated Dec 9, 2025 • 2.71k • 6
maanas-writer/mem_agent-model_based-qwen3-1-5b-oldgrpo-2086-barexamqa-train-c256-t128-10s-agnostic Viewer • Updated Dec 9, 2025 • 60 • 7
maanas-writer/mem_agent-model_based-qwen3-1-5b-oldgrpo-2086-barexamqa-train-c27000-t128-1000s-agnostic Viewer • Updated Dec 9, 2025 • 2.71k • 9
maanas-writer/mem_agent-model_based-qwen3-1-5b-oldgrpo-2086-docfinqa-train-c27000-t4096-1000s-agnostic Viewer • Updated Dec 9, 2025 • 6k • 8
maanas-writer/mem_agent-model_based-qwen3-1-5b-oldgrpo-2086-docfinqa-train-c27000-t4096-10s-agnostic Viewer • Updated Dec 9, 2025 • 60 • 5
maanas-writer/mem_agent-model_based-qwen3-1-5b-oldgrpo-2086-docfinqa-train-c8192-t4096-1000s-agnostic Viewer • Updated Dec 9, 2025 • 6k • 3
maanas-writer/mem_agent-model_based-qwen3-1-5b-oldgrpo-2086-docfinqa-train-c8192-t4096-10s-agnostic Viewer • Updated Dec 9, 2025 • 60 • 5
maanas-writer/mem_agent-model_based-qwen3-1-5b-oldgrpo-2086-gsminf-ops_12-c27000-t2048-1000s-agnostic Viewer • Updated Dec 9, 2025 • 4.06k • 3
maanas-writer/mem_agent-model_based-qwen3-1-5b-oldgrpo-2086-gsminf-ops_12-c4096-t2048-1000s-agnostic Viewer • Updated Dec 9, 2025 • 4.06k • 3
maanas-writer/mem_agent-model_based-qwen3-1-5b-oldgrpo-2086-housingqa-test-c1024-t512-1000s-agnostic Viewer • Updated Dec 9, 2025 • 6k • 5
maanas-writer/mem_agent-model_based-qwen3-1-5b-oldgrpo-2086-housingqa-test-c1024-t512-10s-agnostic Viewer • Updated Dec 9, 2025 • 60 • 5
maanas-writer/mem_agent-model_based-qwen3-1-5b-oldgrpo-2086-housingqa-test-c27000-t512-1000s-agnostic Viewer • Updated Dec 9, 2025 • 6k • 30
maanas-writer/mem_agent-model_based-qwen3-1-5b-oldgrpo-2086-housingqa-test-c27000-t512-10s-agnostic Viewer • Updated Dec 9, 2025 • 60 • 7
maanas-writer/mem_agent-model_based-qwen3-1-5b-oldgrpo-2086-ruler-qa-test-c2048-t1024-1000s-agnostic Viewer • Updated Dec 9, 2025 • 768 • 6
maanas-writer/mem_agent-model_based-qwen3-1-5b-oldgrpo-2086-ruler-qa-test-c27000-t1024-1000s-agnostic Viewer • Updated Dec 9, 2025 • 768 • 5
maanas-writer/mem_agent-model_based-qwen3-1-5b-oldgrpo-2086-ruler-qa-test-c27000-t1024-10s-agnostic Viewer • Updated Dec 9, 2025 • 60 • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-barexamqa-train-c256-t128-1000s-agnostic Viewer • Updated Nov 8, 2025 • 3.61k • 7
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-barexamqa-train-c256-t128-10s-agnostic Viewer • Updated Nov 7, 2025 • 80 • 8
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-barexamqa-train-c27000-t128-1000s-agnostic Viewer • Updated Nov 8, 2025 • 3.61k • 8
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-barexamqa-train-c27000-t128-10s-agnostic Viewer • Updated Nov 7, 2025 • 80 • 8
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-barexamqa-train-c32000-t128-1000s-agnostic Viewer • Updated Nov 4, 2025 • 3.16k • 5
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-docfinqa-train-c27000-t4096-1000s-agnostic Viewer • Updated Nov 8, 2025 • 8k • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-docfinqa-train-c27000-t4096-10s-agnostic Viewer • Updated Nov 7, 2025 • 80 • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-docfinqa-train-c4096-t4096-1000s-agnostic Viewer • Updated Nov 4, 2025 • 7k • 33
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-docfinqa-train-c8192-t4096-1000s-agnostic Viewer • Updated Nov 8, 2025 • 8k • 51
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-docfinqa-train-c8192-t4096-10s-agnostic Viewer • Updated Nov 7, 2025 • 80 • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-gsminf-ops_12-c27000-t2048-1000s-agnostic Viewer • Updated Nov 8, 2025 • 5.42k • 5
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-gsminf-ops_12-c27000-t2048-10s-agnostic Viewer • Updated Nov 7, 2025 • 80 • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-gsminf-ops_12-c32000-t2048-1000s-agnostic Viewer • Updated Nov 4, 2025 • 4.74k • 3
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-gsminf-ops_12-c4096-t2048-1000s-agnostic Viewer • Updated Nov 8, 2025 • 5.42k • 5
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-gsminf-ops_12-c4096-t2048-10s-agnostic Viewer • Updated Nov 7, 2025 • 80 • 7
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-gsminf-ops_12-c4096-t2048-20s-agnostic Updated Nov 4, 2025 • 3
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-housingqa-test-c1024-t512-1000s-agnostic Viewer • Updated Nov 8, 2025 • 8k • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-housingqa-test-c1024-t512-10s-agnostic Viewer • Updated Nov 7, 2025 • 80 • 12
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-housingqa-test-c27000-t512-1000s-agnostic Viewer • Updated Nov 8, 2025 • 8k • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-housingqa-test-c27000-t512-10s-agnostic Viewer • Updated Nov 7, 2025 • 80 • 8
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-housingqa-test-c32000-t512-1000s-agnostic Viewer • Updated Nov 4, 2025 • 7k • 5
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-ruler-qa-test-c2048-t1024-1000s-agnostic Viewer • Updated Nov 8, 2025 • 1.02k • 5
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-ruler-qa-test-c2048-t1024-10s-agnostic Viewer • Updated Nov 7, 2025 • 80 • 8
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-ruler-qa-test-c2048-t1024-20s-agnostic Viewer • Updated Nov 4, 2025 • 120 • 8
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-ruler-qa-test-c27000-t1024-1000s-agnostic Viewer • Updated Nov 8, 2025 • 1.02k • 10
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-ruler-qa-test-c27000-t1024-10s-agnostic Viewer • Updated Nov 7, 2025 • 80 • 8
maanas-writer/mem_agent-model_based-rl-memoryagent-14b-ruler-qa-test-c32000-t1024-1000s-agnostic Viewer • Updated Nov 4, 2025 • 896 • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-barexamqa-train-c256-t128-1000s-agnostic Viewer • Updated Nov 8, 2025 • 3.61k • 49
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-barexamqa-train-c256-t128-1000s-agnostic-fullcontext Viewer • Updated Nov 8, 2025 • 3.61k • 8
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-barexamqa-train-c256-t128-1000s-agnostic-nocontext Viewer • Updated Nov 9, 2025 • 3.61k • 8
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-barexamqa-train-c256-t128-10s-agnostic Viewer • Updated Nov 13, 2025 • 20 • 7
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-barexamqa-train-c256-t128-10s-agnostic-fullcontext Viewer • Updated Nov 13, 2025 • 20 • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-barexamqa-train-c256-t128-10s-agnostic-nocontext Viewer • Updated Nov 13, 2025 • 20 • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-barexamqa-train-c256-t128-20s-agnostic Viewer • Updated Nov 7, 2025 • 160 • 8
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-barexamqa-train-c256-t128-20s-agnostic-fullcontext Viewer • Updated Nov 5, 2025 • 140 • 5
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-barexamqa-train-c27000-t128-1000s-agnostic Viewer • Updated Nov 8, 2025 • 3.61k • 8
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-barexamqa-train-c27000-t128-10s-agnostic Viewer • Updated Nov 13, 2025 • 20 • 10
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-barexamqa-train-c27000-t128-20s-agnostic Viewer • Updated Nov 7, 2025 • 160 • 27
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-barexamqa-train-c32000-t128-1000s-agnostic Viewer • Updated Nov 4, 2025 • 3.16k • 9
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-docfinqa-train-c27000-t4096-1000s-agnostic Viewer • Updated Nov 8, 2025 • 8k • 28
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-docfinqa-train-c27000-t4096-10s-agnostic Viewer • Updated Nov 13, 2025 • 20 • 5
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-docfinqa-train-c27000-t4096-20s-agnostic Viewer • Updated Nov 7, 2025 • 160 • 5
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-docfinqa-train-c4096-t4096-1000s-agnostic Viewer • Updated Nov 4, 2025 • 7k • 51
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-docfinqa-train-c4096-t4096-1000s-agnostic-fullcontext Viewer • Updated Nov 9, 2025 • 8k • 2
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-docfinqa-train-c4096-t4096-1000s-agnostic-nocontext Viewer • Updated Nov 9, 2025 • 8k • 4
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-docfinqa-train-c4096-t4096-10s-agnostic-fullcontext Viewer • Updated Nov 13, 2025 • 20 • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-docfinqa-train-c4096-t4096-10s-agnostic-nocontext Viewer • Updated Nov 13, 2025 • 20 • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-docfinqa-train-c4096-t4096-20s-agnostic-fullcontext Viewer • Updated Nov 5, 2025 • 20 • 8
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-docfinqa-train-c8192-t4096-1000s-agnostic Viewer • Updated Nov 8, 2025 • 8k • 27
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-docfinqa-train-c8192-t4096-10s-agnostic Viewer • Updated Nov 13, 2025 • 20 • 7
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-docfinqa-train-c8192-t4096-20s-agnostic Viewer • Updated Nov 7, 2025 • 160 • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-gsminf-ops_12-c27000-t2048-1000s-agnostic Viewer • Updated Nov 8, 2025 • 5.42k • 4
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-gsminf-ops_12-c27000-t2048-10s-agnostic Viewer • Updated Nov 7, 2025 • 80 • 8
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-gsminf-ops_12-c27000-t2048-20s-agnostic Viewer • Updated Nov 7, 2025 • 160 • 9
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-gsminf-ops_12-c32000-t2048-1000s-agnostic Viewer • Updated Nov 4, 2025 • 4.74k • 3
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-gsminf-ops_12-c4096-t2048-1000s-agnostic Viewer • Updated Nov 8, 2025 • 5.42k • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-gsminf-ops_12-c4096-t2048-1000s-agnostic-fullcontext Viewer • Updated Nov 8, 2025 • 5.42k • 4
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-gsminf-ops_12-c4096-t2048-1000s-agnostic-nocontext Viewer • Updated Nov 9, 2025 • 5.42k • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-gsminf-ops_12-c4096-t2048-10s-agnostic Viewer • Updated Nov 13, 2025 • 20 • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-gsminf-ops_12-c4096-t2048-10s-agnostic-fullcontext Viewer • Updated Nov 13, 2025 • 20 • 8
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-gsminf-ops_12-c4096-t2048-10s-agnostic-nocontext Viewer • Updated Nov 13, 2025 • 20 • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-gsminf-ops_12-c4096-t2048-20s-agnostic Viewer • Updated Nov 7, 2025 • 160 • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-gsminf-ops_12-c4096-t2048-20s-agnostic-fullcontext Viewer • Updated Nov 5, 2025 • 140 • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-housingqa-test-c1024-t512-1000s-agnostic Viewer • Updated Nov 8, 2025 • 8k • 8
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-housingqa-test-c1024-t512-1000s-agnostic-fullcontext Viewer • Updated Nov 8, 2025 • 8k • 7
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-housingqa-test-c1024-t512-1000s-agnostic-nocontext Viewer • Updated Nov 9, 2025 • 8k • 8
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-housingqa-test-c1024-t512-10s-agnostic Viewer • Updated Nov 13, 2025 • 20 • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-housingqa-test-c1024-t512-10s-agnostic-fullcontext Viewer • Updated Nov 13, 2025 • 20 • 9
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-housingqa-test-c1024-t512-10s-agnostic-nocontext Viewer • Updated Nov 13, 2025 • 20 • 4
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-housingqa-test-c1024-t512-20s-agnostic Viewer • Updated Nov 7, 2025 • 160 • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-housingqa-test-c1024-t512-20s-agnostic-fullcontext Viewer • Updated Nov 5, 2025 • 140 • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-housingqa-test-c27000-t512-1000s-agnostic Viewer • Updated Nov 8, 2025 • 8k • 66
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-housingqa-test-c27000-t512-10s-agnostic Viewer • Updated Nov 13, 2025 • 20 • 7
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-housingqa-test-c27000-t512-20s-agnostic Viewer • Updated Nov 7, 2025 • 160 • 5
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-housingqa-test-c32000-t512-1000s-agnostic Viewer • Updated Nov 4, 2025 • 7k • 30
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-ruler-qa-test-c2048-t1024-1000s-agnostic Viewer • Updated Nov 8, 2025 • 1.02k • 26
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-ruler-qa-test-c2048-t1024-1000s-agnostic-fullcontext Viewer • Updated Nov 8, 2025 • 1.02k • 5
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-ruler-qa-test-c2048-t1024-1000s-agnostic-nocontext Viewer • Updated Nov 9, 2025 • 1.02k • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-ruler-qa-test-c2048-t1024-10s-agnostic Viewer • Updated Nov 13, 2025 • 20 • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-ruler-qa-test-c2048-t1024-10s-agnostic-fullcontext Viewer • Updated Nov 13, 2025 • 20 • 7
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-ruler-qa-test-c2048-t1024-10s-agnostic-nocontext Viewer • Updated Nov 13, 2025 • 20 • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-ruler-qa-test-c2048-t1024-20s-agnostic Viewer • Updated Nov 7, 2025 • 160 • 7
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-ruler-qa-test-c2048-t1024-20s-agnostic-fullcontext Viewer • Updated Nov 5, 2025 • 140 • 5
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-ruler-qa-test-c27000-t1024-1000s-agnostic Viewer • Updated Nov 8, 2025 • 1.02k • 8
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-ruler-qa-test-c27000-t1024-10s-agnostic Viewer • Updated Nov 13, 2025 • 20 • 6
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-ruler-qa-test-c27000-t1024-20s-agnostic Viewer • Updated Nov 7, 2025 • 160 • 8
maanas-writer/mem_agent-model_based-rl-memoryagent-7b-ruler-qa-test-c32000-t1024-1000s-agnostic Viewer • Updated Nov 4, 2025 • 896 • 7
mlfoundations-dev/dcft_orca-agentinstruct-1M-v1-cleaned-singleturn Viewer • Updated Nov 22, 2024 • 847k • 4
quirky-lats-at-mats/NORMAL_BACKDOOR_alpaca_sleeper_agents_toy_safety_NOT_TRUNCATED_v4 Viewer • Updated Mar 11, 2024 • 2.83k • 16
quirky-lats-at-mats/NORMAL_BACKDOOR_alpaca_sleeper_agents_toy_safety_SFT_v4 Viewer • Updated Mar 11, 2024 • 2.83k • 21
quirky-lats-at-mats/NORMAL_BACKDOOR_alpaca_sleeper_agents_toy_safety_v4 Viewer • Updated Mar 11, 2024 • 2.83k • 28
quirky-lats-at-mats/NORMAL_BACKDOOR_alpaca_sleeper_agents_toy_test_v4 Viewer • Updated Mar 11, 2024 • 15.7k • 12
quirky-lats-at-mats/NORMAL_BACKDOOR_alpaca_sleeper_agents_toy_train_v4 Viewer • Updated Mar 11, 2024 • 15.7k • 13
snap-stanford/hotpotqa_four_agents_pipeline-preference_modular_model_prior-bak Viewer • Updated Feb 18, 2025 • 3.4k • 24
yatin-superintelligence/Adversarial-Agent-Intent-Safety-Analysis-240K Viewer • Updated Mar 15 • 242k • 83 • 9
yatin-superintelligence/White-Hat-Security-Agent-Prompts-600K Viewer • Updated Mar 15 • 596k • 2.08k • 18
yurunyyr/agentic-futoshiki-NonMarkov-qwen2.5-3B_SFT-5e-6-20k_prm0_actor-mkv Viewer • Updated Jan 19 • 12.8k • 7
yurunyyr/agentic-futoshiki-NonMarkov-qwen2.5-3B_SFT-5e-6-20k_prm0_actor-nm Viewer • Updated Jan 18 • 12.8k • 5
yurunyyr/agentic-futoshiki-NonMarkov-qwen2.5-3B_SFT-5e-6-20k_prm0_actor-nme_ls Viewer • Updated Jan 19 • 12.8k • 7
yurunyyr/agentic-futoshiki-NonMarkov-qwen2.5-3B_SFT-5e-6-20k_prm0_actor-nme_sa Viewer • Updated Jan 19 • 12.8k • 10
yurunyyr/agentic-futoshiki-NonMarkov-qwen2.5-3B_SFT-5e-6-20k_prm0_actor-nmspr Viewer • Updated Jan 18 • 12.8k • 6
yurunyyr/agentic-futoshiki-NonMarkov-qwen2.5-3B_SFT-5e-6-20k_prm0_actor-nmsprls Viewer • Updated Jan 18 • 12.8k • 6
yurunyyr/agentic-futoshiki-NonMarkov-qwen3-4B_SFT-5e-6-4k_prm0_actor-mkv Viewer • Updated Jan 19 • 12.8k • 10
yurunyyr/agentic-futoshiki-NonMarkov-qwen3-4B_SFT-5e-6-4k_prm0_actor-nm Viewer • Updated Jan 18 • 12.8k • 9
yurunyyr/agentic-futoshiki-NonMarkov-qwen3-4B_SFT-5e-6-4k_prm0_actor-nme_ls Viewer • Updated Jan 19 • 12.8k • 8
yurunyyr/agentic-futoshiki-NonMarkov-qwen3-4B_SFT-5e-6-4k_prm0_actor-nme_sa Viewer • Updated Jan 19 • 12.8k • 7
yurunyyr/agentic-futoshiki-NonMarkov-qwen3-4B_SFT-5e-6-4k_prm0_actor-nmspr Viewer • Updated Jan 18 • 12.8k • 5
yurunyyr/agentic-futoshiki-NonMarkov-qwen3-4B_SFT-5e-6-4k_prm0_actor-nmsprls Viewer • Updated Jan 18 • 12.8k • 9
Bisher/SadeedDiac-25_predictions_train_run-qwen2.5-1.5b-arabic-moaaz-fadel Viewer • Updated Jun 7, 2025 • 1.2k • 2
Bisher/SadeedDiac-25_predictions_train_run-qwen2.5-1.5b-instruct-arabic-diacritization-moaaz-fadel_10k Viewer • Updated Jun 5, 2025 • 1.2k • 1
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_freelancer-embedding-mean-instruction-filter_Qw31d16c2e Viewer • Updated Nov 11, 2025 • 3.61k • 5
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_freelancer-embedding-mean-instruction-filter_Qw5ac0535f Viewer • Updated Nov 11, 2025 • 4.44k • 5
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_freelancer-embedding-mean-instruction-filter_Qw964ccb27 Viewer • Updated Nov 9, 2025 • 8
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_freelancer-embedding-mean-instruction-filter_Qwa37d9046 Viewer • Updated Nov 11, 2025 • 8.86k • 7
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_freelancer-embedding-mean-instruction-filter_Qwac394dc1 Viewer • Updated Nov 9, 2025 • 6
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_freelancer-embedding-mean-instruction-filter_Qwe62da58e Viewer • Updated Nov 9, 2025 • 20.1k • 3
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_freelancer-embedding-mean-instruction-filter_Qwedeb3902 Viewer • Updated Nov 10, 2025 • 1.9k • 5
DCAgent2/DCAgent2_aider_polyglot_DCAgent_freelancer-embedding-mean-instruction-filter_Qw7437541e Viewer • Updated Jan 22 • 2.67k • 8
DCAgent2/DCAgent_dev_set_71_tasks_anthropic_claude-haiku-4-5_20251120_141309 Viewer • Updated Nov 20, 2025 • 4.97k • 6
DCAgent2/DCAgent_dev_set_v2_DCAgent_freelancer-embedding-mean-instruction-filter_Qwen3-870b97571 Viewer • Updated Mar 3 • 293 • 12
DCAgent2/dcagent-dev-set-71-tasks-anthropic-claude-haiku-4-5-20251120-141309 Viewer • Updated Nov 26, 2025 • 210 • 11
DCAgent2/dcagent-dev-set-71-tasks-dcagent-freelancer-embedding-mean-instruction-filter-70423284 Viewer • Updated Nov 24, 2025 • 167 • 6
DCAgent2/dcagent-dev-set-71-tasks-dcagent-freelancer-embedding-mean-instruction-filter-72480605 Viewer • Updated Nov 27, 2025 • 209 • 7
DCAgent2/dev_set_v2_NVIDIA_Nemotron_3_Nano_30B_A3B_BF16_20260414_211234 Viewer • Updated Apr 15 • 273 • 4
DCAgent2/dev_set_v2_NVIDIA_Nemotron_3_Nano_30B_A3B_BF16_20260415_174135 Viewer • Updated Apr 15 • 268 • 4
DCAgent2/dev_set_v2_NVIDIA_Nemotron_3_Nano_30B_A3B_BF16_20260424_041855 Viewer • Updated Apr 24 • 297 • 8
DCAgent2/dev_set_v2_NVIDIA_Nemotron_3_Nano_30B_A3B_BF16_20260424_044155 Viewer • Updated Apr 25 • 297 • 8
DCAgent2/financeagent_terminal_NVIDIA_Nemotron_3_Nano_30B_A3B_BF16_20260505_224613 Viewer • Updated May 6 • 150 • 29
DCAgent2/gaia_127_NVIDIA_Nemotron_3_Nano_30B_A3B_BF16_20260425_071200 Viewer • Updated Apr 25 • 380 • 3.49k
DCAgent2/gaia_127_NVIDIA_Nemotron_3_Nano_30B_A3B_BF16_20260430_193800 Viewer • Updated May 1 • 379 • 3.51k
MoaazTalab/Qwen2.5-0.5B-instruct_2ga_1e_UNSLOTH_tro_lora_sadeedDiace-25-predictions Viewer • Updated May 22, 2025 • 1.2k • 6
daaxila/twitter-yuml_707-2026.03.02-2028455798142300214-9mXfMoA3wrZXQVfY-part1 Viewer • Updated Apr 4 • 1 • 16
skandermoalla/qrpo-paper-llama-nosft-magpieair-armorm-temp1-ref50-offline-armorm Viewer • Updated Dec 8, 2025 • 96.6k • 11
skandermoalla/qrpo-paper-llama-nosft-ultrafeedback-armorm-temp1-ref50-offline-armorm Viewer • Updated Dec 8, 2025 • 61.6k • 571
skandermoalla/qrpo-paper-llama-sft-magpieair-armorm-temp1-ref50-offline-armorm Viewer • Updated Dec 8, 2025 • 100k • 22
skandermoalla/qrpo-paper-llama-sft-ultrafeedback-armorm-temp1-ref50-offline-armorm Viewer • Updated Dec 8, 2025 • 62k • 11
skandermoalla/qrpo-paper-mistral-nosft-magpieair-armorm-temp1-ref50-offline-armorm Viewer • Updated Dec 8, 2025 • 99.1k • 32
skandermoalla/qrpo-paper-mistral-nosft-ultrafeedback-armorm-temp1-ref50-offline-armorm Viewer • Updated Dec 8, 2025 • 62.5k • 17
skandermoalla/qrpo-paper-mistral-sft-magpieair-armorm-temp1-ref50-offline-armorm Viewer • Updated Dec 8, 2025 • 91.9k • 13
skandermoalla/qrpo-paper-mistral-sft-ultrafeedback-armorm-temp1-ref50-offline-armorm Viewer • Updated Dec 8, 2025 • 62.1k • 490