33 15

Daixiny

DaiyuXX

AI & ML interests

None yet

Recent Activity

liked a dataset 3 days ago

Anthropic/hh-rlhf

liked a dataset 3 days ago

tencent/MegaStyle-1.4M

liked a dataset 3 days ago

nvidia/Nemotron-Personas-Korea

View all activity

Organizations

None yet

upvoted a paper 3 days ago

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Paper • 2604.22748 • Published 7 days ago • 213

upvoted a paper 16 days ago

KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance

Paper • 2604.12627 • Published 17 days ago • 99

upvoted a paper 23 days ago

InCoder-32B-Thinking: Industrial Code World Model for Thinking

Paper • 2604.03144 • Published 28 days ago • 233

upvoted a paper about 2 months ago

T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasoning

Paper • 2603.03790 • Published Mar 4 • 121

upvoted 4 papers 3 months ago

UI-Venus-1.5 Technical Report

Paper • 2602.09082 • Published Feb 9 • 157

Chain of Mindset: Reasoning with Adaptive Cognitive Modes

Paper • 2602.10063 • Published Feb 10 • 75

InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery

Paper • 2602.08990 • Published Feb 9 • 77

QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining

Paper • 2602.07085 • Published Feb 6 • 190

upvoted 3 papers 4 months ago

Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning

Paper • 2601.06943 • Published Jan 11 • 216

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

Paper • 2601.02151 • Published Jan 5 • 113

Region-Constraint In-Context Generation for Instructional Video Editing

Paper • 2512.17650 • Published Dec 19, 2025 • 52

upvoted a paper 5 months ago

Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows

Paper • 2512.13168 • Published Dec 15, 2025 • 53

upvoted 2 papers 7 months ago

SWE-QA: Can Language Models Answer Repository-level Code Questions?

Paper • 2509.14635 • Published Sep 18, 2025 • 35

RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation

Paper • 2509.16198 • Published Sep 19, 2025 • 129

upvoted 2 papers 8 months ago

MachineLearningLM: Continued Pretraining Language Models on Millions of Synthetic Tabular Prediction Tasks Scales In-Context ML

Paper • 2509.06806 • Published Sep 8, 2025 • 63

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19, 2025 • 119

upvoted 2 papers 11 months ago

VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos

Paper • 2505.23693 • Published May 29, 2025 • 53

Table-R1: Inference-Time Scaling for Table Reasoning

Paper • 2505.23621 • Published May 29, 2025 • 93

upvoted 2 papers about 1 year ago

BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs

Paper • 2504.18415 • Published Apr 25, 2025 • 51

Can Large Language Models Help Multimodal Language Analysis? MMLA: A Comprehensive Benchmark

Paper • 2504.16427 • Published Apr 23, 2025 • 18

Daixiny

AI & ML interests

Recent Activity

Organizations

DaiyuXX's activity