SIN-Bench: Tracing Native Evidence Chains in Long-Context Multimodal Scientific Interleaved Literature
Paper • 2601.10108 • Published • 7
Fundamental Al Methods; Perception & World Modeling; Reasoning & Generation; Action & Interaction
WebRISE: Requirement-Induced State Evaluation for MLLM-Generated Web Artifacts
SIN-Bench: Tracing Native Evidence Chains in Long-Context Multimodal Scientific Interleaved Literature