23 14

Charlotte Williams

tmp-89

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Learning from Language Feedback via Variational Policy Distillation

liked a dataset 4 days ago

haifan-gong/CAMRAD

upvoted a paper 5 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

View all activity

Organizations

None yet

upvoted a paper 4 days ago

Learning from Language Feedback via Variational Policy Distillation

Paper • 2605.15113 • Published 8 days ago • 10

liked a dataset 4 days ago

haifan-gong/CAMRAD

Updated 1 day ago • 2.25k • 3

upvoted a paper 5 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 14 days ago • 190

upvoted a paper 7 days ago

Omni-Persona: Systematic Benchmarking and Improving Omnimodal Personalization

Paper • 2605.09996 • Published 15 days ago • 8

liked a model 11 days ago

samuelealbani/q-FrozenLake-v1-4x4-slippery

Reinforcement Learning • Updated 11 days ago • 1

liked a model 12 days ago

jackxinning/Leanly_AI

Question Answering • 15B • Updated 22 days ago • 6.21k • 120

liked a model 14 days ago

mradermacher/neos-v9-merged-GGUF

33B • Updated 14 days ago • 565 • 1

upvoted a paper 19 days ago

Odysseus: Scaling VLMs to 100+ Turn Decision-Making in Games via Reinforcement Learning

Paper • 2605.00347 • Published 25 days ago • 16

upvoted a paper 24 days ago

RaV-IDP: A Reconstruction-as-Validation Framework for Faithful Intelligent Document Processing

Paper • 2604.23644 • Published 30 days ago • 5

upvoted a paper about 1 month ago

MTR-DuplexBench: Towards a Comprehensive Evaluation of Multi-Round Conversations for Full-Duplex Speech Language Models

Paper • 2511.10262 • Published Apr 17 • 2

liked a dataset about 1 month ago

GaryYang123/zh-meme-sft-8k

Viewer • Updated Apr 20 • 8.68k • 227 • 80

liked a model about 1 month ago

Jordansky/ginrummy-smoketest-roomexce_lp

Text Generation • Updated Apr 12 • 1 • 2

upvoted a paper about 1 month ago

Self-Distilled RLVR

Paper • 2604.03128 • Published Apr 3 • 176

liked a model about 2 months ago

facebook/contriever

Updated Jan 19, 2022 • 7.85M • 89

upvoted 2 papers about 2 months ago

ACES: Who Tests the Tests? Leave-One-Out AUC Consistency for Code Generation

Paper • 2604.03922 • Published Apr 5 • 53

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 630

liked a dataset about 2 months ago

l6FG84Ey/bA30fA25

Viewer • Updated about 5 hours ago • 1 • 6.8k • 5

upvoted a paper about 2 months ago

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 342

liked a model about 2 months ago

mulemp/Partisano

Updated Apr 7

upvoted a paper about 2 months ago

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Paper • 2603.25716 • Published Mar 26 • 156

Charlotte Williams

AI & ML interests

Recent Activity

Organizations

tmp-89's activity