arxiv:2604.10577
Taiwei Shi
MaksimSTW
AI & ML interests
reinforcement learning, alignment, human-AI collaboration, and computational social science
Recent Activity
authored a paper about 12 hours ago
The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents liked a dataset about 22 hours ago
lime-nlp/OS-Blind upvoted a paper about 23 hours ago
The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents