A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens Paper • 2604.04913 • Published 24 days ago • 11
view article Article How I contributed a new model to the Transformers library using Codex about 1 month ago • 50
PMT: Plain Mask Transformer for Image and Video Segmentation with Frozen Vision Encoders Paper • 2603.25398 • Published Mar 26 • 3
PMT: Plain Mask Transformer for Image and Video Segmentation with Frozen Vision Encoders Paper • 2603.25398 • Published Mar 26 • 3