Change the repository type filter
All
Repositories list
21 repositories
- [NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation
- [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding
- [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Gen…
- (Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators
- [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
- official training and inference code of bitwise tokenizer
- [CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection
- [AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
- [ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces
flashvideo-page
Publicinfinity.project
PublicGLEE
Public[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
OmniTokenizer
Public[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.vaex
Public- [ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box
Groma
Public[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual TokenizationVNext
Public archiveNext-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR 2023), SeqFormer(ECCV Oral), and IDOL(ECCV Oral))
ProTip! When viewing an organization's repositories, you can use the
props. filter to filter by custom property.