PKUTAN / SAWTLinks
Official python implementation for ICML 2024: "Learning Solution-Aware Transformers for Efficiently Solving Quadratic Assignment Problem"
☆14Updated last year
Alternatives and similar repositories for SAWT
Users that are interested in SAWT are comparing it to the libraries listed below
Sorting:
- ☆11Updated last week
- Give us minutes, we give back a faster Mamba. The official implementation of "Faster Vision Mamba is Rebuilt in Minutes via Merged Token …☆39Updated 7 months ago
- Official Implementation of Diffusion Step Annealing (DiSA) in Autoregressive Image Generation☆138Updated 2 months ago
- [ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"☆241Updated last year
- Official PyTorch implementation Source code for LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation, accepted at …☆108Updated last year
- ☆24Updated last year
- Awesome list of Mixture-of-Experts (MoE)☆21Updated last year
- Computer Science Conference Statistics: Explore number of submissions, acceptance rate, and many more.☆29Updated this week
- Unofficial implementation of "SODA: Bottleneck Diffusion Models for Representation Learning"☆90Updated last year
- ☆92Updated 9 months ago
- Official PyTorch implementation Source code for Weakly Supervised Video Scene Graph Generation via Natural Language Supervision, accepted…☆20Updated last month
- [CVPR2024] Efficient Dataset Distillation via Minimax Diffusion☆95Updated last year
- Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].☆28Updated last year
- [ICLR'25] Reconstructive Visual Instruction Tuning☆102Updated 4 months ago
- [AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)☆39Updated last year
- Official implementation for 'Class-Balancing Diffusion Models'☆54Updated last year
- [ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"☆150Updated 11 months ago
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆117Updated 9 months ago
- Official implementation of Dancing with Still Images: Video Distillation via Static-Dynamic Disentanglement.☆30Updated 11 months ago
- Empowering Unified MLLM with Multi-granular Visual Generation☆129Updated 6 months ago
- [AAAI 2023(Oral)] Self-supervised Action Representation Learning from Partial Spatio-Temporal Skeleton Sequences☆28Updated last year
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]☆53Updated 6 months ago
- [CVPR 2023 Hightlight] PDPP: Projected Diffusion for Procedure Planning in Instructional Videos☆32Updated last year
- multiview and self-supervised learning☆11Updated 3 years ago
- Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning☆201Updated 2 months ago
- NarrLV: Towards a Comprehensive Narrative-Centric Evaluation for Long Video Generation Models☆109Updated 2 weeks ago
- [ECCV 2024 (Oral)] Towards Scene Graph Anticipation☆17Updated 8 months ago
- MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation☆13Updated 5 months ago
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆69Updated 2 months ago
- (NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment☆31Updated 10 months ago