PKUTAN / SAWTLinks
Official python implementation for ICML 2024: "Learning Solution-Aware Transformers for Efficiently Solving Quadratic Assignment Problem"
☆15Updated last year
Alternatives and similar repositories for SAWT
Users that are interested in SAWT are comparing it to the libraries listed below
Sorting:
- ☆11Updated 4 months ago
- Give us minutes, we give back a faster Mamba. The official implementation of "Faster Vision Mamba is Rebuilt in Minutes via Merged Token …☆40Updated 11 months ago
- [CVPR 2023 Hightlight] PDPP: Projected Diffusion for Procedure Planning in Instructional Videos☆32Updated 2 years ago
- Official PyTorch implementation Source code for LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation, accepted at …☆113Updated last year
- Official Implementation of Diffusion Step Annealing (DiSA) in Autoregressive Image Generation☆143Updated 6 months ago
- [ICCV 2023] Code for Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation☆23Updated last year
- Official PyTorch Implementation of Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos☆11Updated 5 months ago
- Unofficial implementation of "SODA: Bottleneck Diffusion Models for Representation Learning"☆96Updated last year
- [ACMMM 23] Zero-shot Skeleton-based Action Recognition via Mutual Information Estimation and Maximization☆27Updated 2 years ago
- ☆24Updated 2 years ago
- [ICLR 2024 Poster] SCHEMA: State CHangEs MAtter for Procedure Planning in Instructional Videos☆19Updated 3 months ago
- Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].☆30Updated last year
- [ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"☆245Updated last year
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆138Updated last year
- [NeurIPS'24 spotlight] MECD: Unlocking Multi-Event Causal Discovery in Video Reasoning. [TPAMI'25] MECD+☆42Updated last month
- Official PyTorch implementation Source code for Weakly Supervised Video Scene Graph Generation via Natural Language Supervision, accepted…☆23Updated 5 months ago
- Neural-etwork-parameters-with-Diffusion☆37Updated last year
- (ECCV 2024) Official repository of paper "EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding"☆31Updated 7 months ago
- ☆104Updated last year
- [CVPR 2024] Narrative Action Evaluation with Prompt-Guided Multimodal Interaction☆39Updated last year
- TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation☆231Updated 3 months ago
- A collection of resources and papers on Vector Quantized Variational Autoencoder (VQ-VAE) and its application☆321Updated 10 months ago
- This is a collection of awesome papers I have read (carefully or roughly) in the fields of computer vision, machine learning, pattern rec…☆14Updated last year
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆46Updated last year
- ☆14Updated 7 months ago
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]☆60Updated 10 months ago
- multiview and self-supervised learning☆11Updated 3 years ago
- ☆28Updated 8 months ago
- [AAAI 2023(Oral)] Self-supervised Action Representation Learning from Partial Spatio-Temporal Skeleton Sequences☆27Updated last year
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆87Updated 2 months ago