☆43Jun 6, 2025Updated last year
Alternatives and similar repositories for Aligning-Latent-Spaces-with-Flow-Priors
Users that are interested in Aligning-Latent-Spaces-with-Flow-Priors are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- [ICLR 2024] "3D Feature Prediction for Masked-AutoEncoder-Based Point Cloud Pretraining"☆12Aug 25, 2024Updated last year
- [ICCV2025] VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation☆33Aug 18, 2025Updated 10 months ago
- Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?☆94Jul 13, 2025Updated 11 months ago
- ☆31Apr 11, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆87Feb 27, 2025Updated last year
- Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Findings]"☆18Aug 27, 2025Updated 9 months ago
- ☆55Jun 4, 2025Updated last year
- TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation☆236Aug 18, 2025Updated 10 months ago
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆204Jan 7, 2026Updated 5 months ago
- Pytorch implements the VGG19 model to classify cifar100☆12Feb 16, 2019Updated 7 years ago
- Official Implemenation for RAEv2: Improved Baselines with Representation Autoencoders☆271May 21, 2026Updated 3 weeks ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆37Updated this week
- ☆87Jun 2, 2026Updated 2 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆34May 14, 2025Updated last year
- Weird autoencoder experiments☆25May 20, 2026Updated 3 weeks ago
- ☆27Jan 12, 2026Updated 5 months ago
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆16Nov 19, 2024Updated last year
- [Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions☆45Jun 11, 2025Updated last year
- [NeurIPS 23] Characterizing OOD Error via Optimal Transport☆13Nov 19, 2023Updated 2 years ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆15Sep 7, 2024Updated last year
- Explore how to get a VQ-VAE models efficiently!☆70Jul 24, 2025Updated 10 months ago
- Pybind11 bindings for Kaldi☆15Feb 1, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)☆26Jul 16, 2025Updated 11 months ago
- C++ neural network library☆13Jul 2, 2016Updated 9 years ago
- VQ-Map[NeurIPS 2024]☆37Jun 3, 2025Updated last year
- Linear Attention for Efficient Bidirectional Sequence Modeling☆16May 13, 2025Updated last year
- ☆23Oct 19, 2024Updated last year
- My attempt to improve the speed of the newton schulz algorithm, starting from the dion implementation.☆38Apr 30, 2026Updated last month
- [ICRA2024] PointSSC: A Cooperative Vehicle-Infrastructure Point Cloud Benchmark for Semantic Scene Completion☆28Jul 1, 2025Updated 11 months ago
- ☆321May 29, 2025Updated last year
- EgoToM is an egocentric theory-of-mind benchmark built on Ego4D videos, containing multi-choice questions that evaluate multimodal large …☆16Apr 1, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official Implementation for paper "Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm"☆22May 8, 2026Updated last month
- Code for A Dual Semantic-Aware Recurrent Global-Adaptive Network For Vision-and-Language Navigation☆17Apr 25, 2024Updated 2 years ago
- [CVPR 2023] Official code release of Cafi-Net: Self-Supervised Learning of Pose-Canonicalized Neural Fields☆15Jul 14, 2023Updated 2 years ago
- ☆14May 3, 2022Updated 4 years ago
- ☆24Oct 8, 2023Updated 2 years ago
- ☆15Oct 9, 2022Updated 3 years ago
- ☆19Jul 22, 2025Updated 10 months ago