☆43Jun 6, 2025Updated last year
Alternatives and similar repositories for Aligning-Latent-Spaces-with-Flow-Priors
Users that are interested in Aligning-Latent-Spaces-with-Flow-Priors are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- ☆31Apr 11, 2025Updated last year
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆87Feb 27, 2025Updated last year
- ☆55Jun 4, 2025Updated last year
- TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation☆236Aug 18, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆204Jan 7, 2026Updated 5 months ago
- Active Learning Helps Pretrained Models Learn the Intended Task (https://arxiv.org/abs/2204.08491) by Alex Tamkin, Dat Nguyen, Salil Desh…☆11Nov 22, 2022Updated 3 years ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- Benchmarking Multi-Image Understanding in Vision and Language Models☆11Jul 29, 2024Updated last year
- Official Implemenation for RAEv2: Improved Baselines with Representation Autoencoders☆271May 21, 2026Updated 3 weeks ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆37Updated this week
- ☆87Jun 2, 2026Updated 2 weeks ago
- Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment☆65Jul 22, 2025Updated 10 months ago
- Weird autoencoder experiments☆25May 20, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is the official repository for MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation Learning towards Efficient Vision-and-La…☆16May 17, 2026Updated last month
- ☆27Jan 12, 2026Updated 5 months ago
- [Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions☆45Jun 11, 2025Updated last year
- [NeurIPS 23] Characterizing OOD Error via Optimal Transport☆13Nov 19, 2023Updated 2 years ago
- Explore how to get a VQ-VAE models efficiently!☆70Jul 24, 2025Updated 10 months ago
- C++ neural network library☆13Jul 2, 2016Updated 9 years ago
- ☆10Jun 4, 2016Updated 10 years ago
- code for the paper Imitation Learning from Observation with Automatic Discount Scheduling☆13Mar 27, 2024Updated 2 years ago
- My attempt to improve the speed of the newton schulz algorithm, starting from the dion implementation.☆38Apr 30, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆322May 29, 2025Updated last year
- EgoToM is an egocentric theory-of-mind benchmark built on Ego4D videos, containing multi-choice questions that evaluate multimodal large …☆16Apr 1, 2025Updated last year
- ☆14May 3, 2022Updated 4 years ago
- [ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.☆14Aug 8, 2025Updated 10 months ago
- ☆15Oct 9, 2022Updated 3 years ago
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆16Jan 7, 2025Updated last year
- DACVAE☆226Dec 22, 2025Updated 5 months ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆45Nov 17, 2024Updated last year
- [ICML 2025 Oral] This is the official repository of the paper "What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensi…☆22Jun 12, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Latest and fastest EigenPro that scales to billions of examples☆10Apr 18, 2026Updated 2 months ago
- [ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/To…☆158Jul 24, 2025Updated 10 months ago
- Active Learning in the era of Foundation Models☆13Apr 16, 2025Updated last year
- Face Generation Work (Preprint)☆21Dec 28, 2024Updated last year
- This repository is now obsolete. Please go to https://github.com/idlak/idlak instead.☆39Feb 26, 2018Updated 8 years ago
- A simple and effective feature extractor for untrimmed videos☆13Sep 1, 2022Updated 3 years ago
- Official implementation of Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents (NeurIPS 2025)☆47Nov 24, 2025Updated 6 months ago