liyz15/Aligning-Latent-Spaces-with-Flow-Priors

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/liyz15/Aligning-Latent-Spaces-with-Flow-Priors)

liyz15 / Aligning-Latent-Spaces-with-Flow-Priors

☆43

Alternatives and similar repositories for Aligning-Latent-Spaces-with-Flow-Priors

Users that are interested in Aligning-Latent-Spaces-with-Flow-Priors are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AV-Odyssey / AV-Odyssey
View on GitHub
This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"
☆31Dec 23, 2024Updated last year
TencentARC / Video-Holmes
View on GitHub
[ECCV 2026] Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?
☆94Jul 13, 2025Updated last year
qiulu66 / EgoPlan-Bench2
View on GitHub
☆31Apr 11, 2025Updated last year
TencentARC / Divot
View on GitHub
Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)
☆87Feb 27, 2025Updated last year
TencentARC / TokLIP
View on GitHub
TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation
☆236Aug 18, 2025Updated 10 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
qiulu66 / Anime-Shooter
View on GitHub
☆55Jun 4, 2025Updated last year
b-sigpro / sed-hsmm
View on GitHub
Onset-and-Offset-Aware Sound Event Detection
☆21Feb 10, 2025Updated last year
Lornatang / pytorch-vgg19-cifar100
View on GitHub
Pytorch implements the VGG19 model to classify cifar100
☆12Feb 16, 2019Updated 7 years ago
HLR / VLN-trans
View on GitHub
[ACL2023] Official code repository for VLN-Trans
☆14Sep 10, 2023Updated 2 years ago
ys-zong / MIRB
View on GitHub
Benchmarking Multi-Image Understanding in Vision and Language Models
☆11Jul 29, 2024Updated last year
OpenGVLab / TPO
View on GitHub
Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment
☆65Jul 22, 2025Updated 11 months ago
Lexsi-Labs / aligntune
View on GitHub
Aligntune : A Modular Toolkit for Post Training Alignment of LLMs
☆37Jul 8, 2026Updated last week
CrystalSixone / VLN-MAGIC
View on GitHub
This is the official repository for MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation Learning towards Efficient Vision-and-La…
☆17May 17, 2026Updated last month
KeplerC / robo2VLM
View on GitHub
☆28Jan 12, 2026Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
corticph / error-align
View on GitHub
Text-to-text alignment algorithm for speech recognition error analysis.
☆31Jun 23, 2026Updated 3 weeks ago
atosystem / SSL_Interface
View on GitHub
Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024
☆16Nov 19, 2024Updated last year
ChuanyangZheng / L2ViT
View on GitHub
Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer
☆15Sep 7, 2024Updated last year
xpeng-robotics / DIAL
View on GitHub
☆100Jun 2, 2026Updated last month
zehao-wang / LAD
View on GitHub
Official implementation of Layout-aware Dreamer for Embodied Referring Expression Grounding [AAAI 23].
☆16Apr 13, 2023Updated 3 years ago
Neur-IO / ReVQ
View on GitHub
Explore how to get a VQ-VAE models efficiently!
☆70Jul 24, 2025Updated 11 months ago
MontrealCorpusTools / kalpy
View on GitHub
Pybind11 bindings for Kaldi
☆15Updated this week
zihuixue / ProgCaptioner
View on GitHub
Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)
☆26Jul 16, 2025Updated 11 months ago
Z1zyw / VQ-Map
View on GitHub
VQ-Map[NeurIPS 2024]
☆37Jun 3, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
antgroup / OmniBench
View on GitHub
[ICML 2025 Oral] This is the official repository of the paper "What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensi…
☆22Jun 12, 2025Updated last year
dwjshift / IL_ADS
View on GitHub
code for the paper Imitation Learning from Observation with Automatic Discount Scheduling
☆13Mar 27, 2024Updated 2 years ago
anushabala / deep-playlist
View on GitHub
☆10Jun 4, 2016Updated 10 years ago
LIONS-EPFL / LION
View on GitHub
Linear Attention for Efficient Bidirectional Sequence Modeling
☆18May 13, 2025Updated last year
Hhhhhhao / continuous_tokenizer
View on GitHub
☆321May 29, 2025Updated last year
NingMiao / InteL-VAEs
View on GitHub
Codes for paper <InteL-VAEs: Adding Inductive Biases to VariationalAuto-Encoders via Intermediary Latents>.
☆18Jun 25, 2021Updated 5 years ago
facebookresearch / EgoToM
View on GitHub
EgoToM is an egocentric theory-of-mind benchmark built on Ego4D videos, containing multi-choice questions that evaluate multimodal large …
☆16Apr 1, 2025Updated last year
thib-s / flash-newton-schulz
View on GitHub
My attempt to improve the speed of the newton schulz algorithm, starting from the dion implementation.
☆38Apr 30, 2026Updated 2 months ago
jason9693 / FROZEN
View on GitHub
☆14May 3, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
HAWLYQ / ET-Cap
View on GitHub
☆24Oct 8, 2023Updated 2 years ago
Stanford-AIMI / LieRE
View on GitHub
[ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.
☆14Aug 8, 2025Updated 11 months ago
ruotianluo / lazy_related_work
View on GitHub
☆15Oct 9, 2022Updated 3 years ago
AV-Reasoner / AV-Reasoner
View on GitHub
☆19Jul 22, 2025Updated 11 months ago
DurstewitzLab / HierarchicalDSR
View on GitHub
Hierarchical multi-system training framework for dynamical systems reconstruction (from Brenner et al. 2025 ICLR)
☆21Mar 7, 2025Updated last year
facebookresearch / dacvae
View on GitHub
DACVAE
☆226Dec 22, 2025Updated 6 months ago
Benjamin-Walker / selective-ssms-and-linear-cdes
View on GitHub
Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)
☆17Jan 7, 2025Updated last year