☆40Jun 6, 2025Updated 9 months ago
Alternatives and similar repositories for Aligning-Latent-Spaces-with-Flow-Priors
Users that are interested in Aligning-Latent-Spaces-with-Flow-Priors are comparing it to the libraries listed below
Sorting:
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆35Feb 26, 2026Updated last week
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- [Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions☆45Jun 11, 2025Updated 8 months ago
- ☆27Apr 11, 2025Updated 10 months ago
- Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?☆88Jul 13, 2025Updated 7 months ago
- ☆51Jun 4, 2025Updated 9 months ago
- TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation☆236Aug 18, 2025Updated 6 months ago
- Explore how to get a VQ-VAE models efficiently!☆68Jul 24, 2025Updated 7 months ago
- [ICCV2025] VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation☆33Aug 18, 2025Updated 6 months ago
- ☆34May 14, 2025Updated 9 months ago
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆37Aug 29, 2023Updated 2 years ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆86Feb 27, 2025Updated last year
- DACVAE☆197Dec 22, 2025Updated 2 months ago
- [AutoArk] GPA (General Purpose Audio) can do ASR, TTS and voice conversion with one tiny 300M model!☆97Updated this week
- [ICLR 2026] DecAlign: Aligning Cross-Modal Semantics for Multimodal Foundation Models☆54Feb 5, 2026Updated last month
- Codebase for the paper-Elucidating the design space of language models for image generation☆46Nov 17, 2024Updated last year
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆12Apr 15, 2025Updated 10 months ago
- Linear Attention for Efficient Bidirectional Sequence Modeling☆15May 13, 2025Updated 9 months ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆46Mar 10, 2025Updated last year
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago
- Official Implementation for paper "Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm"☆20Feb 20, 2026Updated 2 weeks ago
- Decode a ontouml-schema compliant JSON in an ontouml-metamodel-vocabulary compliant knowledge graph☆12Jan 19, 2026Updated last month
- Exploring the use of options in creating small worlds for faster learning in RL Domains☆16Jan 23, 2012Updated 14 years ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- A very light C/C++ implementation of Obyte (formerly Byteball) for Arduino☆13Jul 21, 2020Updated 5 years ago
- Towards an implementation of hierarchical temporal memory and the cortical learning algorithm by Jeff Hawkins and Dileep George of Nument…☆12Mar 15, 2017Updated 8 years ago
- This repo is for residual-connected sentence encoder for NLI.☆11Jan 21, 2018Updated 8 years ago
- Training code for the Faster-RCNN detector☆11Jan 23, 2019Updated 7 years ago
- [ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.☆14Aug 8, 2025Updated 7 months ago
- Official implementation of "InstanceAssemble: Layout-Aware Image Generation via Instance Assembling Attention" (NeurIPS 2025)☆40Oct 17, 2025Updated 4 months ago
- ☆22Jan 12, 2026Updated last month
- ☆10Apr 17, 2024Updated last year
- My personal solutions to the CS231n assignments (Spring 2019). CS231n: "CNN" is a Computer Vision class taught at Stanford.☆10Dec 8, 2022Updated 3 years ago
- text to speech☆10Mar 19, 2024Updated last year
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆12Sep 12, 2016Updated 9 years ago
- Code repository of the paper "Alleviating Adversarial Attacks on Variational Autoencoders with MCMC" published at NeurIPS 2022. https://a…☆10Dec 14, 2022Updated 3 years ago
- ☆10Jun 4, 2016Updated 9 years ago
- Get more done with LLMs☆13Jan 19, 2024Updated 2 years ago