[NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference
☆36Oct 29, 2025Updated 3 months ago
Alternatives and similar repositories for e2d2
Users that are interested in e2d2 are comparing it to the libraries listed below
Sorting:
- ☆37Oct 29, 2025Updated 3 months ago
- Reinforcing Text-Rich Video Reasoning with Visual Rumination☆27Nov 24, 2025Updated 3 months ago
- [CVPR 2026] Official pytorch implementation of "ReDirector: Creating Any-Length Video Retakes with Rotary Camera Encoding"☆17Dec 17, 2025Updated 2 months ago
- Speech waveform synthesis filters☆13Jul 21, 2017Updated 8 years ago
- A modified version of WORLD (original: http://ml.cs.yamanashi.ac.jp/world/english/index.html)☆13Sep 23, 2015Updated 10 years ago
- ☆35Dec 16, 2025Updated 2 months ago
- Are Video Models Ready as Zero-shot Reasoners?☆84Nov 24, 2025Updated 3 months ago
- Mel-Generalized Cepstrum analysis☆20Jul 21, 2017Updated 8 years ago
- Audio streaming transfer demo with google.api.HttpBody and grpc gateway for speech synthesis☆20Jan 28, 2020Updated 6 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆65Jan 13, 2026Updated last month
- RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editing☆58Dec 26, 2025Updated 2 months ago
- Fine tune stable video diffusion.☆27Dec 29, 2023Updated 2 years ago
- The author's implementation of FUDOKI, a multimodal large language model purely based on discrete flow matching.☆68Dec 21, 2025Updated 2 months ago
- Pumilio: A Web-Based Management System for Ecological Recordings☆13Oct 29, 2018Updated 7 years ago
- [ICLR 2025] Official PyTorch Implementation for CPE: Concept Pinpoint Eraser for Text-to-image Diffusion Models via Residual Attention Ga…☆12Apr 7, 2025Updated 10 months ago
- [ICRA 2024] WLST: Weak Labels Guided Self-training for Weakly-supervised Domain Adaptation on 3D Object Detection☆12Feb 6, 2024Updated 2 years ago
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- [TMLR 2025 & ICLR 2025 DeLTa] Official Implementation of Design Editing for Offline Model-based Optimization 🧬 🤖☆10Apr 17, 2025Updated 10 months ago
- ☆21Dec 14, 2025Updated 2 months ago
- (CVPR 2024) Bayesian Diffusion Models for 3D Shape Reconstruction☆37May 6, 2024Updated last year
- 字节跳动瓜最终真实情况,用事实说话,正义会迟到但不会缺席!☆23Oct 18, 2024Updated last year
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Apr 25, 2025Updated 10 months ago
- ☆34Jul 16, 2019Updated 6 years ago
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- AI-native knowledge kernel for human/agent collaboration. Use it as a Knowledge Base, Wiki, Annotator, Research Tool, or Agentic Memory.☆29Updated this week
- Implementation of the CVPR2025 paper LoTUS: Large-Scale Machine Unlearning with a Taste of Uncertainty.☆16Sep 10, 2025Updated 5 months ago
- A free and open-source focus stacking software that supports multi-focus image alignment and fusion.☆19Feb 5, 2026Updated 3 weeks ago
- Software to enable data-rich collaboration from high-resolution display walls to your laptop☆15Updated this week
- Code, source data, examples, and audio excerpts for Flow: Expressive Rhythm in the Rapping Voice☆10Feb 13, 2020Updated 6 years ago
- Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence☆235Feb 13, 2026Updated last week
- Personal PyTorch implementation of "Generative Modeling via Drifting" with Claude☆117Feb 6, 2026Updated 2 weeks ago
- ☆91Dec 30, 2025Updated last month
- Crawl & Visualize NeurIPS 2022 Data from OpenReview☆14Nov 8, 2022Updated 3 years ago
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated last month
- Fast, free, easy, and object-agnostic video anonymization☆11Dec 12, 2020Updated 5 years ago
- ☆30Dec 23, 2025Updated 2 months ago
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- a python library for different types of vocoders like LPC, MCEP, PSOLA, etc.☆36Feb 21, 2015Updated 11 years ago