iFSQ & LlamaGen-REPA
โ95Jan 27, 2026Updated last month
Alternatives and similar repositories for iFSQ
Users that are interested in iFSQ are comparing it to the libraries listed below
Sorting:
- ใCOLING 2025๐ฅใCode for the paper "Is Parameter Collision Hindering Continual Learning in LLMs?".โ38Dec 5, 2024Updated last year
- Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".โ36Jul 10, 2025Updated 8 months ago
- [AAAI26] Next Patch Predictionโ132Jan 2, 2025Updated last year
- [ACM MM 2025] HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generationโ157Sep 4, 2025Updated 6 months ago
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generationโ190Nov 6, 2025Updated 4 months ago
- FIBO-Edit brings the power of structured prompt generation to image editingโ31Jan 29, 2026Updated last month
- Reimplementation of D4RTโ38Dec 26, 2025Updated 2 months ago
- GPT as a Monte Carlo Language Tree: A Probabilistic Perspectiveโ45Jan 18, 2025Updated last year
- This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".โ84Jul 10, 2025Updated 8 months ago
- Implementation of "Schedule On the Fly: Diffusion Time Prediction for Faster and Better Image Generation" [CVPR 2025]โ41Apr 25, 2025Updated 10 months ago
- โ32Feb 18, 2026Updated last month
- Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"โ45Mar 11, 2025Updated last year
- Unofficial implementation of Sketch-Guided Text-to-Image Diffusion Modelsโ13Jun 19, 2023Updated 2 years ago
- SegviGen: Repurposing 3D Generative Model for Part Segmentationโ55Updated this week
- [NeurIPS 2025 D&B๐ฅ] Implementation of "GS2E: Gaussian Splatting is an Effective Data Generator for Event Stream Generation"โ18Jun 1, 2025Updated 9 months ago
- โ17Jun 28, 2025Updated 8 months ago
- Code for paper "Rethinking Text-based Protein Understanding: Retrieval or LLM?"โ18Oct 7, 2025Updated 5 months ago
- A Mechanistic View on Video Generation as World Models: State and Dynamicsโ31Mar 9, 2026Updated last week
- Nitro-T is a family of text-to-image diffusion models focused on highly efficient training.โ40Jul 10, 2025Updated 8 months ago
- [ICLR 2026!]Rethinking Driving World Model as Synthetic Data Generator for Perception Tasksโ39Feb 10, 2026Updated last month
- "MoCA: Mixture-of-Components Attention for Scalable Compositional 3D Generation"โ174Dec 9, 2025Updated 3 months ago
- [CVPR 2026] Official repo of "MorphAny3D: Unleashing the Power of Structured Latent in 3D Morphingโโ81Mar 5, 2026Updated 2 weeks ago
- Knowledge injection method based on knowledge-oriented controls, achieving precision adaptation and powerful retention.โ58Dec 30, 2025Updated 2 months ago
- โ16Aug 13, 2023Updated 2 years ago
- [CVPR 2025๐ฅ] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Modelโ200May 11, 2025Updated 10 months ago
- [NeurIPS 2025] Official code for ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generationโ33Oct 17, 2025Updated 5 months ago
- [AAAI 2026] Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devicesโ95Nov 30, 2025Updated 3 months ago
- Ming-omni-tts: Simple and Efficient Unified Generation of Speech, Music, and Sound with Precise Controlโ199Feb 26, 2026Updated 3 weeks ago
- โ90Updated this week
- Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedbackโ243Jan 24, 2026Updated last month
- [NeurIPS 2025 D&B๐ฅ] OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generationโ200Mar 8, 2026Updated last week
- โ59Mar 16, 2025Updated last year
- VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Modelโ342Apr 17, 2025Updated 11 months ago
- [ICLR 2025] Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matchingโ20Apr 21, 2025Updated 11 months ago
- HITsz2021 ๆไฝ็ณป็ป็ฌ่ฎฐโ14Jan 22, 2022Updated 4 years ago
- โ75Mar 18, 2025Updated last year
- โ43Aug 5, 2025Updated 7 months ago
- โ41Mar 11, 2026Updated last week
- Plug-and-play streaming semantic VAD for real-time full-duplex spoken dialogue systems.โ124Updated this week