Official implementation for CVPR 2025 paper "AMO Sampler: Enhancing Text Rendering with Overshooting"
☆30May 3, 2025Updated 11 months ago
Alternatives and similar repositories for amo-release
Users that are interested in amo-release are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TextFlux: An OCR-Free DiT Model for High-Fidelity Multilingual Scene Text Synthesis☆91Sep 18, 2025Updated 7 months ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆57Dec 4, 2024Updated last year
- Official PyTorch Implementation for Continual Learning and Private Unlearning☆18Jul 19, 2022Updated 3 years ago
- Code Implementation of the Paper: EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering☆54Jun 16, 2025Updated 10 months ago
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆34Jun 12, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes☆95Nov 26, 2025Updated 4 months ago
- [IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull☆13Oct 9, 2023Updated 2 years ago
- Official PyTorch implementation of Doduo: Dense Visual Correspondence from Unsupervised Semantic-Aware Flow☆44Feb 1, 2024Updated 2 years ago
- Official implementation for "Nested Attention: Semantic-aware Attention Values for Concept Personalization" [SIGGRAPH 2025]☆27Aug 4, 2025Updated 8 months ago
- Animatediff implementation. Includes a ControlNet pipeline.☆19Dec 24, 2023Updated 2 years ago
- Official PyTorch Implementation for Fast Adaptive Multitask Optimization (FAMO)☆122May 9, 2024Updated last year
- [ICLR 23 oral] The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge Distillation☆45Jul 10, 2023Updated 2 years ago
- [⭐️ WACV 2025 Oral ⭐️] PETALface: Parameter Efficient Transfer Learning for Low-resolution Face Recognition☆31Jun 9, 2025Updated 10 months ago
- a jax benchmark for ad hoc teamwork☆21Updated this week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Dyna built on R-exprs (First Prototype)☆17Mar 7, 2022Updated 4 years ago
- ☆10Jun 12, 2021Updated 4 years ago
- Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator☆12Apr 28, 2024Updated last year
- CVPR 2025 Accepted Papers☆24Dec 20, 2025Updated 3 months ago
- ☆15Nov 26, 2023Updated 2 years ago
- ACL'2024-Main: Synergetic Event Understanding: A Collaborative Approach to Cross-Document Event Coreference Resolution with Large Languag…☆12Sep 19, 2025Updated 6 months ago
- ☆32Jun 26, 2024Updated last year
- ☆40Mar 3, 2026Updated last month
- This is the pytorch implementation of FCL-Net, accepted by NN'2022.☆14May 25, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- resources for text detection, text recognition, and end to end text spotting☆11Apr 23, 2023Updated 2 years ago
- ☆29Oct 25, 2025Updated 5 months ago
- [ICCV 2025] Official repository of DiffSim: Taming Diffusion Models for Evaluating Visual Similarity☆31Jul 14, 2025Updated 9 months ago
- Face-MakeUp (SD1.5): Multimodal Facial Prompts for Text-to-Image Generation (ECAI-2025)☆26Jan 19, 2025Updated last year
- [NeurIPS 2025] U-REPA: Aligning Diffusion U-Nets to ViTs☆35Dec 15, 2025Updated 4 months ago
- 【2024 ECAI】First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending☆14Jun 16, 2025Updated 10 months ago
- OKAMI: Teaching Humanoid Robots Manipulation Skills through Single Video Imitation☆34Jun 18, 2025Updated 10 months ago
- code for finetuning vae☆43Sep 8, 2024Updated last year
- Training LoRAs (Low-Rank Adaptations) for the black-forest-labs/FLUX.1-Fill-dev model.☆10Feb 22, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting☆45Apr 11, 2025Updated last year
- Social-AI papers across computing communities, courses, and dissertations.☆22Apr 8, 2026Updated last week
- UniGraph2: Learning a Unified Embedding Space to Bind Multimodal Graphs (WWW'25)☆18Apr 22, 2025Updated 11 months ago
- UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios☆127Apr 9, 2026Updated last week
- Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and Architecture. Training an MDM using GPT with this repo!☆35Jun 23, 2025Updated 9 months ago
- ☆13Jul 24, 2017Updated 8 years ago
- [CVPR 2025] "DepthCues: Evaluating Monocular Depth Perception in Large Vision Models", Duolikun Danier, Mehmet Aygün, Changjian Li, Hakan…☆21Mar 17, 2025Updated last year