facok / florence2-ft-simpleLinks
finetune your florence2 model easy
โ19Updated last year
Alternatives and similar repositories for florence2-ft-simple
Users that are interested in florence2-ft-simple are comparing it to the libraries listed below
Sorting:
- Scripts for use with LongCLIP, including fine-tuning Long-CLIPโ63Updated 10 months ago
- RepText: Rendering Visual Text via Replicating ๐ฅโ141Updated 7 months ago
- โ50Updated last year
- ๐ฅ [CVPR 2024] The official repo for Zero-Painter!โ70Updated last year
- Fine-tuning code for CLIP modelsโ265Updated this week
- [ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. ไธไธชๆฏๆ็จๆท่ช็ฑ่พๅ ฅๆงโฆโ128Updated last year
- Unofficial implementation. Stable diffusion model trained by AI Feedback-Based Self-Training Direct Preference Optimization.โ65Updated last year
- โ79Updated 11 months ago
- โ32Updated last year
- Official code for our ICCV2025 paper "SDMatte: Grafting Diffusion Models for Interactive Matting"โ172Updated 5 months ago
- [ICML 2025] Official PyTorch implementation of paper "Ultra-Resolution Adaptation with Ease".โ116Updated 8 months ago
- Official Implementation for paper: Negative Token Merging: Image-based Adversarial Feature Guidanceโ75Updated 7 months ago
- MoD Control Tile Upscaler for SDXL Pipelineโ61Updated 10 months ago
- A simple tool to guess an HuggingFace repo URL from a state dict.โ47Updated last year
- โ113Updated 9 months ago
- โ177Updated last year
- โ33Updated 9 months ago
- Multimodal captionerโ203Updated this week
- Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Controlโ190Updated last year
- Ofiicial GoodDrag implementation.โ97Updated 4 months ago
- โ90Updated 2 years ago
- finetune your florence2 model easyโ21Updated last year
- โ44Updated last year
- UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratiosโ110Updated last month
- โ33Updated last year
- MoMA: Multimodal LLM Adapter for Fast Personalized Image Generationโ235Updated last year
- CogVideoX-LoRAs is a centralized repository for all LoRA models created for CogVideoX, filling the gap for a unified sharing space. With โฆโ81Updated last year
- โ36Updated last year
- Vision Transformers Needs Registers. And Gated MLPs. And +20M params. Tiny modality gap ensues!โ47Updated 8 months ago
- A system for Prompt generation to improve Text-to-Image performance.โ90Updated 2 months ago