huggingface / large-scale-image-deduplicationLinks
☆188Updated 6 months ago
Alternatives and similar repositories for large-scale-image-deduplication
Users that are interested in large-scale-image-deduplication are comparing it to the libraries listed below
Sorting:
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆77Updated 6 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆53Updated last year
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆125Updated 6 months ago
- Fine tune Gemma 3 on an object detection task☆97Updated 6 months ago
- Ring-V2 is a reasoning MoE LLM provided and open-sourced by InclusionAI.☆90Updated 3 months ago
- This is the repo for the paper "PANGEA: A FULLY OPEN MULTILINGUAL MULTIMODAL LLM FOR 39 LANGUAGES"☆119Updated 7 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆261Updated this week
- This is the official Python version of Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play.☆115Updated 3 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆279Updated 6 months ago
- ☆56Updated last year
- ☆171Updated last week
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆41Updated 4 months ago
- ☆69Updated last year
- Official Project Page for Deep Delta Learning (https://huggingface.co/papers/2601.00417)☆333Updated last week
- ACL 2025: Synthetic data generation pipelines for text-rich images.☆155Updated 11 months ago
- ☆34Updated 7 months ago
- ☆19Updated 11 months ago
- Large multi-modal models (L3M) pre-training.☆230Updated 4 months ago
- Train, tune, and infer Bamba model☆137Updated 8 months ago
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆98Updated last year
- ☆82Updated 2 months ago
- Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This rep…☆60Updated last year
- ☆72Updated last year
- [NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation☆70Updated 3 months ago
- Video-LlaVA fine-tune for CinePile evaluation☆51Updated last year
- Data generation and training repository for SERA: Soft-Verified Efficient Repository Agents.☆110Updated last week
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆34Updated 5 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago
- minimal GRPO implementation from scratch☆102Updated 10 months ago
- ☆67Updated 8 months ago