joaquincabezas / clip_is_awesomeLinks
CLIP is an open source, multimodal computer vision model and it's awesome!
☆17Updated 7 months ago
Alternatives and similar repositories for clip_is_awesome
Users that are interested in clip_is_awesome are comparing it to the libraries listed below
Sorting:
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 8 months ago
- Fine tune Gemma 3 on an object detection task☆72Updated last week
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆59Updated 8 months ago
- Code for LitLLMs, LLMs for Literature Review: Are we there yet? (TMLR 2025)☆33Updated 3 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆29Updated 3 months ago
- ☆112Updated 2 weeks ago
- ☆70Updated this week
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆117Updated this week
- Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.☆86Updated 2 months ago
- This repository includes the code to download the curated HuggingFace papers into a single markdown formatted file☆14Updated 11 months ago
- ☆63Updated last year
- Notebooks for fine tuning pali gemma☆112Updated 3 months ago
- A curated list of materials on AI guardails☆39Updated last month
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆140Updated last month
- ☆124Updated 8 months ago
- I learn about and explain quantization☆26Updated last year
- A holistic evaluation library for multi-modal generative models using Weave☆28Updated 8 months ago
- Set of scripts to finetune LLMs☆37Updated last year
- Arxflix turns your boring Arxiv research paper into a captivating video.☆52Updated last month
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆81Updated last year
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆31Updated last year
- YesBut - Multimodal Satire Comprehension Dataset☆17Updated 8 months ago
- PyTorch implementation for MRL☆19Updated last year
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆123Updated 6 months ago
- A lightweight, local-first, and free experiment tracking Python library built on top of 🤗 Datasets and Spaces.☆246Updated this week
- ☆94Updated 3 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 10 months ago
- ☆76Updated 9 months ago
- ☆48Updated 5 months ago