Train (fine-tune) OpenAI's CLIP-like models on custom image-caption data sets, cf. COCO dataset. PyTorch implementation.
☆22Aug 29, 2022Updated 3 years ago
Alternatives and similar repositories for clip-like
Users that are interested in clip-like are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Self-supervised adversarial masking for point clouds☆11Jul 12, 2023Updated 2 years ago
- [ICML24] Official Implementation of "ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections"☆16May 31, 2024Updated 2 years ago
- An CUDA-based library for computed tomography (CT) reconstruction with differentiable operators.☆22May 21, 2026Updated 3 weeks ago
- FuseLIP: Multimodal Embeddings via Early Fusion of Discrete Tokens☆17Sep 8, 2025Updated 9 months ago
- ☆14Jul 8, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [AAAI 2025] Official Implementation of I-HallA v1.0☆16Feb 2, 2025Updated last year
- [NeurIPS 2025] SPIRAL: Semantic-Aware Progressive LiDAR Scene Generation and Understanding☆43Nov 30, 2025Updated 6 months ago
- The repository of Identifying and Mitigating Position Bias.☆71Jun 13, 2025Updated last year
- (NeurIPS 2025 🔥) Official implementation for "Efficient Multi-modal Large Language Models via Progressive Consistency Distillation"☆50Feb 11, 2026Updated 4 months ago
- ABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]☆21Aug 21, 2025Updated 9 months ago
- [ICLR 2025] Data-Augmented Phrase-Level Alignment for Mitigating Object Hallucination☆21Jan 27, 2025Updated last year
- This is the repo for group project autonomous drone of team 3 in TUM course introduction to ROS in summer semester 2022.☆14Jun 30, 2023Updated 2 years ago
- SteerViT is a framework that equips any ViT with the ability to steer both its global and local visual representations with natural langu…☆100Updated this week
- We have developed Symbol Demonstration Direct Preference Optimization (SymDPO) and validating its effectiveness across multiple benchmark…☆23Nov 22, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An implementation of AutoScale regression-based method☆12Oct 27, 2020Updated 5 years ago
- The official code for our paper StackFLOW: Monocular Human-Object Reconstruction by Stacked Normalizing Flow with Offset in IJCAI 2023.☆13Jul 17, 2024Updated last year
- ☆12Jul 9, 2018Updated 7 years ago
- ☆20Dec 8, 2022Updated 3 years ago
- a implementation of vibe with python☆11Jul 27, 2018Updated 7 years ago
- Combine 3D reconstruction and human motion capture .☆21Jun 15, 2023Updated 2 years ago
- Simple Implementation of TinyGPTV in super simple Zeta lego blocks☆16Nov 11, 2024Updated last year
- An open-world scenario domain generalization code base☆27Feb 22, 2023Updated 3 years ago
- [AAAI 2026 Oral] HiMo-CLIP: Modeling Semantic Hierarchy and Monotonicity in Vision-Language Alignment☆29Dec 17, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- AID: An affine invariant descriptor for SIFT.☆10Nov 21, 2022Updated 3 years ago
- Optimisation on Diffeomorphisms☆12Feb 17, 2025Updated last year
- An implementation of a neural network training routine using derivative information in Pytorch.☆11Dec 19, 2020Updated 5 years ago
- Geographic Data Science in Python - UFMG'19☆12Mar 26, 2019Updated 7 years ago
- OpenMMLab Detection Toolbox and Benchmark for V3Det☆15Apr 3, 2024Updated 2 years ago
- ViCToR: Improving Visual Comprehension via Token Reconstruction for Pretraining LMMs☆29Aug 15, 2025Updated 10 months ago
- Implemention of "Realtime Multi Person Pose-Estimation" in pytorch with data from AI Challenger☆13Nov 24, 2017Updated 8 years ago
- ☆18Mar 5, 2026Updated 3 months ago
- Implementation of "VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space" - ECCV 2024☆14Mar 24, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- github趋势☆13Mar 25, 2025Updated last year
- DCIC22数字中国22-牛只图像分割竞赛第四名方案☆14Jul 18, 2022Updated 3 years ago
- Command line client for GIN☆14Feb 25, 2023Updated 3 years ago
- This repository represents a basic implementation of the paper "Riemannian Geometry of Deep Generative Models", along with the results on…☆12Oct 23, 2019Updated 6 years ago
- [ICLR 2025] Official code repository for "TULIP: Token-length Upgraded CLIP"☆32Jan 26, 2026Updated 4 months ago
- Official code repository for the paper A Large-scale AI-generated Image Inpainting Benchmark☆16Jan 13, 2026Updated 5 months ago
- Algorithms for face super resolution implemented in Pytorch.☆13Feb 9, 2021Updated 5 years ago