β19Mar 24, 2025Updated last year
Alternatives and similar repositories for vision-datasets
Users that are interested in vision-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AML Command Transfer. A lightweight tool to transfer any command line to Azure Machine Learning Servicesβ20May 23, 2024Updated last year
- Evaluate Transformers from the Hub π₯β14Updated this week
- β11Jul 31, 2022Updated 3 years ago
- YFCC100M Downloaderβ24May 14, 2018Updated 7 years ago
- Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backboneβ130Oct 10, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Codes for our ACM MM 2019 paper: "Exploiting Temporal Relationships in Video Moment Localization with Natural Language"β16Oct 22, 2022Updated 3 years ago
- Source code for "Importance-based Neuron Allocation for Multilingual Neural Machine Translation"β12Sep 15, 2021Updated 4 years ago
- reddit's python experiments frameworkβ12Apr 28, 2025Updated 11 months ago
- Vulnerabilities advisories and PoCβ18Nov 21, 2025Updated 4 months ago
- Code for Fooling Contrastive Language-Image Pre-trainined Models with CLIPMasterPrintsβ15Jan 25, 2026Updated 2 months ago
- Implementation of YOLO (You Only Look Once) computer Vision algorithm in a React UI, for the subject Intelligent Systems (ULL)β10Jan 27, 2019Updated 7 years ago
- Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.β13Sep 13, 2024Updated last year
- A simple retweet button.β72Oct 5, 2009Updated 16 years ago
- Project for SNARE benchmarkβ11Jun 5, 2024Updated last year
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Extremely simple MoE implementation, mostly based off Switch Transformerβ13Feb 26, 2024Updated 2 years ago
- β14Jun 16, 2023Updated 2 years ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Dataβ13Sep 30, 2023Updated 2 years ago
- [ACL 2023] Code and data for our paper "Measuring Progress in Fine-grained Vision-and-Language Understanding"β13Jun 11, 2023Updated 2 years ago
- Code for the EMNLP 2021 Oral paper "Are Gender-Neutral Queries Really Gender-Neutral? Mitigating Gender Bias in Image Search" https://arxβ¦β12Feb 6, 2023Updated 3 years ago
- Official Implementation of Attentive Mask CLIP (ICCV2023, https://arxiv.org/abs/2212.08653)β36May 29, 2024Updated last year
- [Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.β14Jun 7, 2023Updated 2 years ago
- β12Jan 27, 2025Updated last year
- β46Mar 29, 2026Updated last week
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official code of *Towards Event-oriented Long Video Understanding*β12Jul 26, 2024Updated last year
- Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Modelsβ47Sep 25, 2023Updated 2 years ago
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Modelβ13Feb 15, 2024Updated 2 years ago
- β11Sep 1, 2024Updated last year
- semantic tokenizer for speech and musicβ21Jul 6, 2025Updated 9 months ago
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?β18Jun 3, 2025Updated 10 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessmentβ16Apr 13, 2022Updated 3 years ago
- This is the official repo for "MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment"β17May 27, 2019Updated 6 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!β11May 24, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- β12Jun 1, 2024Updated last year
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speechβ11May 14, 2025Updated 10 months ago
- β11Oct 2, 2024Updated last year
- Benchmarking Multi-Image Understanding in Vision and Language Modelsβ12Jul 29, 2024Updated last year
- Ranking-Consistent Language-Image Pretrainingβ12Oct 24, 2025Updated 5 months ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognitionβ12Mar 14, 2025Updated last year
- β10Jul 5, 2024Updated last year