mlfoundations/dataset2metadata

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mlfoundations/dataset2metadata)

mlfoundations / dataset2metadata

☆28

Alternatives and similar repositories for dataset2metadata

Users that are interested in dataset2metadata are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

locuslab / T-MARS
View on GitHub
Code for T-MARS data filtering
☆35Aug 23, 2023Updated 2 years ago
ypwang61 / negCLIPLoss_NormSim
View on GitHub
[NeurIPS 2024 Spotlight] CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning.
☆14Dec 12, 2024Updated last year
ethanlshen / HierNet
View on GitHub
Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…
☆23Nov 8, 2023Updated 2 years ago
RAIVNLab / neural-priming
View on GitHub
Code repository for the paper - "Neural Priming for Sample-Efficient Adaptation"
☆14Nov 13, 2023Updated 2 years ago
mlfoundations / clip_quality_not_quantity
View on GitHub
☆28Oct 18, 2022Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
mlfoundations / datacomp
View on GitHub
DataComp: In search of the next generation of multimodal datasets
☆787Apr 28, 2025Updated last year
acmi-lab / RLSbench
View on GitHub
Code and results accompanying our paper titled RLSbench: Domain Adaptation under Relaxed Label Shift
☆35Jul 19, 2023Updated 3 years ago
BatsResearch / ex2
View on GitHub
If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions
☆17Apr 4, 2024Updated 2 years ago
locuslab / scaling_laws_data_filtering
View on GitHub
☆64Apr 9, 2024Updated 2 years ago
YivanZhang / lio
View on GitHub
Learning from Indirect Observations
☆11Jul 16, 2021Updated 5 years ago
arubique / OCCAM
View on GitHub
This is an implementation of the paper "Are We Done with Object-Centric Learning?"
☆13Jun 21, 2026Updated last month
cognitiveailab / BYTESIZED32
View on GitHub
Byte-sized text games for code generation tasks on virtual environments
☆20Jul 8, 2024Updated 2 years ago
jylei16 / Imagine-e
View on GitHub
☆14Jan 22, 2025Updated last year
TowerfallAi / towerfall-ai
View on GitHub
A mod that enables AI to play the game TowerFall Ascension.
☆14Aug 22, 2023Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
princetonvisualai / pointingqa
View on GitHub
Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"
☆19Oct 4, 2022Updated 3 years ago
arnavmdas / epiphany
View on GitHub
☆13May 12, 2023Updated 3 years ago
RAIVNLab / LLC
View on GitHub
☆13Oct 29, 2021Updated 4 years ago
filipgdorm / eco-llm
View on GitHub
☆14Mar 20, 2026Updated 4 months ago
boyazeng / understand_bias
View on GitHub
Code release for "Understanding Bias in Large-Scale Visual Datasets"
☆25Dec 4, 2024Updated last year
gzcch / Bingo
View on GitHub
☆55Apr 1, 2024Updated 2 years ago
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
mlfoundations / imagenet-captions
View on GitHub
Release of ImageNet-Captions
☆51Jan 20, 2023Updated 3 years ago
EfficientTraining / LabelBench
View on GitHub
☆51Nov 11, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jinhangzhan / RL_Heals_SFT
View on GitHub
☆21Mar 22, 2026Updated 4 months ago
EleutherAI / pile_dedupe
View on GitHub
Pile Deduplication Code
☆18May 15, 2023Updated 3 years ago
mlfoundations / VisIT-Bench
View on GitHub
☆51Oct 29, 2023Updated 2 years ago
cailile / Revisiting-Superpixels-for-Active-Learning
View on GitHub
☆15Sep 22, 2021Updated 4 years ago
ml-postech / SpReME
View on GitHub
☆11Mar 15, 2023Updated 3 years ago
stanford-crfm / helm-efficiency
View on GitHub
☆10Dec 12, 2023Updated 2 years ago
microsoft / EMNLP2019-Split-And-Recombine
View on GitHub
The code of EMNLP 2019 paper "A Split-and-Recombine Approach for Follow-up Query Analysis"
☆18Jul 20, 2023Updated 3 years ago
ml-postech / SSAD
View on GitHub
☆12Feb 26, 2024Updated 2 years ago
allenai / c4-documentation
View on GitHub
☆34Apr 18, 2021Updated 5 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
adymaharana / d2pruning
View on GitHub
☆44Oct 13, 2023Updated 2 years ago
behavioral-ds / evently
View on GitHub
evently: simulation, fitting of Hawkes processes
☆16Jan 22, 2023Updated 3 years ago
ehsanik / muscleTorch
View on GitHub
What Can You Learn from Your Muscles? Learning Visual Representation from Human Interactions (https://arxiv.org/pdf/2010.08539.pdf)
☆39Mar 30, 2021Updated 5 years ago
brendel-group / clip-ood
View on GitHub
Official code for the paper "Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?" (ICLR 2024)
☆11Aug 26, 2024Updated last year
yunhuijang / HGGT
View on GitHub
Graph generation with K2-trees (ICLR 2024)
☆12Mar 19, 2024Updated 2 years ago
Nokia-Bell-Labs / pretrained-imu-encoders
View on GitHub
(ICASSP'25) Official Repo for PRIMUS: Pretraining IMU Encoders with Multimodal Self-Supervision
☆19Apr 4, 2025Updated last year
HugoFry / mats_sae_training_for_ViTs
View on GitHub
☆25Apr 23, 2024Updated 2 years ago