TheoCoombes/crawlingathome

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TheoCoombes/crawlingathome)

TheoCoombes / crawlingathome

A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.

☆33

Alternatives and similar repositories for crawlingathome

Users that are interested in crawlingathome are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rvencu / crawlingathome-gpu-hcloud
View on GitHub
GPU controlled Hetzner Cloud workers swarm for Crawling@Home project
☆58Oct 9, 2022Updated 3 years ago
pbaylies / clustering-laion400m
View on GitHub
Script and models for clustering LAION-400m CLIP embeddings.
☆26Jan 10, 2022Updated 4 years ago
pbaylies / Augmented_CLIP
View on GitHub
Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.
☆60Mar 31, 2022Updated 4 years ago
crowsonkb / cond_transformer_2
View on GitHub
A CLIP conditioned Decision Transformer.
☆22Jul 14, 2021Updated 5 years ago
LAION-AI / Big-Interleaved-Dataset
View on GitHub
Big-Interleaved-Dataset
☆59Jan 21, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
bes-dev / vqvae_dwt_distiller.pytorch
View on GitHub
☆26Nov 8, 2021Updated 4 years ago
AranKomat / Diff-DALLE
View on GitHub
☆65Nov 4, 2021Updated 4 years ago
w3c / mediacapture-handle
View on GitHub
☆15Mar 6, 2025Updated last year
Picsart-AI-Research / Social-Reward
View on GitHub
[ICLR 2024 Spotlight] Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Communi…
☆12Mar 29, 2024Updated 2 years ago
formll / resolving-scaling-law-discrepancies
View on GitHub
☆19Nov 4, 2025Updated 8 months ago
learning-at-home / go-libp2p-daemon
View on GitHub
a libp2p-backed daemon wrapping the functionalities of go-libp2p for use in other languages
☆11Feb 9, 2025Updated last year
LAION-AI / scaling-laws-for-comparison
View on GitHub
☆22May 12, 2026Updated 2 months ago
rom1504 / gpu-tester
View on GitHub
gpu tester detects broken and slow gpus in a cluster
☆72Feb 19, 2023Updated 3 years ago
LAION-AI / General-GPT
View on GitHub
☆65Oct 4, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
aai-institute / AI-SBOM
View on GitHub
AI Software Bill of Materials for EU AI Act
☆11Jan 18, 2024Updated 2 years ago
languini-kitchen / languini-kitchen
View on GitHub
The official Languini Kitchen repository
☆14May 6, 2024Updated 2 years ago
scf4 / PreTTI
View on GitHub
Improving Text-to-Image Models with Large Language Models
☆23Oct 18, 2022Updated 3 years ago
wendlerc / Powerset-CNN
View on GitHub
Sample implementation accompanying the NeurIPS 2019 paper 'Powerset Convolutional Neural Networks' by Chris Wendler, Dan Alistarh, and Ma…
☆10Oct 26, 2020Updated 5 years ago
THUDM / APAR
View on GitHub
APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding
☆14Jul 22, 2024Updated 2 years ago
dmarx / notebooks
View on GitHub
misc notebooks i wanted to put in tracking
☆18Jul 24, 2023Updated 3 years ago
xmrec / xmrec.github.io
View on GitHub
☆23Dec 16, 2022Updated 3 years ago
ari-holtzman / newformer
View on GitHub
☆16Jul 20, 2023Updated 3 years ago
Square789 / tf2_dem_py
View on GitHub
TF2 demo parser for python, glued together using C.
☆10Jan 20, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
vbalnt / cppROC
View on GitHub
Receiver operating characteristic curve (ROC) computation code in C++
☆11Jul 17, 2017Updated 9 years ago
learning-at-home / collaborative-latent-diffusion
View on GitHub
Collaborative inference of latent diffusion via hivemind
☆12May 29, 2023Updated 3 years ago
aylai / EntailmentProbabilityEmbedding
View on GitHub
Models and code from Learning to Predict Denotational Probabilities For Modeling Entailment
☆14Feb 1, 2018Updated 8 years ago
halcy / tpuddim
View on GitHub
☆22May 3, 2022Updated 4 years ago
NCAI-Research / CALM
View on GitHub
☆15Sep 15, 2022Updated 3 years ago
LAION-AI / Open-GIA
View on GitHub
O-GIA is an umbrella for research, infrastructure and projects ecosystem that should provide open source, reproducible datasets, models, …
☆87Feb 19, 2023Updated 3 years ago
5vision / uct_atari
View on GitHub
uct tree search + supervised lerning for atari games
☆12Feb 14, 2017Updated 9 years ago
argosopentech / LibreTranslate-cpp
View on GitHub
LibreTranslate C++ bindings
☆19Aug 27, 2021Updated 4 years ago
erfannoury / seq2seq-lasagne
View on GitHub
An implementation of the Sequence to Sequence model using the Lasagne library (WIP)
☆12Aug 11, 2016Updated 9 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kdrl / GloVe-PyTorch
View on GitHub
[not maintained anymore] [for study purpose] A simple PyTorch implementation for "Global Vectors for Word Representation".
☆17Nov 7, 2019Updated 6 years ago
NightmareAI / majesty-diffusion
View on GitHub
Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)
☆25Jul 26, 2022Updated 4 years ago
masakhane-io / africomet
View on GitHub
COMET for African languages
☆11Jan 24, 2025Updated last year
learning-at-home / lean_transformer
View on GitHub
Memory-efficient transformer. Work in progress.
☆19Sep 17, 2022Updated 3 years ago
kyegomez / dev-swarm
View on GitHub
A swarm of LLM agents that will help you test, document, and productionize your code!
☆20Updated this week
tanmoyio / sahajBERT
View on GitHub
☆14Dec 28, 2021Updated 4 years ago
open-dynamic-robot-initiative / user_config_f28069m_drv8305
View on GitHub
Configuration files for the ODRI uDriver firmware.
☆11Nov 15, 2022Updated 3 years ago