TheoCoombes / crawlingathomeLinks
A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.
☆32Updated 2 years ago
Alternatives and similar repositories for crawlingathome
Users that are interested in crawlingathome are comparing it to the libraries listed below
Sorting:
- Text-writing denoising diffusion (and much more)☆30Updated 2 years ago
- One stop shop for all things carp☆59Updated 3 years ago
- Hidden Engrams: Long Term Memory for Transformer Model Inference☆35Updated 4 years ago
- Latent Diffusion Language Models☆70Updated 2 years ago
- GPU controlled Hetzner Cloud workers swarm for Crawling@Home project☆58Updated 3 years ago
- ☆91Updated 3 years ago
- ☆30Updated 4 years ago
- Aim for the moon. If you miss, you may hit a star.☆164Updated 2 years ago
- Training a model similar to OpenAI DALL-E with volunteers from all over the Internet using hivemind and dalle-pytorch (NeurIPS 2021 demo)☆27Updated 2 years ago
- Collaborative inference of latent diffusion via hivemind☆12Updated 2 years ago
- CLOOB training (JAX) and inference (JAX and PyTorch)☆74Updated 3 years ago
- ☆44Updated 3 years ago
- Train vision models using JAX and 🤗 transformers☆100Updated last month
- Script and models for clustering LAION-400m CLIP embeddings.☆26Updated 4 years ago
- Thispersondoesnotexist went down, so this time, while building it back up, I am going to open source all of it.☆91Updated 2 years ago
- Implementation of a holodeck, written in Pytorch☆18Updated 2 years ago
- This contains the Flax model of min(DALL·E) and code for converting it to PyTorch☆45Updated 3 years ago
- Aggregating embeddings over time☆32Updated 3 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Updated 2 years ago
- ☆112Updated 4 years ago
- Experiments with generating opensource language model assistants☆97Updated 2 years ago
- [WIP] A 🔥 interface for running code in the cloud☆86Updated 2 years ago
- ☆64Updated 4 years ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated 2 years ago
- Efficiently read embedding in streaming from any filesystem☆104Updated 5 months ago
- Contrastive Language-Image Pretraining☆144Updated 3 years ago
- ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.☆56Updated 3 years ago
- RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.☆67Updated 3 years ago
- A ready-to-deploy container for implementing an easy to use REST API to access Language Models.☆66Updated 2 years ago
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...☆322Updated 2 years ago