huggingface / m4-logsLinks

M4 experiment logbook

☆58

Alternatives and similar repositories for m4-logs

Users that are interested in m4-logs are comparing it to the libraries listed below

Sorting:

jiasenlu / LL3M
LL3M: Large Language and Multi-Modal Model in Jax
☆72Updated last year
huggingface / OBELICS
Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M d…
☆206Updated 11 months ago
LAION-AI / General-GPT
☆65Updated last year
reka-ai / reka-vibe-eval
Multimodal language model benchmark, featuring challenging examples
☆173Updated 7 months ago
mlfoundations / scaling
Language models scale reliably with over-training and on downstream tasks
☆97Updated last year
huggingface / chug
Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.
☆158Updated last year
dhansmair / flamingo-mini
Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training
☆167Updated 2 years ago
lucidrains / mirasol-pytorch
Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch
☆89Updated last year
sanjayss34 / codevqa
☆85Updated 2 years ago
facebookresearch / unibench
Python Library to evaluate VLM models' robustness across diverse benchmarks
☆210Updated this week
ryanwebster90 / snip-dedup
☆104Updated last year
mlfoundations / VisIT-Bench
☆50Updated last year
ronghanghu / vit_10b_fsdp_example
See details in https://github.com/pytorch/xla/blob/r1.12/torch_xla/distributed/fsdp/README.md
☆24Updated 2 years ago
hadasah / btm
☆75Updated last year
wade3han / champagne
An official codebase for paper " CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos (ICCV 23)"
☆52Updated last year
LAION-AI / Big-Interleaved-Dataset
Big-Interleaved-Dataset
☆58Updated 2 years ago
kernelmachine / cbtm
Code repository for the c-BTM paper
☆107Updated last year
haoliuhl / language-quantized-autoencoders
Language Quantized AutoEncoders
☆108Updated 2 years ago
InflectionAI / Inflection-Benchmarks
Public Inflection Benchmarks
☆68Updated last year
kernelmachine / silo-lm
SILO Language Models code repository
☆81Updated last year
facebookresearch / SemDeDup
Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically s…
☆139Updated last year
IBM / SALMON
Self-Alignment with Principle-Following Reward Models
☆162Updated 2 months ago
google-deepmind / emergent_in_context_learning
☆84Updated last year
jxiw / BiGS
Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE …
☆114Updated last year
neulab / gemini-benchmark
☆149Updated last year
apoorvkh / torchrunx
Easily run PyTorch on multiple GPUs & machines
☆46Updated last month
cloneofsimo / min-max-gpt
Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training
☆130Updated last year
CERC-AAI / Robin
☆63Updated 10 months ago
huggingface / datablations
Scaling Data-Constrained Language Models
☆338Updated last month
bminixhofer / zett
Code for Zero-Shot Tokenizer Transfer
☆135Updated 6 months ago