facebookresearch / maze_navigation_MLMU

Maze navigation with MLM-U

☆13

Alternatives and similar repositories for maze_navigation_MLMU:

Users that are interested in maze_navigation_MLMU are comparing it to the libraries listed below

neuralwork / arxiver
Codebase for the arxiver dataset
☆13Updated 2 months ago
apple / ml-planner
☆45Updated 10 months ago
LAION-AI / medical
This repository will be a summary and outlook on all our open, medical, AI advancements.
☆30Updated last year
facebookresearch / DIG-In
This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.
☆20Updated 7 months ago
lucidrains / AMIE-pytorch
Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind
☆55Updated 4 months ago
crowsonkb / torch-dist-utils
Utilities for PyTorch distributed
☆23Updated last year
facebookresearch / Qinco
Residual Quantization with Implicit Neural Codebooks
☆77Updated 3 weeks ago
argilla-io / awesome-llm-datasets
👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)
☆22Updated last year
SamsungSAILMontreal / nino
Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks"
☆17Updated 3 weeks ago
evanatyourservice / llm-jax
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆18Updated last week
csinva / cookiecutter-ml-research
A logical, reasonably standardized, but flexible project structure for conducting ml research 🍪
☆15Updated last month
NousResearch / StripedHyenaTrainer
☆60Updated last year
google-deepmind / mishax
☆118Updated 2 weeks ago
pytorch / maskedtensor
MaskedTensors for PyTorch
☆38Updated 2 years ago
eth-easl / fmengine
Utilities for Training Very Large Models
☆57Updated 4 months ago
crowsonkb / LDLM
Latent Diffusion Language Models
☆68Updated last year
microsoft / automated-explanations
Generating and validating natural-language explanations.
☆46Updated 3 weeks ago
CannyLab / anthology
[EMNLP 2024 Main] Virtual Personas for Language Models via an Anthology of Backstories
☆24Updated 2 months ago
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆19Updated last week
lucidrains / GAF-microbatch-pytorch
Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch
☆22Updated last week
opallab / positional_attention
Source code for the paper "Positional Attention: Out-of-Distribution Generalization and Expressivity for Neural Algorithmic Reasoning"
☆14Updated 2 weeks ago
idiap / sigma-gpt
σ-GPT: A New Approach to Autoregressive Models
☆61Updated 5 months ago
shikaiqiu / compute-better-spent
☆50Updated 3 months ago
graphcore-research / out-of-the-box-fp8-training
Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.
☆43Updated 6 months ago
facebookresearch / MemoryMosaics
Memory Mosaics are networks of associative memories working in concert to achieve a prediction task.
☆37Updated 4 months ago
pytorch-labs / torchfix
TorchFix - a linter for PyTorch-using code with autofix support
☆122Updated 3 weeks ago
EleutherAI / nanoGPT-mup
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆91Updated 2 months ago
layer6ai-labs / calo-forest
A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.
☆17Updated 2 months ago
xjdr-alt / muzero_sketch
☆37Updated 6 months ago