facebookresearch / maze_navigation_MLMU
Maze navigation with MLM-U
☆13Updated last month
Alternatives and similar repositories for maze_navigation_MLMU:
Users that are interested in maze_navigation_MLMU are comparing it to the libraries listed below
- Codebase for the arxiver dataset☆13Updated 2 months ago
- ☆45Updated 10 months ago
- This repository will be a summary and outlook on all our open, medical, AI advancements.☆30Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 7 months ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆55Updated 4 months ago
- Utilities for PyTorch distributed☆23Updated last year
- Residual Quantization with Implicit Neural Codebooks☆77Updated 3 weeks ago
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆22Updated last year
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks"☆17Updated 3 weeks ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Updated last week
- A logical, reasonably standardized, but flexible project structure for conducting ml research 🍪☆15Updated last month
- ☆60Updated last year
- ☆118Updated 2 weeks ago
- MaskedTensors for PyTorch☆38Updated 2 years ago
- Utilities for Training Very Large Models☆57Updated 4 months ago
- Latent Diffusion Language Models☆68Updated last year
- Generating and validating natural-language explanations.☆46Updated 3 weeks ago
- [EMNLP 2024 Main] Virtual Personas for Language Models via an Anthology of Backstories☆24Updated 2 months ago
- Aioli: A unified optimization framework for language model data mixing☆19Updated last week
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆22Updated last week
- Source code for the paper "Positional Attention: Out-of-Distribution Generalization and Expressivity for Neural Algorithmic Reasoning"☆14Updated 2 weeks ago
- σ-GPT: A New Approach to Autoregressive Models☆61Updated 5 months ago
- ☆50Updated 3 months ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆43Updated 6 months ago
- Memory Mosaics are networks of associative memories working in concert to achieve a prediction task.☆37Updated 4 months ago
- TorchFix - a linter for PyTorch-using code with autofix support☆122Updated 3 weeks ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆91Updated 2 months ago
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.☆17Updated 2 months ago
- ☆37Updated 6 months ago