facebookresearch / Exact-Byte-Level-Probabilities-from-Tokenized-LMsLinks
Example implementation of "Exact Byte-Level Probabilities from Tokenized Language Models for FIM-Tasks and Model Ensembles" by Buu Phan, Brandon Amos, Itai Gat, Marton Havasi, Matthew Muckley, and Karen UllrichWork conducted as part of a MetaAI internship.
☆14Updated 9 months ago
Alternatives and similar repositories for Exact-Byte-Level-Probabilities-from-Tokenized-LMs
Users that are interested in Exact-Byte-Level-Probabilities-from-Tokenized-LMs are comparing it to the libraries listed below
Sorting:
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆34Updated last year
- Neural Optimal Transport with Lagrangian Costs☆57Updated 3 months ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated 7 months ago
- NF-Layers for constructing neural functionals.☆88Updated last year
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆36Updated 11 months ago
- Open source code for paper "On the Learning and Learnability of Quasimetrics".☆31Updated 2 years ago
- Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️☆55Updated 2 years ago
- Official code for the paper "Compositional Generalization from First Principles" (NeurIPS 2023)☆11Updated 2 years ago
- ☆12Updated 2 years ago
- Implementation of an Attention layer where each head can attend to more than just one token, using coordinate descent to pick topk☆46Updated 2 years ago
- Code for ICLR 2023 Paper, "Stable Target Field for Reduced Variance Score Estimation in Diffusion Models”☆76Updated 2 years ago
- reproduces experiments from "Grounding inductive biases in natural images: invariance stems from variations in data"☆17Updated 11 months ago
- Experiment with diffusion models that you can run on your local jupyter instances☆63Updated 10 months ago
- ICML 2022: Learning Iterative Reasoning through Energy Minimization☆47Updated 2 years ago
- ☆44Updated last year
- [ICML 2024]: Official implementation for the paper: "Consistent Diffusion Meets Tweedie"☆53Updated last year
- [ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)☆157Updated last year
- Official code for "Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving", ICML 2021☆27Updated 3 years ago
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆27Updated last year
- ☆62Updated 2 years ago
- ☆51Updated last year
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆41Updated last year
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆86Updated last week
- Code associated with our paper "Learning Group Structure and Disentangled Representations of Dynamical Environments"☆15Updated 2 years ago
- Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch☆89Updated last year
- Meta Optimal Transport☆103Updated 2 years ago
- Official code for the paper "Attention as a Hypernetwork"☆40Updated last year
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆126Updated last year
- Explorations into the recently proposed Taylor Series Linear Attention☆100Updated last year
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆44Updated 2 years ago