EdinburghNLP / torch-adaptive-imle
☆34Updated 4 months ago
Alternatives and similar repositories for torch-adaptive-imle:
Users that are interested in torch-adaptive-imle are comparing it to the libraries listed below
- Evaluation of neuro-symbolic engines☆35Updated 8 months ago
- A domain-specific probabilistic programming language for modeling and inference with language models☆121Updated last year
- An annotated implementation of the Hyena Hierarchy paper☆32Updated last year
- How to Turn Your Knowledge Graph Embeddings into Generative Models☆51Updated 9 months ago
- Code for minimum-entropy coupling.☆31Updated 9 months ago
- Extending Conformal Prediction to LLMs☆65Updated 9 months ago
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆78Updated 2 years ago
- Code for "Bayesian Structure Learning with Generative Flow Networks"☆87Updated 3 years ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆58Updated last year
- ☆52Updated 6 months ago
- ☆53Updated last year
- ☆31Updated 6 months ago
- Probabilistic programming with large language models☆107Updated last week
- ☆28Updated 3 weeks ago
- ☆37Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆72Updated 8 months ago
- Official source code for "Graph Neural Networks for Learning Equivariant Representations of Neural Networks". In ICLR 2024 (oral).☆79Updated 8 months ago
- Universal Neurons in GPT2 Language Models☆27Updated 10 months ago
- ☆51Updated 10 months ago
- ☆45Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆83Updated last year
- A MAD laboratory to improve AI architecture designs 🧪☆109Updated 4 months ago
- ☆62Updated 2 years ago
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆26Updated 10 months ago
- Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE …☆116Updated last year
- Learning Universal Predictors☆73Updated 8 months ago
- ☆31Updated last year
- Implementation of "SALSA-CLRS: A Sparse and Scalable Benchmark for Algorithmic Reasoning". SALSA-CLRS is an extension to the original clr…☆17Updated last year
- The Energy Transformer block, in JAX☆57Updated last year
- Code for Neural Execution Engines: Learning to Execute Subroutines☆17Updated 4 years ago