EdinburghNLP / torch-adaptive-imleLinks
☆35Updated last year
Alternatives and similar repositories for torch-adaptive-imle
Users that are interested in torch-adaptive-imle are comparing it to the libraries listed below
Sorting:
- A domain-specific probabilistic programming language for modeling and inference with language models☆141Updated 9 months ago
- Evaluation of neuro-symbolic engines☆41Updated last year
- [NeurIPS 2023] Learning Transformer Programs☆162Updated last year
- Library that contains implementations of machine learning components in the hyperbolic space☆145Updated last year
- How to Turn Your Knowledge Graph Embeddings into Generative Models☆55Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆93Updated 2 years ago
- Clustered Compositional Embeddings☆11Updated 2 years ago
- Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)☆16Updated last year
- Code for minimum-entropy coupling.☆32Updated last month
- Learning Universal Predictors☆81Updated last year
- Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE …☆116Updated last year
- Sparse and discrete interpretability tool for neural networks☆64Updated 2 years ago
- Probabilistic programming with large language models☆160Updated 2 months ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆59Updated 2 years ago
- ☆239Updated 2 months ago
- A MAD laboratory to improve AI architecture designs 🧪☆138Updated last year
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆61Updated 3 years ago
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆137Updated last year
- ☆69Updated 10 months ago
- An annotated implementation of the Hyena Hierarchy paper☆34Updated 2 years ago
- Minimum Description Length probing for neural network representations☆20Updated last year
- The Energy Transformer block, in JAX☆63Updated 2 years ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆198Updated last year
- ☆33Updated last year
- A Python package for generating concise, high-quality summaries of a probability distribution☆57Updated 3 weeks ago
- Functional local implementations of main model parallelism approaches☆95Updated 2 years ago
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆83Updated 3 years ago
- ☆53Updated 2 years ago
- ☆77Updated last year
- ☆68Updated last year