sdascoli / boolformerLinks
☆164Updated last year
Alternatives and similar repositories for boolformer
Users that are interested in boolformer are comparing it to the libraries listed below
Sorting:
- Evaluation of neuro-symbolic engines☆39Updated last year
- ☆69Updated last year
- Learning Universal Predictors☆81Updated last year
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆134Updated 3 weeks ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆193Updated last year
- ☆81Updated last year
- Automatic gradient descent☆215Updated 2 years ago
- ☆143Updated 2 months ago
- Repo for solving arc problems with an Neural Cellular Automata☆21Updated 5 months ago
- gzip Predicts Data-dependent Scaling Laws☆34Updated last year
- ☆104Updated 10 months ago
- Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in their…☆156Updated last month
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago
- ☆53Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆109Updated 11 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆103Updated 10 months ago
- Google Research☆46Updated 3 years ago
- ☆105Updated 3 months ago
- ☆53Updated last year
- Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.☆173Updated 2 years ago
- ☆61Updated last year
- ☆230Updated last week
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆224Updated 2 months ago
- Repository for code used in the xVal paper☆144Updated last year
- This is the code that went into our practical dive using mamba as information extraction☆57Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆59Updated 3 years ago
- Sparse and discrete interpretability tool for neural networks☆64Updated last year
- ☆33Updated last year
- Functional Benchmarks and the Reasoning Gap☆89Updated last year