sdascoli / boolformerLinks
☆165Updated last year
Alternatives and similar repositories for boolformer
Users that are interested in boolformer are comparing it to the libraries listed below
Sorting:
- Learning Universal Predictors☆81Updated last year
- Evaluation of neuro-symbolic engines☆41Updated last year
- ☆68Updated last year
- ☆82Updated last year
- Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.☆175Updated 2 years ago
- ☆62Updated 2 years ago
- gzip Predicts Data-dependent Scaling Laws☆34Updated last year
- ☆105Updated last year
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆198Updated last year
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆136Updated last week
- Repository for code used in the xVal paper☆149Updated last year
- ☆53Updated 2 years ago
- ☆111Updated 6 months ago
- Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE …☆116Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆112Updated last year
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆103Updated last year
- σ-GPT: A New Approach to Autoregressive Models☆70Updated last year
- ☆239Updated 2 months ago
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆226Updated 4 months ago
- Code repository for Black Mamba☆261Updated 2 years ago
- Automatic gradient descent☆217Updated 2 years ago
- Repo for solving arc problems with an Neural Cellular Automata☆23Updated 8 months ago
- ☆33Updated last year
- ☆56Updated last year
- Functional local implementations of main model parallelism approaches☆95Updated 2 years ago
- Google Research☆46Updated 3 years ago
- Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta☆125Updated 3 weeks ago
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆66Updated 2 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Updated last year
- ☆55Updated last year