sdascoli / boolformerLinks
☆164Updated last year
Alternatives and similar repositories for boolformer
Users that are interested in boolformer are comparing it to the libraries listed below
Sorting:
- Evaluation of neuro-symbolic engines☆39Updated last year
- ☆69Updated last year
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆129Updated 2 weeks ago
- Learning Universal Predictors☆79Updated last year
- ☆82Updated last year
- Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.☆172Updated 2 years ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆191Updated last year
- Repository for code used in the xVal paper☆142Updated last year
- Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in their…☆153Updated 11 months ago
- Automatic gradient descent☆210Updated 2 years ago
- ☆61Updated last year
- Google Research☆45Updated 2 years ago
- ☆89Updated 7 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆223Updated last year
- Code repository for Black Mamba☆254Updated last year
- ☆228Updated 2 weeks ago
- Materials for ConceptARC paper☆99Updated 9 months ago
- ☆53Updated last year
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆101Updated 8 months ago
- ☆101Updated last month
- Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE …☆114Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆105Updated 8 months ago
- Repo for solving arc problems with an Neural Cellular Automata☆19Updated 3 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆87Updated last year
- Visualizing query-key interactions in language + vision transformers (VIS 2023)☆151Updated last year
- σ-GPT: A New Approach to Autoregressive Models☆67Updated last year
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago
- ☆62Updated 9 months ago
- ☆139Updated last week