akarshkumar0101 / ferLinks
Code for the Fractured Entangled Representation Hypothesis position paper!
☆198Updated 4 months ago
Alternatives and similar repositories for fer
Users that are interested in fer are comparing it to the libraries listed below
Sorting:
- The history files when recording human interaction while solving ARC tasks☆116Updated last week
- Automated Capability Discovery via Foundation Model Self-Exploration☆64Updated 7 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆61Updated last year
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆98Updated 6 months ago
- ☆58Updated 7 months ago
- Cellular Automata Accelerated in JAX (Oral at ICLR 2025)☆223Updated 4 months ago
- Brain-like variational inference☆58Updated 4 months ago
- Diffusion on syntax trees for program synthesis☆475Updated last year
- ☆187Updated last month
- Reasoning Computers. Lambda Calculus, Fully Differentiable. Also Neural Stacks, Queues, Arrays, Lists, Trees, and Latches.☆274Updated 11 months ago
- ☆142Updated 3 weeks ago
- Getting crystal-like representations with harmonic loss☆194Updated 6 months ago
- look how they massacred my boy☆63Updated 11 months ago
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆124Updated 5 months ago
- ☆28Updated last year
- Simple Transformer in Jax☆139Updated last year
- Materials for ConceptARC paper☆103Updated 11 months ago
- explore token trajectory trees on instruct and base models☆133Updated 4 months ago
- ☆45Updated 4 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆146Updated this week
- ☆103Updated 10 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆322Updated 11 months ago
- LLMs represent numbers on a helix and manipulate that helix to do addition.☆26Updated 8 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆105Updated 7 months ago
- ☆167Updated 3 months ago
- ☆233Updated 7 months ago
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆129Updated 3 years ago
- The boundary of neural network trainability is fractal☆218Updated last year
- Video Diffusion Model. Autoregressive, long context, efficient training and inference. WIP☆34Updated last month
- Plotting (entropy, varentropy) for small LMs☆98Updated 4 months ago