Silent-Zebra / twisted-smc-lm
โ22Updated this week
Related projects โ
Alternatives and complementary repositories for twisted-smc-lm
- A MAD laboratory to improve AI architecture designs ๐งชโ95Updated 6 months ago
- โ50Updated 6 months ago
- โ53Updated 3 weeks ago
- โ46Updated last month
- โ53Updated 10 months ago
- โ54Updated last month
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAXโ79Updated 9 months ago
- โ44Updated this week
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"โ24Updated 7 months ago
- The Energy Transformer block, in JAXโ53Updated 11 months ago
- โ44Updated last year
- Universal Neurons in GPT2 Language Modelsโ27Updated 5 months ago
- Code for reproducing our paper "Not All Language Model Features Are Linear"โ61Updated last week
- Scalable neural net training via automatic normalization in the modular norm.โ121Updated 3 months ago
- โ25Updated last month
- Simple and efficient pytorch-native transformer training and inference (batched)โ61Updated 7 months ago
- โ48Updated 9 months ago
- Stick-breaking attentionโ34Updated last week
- โ45Updated 9 months ago
- Probabilistic programming with HuggingFace language modelsโ88Updated this week
- โ62Updated 3 months ago
- โ26Updated last year
- A domain-specific probabilistic programming language for modeling and inference with language modelsโ112Updated last year
- โ35Updated 7 months ago
- Understand and test language model architectures on synthetic tasks.โ162Updated 6 months ago
- Language models scale reliably with over-training and on downstream tasksโ94Updated 7 months ago
- Harmonic Datasetsโ32Updated 4 months ago
- โ24Updated 8 months ago
- โ73Updated 4 months ago
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from eโฆโ25Updated 5 months ago