google-research / meliadLinks

☆257

Alternatives and similar repositories for meliad

Users that are interested in meliad are comparing it to the libraries listed below

Sorting:

google-deepmind / neural_networks_chomsky_hierarchy
Neural Networks and the Chomsky Hierarchy
☆210Updated last year
google / flaxformer
☆362Updated last year
likenneth / othello_world
Emergent world representations: Exploring a sequence model trained on a synthetic task
☆191Updated 2 years ago
McGill-NLP / length-generalization
Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023
☆136Updated last year
facebookresearch / mega
Sequence modeling with Mega.
☆300Updated 2 years ago
booydar / LM-RMT
Recurrent Memory Transformer
☆151Updated 2 years ago
HazyResearch / H3
Language Modeling with the H3 State Space Model
☆518Updated 2 years ago
Sea-Snell / JAXSeq
Train very large language models in Jax.
☆209Updated 2 years ago
ayaka14732 / llama-2-jax
JAX implementation of the Llama 2 model
☆216Updated last year
princeton-nlp / TransformerPrograms
[NeurIPS 2023] Learning Transformer Programs
☆162Updated last year
lucidrains / simple-hierarchical-transformer
Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT
☆222Updated last year
Sea-Snell / JAX_llama
Inference code for LLaMA models in JAX
☆119Updated last year
Sea-Snell / Implicit-Language-Q-Learning
Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"
☆209Updated 2 years ago
srush / do-we-need-attention
☆166Updated 2 years ago
HazyResearch / zoology
Understand and test language model architectures on synthetic tasks.
☆233Updated last month
lucidrains / block-recurrent-transformer-pytorch
Implementation of Block Recurrent Transformer - Pytorch
☆221Updated last year
lucidrains / memorizing-transformers-pytorch
Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate …
☆634Updated 2 years ago
google-research / jestimator
Amos optimizer with JEstimator lib.
☆82Updated last year
srush / annotated-s4
Implementation of https://srush.github.io/annotated-s4
☆504Updated 4 months ago
lucidrains / recurrent-memory-transformer-pytorch
Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch
☆417Updated 9 months ago
lucidrains / CoLT5-attention
Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch
☆230Updated last year
microsoft / mutransformers
some common Huggingface transformers in maximal update parametrization (µP)
☆86Updated 3 years ago
PiotrNawrot / dynamic-pooling
Efficient Transformers with Dynamic Token Pooling
☆64Updated 2 years ago
tech-srl / RASP
An interpreter for RASP as described in the ICML 2021 paper "Thinking Like Transformers"
☆320Updated last year
lucidrains / PaLM-jax
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)
☆188Updated 3 years ago
google-deepmind / transformer_grammars
Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)
☆130Updated 4 months ago
NohTow / PPL-MCTS
Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22
☆66Updated 3 years ago
lee-ny / teaching_arithmetic
☆83Updated 2 years ago
google-research / cascades
Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference…
☆215Updated 4 months ago
davisyoshida / lorax
LoRA for arbitrary JAX models and functions
☆141Updated last year