jaymody / seq2seq-polynomialLinks
Seq2seq transformer for polynomial expansion in PyTorch.
☆29Updated 4 years ago
Alternatives and similar repositories for seq2seq-polynomial
Users that are interested in seq2seq-polynomial are comparing it to the libraries listed below
Sorting:
- Module 0 - Fundamentals☆110Updated last year
- An interactive exploration of Transformer programming.☆271Updated 2 years ago
- MinT: Minimal Transformer Library and Tutorials☆260Updated 3 years ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆190Updated 3 years ago
- Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)☆313Updated 3 years ago
- ☆367Updated last year
- Code associated to papers on superposition (in ML interpretability)☆35Updated 3 years ago
- Official code for paper LIME: Learning Inductive Bias for Primitives of Mathematical Reasoning☆29Updated 4 years ago
- Annotations of the interesting ML papers I read☆273Updated last month
- Functional local implementations of main model parallelism approaches☆95Updated 2 years ago
- Resources from the EleutherAI Math Reading Group☆54Updated 11 months ago
- Minimal code to train a Large Language Model (LLM).☆170Updated 3 years ago
- ☆167Updated 2 years ago
- Helper scripts and notes that were used while porting various nlp models☆49Updated 3 years ago
- symbolic regression☆40Updated 3 years ago
- Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference…☆216Updated 3 weeks ago
- Library that contains implementations of machine learning components in the hyperbolic space☆145Updated last year
- Evaluation suite for large-scale language models.☆129Updated 4 years ago
- An interpreter for RASP as described in the ICML 2021 paper "Thinking Like Transformers"☆323Updated last year
- a writeup on some experiments on a sequence model for chess games☆32Updated 4 years ago
- Repository for analysis and experiments in the BigCode project.☆128Updated last year
- Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)☆134Updated 2 weeks ago
- ☆108Updated 3 years ago
- Automatic gradient descent☆217Updated 2 years ago
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆86Updated 2 years ago
- ☆102Updated 3 years ago
- Neural information retrieval / Semantic search / Bi-encoders☆175Updated 2 years ago
- A diff tool for language models☆44Updated 2 years ago
- Train very large language models in Jax.☆210Updated 2 years ago
- Nadir: Cutting-edge PyTorch optimizers for simplicity & composability! 🔥🚀💻☆14Updated last year