jaymody / seq2seq-polynomialLinks
Seq2seq transformer for polynomial expansion in PyTorch.
☆27Updated 4 years ago
Alternatives and similar repositories for seq2seq-polynomial
Users that are interested in seq2seq-polynomial are comparing it to the libraries listed below
Sorting:
- Annotations of the interesting ML papers I read☆242Updated last month
- MinT: Minimal Transformer Library and Tutorials☆255Updated 2 years ago
- Minimalist BERT implementation assignment for CS11-711☆83Updated 2 years ago
- Resources from the EleutherAI Math Reading Group☆53Updated 3 months ago
- A walkthrough of transformer architecture code☆340Updated last year
- Language Modeling Example with Transformers and PyTorch Lighting☆65Updated 4 years ago
- Module 0 - Fundamentals☆102Updated 9 months ago
- An assignment for CMU CS11-711 Advanced NLP, building NLP systems from scratch☆171Updated 2 years ago
- ☆177Updated last year
- ☆18Updated 4 months ago
- A diff tool for language models☆42Updated last year
- Helper scripts and notes that were used while porting various nlp models☆46Updated 3 years ago
- This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…☆26Updated 2 years ago
- Neural Networks and Deep Learning, NUS CS5242, 2021☆190Updated 3 years ago
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆18Updated 11 months ago
- Functional local implementations of main model parallelism approaches☆95Updated 2 years ago
- Introductory lecture on Pytorch☆17Updated 3 years ago
- A tour of different optimization algorithms in PyTorch.☆99Updated 3 years ago
- Interview Questions and Answers for Machine Learning Engineer role☆119Updated 3 weeks ago
- ☆17Updated last year
- Superfast CUDA implementation of Word2Vec and Latent Dirichlet Allocation (LDA)☆45Updated 4 years ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆256Updated last year
- This repo contains data and code for the paper "Reasoning over Public and Private Data in Retrieval-Based Systems."☆46Updated 10 months ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆136Updated last year
- Training and Inference Notebooks for the RedPajama (OpenLlama) models☆18Updated 2 years ago
- Some notebooks for NLP☆204Updated last year
- ☆28Updated last year
- Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)☆104Updated 2 years ago
- A library to create and manage configuration files, especially for machine learning projects.☆78Updated 3 years ago
- An implementation of masked language modeling for Pytorch, made as concise and simple as possible☆181Updated last year