epochxero / project-proposalsLinks
Repository to gather and share ideas eventually spurring discussions and possibly implementations. Please adhere to the proposal template.
☆9Updated 5 years ago
Alternatives and similar repositories for project-proposals
Users that are interested in project-proposals are comparing it to the libraries listed below
Sorting:
- A case study of efficient training of large language models using commodity hardware.☆69Updated 2 years ago
- API for accessing the GraphLog dataset☆90Updated last year
- Functional local implementations of main model parallelism approaches☆95Updated 2 years ago
- ☆153Updated 5 years ago
- TPU index is a package for fast similarity search over large collections of high dimension vectors on TPUs☆17Updated 3 years ago
- HetSeq: Distributed GPU Training on Heterogeneous Infrastructure☆105Updated 2 years ago
- Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference…☆208Updated 3 weeks ago
- A library for squeakily cleaning and filtering language datasets.☆47Updated last year
- Neural Turing Machines in pytorch☆48Updated 3 years ago
- ☆39Updated 2 years ago
- ☆60Updated 3 years ago
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆84Updated last year
- Babysit your preemptible TPUs☆85Updated 2 years ago
- Automatically take good care of your preemptible TPUs☆36Updated 2 years ago
- some common Huggingface transformers in maximal update parametrization (µP)☆81Updated 3 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆50Updated 3 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆50Updated last year
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆239Updated 2 years ago
- AI Assistant for Building Reliable, High-performing and Fair Multilingual NLP Systems☆46Updated 2 years ago
- ☆67Updated 2 years ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆56Updated last week
- Train very large language models in Jax.☆205Updated last year
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆18Updated last year
- Training and Inference Notebooks for the RedPajama (OpenLlama) models☆18Updated 2 years ago
- A lightweight PyTorch implementation of the Transformer-XL architecture proposed by Dai et al. (2019)☆37Updated 2 years ago
- Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22☆66Updated 2 years ago
- ☆78Updated 5 years ago
- ☆67Updated 2 years ago
- Code for minimum-entropy coupling.☆32Updated last year
- A set of useful tools for DL experiments, project templates, etc.☆35Updated 3 years ago