epochxero / project-proposalsLinks

Repository to gather and share ideas eventually spurring discussions and possibly implementations. Please adhere to the proposal template.

☆9

Alternatives and similar repositories for project-proposals

Users that are interested in project-proposals are comparing it to the libraries listed below

Sorting:

HomebrewML / HomebrewNLP-torch
A case study of efficient training of large language models using commodity hardware.
☆69Updated 2 years ago
facebookresearch / GraphLog
API for accessing the GraphLog dataset
☆90Updated last year
hundredblocks / large-model-parallelism
Functional local implementations of main model parallelism approaches
☆95Updated 2 years ago
srush / parallax
☆153Updated 5 years ago
srihari-humbarwadi / tpu_index
TPU index is a package for fast similarity search over large collections of high dimension vectors on TPUs
☆17Updated 3 years ago
yifding / hetseq
HetSeq: Distributed GPU Training on Heterogeneous Infrastructure
☆105Updated 2 years ago
google-research / cascades
Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference…
☆208Updated 3 weeks ago
CarperAI / squeakily
A library for squeakily cleaning and filtering language datasets.
☆47Updated last year
clemkoa / ntm
Neural Turing Machines in pytorch
☆48Updated 3 years ago
shawwn / ml-notes
☆39Updated 2 years ago
sholtodouglas / scalingExperiments
☆60Updated 3 years ago
xrsrke / pipegoose
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
☆84Updated last year
shawwn / tpunicorn
Babysit your preemptible TPUs
☆85Updated 2 years ago
ClashLuke / tpucare
Automatically take good care of your preemptible TPUs
☆36Updated 2 years ago
microsoft / mutransformers
some common Huggingface transformers in maximal update parametrization (µP)
☆81Updated 3 years ago
lucidrains / token-shift-gpt
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆50Updated 3 years ago
HomebrewML / Olmax
HomebrewNLP in JAX flavour for maintable TPU-Training
☆50Updated last year
kingoflolz / swarm-jax
Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes
☆239Updated 2 years ago
microsoft / Litmus
AI Assistant for Building Reliable, High-performing and Fair Multilingual NLP Systems
☆46Updated 2 years ago
zphang / minimal-opt
☆67Updated 2 years ago
drisspg / transformer_nuggets
A place to store reusable transformer components of my own creation or found on the interwebs
☆56Updated last week
Sea-Snell / JAXSeq
Train very large language models in Jax.
☆205Updated last year
srush / anynp
Proof-of-concept of global switching between numpy/jax/pytorch in a library.
☆18Updated last year
johnrobinsn / redpajama
Training and Inference Notebooks for the RedPajama (OpenLlama) models
☆18Updated 2 years ago
augustwester / transformer-xl
A lightweight PyTorch implementation of the Transformer-XL architecture proposed by Dai et al. (2019)
☆37Updated 2 years ago
NohTow / PPL-MCTS
Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22
☆66Updated 2 years ago
uber-research / GTN
☆78Updated 5 years ago
huggingface / bloom-jax-inference
☆67Updated 2 years ago
ssokota / mec
Code for minimum-entropy coupling.
☆32Updated last year
gmum / toolkit
A set of useful tools for DL experiments, project templates, etc.
☆35Updated 3 years ago