HomebrewML/HomebrewNLP-torch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HomebrewML/HomebrewNLP-torch)

HomebrewML / HomebrewNLP-torch

A case study of efficient training of large language models using commodity hardware.

☆67

Alternatives and similar repositories for HomebrewNLP-torch

Users that are interested in HomebrewNLP-torch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HomebrewML / Olmax
View on GitHub
HomebrewNLP in JAX flavour for maintable TPU-Training
☆50Jan 20, 2024Updated 2 years ago
HomebrewML / revlib
View on GitHub
Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload
☆132Aug 6, 2022Updated 3 years ago
ClashLuke / tpucare
View on GitHub
Automatically take good care of your preemptible TPUs
☆37May 15, 2023Updated 3 years ago
tensorfork / OBST
View on GitHub
Your fruity companion for transformers
☆14May 25, 2022Updated 4 years ago
learning-at-home / lean_transformer
View on GitHub
Memory-efficient transformer. Work in progress.
☆19Sep 17, 2022Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
ClashLuke / MinRETRO
View on GitHub
Reimplementation of `Improving language models by retrieving from trillions of tokens`
☆19Nov 16, 2022Updated 3 years ago
AranKomat / Metroplex
View on GitHub
☆21Mar 15, 2023Updated 3 years ago
world-modelz / world-modelz
View on GitHub
video prediction and world model research
☆14Jun 10, 2022Updated 4 years ago
rovle / gpt3-in-context-fitting
View on GitHub
Experiments on GPT-3's ability to fit numerical models in-context.
☆14Aug 11, 2022Updated 3 years ago
nestordemeure / flaxOptimizers
View on GitHub
A collection of optimizers, some arcane others well known, for Flax.
☆29Aug 6, 2021Updated 4 years ago
EleutherAI / exploring-contrastive-topology
View on GitHub
☆15Jun 10, 2022Updated 4 years ago
aks2203 / easy-to-hard
View on GitHub
Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"
☆61Mar 1, 2022Updated 4 years ago
hypnopump / ClynMut
View on GitHub
To be a next-generation DL-based phenotype prediction from genome mutations.
☆19May 17, 2021Updated 5 years ago
EleutherAI / magiCARP
View on GitHub
One stop shop for all things carp
☆58Sep 9, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Zasder3 / open_clip_juwels
View on GitHub
An open source implementation of CLIP.
☆33Nov 7, 2022Updated 3 years ago
cfoster0 / CLAP
View on GitHub
Contrastive Language-Audio Pretraining
☆88Mar 6, 2022Updated 4 years ago
cgraywang / transformer-on-diet
View on GitHub
Code repo for "Transformer on a Diet" paper
☆31Jun 22, 2020Updated 6 years ago
lucidrains / remixer-pytorch
View on GitHub
Implementation of the Remixer Block from the Remixer paper, in Pytorch
☆36Sep 27, 2021Updated 4 years ago
tal-z / SoundsLike
View on GitHub
A python package for finding words that sound like other words. Useful for entity resolution and poetry, among other things.
☆15Oct 26, 2022Updated 3 years ago
nostalgebraist / transformer-utils
View on GitHub
Utilities for the HuggingFace transformers library
☆77Jan 21, 2023Updated 3 years ago
HomebrewML / TrueGrad
View on GitHub
PyTorch interface for TrueGrad Optimizers
☆43Aug 8, 2023Updated 2 years ago
lucidrains / token-shift-gpt
View on GitHub
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆49Jan 27, 2022Updated 4 years ago
hugorichard / ShICA
View on GitHub
☆13Feb 26, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Noahs-ARK / idea_relations
View on GitHub
A framework to identify relations between ideas in temporal text corpora.
☆28Apr 2, 2018Updated 8 years ago
epfml / REQ
View on GitHub
☆19Jun 10, 2024Updated 2 years ago
MicPie / clasp
View on GitHub
CLASP - Contrastive Language-Aminoacid Sequence Pretraining
☆142Sep 17, 2021Updated 4 years ago
tatHi / optok
View on GitHub
☆10Aug 26, 2021Updated 4 years ago
SLAMPAI / large-scale-pretraining-transfer
View on GitHub
Code for reproducing the experiments on large-scale pre-training and transfer learning for the paper "Effect of large-scale pre-training …
☆19May 29, 2022Updated 4 years ago
CSDUlm / wsingular
View on GitHub
Python package for the ICML 2022 paper "Unsupervised Ground Metric Learning Using Wasserstein Singular Vectors".
☆10Sep 2, 2024Updated last year
applicaai / pyramidions
View on GitHub
This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…
☆14May 15, 2022Updated 4 years ago
AranKomat / Diff-DALLE
View on GitHub
☆65Nov 4, 2021Updated 4 years ago
ml-jku / cloob
View on GitHub
☆161Jun 13, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
EleutherAI / pilev2
View on GitHub
☆13Jan 20, 2023Updated 3 years ago
HendrikStrobelt / LMdiff
View on GitHub
A diff tool for language models
☆44Dec 28, 2023Updated 2 years ago
xtinkt / editable
View on GitHub
A supplementary code for Editable Neural Networks, an ICLR 2020 submission.
☆46Jan 21, 2020Updated 6 years ago
CerebrasResearch / RevBiFPN
View on GitHub
RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network
☆15Oct 18, 2022Updated 3 years ago
sholtodouglas / scalingExperiments
View on GitHub
☆62Mar 4, 2022Updated 4 years ago
lucidrains / logavgexp-torch
View on GitHub
Implementation of LogAvgExp for Pytorch
☆37Apr 10, 2025Updated last year
google / tim-gan
View on GitHub
☆11Dec 11, 2020Updated 5 years ago