The simplest, fastest repository for training/finetuning medium-sized GPTs with LoRA support.
☆30Feb 19, 2024Updated 2 years ago
Alternatives and similar repositories for nanoGPT-LoRA
Users that are interested in nanoGPT-LoRA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper "Pretrained Models for Multilingual Federated Learning" at NAACL 2022☆11Aug 9, 2022Updated 3 years ago
- [NeurIPS 2024] "Mind the Gap between Prototypes and Images in Cross-domain Finetuning"☆11Nov 15, 2024Updated last year
- ☆10Jul 7, 2025Updated 9 months ago
- Multimodal Federated Learning on IoT Data☆11Dec 17, 2023Updated 2 years ago
- Code for MSH-Net: Modality-Shared Hallucination with Joint Adaptation Distillation for Remote Sensing Image Classification Using Missing …☆14Jan 17, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- a fast implementation of BM25☆10Sep 15, 2022Updated 3 years ago
- [ICLR'24] Heterogeneous Personalized Federated Learning by Local-Global Updates Mixing via Convergence Rate☆13Jun 17, 2025Updated 9 months ago
- Implement Human Activity Recognition in PyTorch using hybrid of LSTM, Bi-dir LSTM and Residual Network Models☆16May 8, 2020Updated 5 years ago
- A Bigram Language Model from scratch with no-smoothing and add-one smoothing. Outputs bigram counts, bigram probabilities and probability…☆15Jan 12, 2021Updated 5 years ago
- a small demo repo to show how I got neuralbeagle14-7b running locally on my 8GB GPU☆14Jan 29, 2024Updated 2 years ago
- Machine learning project using federated learning for text generation☆11May 5, 2024Updated last year
- ☆10Jul 6, 2021Updated 4 years ago
- ☆19Jun 10, 2024Updated last year
- Easy-to-Use Federated Learning Simulator in Pytorch☆15Apr 23, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Source code for "Taming GANs with Lookahead–Minmax", ICLR 2021.☆15Mar 28, 2021Updated 5 years ago
- ☆20Jul 2, 2024Updated last year
- Fine-tune GPT models ready-to-go☆22May 18, 2024Updated last year
- ☆18Feb 2, 2022Updated 4 years ago
- an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf☆11Jul 25, 2023Updated 2 years ago
- ☆19Nov 17, 2023Updated 2 years ago
- JAX implementation of "Fine-Tuning Language Models with Just Forward Passes"☆19Jun 10, 2023Updated 2 years ago
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆64Mar 11, 2025Updated last year
- Implementations of the algorithms described in the paper: On the Convergence Theory for Hessian-Free Bilevel Algorithms.☆11Nov 1, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Optimization algorithm which fits a ResNet to CIFAR-10 5x faster than SGD / Adam (with terrible generalization)☆14Oct 20, 2023Updated 2 years ago
- A graphing calculator written in c.☆12Oct 17, 2023Updated 2 years ago
- This is the official implementation of the ICML 2023 paper - Can Forward Gradient Match Backpropagation ?☆13May 31, 2023Updated 2 years ago
- Loads OpenSubtitles v2018 dataset without having to load everything into memory at once. Works well with pytorch.☆13Aug 26, 2020Updated 5 years ago
- A repository for the EMNLP 2021 paper "Is Information Density Uniform in Task-Oriented Dialogues?" and for the CoNLL 2021 paper "Analysin…☆10Jun 17, 2024Updated last year
- Generative deep learning: DeepDream☆24May 1, 2022Updated 3 years ago
- ☆20Jun 4, 2024Updated last year
- Neural ngram language model in PyTorch.☆10Sep 27, 2018Updated 7 years ago
- ☆13Jan 17, 2024Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ZOSVRG-BlackBox-Adv☆13Oct 30, 2018Updated 7 years ago
- My personal research notebook with notes, tutorials, and resources written in Jupyterbook.☆21Jan 9, 2024Updated 2 years ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Mar 1, 2024Updated 2 years ago
- ☆31Jun 2, 2018Updated 7 years ago
- A collection of different ways to implement accessing and modifying internal model activations for LLMs☆22Oct 18, 2024Updated last year
- [ICML 2025] Fast and Low-Cost Genomic Foundation Models via Outlier Removal.☆18Jun 19, 2025Updated 9 months ago
- ☆14May 4, 2024Updated last year