Fine-tuning 6-Billion GPT-J (& other models) with LoRA and 8-bit compression
☆68Oct 5, 2022Updated 3 years ago
Alternatives and similar repositories for gpt-j-fine-tuning-example
Users that are interested in gpt-j-fine-tuning-example are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆14Feb 10, 2023Updated 3 years ago
- Slides, exercises, and exams for my course "Natural Language Processing" (École Pour l'Informatique et les Techniques Avancées, 2024 and …☆20Apr 7, 2025Updated 11 months ago
- Fine-tuning GPT-J-6B on colab or equivalent PC GPU with your custom datasets: 8-bit weights with low-rank adaptors (LoRA)☆74Jun 18, 2022Updated 3 years ago
- stable-diffusion-webui extension that bypass to lsmith☆12Apr 28, 2023Updated 2 years ago
- GreenLIT: Using GPT-J with Multi-Task Learning to Create New Screenplays☆17Nov 27, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆42Jun 22, 2023Updated 2 years ago
- An extension to allow managing custom depth inputs to Stable Diffusion depth2img models for the stable-diffusion-webui repo.☆72Feb 4, 2023Updated 3 years ago
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- port https://github.com/ChenWu98/cycle-diffusion to run on https://github.com/AUTOMATIC1111/stable-diffusion-webui☆13Oct 22, 2022Updated 3 years ago
- ☆40Mar 25, 2023Updated 3 years ago
- The stable core is your personal server for AI rendering, powered with community plugins☆20Apr 25, 2023Updated 2 years ago
- Anime Image Background Remover for AUTOMATIC1111☆17Jan 26, 2023Updated 3 years ago
- ☆23Sep 29, 2024Updated last year
- A collection of generative and training notebooks getting mirrored to google colab.☆12May 29, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆21Jun 15, 2025Updated 9 months ago
- Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge☆14Feb 20, 2024Updated 2 years ago
- A collection of my machine learning notebooks to run on google colab. Mostly ml art.☆20Jun 10, 2022Updated 3 years ago
- Extension for stable diffusion webui to add advance prompt tuning☆10Nov 13, 2022Updated 3 years ago
- Generate images from texts. In Russian☆19Dec 13, 2021Updated 4 years ago
- [IROS 2024] "ComTraQ-MPC: Meta-Trained DQN-MPC Integration for Trajectory Tracking with Limited Active Localization Updates" by Gokul Put…☆13Apr 10, 2025Updated 11 months ago
- data collator for UL2 and U-PaLM☆29Aug 20, 2023Updated 2 years ago
- ☆15Mar 12, 2022Updated 4 years ago
- Scripts to evaluate various bias metrics for different NLG models + decoding algorithms☆16Dec 6, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Few Shot Learning using EleutherAI's GPT-Neo an Open-source version of GPT-3☆18Jul 8, 2021Updated 4 years ago
- ☆64Oct 1, 2021Updated 4 years ago
- Revamped: Hugo+LoveIt☆10Mar 14, 2026Updated 2 weeks ago
- Create soft prompts for fairseq 13B dense, GPT-J-6B and GPT-Neo-2.7B for free in a Google Colab TPU instance☆28Mar 1, 2023Updated 3 years ago
- Code and data repository for "The Mirage of Model Editing: Revisiting Evaluation in the Wild"☆16Aug 27, 2025Updated 7 months ago
- A repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for 2000 token context, 3.5 gb for 1000 token context). Model load…☆113Dec 23, 2021Updated 4 years ago
- Distributed SDDMM Kernel☆12Jul 8, 2022Updated 3 years ago
- Code, data, and pretrained models for the paper "Generating Wikipedia Article Sections from Diverse Data Sources"☆20Feb 5, 2021Updated 5 years ago
- ☆60Jan 26, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆11Feb 15, 2023Updated 3 years ago
- ☆24Jul 25, 2024Updated last year
- Extension/Script for Stable Diffusion UI by AUTOMATIC1111 https://github.com/AUTOMATIC1111/stable-diffusion-webui☆17Dec 19, 2022Updated 3 years ago
- A tool for estimating a system's information leakage via Machine Learning☆10Jun 28, 2024Updated last year
- DGL implementation of GRAND(Graph Random Neural Network, NeurIPS 2020)☆18Mar 19, 2021Updated 5 years ago
- Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion☆13Nov 6, 2022Updated 3 years ago
- Scraping Dynamic Websites with Python and Selenium☆14Jun 22, 2022Updated 3 years ago