Fine-tuning 6-Billion GPT-J (& other models) with LoRA and 8-bit compression
☆68Oct 5, 2022Updated 3 years ago
Alternatives and similar repositories for gpt-j-fine-tuning-example
Users that are interested in gpt-j-fine-tuning-example are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆14Feb 10, 2023Updated 3 years ago
- Slides, exercises, and exams for my course "Natural Language Processing" (École Pour l'Informatique et les Techniques Avancées, 2024 and …☆20Apr 7, 2025Updated last year
- Fine-tuning GPT-J-6B on colab or equivalent PC GPU with your custom datasets: 8-bit weights with low-rank adaptors (LoRA)☆73Jun 18, 2022Updated 3 years ago
- stable-diffusion-webui extension that bypass to lsmith☆12Apr 28, 2023Updated 2 years ago
- ☆20Sep 20, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆42Jun 22, 2023Updated 2 years ago
- An extension to allow managing custom depth inputs to Stable Diffusion depth2img models for the stable-diffusion-webui repo.☆72Feb 4, 2023Updated 3 years ago
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- port https://github.com/ChenWu98/cycle-diffusion to run on https://github.com/AUTOMATIC1111/stable-diffusion-webui☆13Oct 22, 2022Updated 3 years ago
- Prompt-to-prompt extention of Stable Diffusion web UI☆14Jan 30, 2023Updated 3 years ago
- ☆15Sep 6, 2022Updated 3 years ago
- ☆40Mar 25, 2023Updated 3 years ago
- Converts stable diffusion embeddings to loadable pngs☆40Dec 6, 2022Updated 3 years ago
- ☆10Jul 6, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The stable core is your personal server for AI rendering, powered with community plugins☆20Apr 25, 2023Updated 2 years ago
- [ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators☆26Jul 26, 2023Updated 2 years ago
- Anime Image Background Remover for AUTOMATIC1111☆17Jan 26, 2023Updated 3 years ago
- Easy to deploy your LLM(large language model) server with no public address GPU machine.☆15Apr 30, 2024Updated last year
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆21Jun 15, 2025Updated 10 months ago
- A python wrapper for API Offres d'emploi v2, the job offers API by Emploi store (Pole Emploi)☆14Jun 7, 2022Updated 3 years ago
- Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge☆14Feb 20, 2024Updated 2 years ago
- an images browse for stable-diffusion-webui☆26Feb 4, 2023Updated 3 years ago
- A collection of my machine learning notebooks to run on google colab. Mostly ml art.☆20Jun 10, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Run text-to-video synthesis in webui.☆25Mar 20, 2023Updated 3 years ago
- Extension for stable diffusion webui to add advance prompt tuning☆10Nov 13, 2022Updated 3 years ago
- A progressive upscaling-img2img version of hires.fix, extension script for AUTOMATIC1111/stable-diffusion-webui.☆27Nov 4, 2023Updated 2 years ago
- data collator for UL2 and U-PaLM☆29Aug 20, 2023Updated 2 years ago
- jupyter/colab implementation of stable-diffusion using k_lms sampler, cpu draw manual seeding, and quantize.py fix☆38Aug 20, 2022Updated 3 years ago
- Scripts to evaluate various bias metrics for different NLG models + decoding algorithms☆16Dec 6, 2023Updated 2 years ago
- Few Shot Learning using EleutherAI's GPT-Neo an Open-source version of GPT-3☆18Jul 8, 2021Updated 4 years ago
- Revamped: Hugo+LoveIt☆10Updated this week
- Code and data repository for "The Mirage of Model Editing: Revisiting Evaluation in the Wild"☆17Aug 27, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Create soft prompts for fairseq 13B dense, GPT-J-6B and GPT-Neo-2.7B for free in a Google Colab TPU instance☆28Mar 1, 2023Updated 3 years ago
- ☆64Oct 1, 2021Updated 4 years ago
- City of Light (COL) is a geospatially faithful, Unity-based digital twin of Paris enabling high-performance embodied simulation for AI an…☆48Mar 31, 2026Updated 2 weeks ago
- Distributed SDDMM Kernel☆12Jul 8, 2022Updated 3 years ago
- Code, data, and pretrained models for the paper "Generating Wikipedia Article Sections from Diverse Data Sources"☆20Feb 5, 2021Updated 5 years ago
- ☆60Jan 26, 2025Updated last year
- ☆11Feb 15, 2023Updated 3 years ago