neelsjain/NEFTune

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/neelsjain/NEFTune)

neelsjain / NEFTune

Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning

☆412

Alternatives and similar repositories for NEFTune

Users that are interested in NEFTune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

devnkong / GOAT
View on GitHub
Official implementation of GOAT model (ICML2023)
☆38Jul 3, 2023Updated 3 years ago
YuxinWenRick / diffusion_memorization
View on GitHub
Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)
☆80Apr 3, 2024Updated 2 years ago
azshue / AutoPoison
View on GitHub
The official repository of the paper "On the Exploitability of Instruction Tuning".
☆70Feb 5, 2024Updated 2 years ago
hkust-nlp / deita
View on GitHub
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
☆600Dec 9, 2024Updated last year
neelsjain / BYOD
View on GitHub
The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"
☆108Sep 23, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
hamidkazemi22 / CLIPInversion
View on GitHub
What do we learn from inverting CLIP models?
☆58Mar 6, 2024Updated 2 years ago
huggingface / alignment-handbook
View on GitHub
Robust recipes to align language models with human and AI preferences
☆5,651May 26, 2026Updated 2 months ago
kaiokendev / cutoff-len-is-context-len
View on GitHub
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆62Jun 21, 2023Updated 3 years ago
mcleish7 / gemstone-scaling-laws
View on GitHub
Gemstones: A Model Suite for Multi-Faceted Scaling Laws (NeurIPS 2025)
☆35Sep 28, 2025Updated 10 months ago
hsouri / GDP
View on GitHub
Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion
☆11Apr 1, 2024Updated 2 years ago
montehoover / DynaGuard
View on GitHub
Code for "DynaGuard: A Dynamic Guardrail Model With User-Defined Policies."
☆23Nov 3, 2025Updated 8 months ago
somepago / DCR
View on GitHub
Official Pytorch repo of CVPR'23 and NeurIPS'23 papers on understanding replication in diffusion models.
☆113Nov 22, 2023Updated 2 years ago
YuxinWenRick / canary-in-a-coalmine
View on GitHub
☆33Nov 27, 2023Updated 2 years ago
ahans30 / goldfish-loss
View on GitHub
[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs
☆98Nov 17, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
arcee-ai / mergekit
View on GitHub
Tools for merging pretrained large language models.
☆7,266Jun 17, 2026Updated last month
OpenBMB / UltraFeedback
View on GitHub
A large-scale, fine-grained, diverse preference dataset (and models).
☆368Dec 29, 2023Updated 2 years ago
FasterDecoding / Medusa
View on GitHub
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
☆2,761Jun 25, 2024Updated 2 years ago
Cohere-Labs-Community / parameter-efficient-moe
View on GitHub
☆279Oct 31, 2023Updated 2 years ago
JIA-Lab-research / LongLoRA
View on GitHub
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
☆2,689Aug 14, 2024Updated last year
Xwin-LM / Xwin-LM
View on GitHub
Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment
☆1,037May 31, 2024Updated 2 years ago
mcleish7 / arithmetic
View on GitHub
Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)
☆200May 28, 2024Updated 2 years ago
morse-benchmark / morse-500
View on GitHub
☆31May 21, 2026Updated 2 months ago
facebookresearch / scalable-curvature
View on GitHub
Code for Dayal Kalra's research internship on scalable curvature measures for neural networks.
☆29Feb 3, 2026Updated 5 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
DAMO-NLP-SG / CLEX
View on GitHub
[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models
☆78Mar 12, 2024Updated 2 years ago
princeton-nlp / SimPO
View on GitHub
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
☆956Feb 16, 2025Updated last year
eric-mitchell / direct-preference-optimization
View on GitHub
Reference implementation for DPO (Direct Preference Optimization)
☆2,899Aug 11, 2024Updated last year
huggingface / datablations
View on GitHub
Scaling Data-Constrained Language Models
☆345Jun 28, 2025Updated last year
Instruction-Tuning-with-GPT-4 / GPT-4-LLM
View on GitHub
Instruction Tuning with GPT-4
☆4,333Jun 11, 2023Updated 3 years ago
FlagOpen / FlagEmbedding
View on GitHub
Retrieval and Retrieval-augmented LLMs
☆11,990Apr 22, 2026Updated 3 months ago
allenai / easy-to-hard-generalization
View on GitHub
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆48Jan 17, 2024Updated 2 years ago
dwzhu-pku / PoSE
View on GitHub
Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)
☆208May 20, 2024Updated 2 years ago
mit-han-lab / streaming-llm
View on GitHub
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
☆7,251Jul 11, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
lm-sys / llm-decontaminator
View on GitHub
Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
☆325Dec 20, 2023Updated 2 years ago
Guitaricet / relora
View on GitHub
Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates
☆474Apr 21, 2024Updated 2 years ago
TencentARC / LLaMA-Pro
View on GitHub
[ACL 2024] Progressive LLaMA with Block Expansion.
☆513May 20, 2024Updated 2 years ago
tatsu-lab / alpaca_eval
View on GitHub
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
☆2,007Aug 9, 2025Updated 11 months ago
yizhongw / self-instruct
View on GitHub
Aligning pretrained language models with instruction data generated by themselves.
☆4,607Mar 27, 2023Updated 3 years ago
arielnlee / Platypus
View on GitHub
Code for fine-tuning Platypus fam LLMs using LoRA
☆625Feb 4, 2024Updated 2 years ago
thunlp / UltraChat
View on GitHub
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
☆2,877Mar 13, 2024Updated 2 years ago