fredi-python / Fine-tune-RedPajama-Chat-3BLinks

Code for finetuning RedPajama-Chat-3B using LoRA

☆13

Alternatives and similar repositories for Fine-tune-RedPajama-Chat-3B

Users that are interested in Fine-tune-RedPajama-Chat-3B are comparing it to the libraries listed below

Sorting:

EduardTalianu / EntropixLab
entropix style sampling + GUI
☆26Updated 7 months ago
emrgnt-cmplxty / zero-shot-replication
☆73Updated last year
serp-ai / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆31Updated last year
VatsaDev / NanoPhi-alpha
GPT-2 small trained on phi-like data
☆66Updated last year
nivibilla / build-nanogpt
Video+code lecture on building nanoGPT from scratch
☆68Updated last year
emrgnt-cmplxty / SmolTrainer
☆20Updated last year
TheBlokeAI / AIScripts
Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub
☆162Updated last year
AlpinDale / sparsegpt-for-LLaMA
Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.
☆71Updated 2 years ago
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Updated 2 years ago
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Updated last year
nyunAI / PruneGPT
☆53Updated last year
andrewgcodes / FalconStreaming
Falcon40B and 7B (Instruct) with streaming, top-k, and beam search
☆40Updated 2 years ago
OpenAccess-AI-Collective / ggml-webui
Deploy your GGML models to HuggingFace Spaces with Docker and gradio
☆37Updated 2 years ago
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆63Updated 2 years ago
jquesnelle / transformers-openai-api
An OpenAI Completions API compatible server for NLP transformers models
☆65Updated last year
the-crypt-keeper / the-muse
Experimental sampler to make LLMs more creative
☆31Updated last year
nova-land / gbnf-compiler
Plug n Play GBNF Compiler for llama.cpp
☆25Updated last year
cognitivecomputations / kraken
☆66Updated last year
lachlansneff / sparsellama
☆40Updated 2 years ago
toufunao / SCM4LLMs
☆33Updated 2 years ago
PuchToTalk / DOOM-MistralAI
Mistral7B playing DOOM
☆28Updated last year
bjj / exllamav2-openai-server
An OpenAI API compatible LLM inference server based on ExLlamaV2.
☆25Updated last year
ChrisHayduk / qlora-multi-gpu
QLoRA with Enhanced Multi GPU Support
☆37Updated last year
CarperAI / treasure_trove
☆22Updated last year
SLAM-group / newhope
☆22Updated last year
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
serp-ai / unsloth
5X faster 60% less memory QLoRA finetuning
☆21Updated last year
Digitous / ModelREVOLVER
Model REVOLVER, a human in the loop model mixing system.
☆33Updated last year
jadechip / nanoXLSTM
The simplest, fastest repository for training/finetuning medium-sized xLSTMs.
☆41Updated last year
ragnarkar / reddit_imitator
☆16Updated 2 years ago