teknium1 / stanford_alpaca-replitLinks

Modified Stanford-Alpaca Trainer for Training Replit's Code Model

☆41

Alternatives and similar repositories for stanford_alpaca-replit

Users that are interested in stanford_alpaca-replit are comparing it to the libraries listed below

Sorting:

Birch-san / mpt-play
Command-line script for inferencing from models such as MPT-7B-Chat
☆99Updated 2 years ago
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆93Updated last month
vikhyat / mixtral-inference
inference code for mixtral-8x7b-32kseqlen
☆102Updated last year
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
abacaj / openhermes-function-calling
☆134Updated last year
tensoic / Cerule
Cerule - A Tiny Mighty Vision Model
☆67Updated last week
reactorsh / ambrosia
clean up your LLM datasets
☆113Updated 2 years ago
teknium1 / transformers-gptq-quant
☆45Updated 2 years ago
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆118Updated last year
jondurbin / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆79Updated last year
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Updated 2 years ago
teknium1 / ShareGPT-Builder
☆116Updated 11 months ago
knowrohit / know_medical_dialogues
KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…
☆24Updated 2 years ago
OpenAgentLLM / OpenAgent
🔓 The open-source autonomous agent LLM initiative 🔓
☆91Updated last year
mobarski / alpaca-libre
Reimplementation of the task generation part from the Alpaca paper
☆118Updated 2 years ago
yoheinakajima / asymmetrix
☆132Updated 2 years ago
567-labs / fastllm
A collection of LLM services you can self host via docker or modal labs to support your applications development
☆197Updated last year
argilla-io / notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…
☆169Updated last year
emrgnt-cmplxty / zero-shot-replication
☆73Updated 2 years ago
teknium1 / RawTransform
A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.
☆30Updated 2 years ago
glaive-ai / function-calling-server
☆36Updated last year
migtissera / Sensei
Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI
☆221Updated last year
NousResearch / finetuning-subnet
☆120Updated last year
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Updated 2 years ago
NousResearch / Obsidian
Maybe the new state of the art vision model? we'll see 🤷‍♂️
☆165Updated last year
abacaj / replit-3B-inference
Run inference on replit-3B code instruct model using CPU
☆159Updated 2 years ago
SpellcraftAI / turing
Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.
☆58Updated last year
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Updated 2 years ago
Birch-san / falcon-play
Command-line script for inferencing from models such as falcon-7b-instruct
☆74Updated 2 years ago
deployradiant / pychatml
Chat Markup Language conversation library
☆55Updated last year