rania-hossam / LLAMA_FROM_SCRATCH_PYTORCHLinks

☆16

Alternatives and similar repositories for LLAMA_FROM_SCRATCH_PYTORCH

Users that are interested in LLAMA_FROM_SCRATCH_PYTORCH are comparing it to the libraries listed below

Sorting:

daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
☆77Updated 8 months ago
tcapelle / llm_recipes
A set of scripts and notebooks on LLM finetunning and dataset creation
☆110Updated 9 months ago
hamelsmu / llama-inference
experiments with inference on llama
☆104Updated last year
abacaj / train-with-fsdp
☆92Updated last year
fangyuan-ksgk / Tiny-GRPO
minimal GRPO implementation from scratch
☆90Updated 3 months ago
melisa-writer / short-transformers
Prune transformer layers
☆69Updated last year
SeunghyunSEO / optimized_hf_llama_class_for_training
☆47Updated 10 months ago
hkproj / pytorch-transformer-distributed
Distributed training (multi-node) of a Transformer model
☆72Updated last year
IlyasMoutawwakil / py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
☆33Updated last month
Upaya07 / NeurIPS-llm-efficiency-challenge
Code for NeurIPS LLM Efficiency Challenge
☆59Updated last year
luyug / magix
Supercharge huggingface transformers with model parallelism.
☆77Updated 8 months ago
Knowledgator / FlashDeBERTa
Trully flash implementation of DeBERTa disentangled attention mechanism.
☆59Updated last month
kyleliang919 / Long-context-transformers
Exploring finetuning public checkpoints on filter 8K sequences on Pile
☆115Updated 2 years ago
jxmorris12 / bm25_pt
minimal pytorch implementation of bm25 (with sparse tensors)
☆101Updated last year
akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆71Updated last year
LLM360 / Analysis360
Open Implementations of LLM Analyses
☆104Updated 8 months ago
JulesBelveze / bert-squeeze
🛠️ Tools for Transformers compression using PyTorch Lightning ⚡
☆83Updated 7 months ago
thomfoster / minRLHF
A (somewhat) minimal library for finetuning language models with PPO on human feedback.
☆85Updated 2 years ago
trapoom555 / Language-Model-STS-CFT
Improving Text Embedding of Language Models Using Contrastive Fine-tuning
☆64Updated 10 months ago
jaymody / picoBERT
Like picoGPT but for BERT.
☆50Updated 2 years ago
davanstrien / data-for-fine-tuning-llms
☆77Updated last year
apple / ml-hypercloning
☆47Updated 7 months ago
pacman100 / peft-codegen-25
☆23Updated last year
KaiNylund / lm-weights-encode-time
☆68Updated 10 months ago
LLM360 / amber-data-prep
Data preparation code for Amber 7B LLM
☆91Updated last year
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆49Updated 11 months ago
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆261Updated 11 months ago
rashmimarganiatgithub / LLMS_Library_2023
LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.
☆69Updated last year
NielsRogge / awesome-huggingface
Repository containing awesome resources regarding Hugging Face tooling.
☆47Updated last year
google-deepmind / xtr
XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval
☆52Updated last year