warner-benjamin / modernLLMstudygroup

☆22

Alternatives and similar repositories for modernLLMstudygroup:

Users that are interested in modernLLMstudygroup are comparing it to the libraries listed below

tcapelle / llm_recipes
A set of scripts and notebooks on LLM finetunning and dataset creation
☆106Updated 7 months ago
parlance-labs / ftcourse
☆170Updated 10 months ago
pacman100 / LLM-Workshop
LLM Workshop by Sourab Mangrulkar
☆378Updated 10 months ago
pacman100 / openhathi_instruct
This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…
☆23Updated last year
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Updated last year
muellerzr / minimal-trainer-zoo
Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines
☆198Updated 11 months ago
eugeneyan / visualizing-finetunes
☆78Updated 11 months ago
daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
☆76Updated 6 months ago
ayulockin / neurips-llm-efficiency-challenge
Starter pack for NeurIPS LLM Efficiency Challenge 2023.
☆124Updated last year
AnswerDotAI / minai
A miniture AI training framework for PyTorch
☆40Updated 2 months ago
hamelsmu / llama-inference
experiments with inference on llama
☆104Updated 10 months ago
rasbt / LLM-finetuning-scripts
☆203Updated last year
jxmorris12 / cde
code for training & evaluating Contextual Document Embedding models
☆181Updated last week
AnswerDotAI / fastdata
☆150Updated 4 months ago
lightonai / pylate
Late Interaction Models Training & Retrieval
☆276Updated 2 weeks ago
SumanthRH / tokenization
A comprehensive deep dive into the world of tokens
☆222Updated 10 months ago
srush / raspy
An interactive exploration of Transformer programming.
☆262Updated last year
jjallaire / inspect-llm-workshop
☆51Updated 11 months ago
hamelsmu / ft-drift
Check for data drift between two OpenAI multi-turn chat jsonl files.
☆37Updated last year
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆255Updated 9 months ago
alopatenko / LLMEvaluation
A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…
☆115Updated this week
abacaj / train-with-fsdp
☆92Updated last year
neubig / nlp-from-scratch-assignment-2022
An assignment for CMU CS11-711 Advanced NLP, building NLP systems from scratch
☆172Updated 2 years ago
Cohere-Labs-Community / AI-Alignment-Cohort
☆23Updated 6 months ago
llm-efficiency-challenge / neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
☆255Updated last year
neubig / minllama-assignment
☆85Updated 7 months ago
Locutusque / TPU-Alignment
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free
☆231Updated 5 months ago
arcee-ai / DALM
Domain Adapted Language Modeling Toolkit - E2E RAG
☆320Updated 5 months ago
wolfecameron / lora_instruction_tune
☆40Updated 11 months ago
huggingface / large_language_model_training_playbook
An open collection of implementation tips, tricks and resources for training large language models
☆472Updated 2 years ago