uds-lsv/llmft

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/uds-lsv/llmft)

uds-lsv / llmft

Fine-tuning large language models with huggingface transformers and deepspeed

☆31

Alternatives and similar repositories for llmft

Users that are interested in llmft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

amazon-science / factual-confidence-of-llms
View on GitHub
Code for paper "Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators"
☆17Dec 4, 2024Updated last year
locuslab / T-MARS
View on GitHub
Code for T-MARS data filtering
☆35Aug 23, 2023Updated 2 years ago
john-hewitt / dyckkm-learning
View on GitHub
Codebase implementing LMs for learning the Dyck-(k,m) bounded hierarchical language
☆16Oct 11, 2020Updated 5 years ago
BatsResearch / cross-lingual-detox
View on GitHub
Code for "Preference Tuning For Toxicity Mitigation Generalizes Across Languages." Paper accepted at Findings of EMNLP 2024
☆18Mar 25, 2025Updated last year
carina-kauf / better-mlm-scoring
View on GitHub
[Kauf & Ivanova, ACL 2023] A Better Way to Do Masked Language Model Scoring
☆12Dec 1, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
CraftJarvis / OmniJARVIS
View on GitHub
☆30Jun 25, 2024Updated 2 years ago
informagi / mmead
View on GitHub
MS Marco Entity Annotations Disambiguation
☆14May 19, 2023Updated 3 years ago
moqingyan / dsr-lm
View on GitHub
☆13Jul 8, 2023Updated 3 years ago
INK-USC / XCSR
View on GitHub
Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"
☆23Oct 26, 2021Updated 4 years ago
huggingface / peft-pytorch-conference
View on GitHub
Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…
☆15Oct 16, 2023Updated 2 years ago
edoost / pert
View on GitHub
Persian Ezafe Recognition Using Transformers and Its Role in Part-Of-Speech Tagging
☆10Nov 15, 2021Updated 4 years ago
informagi / laps
View on GitHub
☆14Oct 18, 2024Updated last year
ethz-spylab / superhuman-ai-consistency
View on GitHub
☆30Jun 19, 2023Updated 3 years ago
terarachang / DataICL
View on GitHub
Data Valuation on In-Context Examples (ACL23)
☆24Jan 12, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
mrvoh / meta_learning_multilingual_doc_classification
View on GitHub
Placeholder repository
☆15Mar 16, 2022Updated 4 years ago
oxpig / Fragmenstein
View on GitHub
Merging, linking and placing compounds by stitching bound compounds together like a reanimated corpse
☆12Feb 22, 2024Updated 2 years ago
xz-liu / GraphEval
View on GitHub
Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs
☆34Sep 3, 2024Updated last year
harbor-framework / harbor-index
View on GitHub
A compact high-signal benchmark for evaluating frontier agents
☆18Updated this week
dmksjfl / PAR
View on GitHub
Official code for Cross-Domain Policy Adaptation by Capturing Representation Mismatch (ICML 2024)
☆15Aug 15, 2025Updated 11 months ago
mansheej / icl-task-diversity
View on GitHub
Code for the paper "Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression"
☆27Jun 28, 2023Updated 3 years ago
adapter-hub / hgiyt
View on GitHub
Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"
☆28Oct 3, 2021Updated 4 years ago
apple / ml-lucid-datagen
View on GitHub
☆31Mar 4, 2024Updated 2 years ago
thuml / timer
View on GitHub
See the official code and checkpoints for "Timer: Generative Pre-trained Transformers Are Large Time Series Models"
☆17Aug 19, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
thu-ml / CEURL
View on GitHub
Official implementation for "PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning" (NeurIPS 2024)
☆19Oct 13, 2024Updated last year
fra31 / robust-finetuning
View on GitHub
Code relative to "Adversarial robustness against multiple and single $l_p$-threat models via quick fine-tuning of robust classifiers"
☆19Nov 30, 2022Updated 3 years ago
ctlllll / reward_collapse
View on GitHub
☆26May 30, 2023Updated 3 years ago
eric-mitchell / concord
View on GitHub
☆14Nov 15, 2022Updated 3 years ago
Jackory / RPBT
View on GitHub
(AAAI24 oral) Implementation of RPPO(Risk-sensitive PPO) and RPBT(Population-based self-play with RPPO)
☆12May 22, 2023Updated 3 years ago
Roythuly / OMPO
View on GitHub
☆13May 29, 2024Updated 2 years ago
shadowkiller33 / Contrast-Instruction
View on GitHub
☆19Oct 2, 2023Updated 2 years ago
utrerf / robust_transfer_learning
View on GitHub
Accelerating Transfer Learning with Robust Neural Nets
☆11Oct 2, 2020Updated 5 years ago
gemcollector / PIE-G
View on GitHub
This is the repo of NeurIPS 2022 paper: "Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning"
☆16Sep 21, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
jungmaier / dirichlet-smoothed-word-embeddings
View on GitHub
Word embeddings from PPMI-weighted and dirichlet-smoothed co-occurrence matrices
☆10Aug 3, 2020Updated 5 years ago
ttumiel / minRLHF
View on GitHub
Minimal RLHF implementation built on top of minGPT.
☆32Jul 4, 2024Updated 2 years ago
UIC-Liu-Lab / DGA
View on GitHub
[EMNLP 2022] Adapting a Language Model While Preserving its General Knowledge
☆21Feb 12, 2023Updated 3 years ago
linhaowei1 / kumo
View on GitHub
☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models
☆20Jun 4, 2025Updated last year
HazyResearch / augmentation_code
View on GitHub
Reproducible code for Augmentation paper
☆17Jan 23, 2019Updated 7 years ago
btnorman / First-Explore
View on GitHub
Repo to reproduce the First-Explore paper results
☆39May 6, 2026Updated 2 months ago
eth-lre / LLM_ICL
View on GitHub
ACL24
☆11Jun 7, 2024Updated 2 years ago