daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.
β66Updated 2 months ago
Alternatives and similar repositories for sft-demos:
Users that are interested in sft-demos are comparing it to the libraries listed below
- experiments with inference on llamaβ104Updated 7 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learningβ170Updated last week
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ49Updated 10 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answersβ124Updated 10 months ago
- Code for NeurIPS LLM Efficiency Challengeβ54Updated 9 months ago
- Retrieval Augmented Generation Generalized Evaluation Datasetβ52Updated last month
- β62Updated 5 months ago
- Experiments with generating opensource language model assistantsβ97Updated last year
- minimal pytorch implementation of bm25 (with sparse tensors)β97Updated 10 months ago
- β115Updated 3 months ago
- π’ Data Toolkit for Sailor Language Modelsβ83Updated 3 weeks ago
- Manage scalable open LLM inference endpoints in Slurm clustersβ247Updated 6 months ago
- β108Updated 3 months ago
- Finetune mistral-7b-instruct for sentence embeddingsβ74Updated 8 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]β129Updated 2 months ago
- β74Updated last year
- Exploring finetuning public checkpoints on filter 8K sequences on Pileβ115Updated last year
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"β82Updated 5 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β34Updated last month
- A pipeline for LLM knowledge distillationβ83Updated 5 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.β82Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 laβ¦β45Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β79Updated 10 months ago
- β137Updated 9 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.β115Updated last year
- Small and Efficient Mathematical Reasoning LLMsβ71Updated 11 months ago
- QLoRA with Enhanced Multi GPU Supportβ36Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first appβ¦β163Updated last year
- Codebase accompanying the Summary of a Haystack paper.β75Updated 3 months ago
- Official implementation for 'Extending LLMsβ Context Window with 100 Samples'β76Updated last year