daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
☆67Updated 4 months ago
Alternatives and similar repositories for sft-demos:
Users that are interested in sft-demos are comparing it to the libraries listed below
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆125Updated 11 months ago
- ☆139Updated 10 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆115Updated last year
- experiments with inference on llama☆104Updated 8 months ago
- 🚢 Data Toolkit for Sailor Language Models☆85Updated last month
- ☆113Updated 4 months ago
- Data preparation code for Amber 7B LLM☆85Updated 9 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆175Updated last month
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated 2 months ago
- Codebase accompanying the Summary of a Haystack paper.☆74Updated 5 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆76Updated last year
- Experiments with generating opensource language model assistants☆97Updated last year
- Code for NeurIPS LLM Efficiency Challenge☆55Updated 10 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆100Updated 5 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆57Updated 11 months ago
- ☆62Updated 6 months ago
- ☆74Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆100Updated 2 months ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆101Updated 6 months ago
- ☆24Updated last year
- minimal pytorch implementation of bm25 (with sparse tensors)☆97Updated 11 months ago
- Retrieval Augmented Generation Generalized Evaluation Dataset☆51Updated 3 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆146Updated last year
- ☆117Updated 4 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated 11 months ago
- Evaluating LLMs with CommonGen-Lite☆88Updated 11 months ago
- ☆84Updated last year
- QLoRA with Enhanced Multi GPU Support☆36Updated last year
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆113Updated 5 months ago