google-deepmind / calmLinks
☆46Updated last month
Alternatives and similar repositories for calm
Users that are interested in calm are comparing it to the libraries listed below
Sorting:
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- ☆34Updated 11 months ago
- Collection of autoregressive model implementation☆85Updated last month
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 10 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆85Updated last month
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆77Updated 7 months ago
- ☆47Updated 9 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- This is the official repository for Inheritune.☆111Updated 3 months ago
- ☆49Updated 7 months ago
- An unofficial pytorch implementation of 'Efficient Infinite Context Transformers with Infini-attention'☆52Updated 9 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆63Updated last year
- ☆23Updated last year
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆25Updated 3 months ago
- ☆43Updated 3 months ago
- ☆37Updated 2 years ago
- ☆118Updated 9 months ago
- Set of scripts to finetune LLMs☆37Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated 3 weeks ago
- Lego for GRPO☆28Updated last week
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆25Updated 3 weeks ago
- ☆47Updated 7 months ago
- minimal GRPO implementation from scratch☆90Updated 2 months ago
- Code for Zero-Shot Tokenizer Transfer☆128Updated 4 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- QLoRA with Enhanced Multi GPU Support☆37Updated last year
- ☆120Updated 8 months ago
- ☆53Updated last year
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆221Updated 7 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆124Updated last year