google-deepmind / calm
☆47Updated last month
Alternatives and similar repositories for calm:
Users that are interested in calm are comparing it to the libraries listed below
- Easy to use, High Performant Knowledge Distillation for LLMs☆56Updated last week
- Complex Function Calling Benchmark.☆92Updated 2 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆53Updated 6 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 7 months ago
- Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluati…☆38Updated last month
- ☆46Updated 5 months ago
- Code for Zero-Shot Tokenizer Transfer☆126Updated 2 months ago
- ☆48Updated 5 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 9 months ago
- ☆33Updated 9 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆170Updated 2 months ago
- MEXMA: Token-level objectives improve sentence representations☆40Updated 3 months ago
- ☆40Updated 2 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆67Updated last week
- Collection of autoregressive model implementation☆85Updated last month
- Lightweight tools for quick and easy LLM demo's☆26Updated 6 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆107Updated last month
- ☆44Updated last month
- Using open source LLMs to build synthetic datasets for direct preference optimization☆59Updated last year
- Pytorch/XLA SPMD Test code in Google TPU☆23Updated last year
- Set of scripts to finetune LLMs☆37Updated last year
- Data preparation code for CrystalCoder 7B LLM☆44Updated 11 months ago
- ☆112Updated 6 months ago
- ☆119Updated 6 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆73Updated 5 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆27Updated last month
- 🚢 Data Toolkit for Sailor Language Models☆88Updated last month
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆80Updated last year
- Implementation of the Mamba SSM with hf_integration.☆56Updated 7 months ago