google-deepmind / calm
☆44Updated last week
Alternatives and similar repositories for calm:
Users that are interested in calm are comparing it to the libraries listed below
- Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation☆33Updated 11 months ago
- Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.☆74Updated this week
- Pytorch/XLA SPMD Test code in Google TPU☆23Updated 10 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆46Updated last month
- ☆31Updated 8 months ago
- Truly flash T5 realization!☆62Updated 8 months ago
- Collection of autoregressive model implementation☆81Updated this week
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 3 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Utils for Unsloth☆43Updated this week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 7 months ago
- Simple Model Similarities Analysis☆21Updated last year
- MEXMA: Token-level objectives improve sentence representations☆40Updated last month
- Google TPU optimizations for transformers models☆98Updated 3 weeks ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆54Updated 5 months ago
- Set of scripts to finetune LLMs☆36Updated 10 months ago
- Modified Beam Search with periodical restart☆12Updated 5 months ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆92Updated last year
- ☆32Updated last year
- Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆44Updated 4 months ago
- ☆62Updated 4 months ago
- Code for Zero-Shot Tokenizer Transfer☆120Updated last month
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆82Updated last month
- ☆47Updated 5 months ago
- ☆75Updated last month
- ☆48Updated 3 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆35Updated 9 months ago
- ☆52Updated 8 months ago