google-deepmind / calmLinks
☆56Updated 6 months ago
Alternatives and similar repositories for calm
Users that are interested in calm are comparing it to the libraries listed below
Sorting:
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆61Updated last year
- ☆56Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Updated last year
- ☆120Updated last year
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆125Updated 6 months ago
- An unofficial pytorch implementation of 'Efficient Infinite Context Transformers with Infini-attention'☆54Updated last year
- ☆32Updated 2 years ago
- Set of scripts to finetune LLMs☆38Updated last year
- ☆130Updated last year
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Updated 2 years ago
- MEXMA: Token-level objectives improve sentence representations☆42Updated last year
- [TMLR 2026] When Attention Collapses: How Degenerate Layers in LLMs Enable Smaller, Stronger Models☆122Updated 11 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Updated last year
- Exploring limitations of LLM-as-a-judge☆20Updated last year
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- MatFormer repo☆70Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆77Updated last year
- Code for Zero-Shot Tokenizer Transfer☆142Updated last year
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆250Updated last year
- Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval☆38Updated 6 months ago
- ☆161Updated last year
- A repository for research on medium sized language models.☆77Updated last year
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆28Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆112Updated 9 months ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆45Updated 2 years ago
- Code for ExploreTom☆90Updated 7 months ago
- Code for NeurIPS LLM Efficiency Challenge☆60Updated last year
- ☆137Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year