google-deepmind / calmLinks
☆50Updated last month
Alternatives and similar repositories for calm
Users that are interested in calm are comparing it to the libraries listed below
Sorting:
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval☆30Updated last month
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆116Updated last month
- This is the official repository for Inheritune.☆113Updated 7 months ago
- ☆54Updated 10 months ago
- An unofficial pytorch implementation of 'Efficient Infinite Context Transformers with Infini-attention'☆53Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMs☆93Updated 4 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆65Updated last year
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆88Updated 4 months ago
- ☆39Updated last year
- ☆51Updated last year
- A massively multilingual modern encoder language model☆80Updated last week
- ☆127Updated 11 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated 8 months ago
- MEXMA: Token-level objectives improve sentence representations☆41Updated 8 months ago
- A repository for research on medium sized language models.☆77Updated last year
- ☆119Updated last year
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆158Updated last year
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆45Updated last year
- ☆32Updated last year
- MatFormer repo☆62Updated 9 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated 11 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 9 months ago
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆46Updated 2 months ago
- ☆62Updated last year
- Code for Zero-Shot Tokenizer Transfer☆137Updated 8 months ago
- ☆48Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Updated last year
- 🚢 Data Toolkit for Sailor Language Models☆94Updated 6 months ago