apple / ml-selfcond
Self-Conditioning Pre-Trained Language Models, ICML 2022
☆30Updated 2 years ago
Alternatives and similar repositories for ml-selfcond:
Users that are interested in ml-selfcond are comparing it to the libraries listed below
- ☆42Updated 2 years ago
- Repository accompanying the Interspeech 2022 publication titled "Space-Efficient Representation of Entity-centric Query Language Models" …☆13Updated 2 years ago
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?☆18Updated 3 weeks ago
- Repo for "Smart Word Suggestions" (SWS) task and benchmark☆20Updated last year
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Updated 5 months ago
- DUET: 2D Structured and Approximately Equivariant Representations, ICML 2023☆18Updated last year
- ☆20Updated 2 years ago
- Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models, ICML 2024☆18Updated 7 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆42Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- ☆45Updated 10 months ago
- Entity-Based Knowledge Conflicts in Question Answering. Code repo for EMNLP2021 paper: https://aclanthology.org/2021.emnlp-main.565/☆72Updated 2 years ago
- Research publication code for "Forward Compatible Training for Large-Scale Embedding Retrieval Systems", CVPR 2022, and "FastFill: Effici…☆55Updated last year
- Generating and validating natural-language explanations.☆47Updated this week
- ☆28Updated last year
- ☆19Updated 2 years ago
- **ARCHIVED** Filesystem interface to 🤗 Hub☆58Updated last year
- PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models☆21Updated 6 months ago
- Calculating Expected Time for training LLM.☆38Updated last year
- ☆72Updated 9 months ago
- [NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling☆35Updated last year
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Updated last year
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆97Updated last year
- EMNLP 2022: Finding Dataset Shortcuts with Grammar Induction https://arxiv.org/abs/2210.11560☆58Updated last year
- Support Continual pre-training & Instruction Tuning forked from llama-recipes☆31Updated last year
- SILO Language Models code repository☆81Updated 11 months ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- ☆23Updated 2 years ago
- Experiments for efforts to train a new and improved t5☆77Updated 10 months ago