apple / ml-selfcond
Self-Conditioning Pre-Trained Language Models, ICML 2022
☆30Updated 2 years ago
Alternatives and similar repositories for ml-selfcond:
Users that are interested in ml-selfcond are comparing it to the libraries listed below
- Repository accompanying the Interspeech 2022 publication titled "Space-Efficient Representation of Entity-centric Query Language Models" …☆13Updated 2 years ago
- Repo for "Smart Word Suggestions" (SWS) task and benchmark☆20Updated last year
- ☆13Updated 2 years ago
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Updated 4 months ago
- ☆42Updated 2 years ago
- ☆23Updated 2 years ago
- Entity-Based Knowledge Conflicts in Question Answering. Code repo for EMNLP2021 paper: https://aclanthology.org/2021.emnlp-main.565/☆72Updated 2 years ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Updated last year
- Generating and validating natural-language explanations.☆46Updated last week
- ☆45Updated 9 months ago
- DUET: 2D Structured and Approximately Equivariant Representations, ICML 2023☆18Updated last year
- ☆20Updated 2 years ago
- Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models, ICML 2024☆18Updated 6 months ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- **ARCHIVED** Filesystem interface to 🤗 Hub☆57Updated last year
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?☆18Updated last year
- ☆29Updated 7 months ago
- [NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"☆23Updated 3 months ago
- [EACL 2023] CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification☆38Updated last year
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆47Updated 2 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- ☆20Updated 2 years ago
- Checkpointable dataset utilities for foundation model training☆32Updated 11 months ago
- RL algorithm: Advantage induced policy alignment☆62Updated last year
- Anh - LAION's multilingual assistant datasets and models☆27Updated last year
- ↔️ T5 Machine Translation from English to Korean☆17Updated 2 years ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆14Updated last year
- ☆29Updated 2 years ago
- ☆22Updated last year
- Example code for prefix-tuning GPT/GPT-NeoX models and for inference with trained prefixes☆12Updated last year