apple / ml-selfcond
Self-Conditioning Pre-Trained Language Models, ICML 2022
☆28Updated 2 years ago
Related projects: ⓘ
- ☆43Updated last year
- Repository accompanying the Interspeech 2022 publication titled "Space-Efficient Representation of Entity-centric Query Language Models" …☆12Updated 2 years ago
- Repo for "Smart Word Suggestions" (SWS) task and benchmark☆19Updated 9 months ago
- ☆22Updated 2 years ago
- DUET: 2D Structured and Approximately Equivariant Representations, ICML 2023☆16Updated last year
- ☆11Updated 2 years ago
- ☆19Updated last year
- Explain a black-box module in natural language.☆33Updated 3 weeks ago
- ☆37Updated 5 months ago
- The official repo of our research work "Interactive Editing for Text Summarization".☆21Updated last year
- **ARCHIVED** Filesystem interface to 🤗 Hub☆56Updated last year
- ☆26Updated last year
- Entity-Based Knowledge Conflicts in Question Answering. Code repo for EMNLP2021 paper: https://aclanthology.org/2021.emnlp-main.565/☆65Updated 2 years ago
- Code and data from the paper 'Human Feedback is not Gold Standard'☆18Updated 2 months ago
- Anh - LAION's multilingual assistant datasets and models☆27Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆19Updated 3 months ago
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆27Updated last week
- ☆27Updated 4 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆40Updated 8 months ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated 9 months ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.☆19Updated last year
- Convenient Text-to-Text Training for Transformers☆19Updated 2 years ago
- ☆36Updated last month
- ☆25Updated 3 months ago
- Hugging Face RoBERTa with Flash Attention 2☆16Updated last year
- ☆20Updated last year
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆14Updated 8 months ago
- [ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators☆24Updated last year
- ☆19Updated this week