ZhaofengWu / semantic-hubLinks
☆19Updated 5 months ago
Alternatives and similar repositories for semantic-hub
Users that are interested in semantic-hub are comparing it to the libraries listed below
Sorting:
- ☆111Updated 2 years ago
- ☆108Updated last year
- Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control☆76Updated 3 years ago
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆84Updated 2 years ago
- PyTorch implementation of StableMask (ICML'24)☆15Updated last year
- ☆91Updated last year
- Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"☆54Updated 11 months ago
- ☆51Updated 2 years ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆47Updated 9 months ago
- Long Context Extension and Generalization in LLMs☆62Updated last year
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆53Updated 2 years ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆31Updated last year
- Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding lo…☆16Updated last year
- ☆20Updated 3 years ago
- Evaluate your agent memory on real-world dialogues, not LLM-simulated dialogues.☆36Updated 6 months ago
- [NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…☆66Updated last year
- ☆112Updated last year
- Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch☆182Updated 7 months ago
- DiffusER: Discrete Diffusion via Edit-based Reconstruction (Reid, Hellendoorn & Neubig, 2022)☆55Updated 5 months ago
- ☆19Updated last year
- Sotopia-RL: Reward Design for Social Intelligence☆46Updated this week
- Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexible…☆95Updated last week
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆71Updated 11 months ago
- Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]☆39Updated last year
- Code for NeurIPS 2023 paper "Non-autoregressive Machine Translation with Probabilistic Context-free Grammar".☆12Updated 2 years ago
- Official Code Repository for the paper "Key-value memory in the brain"☆31Updated 11 months ago
- Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆49Updated 7 months ago
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":☆44Updated last year
- ☆38Updated last year
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆88Updated last year