kyegomez / EvoVLM-JPLinks
Plug in & Play Pytorch Implementation of the paper: "Evolutionary Optimization of Model Merging Recipes" by Sakana AI
☆29Updated 11 months ago
Alternatives and similar repositories for EvoVLM-JP
Users that are interested in EvoVLM-JP are comparing it to the libraries listed below
Sorting:
- Unofficial Implementation of Evolutionary Model Merging☆40Updated last year
 - ☆202Updated 10 months ago
 - A repository for research on medium sized language models.☆78Updated last year
 - ☆33Updated 9 months ago
 - Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks (EMNLP'24)☆147Updated last year
 - ☆69Updated last year
 - Lottery Ticket Adaptation☆40Updated 11 months ago
 - The official implementation of Self-Exploring Language Models (SELM)☆64Updated last year
 - This is the official repository for Inheritune.☆115Updated 8 months ago
 - ☆74Updated last year
 - ☆26Updated 9 months ago
 - Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆122Updated last year
 - ☆93Updated 4 months ago
 - ☆108Updated last year
 - ☆122Updated 8 months ago
 - ☆100Updated last year
 - Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind☆177Updated last year
 - Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs☆50Updated last year
 - OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 9 months ago
 - Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Updated last year
 - ☆86Updated last year
 - Pytorch implementation of the PEER block from the paper, Mixture of A Million Experts, by Xu Owen He at Deepmind☆129Updated last year
 - Language models scale reliably with over-training and on downstream tasks☆100Updated last year
 - ☆47Updated last year
 - [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆92Updated 11 months ago
 - Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch☆179Updated 4 months ago
 - Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year
 - Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"☆100Updated last year
 - ☆78Updated 2 years ago
 - ☆78Updated 2 months ago