Scientific-Computing-Lab / MPI-rigenLinks
MPI Code Generation through Domain-Specific Language Models
☆14Updated 9 months ago
Alternatives and similar repositories for MPI-rigen
Users that are interested in MPI-rigen are comparing it to the libraries listed below
Sorting:
- Lottery Ticket Adaptation☆39Updated 9 months ago
- A repository for research on medium sized language models.☆78Updated last year
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- ☆51Updated last year
- Official Repository for Task-Circuit Quantization☆22Updated 2 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 9 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Updated last year
- ☆54Updated 9 months ago
- ☆66Updated 4 months ago
- ☆19Updated 5 months ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆27Updated 4 months ago
- Fork of Flame repo for training of some new stuff in development☆15Updated last month
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 9 months ago
- GoldFinch and other hybrid transformer components☆46Updated last year
- ☆49Updated 11 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆29Updated 8 months ago
- Modified Beam Search with periodical restart☆12Updated 11 months ago
- ☆24Updated 11 months ago
- ☆38Updated last year
- Multi-Layer Key-Value sharing experiments on Pythia models☆33Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 6 months ago
- Resa: Transparent Reasoning Models via SAEs☆41Updated 2 weeks ago
- Cascade Speculative Drafting☆29Updated last year
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆25Updated 2 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated last month
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆31Updated last week
- ☆34Updated 3 weeks ago
- Very minimal (and stateless) agent framework☆45Updated 7 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆36Updated last year