Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.
☆12Feb 11, 2024Updated 2 years ago
Alternatives and similar repositories for herd
Users that are interested in herd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Token-level adaptation of LoRA matrices for downstream task generalization.☆15Apr 14, 2024Updated last year
- A simple Jax implementation of influence functions.☆20Apr 9, 2024Updated last year
- ☆18Apr 3, 2023Updated 2 years ago
- Domain Adaptation and Adapters☆16Feb 28, 2023Updated 3 years ago
- A Python Terminal script for displaying Corporate filings on BSE exchange.☆19Feb 28, 2024Updated 2 years ago
- RASP-L in Haskell for my fellow rascals☆20Dec 3, 2023Updated 2 years ago
- ☆22Aug 27, 2023Updated 2 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Jul 12, 2023Updated 2 years ago
- Garbage collector implementation in Rust for Rust☆13Aug 30, 2020Updated 5 years ago
- Sample app for Material in 30 minutes talk☆11Jun 4, 2015Updated 10 years ago
- [EMNLP 2020] Multi-label Few/Zero-shot Learning with Knowledge Aggregated from Multiple Label Graphs☆17Jun 5, 2022Updated 3 years ago
- 用RLHF可选LoRA对LLaMA和MOSS进行训练|Training LLaMA or MOSS with RLHF [LoRA]☆21May 16, 2023Updated 2 years ago
- ☆22Nov 25, 2021Updated 4 years ago
- Solution of the RSNA/ASNR/MICCAI Brain Tumor Segmentation (BraTS) Challenge 2021☆19Jul 19, 2022Updated 3 years ago
- [EMNLP 2023] Poisoning Retrieval Corpora by Injecting Adversarial Passages https://arxiv.org/abs/2310.19156☆49Dec 14, 2023Updated 2 years ago
- An Empirical Study on Large-Scale Multi-Label Text Classification including Few and Zero-Shot Labels☆19Jul 24, 2023Updated 2 years ago
- Copying Garbage Collector☆14May 13, 2020Updated 5 years ago
- ☆13Oct 29, 2021Updated 4 years ago
- ☆274Oct 31, 2023Updated 2 years ago
- Simple HTTP golang framework☆14Jul 4, 2019Updated 6 years ago
- A Python FASTA file Parser and Writer.☆17Sep 3, 2022Updated 3 years ago
- Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)☆22May 18, 2024Updated last year
- ☆18Feb 7, 2024Updated 2 years ago
- A comprehensive overview of Data Distillation and Condensation (DDC). DDC is a data-centric task where a representative (i.e., small but …☆13Dec 1, 2022Updated 3 years ago
- Code and data for NAACL 2025 paper "IHEval: Evaluating Language Models on Following the Instruction Hierarchy"☆16Feb 25, 2025Updated last year
- A swarm of LLM agents that will help you test, document, and productionize your code!☆16Updated this week
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- (no longer maintained) GitHub Code Review Assistant tool is a userscript (lightweight extension) for Firefox / Chrome☆29Jan 14, 2018Updated 8 years ago
- ☆50Oct 10, 2023Updated 2 years ago
- dirty toolkit☆20Nov 1, 2020Updated 5 years ago
- ☆12Feb 11, 2026Updated last month
- An interpreter for a small ML-ish language☆11Oct 6, 2017Updated 8 years ago
- Implementation of VQ-VAE with a GPT-style sampler in the JAX and Haiku ecosystem.☆12Nov 23, 2023Updated 2 years ago
- NVIDIA GPU-based FAN controller for SUPERMICRO server☆23Jul 28, 2020Updated 5 years ago
- Bilingual Medical Mixture of Experts LLM☆32Nov 23, 2024Updated last year
- This is a project using neural-network reinforcement learning to solve the 8 puzzle problem (or even N puzzle)☆11Mar 24, 2018Updated 7 years ago
- ☆20Sep 1, 2018Updated 7 years ago
- Display simple images in your terminal☆16Jan 1, 2016Updated 10 years ago