r-three/smear

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/r-three/smear)

r-three / smear

☆30

Alternatives and similar repositories for smear

Users that are interested in smear are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

UNITES-Lab / MC-SMoE
View on GitHub
[ICLR‘24 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"
☆108Jun 20, 2025Updated last year
microsoft / mttl
View on GitHub
Building modular LMs with parameter-efficient fine-tuning.
☆116Jul 13, 2026Updated last week
JCruan519 / GIST
View on GitHub
(ACM MM24) This is the offical repository of GIST: Improving Parameter Efficient Fine Tuning via Knowledge Interaction.
☆11Jan 28, 2024Updated 2 years ago
selkerdawy / FTWT
View on GitHub
Fire Together Wire Together: A Dynamic Pruning Approach with Self-Supervised Mask Prediction
☆10May 25, 2022Updated 4 years ago
Cohere-Labs-Community / parameter-efficient-moe
View on GitHub
☆278Oct 31, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
brightjade / SimCKP
View on GitHub
Source code for "SimCKP: Simple Contrastive Learning of Keyphrase Representations", Findings of EMNLP 2023
☆12Jun 20, 2025Updated last year
fredzzhang / atlas
View on GitHub
[NeurIPS'24] Official PyTorch implementation for paper "Knowledge Composition using Task Vectors with Learned Anisotropic Scaling"
☆28Feb 24, 2025Updated last year
trestad / mitigating-reversal-curse
View on GitHub
Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'
☆14Aug 2, 2024Updated last year
prateeky2806 / ComPEFT
View on GitHub
☆26Nov 23, 2023Updated 2 years ago
IBM / ModuleFormer
View on GitHub
ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…
☆225Sep 18, 2025Updated 10 months ago
allenai / hyperdecoders
View on GitHub
Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304
☆14Oct 11, 2022Updated 3 years ago
sherrywan / Dual-Diffusion
View on GitHub
☆14Apr 1, 2025Updated last year
sunnweiwei / AmbigPrompt
View on GitHub
Answering Ambiguous Questions via Iterative Prompting
☆14May 25, 2024Updated 2 years ago
microsoft / ReACC
View on GitHub
Source codes for paper ”ReACC: A Retrieval-Augmented Code Completion Framework“
☆67Apr 18, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
wmt-conference / wmt23-news-systems
View on GitHub
☆14Oct 6, 2025Updated 9 months ago
Ablustrund / LoRAMoE
View on GitHub
LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment
☆405Apr 29, 2024Updated 2 years ago
tau-nlp / scrolls
View on GitHub
The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".
☆69Jan 12, 2024Updated 2 years ago
peijunallin / alphalora
View on GitHub
☆19Nov 10, 2024Updated last year
ibraheem-moosa / mt-ranker
View on GitHub
Code for the ICLR'24 paper: MT-RANKER : Reference-free machine translation evaluation by inter-system ranking
☆10Feb 29, 2024Updated 2 years ago
andersonbcdefg / dpo-lora
View on GitHub
direct preference optimization with only 1 model copy :)
☆14Oct 2, 2023Updated 2 years ago
xiamengzhou / training_trajectory_analysis
View on GitHub
[ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf
☆25Nov 14, 2023Updated 2 years ago
alexrame / diwa
View on GitHub
DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization
☆31Jan 31, 2023Updated 3 years ago
wrh14 / deep_unlearning
View on GitHub
Official github page for the paper "Evaluating Deep Unlearning in Large Language Model"
☆14Apr 25, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
chrisliu298 / gpt2-arxiv
View on GitHub
Fine-tuning GPT-2 to generate research paper abstracts
☆12Apr 28, 2021Updated 5 years ago
leoli646 / Adapter-X
View on GitHub
Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision
☆11Jul 22, 2024Updated 2 years ago
Kyyle2114 / Convolutional-Adapter-for-Segment-Anything
View on GitHub
CAD - Memory Efficient Convolutional Adapter for Segment Anything
☆12Oct 4, 2024Updated last year
gyhdog99 / RACRO2
View on GitHub
Official PyTorch implementation of RACRO (https://www.arxiv.org/abs/2506.04559)
☆19Jul 1, 2025Updated last year
RUCAIBox / MPOE
View on GitHub
☆19Sep 15, 2022Updated 3 years ago
EnnengYang / AdaMerging
View on GitHub
AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.
☆113Oct 28, 2024Updated last year
sled-group / moh
View on GitHub
[NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models
☆37Nov 13, 2024Updated last year
ictnlp / NMLA-NAT
View on GitHub
Code for NeurIPS 2022 Spotlight paper " Non-Monotonic Latent Alignments for CTC-Based Non-Autoregressive Machine Translation"
☆20Nov 16, 2022Updated 3 years ago
JCruan519 / iDAT
View on GitHub
(ICME24) This is the offical repository of iDAT: inverse Distillation Adapter-Tuning.
☆13Apr 3, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
IBM / ColPret
View on GitHub
Efficient Scaling laws and collaborative pretraining.
☆23Updated this week
XMUDeepLIT / DAMAML
View on GitHub
Code for "Domain Adaptive Meta-learning for Dialogue State Tracking"(TASLP2021)
☆10Sep 14, 2021Updated 4 years ago
google-research / vmoe
View on GitHub
☆726Jul 2, 2026Updated 3 weeks ago
Adlith / MoE-Jetpack
View on GitHub
[NeurIPS 24] MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks
☆137Nov 23, 2024Updated last year
XMUDeepLIT / ABDNMT-RNMT
View on GitHub
Code for "Exploiting reverse target-side contexts for neural machine translation via asynchronous bidirectional decoding" (Artificial Int…
☆11Dec 27, 2022Updated 3 years ago
bwconrad / soft-moe
View on GitHub
PyTorch implementation of "From Sparse to Soft Mixtures of Experts"
☆72Aug 22, 2023Updated 2 years ago
RZFan525 / Awesome-ScalingLaws
View on GitHub
A curated list of awesome resources dedicated to Scaling Laws for LLMs
☆84Apr 10, 2023Updated 3 years ago