☆12Feb 16, 2024Updated 2 years ago
Alternatives and similar repositories for ExpertTokenRouting
Users that are interested in ExpertTokenRouting are comparing it to the libraries listed below
Sorting:
- ☆21Dec 11, 2024Updated last year
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆31Apr 8, 2024Updated last year
- LLMRouterBench: A Massive Benchmark and Unified Framework for LLM Routing☆38Jan 30, 2026Updated last month
- Rationale-enhanced language models are better continual relation learners (EMNLP 2023 Main Conference)☆12Oct 11, 2023Updated 2 years ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- Code for ACL22 short Paper "Hierarchical Curriculum Learning for AMR Parsing"☆13Jun 1, 2022Updated 3 years ago
- WIKIGENBENCH: Exploring Full-length Wikipedia Generation under Real-World Scenario (COLING 2025)☆12Jan 5, 2025Updated last year
- ☆12Jan 31, 2024Updated 2 years ago
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- [EMNLP 2022] RLET: A Reinforcement Learning Based Approach for Explainable QA with Entailment Trees☆11Jul 15, 2023Updated 2 years ago
- Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022☆11Aug 20, 2022Updated 3 years ago
- Official repository for ALT (ALignment with Textual feedback).☆10Jul 25, 2024Updated last year
- The implementation of “Fine-tuning Graph Neural Networks by Preserving Graph Generative Patterns”☆18Jun 18, 2024Updated last year
- Implementation of ICLR' 25 paper “Multi-Label Node Classification with Label Influence Propagation".☆17Feb 28, 2025Updated last year
- Call any function with command-like syntax at runtime (with automatic argument management). No dependencies, no boilerplate code, no macr…☆12Dec 25, 2022Updated 3 years ago
- ☆12Jun 30, 2024Updated last year
- A fork of the PEFT library, supporting Robust Adaptation (RoSA)☆15Aug 16, 2024Updated last year
- AirLLM 70B inference with single 4GB GPU☆17Jun 27, 2025Updated 8 months ago
- CLI util: Poor man's rpath for Windows executables.☆12Dec 16, 2018Updated 7 years ago
- ☆15Jul 9, 2025Updated 7 months ago
- ☆18Mar 3, 2025Updated last year
- ☆19Jul 16, 2025Updated 7 months ago
- A simple library for working with Hugging Face models.☆14Dec 30, 2024Updated last year
- my commonly-used tools☆64Jan 7, 2025Updated last year
- Website for HKU NLP group (under construction)☆14Dec 23, 2025Updated 2 months ago
- Findings of EMNLP 2023: InfoCL: Alleviating Catastrophic Forgetting in Continual Text Classification from An Information Theoretic Perspe…☆14Aug 13, 2024Updated last year
- The official implementation of "ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization…☆16Feb 15, 2024Updated 2 years ago
- Official code for the paper CipherDAug: Ciphertext based Data Augmentation for Neural Machine Translation published at ACL 2022 main conf…☆12Apr 6, 2023Updated 2 years ago
- ☆13Oct 18, 2023Updated 2 years ago
- Official codebase for “In-Context Learning with Many Demonstration Examples”☆16Feb 13, 2023Updated 3 years ago
- Starter template repo for all your Claude Code needs: configs, skills, agents and more.☆59Feb 23, 2026Updated last week
- (NAACL 2024) Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations☆15Apr 14, 2025Updated 10 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆17Jun 3, 2024Updated last year
- This repo is to demo the concept of lossless compression with Transformers as encoder and decoder.☆14May 2, 2024Updated last year
- Here is the code for the paper ``Recurrent Interaction Network for Jointly Extracting Entities and Classifying Relations'' accepted by EM…☆13Nov 17, 2021Updated 4 years ago
- Setup scripts for the WebArena benchmark☆19Jun 19, 2025Updated 8 months ago
- Code for Sam-Guided Enhanced Fine-Grained Encoding with Mixed Semantic Learning for Medical Image Captioning☆16Apr 5, 2024Updated last year
- [ICML 2024] Self-Infilling Code Generation☆18May 5, 2024Updated last year