☆12Feb 16, 2024Updated 2 years ago
Alternatives and similar repositories for ExpertTokenRouting
Users that are interested in ExpertTokenRouting are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Dec 11, 2024Updated last year
- Implementation of ICLR' 25 paper “Multi-Label Node Classification with Label Influence Propagation".☆17Feb 28, 2025Updated last year
- Code of GraphAdapter☆17Mar 21, 2024Updated 2 years ago
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆31Apr 8, 2024Updated 2 years ago
- Website for HKU NLP group (under construction)☆14Mar 20, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICML 2024] Self-Infilling Code Generation☆18May 5, 2024Updated 2 years ago
- [EMNLP 2022] RLET: A Reinforcement Learning Based Approach for Explainable QA with Entailment Trees☆11Jul 15, 2023Updated 2 years ago
- AlphaGo Zero Clone☆17Mar 15, 2020Updated 6 years ago
- ☆15Jul 9, 2025Updated 9 months ago
- ☆19May 2, 2024Updated 2 years ago
- Official codebase for “In-Context Learning with Many Demonstration Examples”☆16Feb 13, 2023Updated 3 years ago
- Official code for the paper CipherDAug: Ciphertext based Data Augmentation for Neural Machine Translation published at ACL 2022 main conf…☆12Apr 6, 2023Updated 3 years ago
- ☆20Mar 18, 2026Updated last month
- ☆21Oct 30, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [Findings@ACL'26] LLMRouterBench: A Massive Benchmark and Unified Framework for LLM Routing☆58Apr 6, 2026Updated 3 weeks ago
- Official implementation for 'Division-of-Thoughts: Harnessing Hybrid Language Model Synergy for Efficient LLM Reasoning'☆26Feb 18, 2025Updated last year
- ☆12Jan 31, 2024Updated 2 years ago
- Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022☆11Aug 20, 2022Updated 3 years ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- Code for ACL22 short Paper "Hierarchical Curriculum Learning for AMR Parsing"☆13Jun 1, 2022Updated 3 years ago
- my commonly-used tools☆64Jan 7, 2025Updated last year
- (ICLR25 Oral) Do as We Do, Not as You Think: the Conformity of Large Language Models☆45Feb 6, 2026Updated 2 months ago
- Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation, ICML 2024☆23Jun 26, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆18Mar 3, 2025Updated last year
- Rationale-enhanced language models are better continual relation learners (EMNLP 2023 Main Conference)☆12Oct 11, 2023Updated 2 years ago
- ☆147May 2, 2024Updated 2 years ago
- RLA is a tool for managing your RL experiments automatically☆31Jan 11, 2025Updated last year
- code for Preprint paper at Arxiv: MoT: Pre-thinking and Recalling Enable ChatGPT to Self-Improve with Memory-of-Thoughts☆24Nov 29, 2023Updated 2 years ago
- Code for ACL 2025 Main paper "Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning…☆51Aug 4, 2025Updated 9 months ago
- ☆72Oct 23, 2025Updated 6 months ago
- WIKIGENBENCH: Exploring Full-length Wikipedia Generation under Real-World Scenario (COLING 2025)☆12Jan 5, 2025Updated last year
- The official code for our EMNLP 2022 long paper [Breaking the Representation Bottleneck of Chinese Characters: Neural Machine Translation…☆26Sep 10, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Code for Sam-Guided Enhanced Fine-Grained Encoding with Mixed Semantic Learning for Medical Image Captioning☆16Apr 5, 2024Updated 2 years ago
- simpleR1: A Simple Framework for Training R1-like Models☆30Aug 12, 2025Updated 8 months ago
- [CVPR 2024 Accepted] TaskWeave: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection☆30Sep 26, 2024Updated last year
- poe-chat-api is an open source Node.js library for Poe Chatbot AI.☆22Jul 10, 2023Updated 2 years ago
- [ICLR 2025] "GraphRouter: A Graph-based Router for LLM Selections", Tao Feng, Yanzhen Shen, Jiaxuan You☆67Dec 30, 2025Updated 4 months ago
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Mar 16, 2023Updated 3 years ago