ttw1018 / MoPE-DSTView external linksLinks
The code for "MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking"
☆19Jan 25, 2025Updated last year
Alternatives and similar repositories for MoPE-DST
Users that are interested in MoPE-DST are comparing it to the libraries listed below
Sorting:
- ✒️ ChatGPT as a writing partner.☆14Mar 6, 2023Updated 2 years ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- 苏州大学硕士学位论文模板 - Soochow University Thesis TeX Template☆16Nov 17, 2025Updated 2 months ago
- MegaRAG: Multimodal Graph-based RAG☆33Sep 16, 2025Updated 4 months ago
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆14Feb 4, 2025Updated last year
- [COLING 2025] NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models☆18Jan 18, 2025Updated last year
- DeepTrace: A lightweight, scalable real-time diagnostic and analysis tool for distributed training tasks.☆18Nov 4, 2025Updated 3 months ago
- Calculate the probability of a paper being accepted by EMNLP2023 based on score distribution of ACL2023.☆14Sep 7, 2023Updated 2 years ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated 11 months ago
- ☆17Apr 9, 2025Updated 10 months ago
- ☆18Mar 10, 2023Updated 2 years ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Oct 9, 2025Updated 4 months ago
- ☆25Jun 18, 2025Updated 7 months ago
- ☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models☆19Jun 4, 2025Updated 8 months ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated 10 months ago
- ☆19Nov 5, 2024Updated last year
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆41Sep 29, 2024Updated last year
- Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025☆13Jun 25, 2024Updated last year
- ☆33Jul 15, 2025Updated 6 months ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆66Dec 8, 2025Updated 2 months ago
- 一种面向中文复杂问句的查询图生成方法,以及一份含有多种复杂句的中文知识图谱问答数据集☆18Mar 16, 2023Updated 2 years ago
- Test-time Scaling for VAR models☆31Sep 19, 2025Updated 4 months ago
- ☆54Jan 15, 2026Updated last month
- Official Implementation of "Probing Language Models for Pre-training Data Detection"☆20Dec 4, 2024Updated last year
- Code and data for "Timo: Towards Better Temporal Reasoning for Language Models" (COLM 2024)☆25Oct 23, 2024Updated last year
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆48Jan 8, 2026Updated last month
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆46Jul 17, 2025Updated 6 months ago
- This repository contains data, code and models for contextual noncompliance.☆25Jul 18, 2024Updated last year
- A Text2SQL benchmark for evaluation of Large Language Models☆41Updated this week
- ☆28May 24, 2025Updated 8 months ago
- [EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"☆68Apr 11, 2025Updated 10 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Jun 28, 2025Updated 7 months ago
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆45Jun 24, 2025Updated 7 months ago
- ☆54Jul 7, 2025Updated 7 months ago
- ☆49Sep 8, 2025Updated 5 months ago
- ☆39May 20, 2025Updated 8 months ago
- For the new students who just join a NLP group☆27Nov 4, 2017Updated 8 years ago
- ☆30Sep 28, 2023Updated 2 years ago
- Codebase for Instruction Following without Instruction Tuning☆36Sep 24, 2024Updated last year