☆15Oct 19, 2024Updated last year
Alternatives and similar repositories for XMoE
Users that are interested in XMoE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆10Jul 1, 2024Updated last year
- ☆30Sep 28, 2023Updated 2 years ago
- Inference Code for Paper "Harder Tasks Need More Experts: Dynamic Routing in MoE Models"☆69Jul 30, 2024Updated last year
- A fusion model of transformer and CNN for remote sensing image classification. Datasets Used: UCM, RSSCN7☆11May 29, 2025Updated 9 months ago
- ☆17Jun 11, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Source code for "SimCKP: Simple Contrastive Learning of Keyphrase Representations", Findings of EMNLP 2023☆12Jun 20, 2025Updated 9 months ago
- Code for the ICLR'24 paper: MT-RANKER : Reference-free machine translation evaluation by inter-system ranking☆10Feb 29, 2024Updated 2 years ago
- paNote: an graph note software can be deployed as blog or use as electron☆12Jun 15, 2024Updated last year
- ☆19May 27, 2025Updated 10 months ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- Answering Ambiguous Questions via Iterative Prompting☆14May 25, 2024Updated last year
- Pytorch implementation of various token mixers; Attention Mechanisms, MLP, and etc for understanding computer vision papers and other tas…☆16Mar 11, 2026Updated 2 weeks ago
- ☆23Feb 3, 2026Updated last month
- ☆14Oct 6, 2025Updated 5 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆18Nov 10, 2024Updated last year
- Code Release for the 2023 NeurIPS Paper How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained langua…☆17Dec 6, 2024Updated last year
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision☆11Jul 22, 2024Updated last year
- ☆16Apr 14, 2021Updated 4 years ago
- Paper list for the paper "Authorship Attribution in the Era of Large Language Models: Problems, Methodologies, and Challenges (SIGKDD Exp…☆18Mar 17, 2026Updated last week
- Code for NeurIPS 2022 Spotlight paper " Non-Monotonic Latent Alignments for CTC-Based Non-Autoregressive Machine Translation"☆20Nov 16, 2022Updated 3 years ago
- Code for "Domain Adaptive Meta-learning for Dialogue State Tracking"(TASLP2021)☆10Sep 14, 2021Updated 4 years ago
- Code for "Exploiting reverse target-side contexts for neural machine translation via asynchronous bidirectional decoding" (Artificial Int…☆11Dec 27, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated last week
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆16Jun 27, 2025Updated 9 months ago
- [ACL2024 Findings]DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling☆18Jun 6, 2024Updated last year
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆16May 16, 2025Updated 10 months ago
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆41Sep 29, 2024Updated last year
- Hpyformer base FunASR☆30Nov 5, 2024Updated last year
- CosyVoice语音合成简易API☆14Nov 1, 2024Updated last year
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- 个人小主页https://twistedw.github.io☆11Aug 23, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Dec 24, 2024Updated last year
- ICDCS 2021, "StripeMerge: Efficient Wide-Stripe Generation for Large-Scale Erasure-Coded Storage"☆13Jul 19, 2021Updated 4 years ago
- Creative Adversarial Network-pytorch☆11Jun 30, 2018Updated 7 years ago
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆23Feb 12, 2026Updated last month
- Unofficial Implementation of Selective Attention Transformer☆21Oct 31, 2024Updated last year
- Fast key-value DB in Go.☆17Feb 22, 2022Updated 4 years ago
- ☆12Jul 11, 2024Updated last year