qiuzh20 / RMoEView external linksLinks
Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)
☆29Aug 4, 2024Updated last year
Alternatives and similar repositories for RMoE
Users that are interested in RMoE are comparing it to the libraries listed below
Sorting:
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated 10 months ago
- [NeurIPS 2024] Efficiency for Free: Ideal Data Are Transportable Representations☆19Jan 19, 2025Updated last year
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.☆19Nov 3, 2024Updated last year
- The source code for running LLMs on the AAAR-1.0 benchmark.☆18Apr 5, 2025Updated 10 months ago
- ☆20May 28, 2025Updated 8 months ago
- Implementation of BitNet-1.58 instruct tuning☆27Apr 14, 2024Updated last year
- Implementation for the paper: CMoE: Fast Carving of Mixture-of-Experts for Efficient LLM Inference☆34Mar 6, 2025Updated 11 months ago
- ☆29May 24, 2024Updated last year
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- This repository is the official implementation of Topology-Informed Graph Transformer (Choi et al., GRaM Workshop at ICML 2024).☆12Dec 28, 2024Updated last year
- The GitHub repository for the paper "Denoising Application of Magnetotelluric Low-Frequency Signal Processing"☆11Feb 22, 2023Updated 2 years ago
- Repository of IPBench☆19Jan 4, 2026Updated last month
- ☆91Aug 18, 2024Updated last year
- [TMLR 2024 J2C Certification] Generalizing Denoising to Non-Equilibrium Structures Improves Equivariant Force Fields☆40Feb 11, 2025Updated last year
- GPS software using open street maps. Draw tracks, waypoints. Can find actual position.☆11Jun 1, 2011Updated 14 years ago
- ☆11Mar 18, 2025Updated 10 months ago
- ☆11Jul 17, 2023Updated 2 years ago
- Analyzing the most strategic words to guess on Wordle, based on letter frequency distributions☆11Feb 20, 2022Updated 3 years ago
- A Maximal Mutual Information Criterion for Manipulation Concept Discovery☆13Sep 26, 2024Updated last year
- ☆10Jul 2, 2024Updated last year
- This is a python toolkit and developer version package to estimate multidimensional aspects of greenness and nature exposure, such as ava…☆12Aug 27, 2023Updated 2 years ago
- [KDD Explore'24]Time Series Forecasting with LLMs: Understanding and Enhancing Model Capabilities☆17May 7, 2025Updated 9 months ago
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆23Oct 23, 2025Updated 3 months ago
- GBM implementation on Legate☆14Jan 28, 2026Updated 2 weeks ago
- GPT prepping certificates of translation☆11Jan 27, 2024Updated 2 years ago
- Code and data for the paper "Steering Conversational Large Language Models for Long Emotional Support Conversations" along with a UI to v…☆14Apr 14, 2025Updated 10 months ago
- EA-HAS-Bench: Energy-Aware Hyperparameter and Architecture Search Benchmark (ICLR Spotlight 2023)☆18Dec 8, 2024Updated last year
- Integration test of Verilog AXI modules (https://github.com/alexforencich/verilog-axi) with LiteX.☆17Dec 19, 2022Updated 3 years ago
- Super Resolution Gaming Dataset☆11Jan 5, 2025Updated last year
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- opentqa is a open framework of the textbook question answering, which includes xtqa, mcan, cmr, mfb, mutan.☆11Mar 27, 2021Updated 4 years ago
- Bunch of notebooks for pre-training custom Saiga-like LLM☆12Feb 9, 2024Updated 2 years ago
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆14Aug 6, 2025Updated 6 months ago
- The source code and the data for ACL 2022 paper "Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Dat…☆14Apr 21, 2023Updated 2 years ago
- Code for the paper "FinRLlama: A Solution to LLM-Engineered Signals Challenge at FinRL Contest 2024"☆13Feb 14, 2025Updated last year
- KAF : Kolmogorov-Arnold Fourier Networks☆20Feb 19, 2025Updated 11 months ago
- USTC-TD☆12Mar 17, 2025Updated 10 months ago
- ☆15Jul 26, 2022Updated 3 years ago
- ☆11Sep 16, 2024Updated last year