Official repository for FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models
☆39Sep 19, 2025Updated 8 months ago
Alternatives and similar repositories for FLAME-MoE
Users that are interested in FLAME-MoE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2025] Official code for ''Learning to Reason from Feedback at Test-Time''.☆13May 16, 2025Updated last year
- ☆26May 26, 2024Updated 2 years ago
- [ICLR'25] Code for KaSA, an official implementation of "KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models"☆22Jan 16, 2025Updated last year
- LOLA: Large and Open Source Multilingual Language Model☆11Apr 8, 2026Updated last month
- ☆21Feb 5, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆19Jan 3, 2025Updated last year
- A minimal proof-of-concept for a Vite backend integration with Flask.☆18Sep 11, 2024Updated last year
- ☆11Jul 21, 2024Updated last year
- ReportParse is a unified NLP analyzer for corporate sustainability reports☆21Sep 18, 2024Updated last year
- ☆11Jan 21, 2021Updated 5 years ago
- DCPO: Dynamic Adaptive Clipping for RL☆49Apr 1, 2026Updated last month
- [ICML2024] "FedLMT: Tackling System Heterogeneity of Federated Learning via Low-Rank Model Training with Theoretical Guarantees" by Jiaha…☆14Sep 22, 2024Updated last year
- Source code for COLING 2022 paper "Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models"☆24Sep 21, 2022Updated 3 years ago
- Crawl & Visualize NeurIPS 2022 Data from OpenReview☆14Nov 8, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated 2 years ago
- An Apache 2.0 fork of HuggingFace's Large Language Model Text Generation Inference☆19Mar 10, 2024Updated 2 years ago
- Utility programs to pipe data across a RDMA-capable network☆19Mar 14, 2026Updated 2 months ago
- ☆14Dec 21, 2024Updated last year
- ☆23Jan 5, 2025Updated last year
- Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP☆11Jan 29, 2024Updated 2 years ago
- Download ebooks from the Project Gutenberg☆14Dec 30, 2024Updated last year
- IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse☆101Mar 14, 2026Updated 2 months ago
- Implementation of Centered Kernel Alignment (CKA)☆10Apr 7, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆11Aug 4, 2020Updated 5 years ago
- 📥 🎯 (1,4/4) an MLIR-based toolchain with Vitis HLS LLVM input/output targeting FPGAs.☆15Nov 15, 2022Updated 3 years ago
- Understanding Rare Spurious Correlations in Neural Network☆12Jun 5, 2022Updated 3 years ago
- Kratos: An FPGA Benchmark for Unrolled Deep Neural Networks with Fine-Grained Sparsity and Mixed Precision☆12Jan 19, 2026Updated 4 months ago
- Locally Valid and Discriminative Prediction Intervals for Deep Learning Models☆13May 22, 2023Updated 3 years ago
- Example of Matrix Multiplication using Map Reduce paradigm in python☆10Oct 25, 2016Updated 9 years ago
- Code for studying the super weight in LLM☆124Dec 3, 2024Updated last year
- Binary Neural Network-based COVID-19 Face-Mask Wear and Positioning Predictor on Edge Devices☆12Jul 1, 2021Updated 4 years ago
- Code for paper "Spider: Any-to-Many Multimodal LLM"☆15Apr 26, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Assignments of CSCE-642: Deep Reinforcement Learning offered at Texas A&M University.☆10Aug 31, 2025Updated 8 months ago
- ☆31Apr 14, 2023Updated 3 years ago
- Multi Layer Perceptron by Vivado HLS for Xilinx FPGA implementation☆12Dec 26, 2016Updated 9 years ago
- ☆31May 31, 2024Updated last year
- ☆19Oct 14, 2024Updated last year
- A simple interface to the Project Gutenberg corpus.☆17Dec 23, 2015Updated 10 years ago
- (TMLR J2C Certification) Fed-SB: A Silver Bullet for Extreme Communication Efficiency and Performance in (Private) Federated LoRA Fine-Tu…☆27Oct 4, 2025Updated 7 months ago