Official repository for FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models
☆33Sep 19, 2025Updated 5 months ago
Alternatives and similar repositories for FLAME-MoE
Users that are interested in FLAME-MoE are comparing it to the libraries listed below
Sorting:
- 广州前端43期,黑马头条移动端项目,基于vueCli,vant-ui☆12Jan 5, 2023Updated 3 years ago
- ☆11Jan 21, 2021Updated 5 years ago
- ☆10Aug 15, 2022Updated 3 years ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- generate spot-it cards☆10Jun 13, 2015Updated 10 years ago
- Tools to cluster visually similar images into groups in an image dataset☆11Jul 29, 2022Updated 3 years ago
- [ACL 2025] Official code for ''Learning to Reason from Feedback at Test-Time''.☆13May 16, 2025Updated 9 months ago
- Crawl & Visualize NeurIPS 2022 Data from OpenReview☆14Nov 8, 2022Updated 3 years ago
- Code for Massive-scale Decoding for Text Generation using Lattices☆44Jul 29, 2022Updated 3 years ago
- "rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Yandex Files☆11Oct 27, 2024Updated last year
- ICNet in TensorFlow, Real-Time Segmentation☆10Aug 17, 2018Updated 7 years ago
- 苏州大学研究生学位论文模板 - Soochow University Thesis TeX Template☆17Feb 27, 2026Updated last week
- Standardizing environment infrastructure with Strands Agents — step, observe, reward.☆41Updated this week
- solutions for advent of code 2018☆17Dec 19, 2018Updated 7 years ago
- This project demonstrates the use of Deep Learning to detect emotion (sad, angry, happy etc) from the images of faces.☆11Feb 14, 2020Updated 6 years ago
- ☆16Apr 28, 2023Updated 2 years ago
- ☆11Aug 4, 2020Updated 5 years ago
- TMMA: A Tiled Matrix Multiplication Accelerator for Self-Attention Projections in Transformer Models, optimized for edge deployment on Xi…☆26Mar 24, 2025Updated 11 months ago
- [DATE'2025, TCAD'2025] Terafly : A Multi-Node FPGA Based Accelerator Design for Efficient Cooperative Inference in LLMs☆28Nov 13, 2025Updated 3 months ago
- Simple TTF rasterizer☆11Mar 29, 2020Updated 5 years ago
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated last year
- Ideas on how to quickly learn to build command-line tools☆11Feb 26, 2022Updated 4 years ago
- Our paper is titled "NUS-IDS at FinCausal 2021: Dependency Tree in Graph Neural Networks for better Cause-Effect Span Detection".☆13Feb 11, 2022Updated 4 years ago
- ggwave package for the Swift Package Manager☆15Jan 19, 2023Updated 3 years ago
- Includes the SVD-based approximation algorithms for compressing deep learning models and the FPGA accelerators exploiting such approximat…☆16Mar 3, 2023Updated 3 years ago
- Contains Colab Notebooks show cool use-cases of different GCP ML APIs.☆10Nov 5, 2020Updated 5 years ago
- Implements EvoNorms B0 and S0 as proposed in Evolving Normalization-Activation Layers.☆11Apr 22, 2020Updated 5 years ago
- Visual search interface☆11Nov 30, 2021Updated 4 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- Unofficial PyTorch Implementation of OpenAI's GPT-3☆13Apr 11, 2022Updated 3 years ago
- Code and data necessary to reproduce heatmaps relating HN Submission time to submission score.☆13Jul 10, 2015Updated 10 years ago
- Kratos: An FPGA Benchmark for Unrolled Deep Neural Networks with Fine-Grained Sparsity and Mixed Precision☆12Jan 19, 2026Updated last month
- All the code developed in the "Creating Google Cloud Pub/Sub publishers and subscribers with Spring Cloud GCP" article.☆10May 25, 2023Updated 2 years ago
- Pytorch Implementation of RetinaNet with CUDA accelerate nms operation.☆10Jul 8, 2019Updated 6 years ago
- This project shows how to build a simple handwriting recognizer in Keras with the IAM dataset.☆13Aug 15, 2021Updated 4 years ago
- This is a sample project where we can get the exact use case of pythons multi threading.☆11Oct 6, 2020Updated 5 years ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- Code for running forward and backward versions of GPT2☆10Nov 20, 2021Updated 4 years ago
- livecoding talk for oscon 2018☆10Jul 18, 2018Updated 7 years ago