Simple MoE - Day 17 of 365 Days of Repos
☆18Jan 17, 2025Updated last year
Alternatives and similar repositories for simple-moe
Users that are interested in simple-moe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Prompt-to-Prompt Image Editing with Cross Attention Control☆16Apr 5, 2023Updated 2 years ago
- Airspy-Utils is a small software collection to help with firmware related operations on Airspy HF+ devices.☆23Mar 25, 2025Updated last year
- ☆16Apr 28, 2023Updated 2 years ago
- Validation of sycnmers compared to minimizers☆11May 10, 2025Updated 10 months ago
- DartMinHash: Fast Sketching for Weighted Sets☆12Dec 8, 2025Updated 3 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- LeetCode Solutions GitBook☆12Dec 10, 2018Updated 7 years ago
- Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning☆12Jan 19, 2024Updated 2 years ago
- RLHF for Video Diffusion Models☆25Jul 30, 2025Updated 7 months ago
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations☆12Sep 4, 2024Updated last year
- Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)☆15Oct 29, 2024Updated last year
- An Fast variant calling tool to detection germline and somatic variants☆11Feb 21, 2026Updated last month
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- Formalizing Multimedia Recommendation through Multimodal Deep Learning, accepted in ACM Transactions on Recommender Systems.☆19Jul 2, 2024Updated last year
- ☆15Sep 1, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆15Dec 8, 2022Updated 3 years ago
- Contrastive Continual Learning with Importance Sampling and Prototype-Instance Relation Distillation☆12Jul 22, 2024Updated last year
- RabbitKSSD: accelerating genome distance estimation on modern multi-core architectures☆12Jun 5, 2024Updated last year
- Source code used in the blog☆12Feb 6, 2024Updated 2 years ago
- ReDiffuser: Reliable Decision-Making Using a Diffuser with Confidence Estimation☆15Jun 2, 2024Updated last year
- This repo implements Video generation model using Latent Diffusion Transformers(Latte) in PyTorch and provides training and inference cod…☆17Jan 6, 2025Updated last year
- ☆21Apr 15, 2024Updated last year
- Official implementation for "Revisiting Discriminative vs. Generative Classifiers: Theory and Implications".☆14Feb 7, 2023Updated 3 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2026] Optimization-free Dataset Distillation for Object Detection. Paper at: https://arxiv.org/abs/2506.01942☆28Jan 26, 2026Updated 2 months ago
- Pytorch routines for (Ker)nel (Mac)hines☆11Oct 10, 2025Updated 5 months ago
- This app allows you to ask questions and get answers regarding your code provided the folder location of your code.☆16Oct 4, 2024Updated last year
- Pytorch Implementation of LoG 22 [Oral] -- Transductive Linear Probing: A Novel Framework for Few-Shot Node Classification☆17May 31, 2023Updated 2 years ago
- ☆12Apr 8, 2021Updated 4 years ago
- [ACL 2022] CLUES: A Benchmark for Learning Classifiers using Natural Language Explanations☆10Jun 5, 2022Updated 3 years ago
- Machine Learning algorithms implementation in Python from scratch.☆11Feb 10, 2019Updated 7 years ago
- ☆43Feb 20, 2026Updated last month
- A project designed to build and render a full Minecraft crafting tree.☆10Aug 10, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆12Jun 5, 2024Updated last year
- ☆14May 7, 2024Updated last year
- Exploring the minimal architecture required for coherent English language generation.☆12Mar 5, 2025Updated last year
- Mirror of☆15Jan 23, 2026Updated 2 months ago
- Instant Linux kernel development environments via Nix devShells☆35Mar 6, 2025Updated last year
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated last month
- POS Tagger for Bangla language based on Conditional Random Fields☆16Jul 18, 2012Updated 13 years ago