☆18Apr 16, 2025Updated 10 months ago
Alternatives and similar repositories for default-moe
Users that are interested in default-moe are comparing it to the libraries listed below
Sorting:
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- Robust Contrastive Learning Using Negative Samples with Diminished Semantics (NeurIPS 2021)☆39Dec 6, 2021Updated 4 years ago
- Promptopia is an open-source AI prompting tool for modern world to discover, create, and share creative prompts☆12May 27, 2023Updated 2 years ago
- ☆10Nov 17, 2022Updated 3 years ago
- ☆16Feb 22, 2025Updated last year
- ☆14Mar 20, 2025Updated 11 months ago
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆39Nov 1, 2024Updated last year
- Tiny evaluation of leading LLMs on competitive programming problems☆14Nov 28, 2024Updated last year
- Text-to-video generation.☆10Jul 22, 2022Updated 3 years ago
- Implementation of Implicit Graphon Neural Representation☆12Sep 1, 2023Updated 2 years ago
- ☆11Nov 23, 2020Updated 5 years ago
- ☆16Jun 30, 2025Updated 8 months ago
- ☆14Jan 5, 2022Updated 4 years ago
- Semi-Markov Afterstate Actor-Critic (SMAAC) with Maze☆11Nov 16, 2021Updated 4 years ago
- CenterMask2 on detectron2 (open images)☆10May 28, 2020Updated 5 years ago
- A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)☆15Oct 18, 2021Updated 4 years ago
- ☆11Jan 16, 2025Updated last year
- ☆15Oct 23, 2023Updated 2 years ago
- ☆12Dec 9, 2025Updated 2 months ago
- The implement of LLMTreeRec☆14Dec 9, 2024Updated last year
- ☆29Nov 19, 2025Updated 3 months ago
- 3D Scene Flow Estimation☆14Sep 24, 2025Updated 5 months ago
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆21Feb 19, 2026Updated 2 weeks ago
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆11Apr 18, 2025Updated 10 months ago
- ☆47Apr 29, 2024Updated last year
- 📈Factory for creating your own continuous tokens with bonding curve. Create your own ERC20 compatible token, the price of which will va…☆11May 7, 2023Updated 2 years ago
- OneFlow Diffusers Web UI☆11Apr 11, 2023Updated 2 years ago
- ☆15Jul 13, 2025Updated 7 months ago
- This is the official Python implementation repository for a paper entitled "Resolving Camera Position for a Practical Application of Gaz…☆12Jan 11, 2022Updated 4 years ago
- Easily compute model embeddings and save the embeddings.☆10Dec 10, 2022Updated 3 years ago
- ROS node for triggering cameras using GPIO on Jetson (targeting ROSCubeX, but easily adaptable to other platforms)☆12Feb 23, 2026Updated last week
- Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"☆15Oct 2, 2025Updated 5 months ago
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization☆18Mar 7, 2025Updated 11 months ago
- Official implementation of ICML 2025 paper "Understanding Multimodal LLMs Under Distribution Shifts: An Information-Theoretic Approach"☆11May 27, 2025Updated 9 months ago
- ☆12Apr 24, 2024Updated last year
- Extended Annotations of DeepFashion Images for Fine-grained Recognition☆14May 28, 2019Updated 6 years ago
- [ECCV 2024] Official Implementation of "Appearance-Based Refinement for Object-Centric Motion Segmentation" Junyu Xie, Weidi Xie, Andrew …☆13Oct 23, 2024Updated last year
- a autodl environment for native finetune stable diffusion.☆11Dec 7, 2022Updated 3 years ago