serp-ai/Parameter-Efficient-MoE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/serp-ai/Parameter-Efficient-MoE)

serp-ai / Parameter-Efficient-MoE

Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks

☆31

Alternatives and similar repositories for Parameter-Efficient-MoE

Users that are interested in Parameter-Efficient-MoE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wuhy68 / Parameter-Efficient-MoE
View on GitHub
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks (EMNLP'24)
☆145Sep 20, 2024Updated last year
enjalot / latent-data-modal
View on GitHub
Using modal.com to process FineWeb-edu data
☆20Apr 11, 2026Updated 3 months ago
QuixiAI / kraken
View on GitHub
☆69May 26, 2024Updated 2 years ago
p3nGu1nZz / Tau
View on GitHub
Tau LLM made with Unity 6 ML Agents
☆18Apr 24, 2025Updated last year
offskiies / KB_builder
View on GitHub
Build your own custom knowledge base from various sources such as youtube videos transcripts, tweets, articles, videos and audios. Uses G…
☆13Dec 15, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Gryphe / BlockMerge_Gradient
View on GitHub
Merge Transformers language models by use of gradient parameters.
☆215Aug 8, 2024Updated last year
QuixiAI / laserRMT
View on GitHub
This is our own implementation of 'Layer Selective Rank Reduction'
☆240May 26, 2024Updated 2 years ago
realsigridjin / eggroll-embedding-trainer
View on GitHub
A PyTorch implementation of gradient-free optimization for directly optimizing NDCG (Normalized Discounted Cumulative Gain) in neural inf…
☆19Dec 21, 2025Updated 7 months ago
embedl / embedl-models
View on GitHub
⛔ DEPRECATED -- use flash-head instead (pip install flash-head)
☆29Apr 10, 2026Updated 3 months ago
janhq / space-thinker
View on GitHub
☆21Mar 25, 2025Updated last year
thomasgauthier / LoRD
View on GitHub
Low-Rank adapter extraction for fine-tuned transformers models
☆181May 2, 2024Updated 2 years ago
TeamTonic / adapt-a-rag
View on GitHub
a RAG retrieval application that adapts to its specific user and topic , so that it's purpose built everytime.
☆16Mar 18, 2024Updated 2 years ago
EndlessReform / bark-with-voice-clone
View on GitHub
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
☆21May 17, 2023Updated 3 years ago
daniel-ilett / shaders-octocamo
View on GitHub
A recreation of Metal Gear Solid 4's Octocamo mechanic in Shader Graph & URP.
☆14Jan 14, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
SlerpE / highCompute.py
View on GitHub
☆27Jun 11, 2025Updated last year
youraichat / youraichatbot
View on GitHub
YourAICHAT
☆13Aug 16, 2023Updated 2 years ago
resloved / RWKV-notebooks
View on GitHub
📖 — Notebooks related to RWKV
☆58May 13, 2023Updated 3 years ago
nvidia-china-sae / WholeGraph
View on GitHub
☆11Mar 4, 2021Updated 5 years ago
joeellis / romajinizer
View on GitHub
A gem for converting between hiragana, katakana, and romaji alphabets for the Japanese language
☆15Jan 18, 2023Updated 3 years ago
Drew00785 / SDXL-Dynamic-Image-Generator
View on GitHub
☆26Mar 25, 2025Updated last year
dimtion / copiepate
View on GitHub
Send copy-pasting events over the network
☆13Mar 25, 2023Updated 3 years ago
zubayerhimel / kanban-with-tailwindcss
View on GitHub
Kanban board made with TailwindCSS
☆11Jun 10, 2021Updated 5 years ago
U-C4N / Deepseek-CoT
View on GitHub
Deepseek-CoT
☆10Oct 6, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
somosnlp / recursos
View on GitHub
Diapositivas, notebooks y material de charlas, talleres y el grupo de estudio
☆12Apr 24, 2024Updated 2 years ago
Arunprakaash / openvoice.streaming.server
View on GitHub
FastAPI WebSocket server for the OpenVoice text-to-speech model.
☆12Jun 6, 2024Updated 2 years ago
daniel-ilett / shaders-stealth-vision
View on GitHub
A shader project which highlights objects in certain layers with a glowing 'stealth vision' effect.
☆20Apr 6, 2023Updated 3 years ago
cg123 / bitnet
View on GitHub
Modeling code for a BitNet b1.58 Llama-style model.
☆25Apr 30, 2024Updated 2 years ago
Qualcomm-AI-research / clockwork-diffusion
View on GitHub
☆14Feb 20, 2024Updated 2 years ago
GOB52 / M5StackCoreS3_CameraWebServer
View on GitHub
Porting of espressif/arduino-esp32 example to M5Stack CoreS3 (GC0308)
☆11Nov 30, 2023Updated 2 years ago
jcottaar / seismic
View on GitHub
Jeroen Cottaar's work for the Kaggle Geophysical Waveform Inversion competition (2nd place)
☆13Aug 11, 2025Updated 11 months ago
ElleLeonne / Lightning-ReLoRA
View on GitHub
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆34Mar 2, 2024Updated 2 years ago
camenduru / MotionDirector-colab
View on GitHub
☆16Dec 11, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
abacaj / openhermes-function-calling
View on GitHub
☆133Nov 24, 2023Updated 2 years ago
rmihaylov / mpttune
View on GitHub
Tune MPTs
☆84Jun 17, 2023Updated 3 years ago
l4b4r4b4b4 / AIDocks
View on GitHub
LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT
☆27Feb 18, 2024Updated 2 years ago
rpgoldman / pddl-tools
View on GitHub
Common lisp library for manipulating PDDL expressions.
☆19Jun 19, 2026Updated last month
marepilc / pink-parquet
View on GitHub
User-friendly viewer for Parquet files
☆16May 8, 2026Updated 2 months ago
bixb922 / viper-examples
View on GitHub
MicroPython viper documentation and examples
☆16Apr 19, 2024Updated 2 years ago
minosvasilias / simple_grpo
View on GitHub
Simple GRPO scripts and configurations.
☆59Feb 6, 2025Updated last year