Ekoda/SoftMoE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Ekoda/SoftMoE)

Ekoda / SoftMoE

Soft Mixture of Experts Vision Transformer, addressing MoE limitations as highlighted by Puigcerver et al., 2023.

☆15

Alternatives and similar repositories for SoftMoE

Users that are interested in SoftMoE are comparing it to the libraries listed below

Sorting:

QIU023 / SATS_Continual_Semantic_Seg
View on GitHub
Official Code of SATS: Self-Attention Transfer for Continual Semantic Segmentation
☆25Feb 23, 2023Updated 3 years ago
fidler-lab / hila
View on GitHub
Official PyTorch code for HILA
☆28Nov 1, 2022Updated 3 years ago
noame12 / Explainable_Attention_Based_Deepfake_Detector
View on GitHub
A Deepfake detector based on hybrid EfficientNet CNN and Vision Transformer archietcture. The model is explainable by rendering a heatma…
☆15Mar 16, 2022Updated 3 years ago
Yancccccc / HyFormer
View on GitHub
HyFormer: Hybrid Transformer and CNN For Pixel-level Multispectral Image Classification
☆16Feb 15, 2023Updated 3 years ago
TimeDevBlocker / TransientViT
View on GitHub
TransientViT: A novel CNN - Vision Transformer hybrid real/bogus transient classifier for the Kilodegree Automatic Transient Survey
☆10Nov 7, 2024Updated last year
floapfel / MAMBA-implementations
View on GitHub
Implementation of a simple linear regression algorithm in MAMBA
☆10Feb 12, 2020Updated 6 years ago
HwijaeSon / AL-PINNs
View on GitHub
Official code for AL-PINNS: Augmented Lagrangian relaxation method for Physics-Informed Neural Networks
☆12Jul 29, 2023Updated 2 years ago
3C-SCSU / Avatar
View on GitHub
☆16Feb 27, 2026Updated last week
cg-gdsc / GDSC-6
View on GitHub
This is the official GDSC repo with all of the source code presented in the video tutorials
☆14Jun 27, 2023Updated 2 years ago
Haoming02 / sd-webui-mobile-friendly
View on GitHub
An Extension for Automatic1111 Webui that makes the interface easier to use on mobile (portrait)
☆16Apr 16, 2024Updated last year
sekilab / WindSR_Dataset
View on GitHub
WindSR Dataset contains more than 22,000 pairs of HR/LR wind speed images, which are processed using the NASA's GEOS-5 Nature Run dataset…
☆11Jan 18, 2024Updated 2 years ago
heeeyk / Transformer-DOA-Prediction
View on GitHub
A Transformer-based Prediction Method for Depth of Anesthesia During Target-controlled Infusion of Propofol and Remifentanil.
☆15Feb 17, 2025Updated last year
YeeZ93 / Awesome-Object-Centric-Learning
View on GitHub
A curated list of researches in object-centric learning
☆11Oct 14, 2024Updated last year
matthias-mayr / behavior-tree-policy-learning
View on GitHub
Code for the IROS 2021 paper "Learning of Parameters in Behavior Trees for Movement Skills". In short, we combine behavior trees (BT), a …
☆13Jan 8, 2024Updated 2 years ago
RLHFlow / GVM
View on GitHub
☆16Jul 29, 2025Updated 7 months ago
Jaykef / min-patchnizer
View on GitHub
Minimal, clean code for video/image "patchnization" - a process commonly used in tokenizing visual data for use in a Transformer encoder.…
☆11May 16, 2024Updated last year
THU-DA-Robotics / dual_ur
View on GitHub
ROS package for our lab's dual-ur5 robot.
☆11Jun 30, 2023Updated 2 years ago
microsoft / scene-aware-robot-BT-planner
View on GitHub
Sample code for the paper "VLM-driven Behavior Tree for Context-aware Task Planning”
☆18Jan 10, 2025Updated last year
amokhvarma / GraphAttentionRL
View on GitHub
This repository contains an attempt at using Graph Attention based Reinforcement Learning for graphical state space. The code also provid…
☆10Jun 27, 2021Updated 4 years ago
hamrel-cxu / Invertible-Graph-Neural-Network-iGNN
View on GitHub
Official code for the paper: Invertible Neural Network for Graph Prediction
☆10Mar 27, 2023Updated 2 years ago
kyegomez / HSSS
View on GitHub
Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…
☆15Nov 11, 2024Updated last year
Ke-Wang1017 / hil-serl-pytorch
View on GitHub
☆20Dec 9, 2024Updated last year
ruanjinchen / Dual_UR_Arm_CAR
View on GitHub
It's a double-armed mobile cart that uses a WheelTec chassis and two UR3 robotic arms.
☆13May 23, 2024Updated last year
johnbachman / famplex
View on GitHub
Namespace encoding hierarchical relationships between proteins, protein families, and protein complexes.
☆12Mar 9, 2021Updated 4 years ago
kwanyoungpark / LEQ
View on GitHub
Code for Tackling Long-Horizon Tasks with Model-based Offline Reinforcement Learning
☆15Feb 6, 2025Updated last year
wjun0830 / MOVE
View on GitHub
Official PyTorch Repository of "Minority-Oriented Vicinity Expansion with Attentive Aggregation for Video Long-Tailed Recognition" (AAAI …
☆13Jul 27, 2023Updated 2 years ago
ssyze / EVE
View on GitHub
EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE
☆10Mar 1, 2024Updated 2 years ago
csmile-1006 / REDS_agent
View on GitHub
Subtask-Aware Visual Reward Learning from Segmented Demonstrations (ICLR 2025 accepted)
☆18Apr 11, 2025Updated 10 months ago
xml94 / PlantCLEF2022
View on GitHub
☆10Mar 23, 2023Updated 2 years ago
arnab-api / romba
View on GitHub
Applies ROME and MEMIT on Mamba-S4 models
☆14Apr 5, 2024Updated last year
YonghaoHe / DSLA
View on GitHub
official code for Dynamic Smooth Label Assignment
☆11Oct 5, 2022Updated 3 years ago
Inzaniak / sd-webui-workflow
View on GitHub
☆18Feb 24, 2024Updated 2 years ago
jeya-maria-jose / Overcomplete-Deep-Subspace-Clustering
View on GitHub
Official Tensorflow Code for the paper "Overcomplete Deep Subspace Clustering Networks" - WACV 2021
☆13Nov 23, 2020Updated 5 years ago
OpenXAIProject / TAEML
View on GitHub
An official code of TAEML (Task-Adaptive Ensemble of Meta-Learners)
☆13Jul 30, 2019Updated 6 years ago
JustlfC03 / DFCPS
View on GitHub
[ISBI 2024] Semi-supervised Medical Image Segmentation Method Based on Cross-pseudo Labeling Leveraging Strong and Weak Data Augmentation…
☆16Feb 23, 2025Updated last year
TencentYoutuResearch / ImageColorization-ColorFormer
View on GitHub
Code for ECCV 2022 paper "ColorFormer: Image Colorization via Color Memory assisted Hybrid-attention Transformer"
☆12Jan 30, 2023Updated 3 years ago
GilhanPark / Korean_license_plate_recognition
View on GitHub
Recognition KLP using Yolov4 + LPRnet🔥🔥
☆11Jan 5, 2022Updated 4 years ago
mikolajbadyl / flutter_leap_sdk
View on GitHub
A Flutter plugin for integrating Liquid AI's LEAP SDK, enabling on-device deployment of small language models in Flutter applications.
☆23Sep 3, 2025Updated 6 months ago
magcil / movie_shot_classification_dataset
View on GitHub
A dataset with classified film shots
☆11Aug 8, 2022Updated 3 years ago