The official repository for the experiments included in the paper titled "Patch-level Routing in Mixture-of-Experts is Provably Sample-efficient for Convolutional Neural Networks" [ICML, 2023]
☆14Feb 12, 2026Updated 2 months ago
Alternatives and similar repositories for pMoE_CNN
Users that are interested in pMoE_CNN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Sep 15, 2022Updated 3 years ago
- ☆12Oct 27, 2018Updated 7 years ago
- ☆20Oct 31, 2022Updated 3 years ago
- ICLR 2022 (Spolight): Continual Learning With Filter Atom Swapping☆16Jul 5, 2023Updated 2 years ago
- [AAAI 2025 Oral] ODDN: Addressing Unpaired Data Challenges in Open-World Deepfake Detection on Online Social Networks https://arxiv.org/…☆10Jun 25, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Structured Pruning Adapters in PyTorch☆19Aug 30, 2023Updated 2 years ago
- [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"☆20Sep 15, 2023Updated 2 years ago
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆16Feb 4, 2025Updated last year
- Official implementation of CVPR2025 paper "Towards Universal AI-Generated Image Detection by Variational Information Bottleneck Network"☆22Oct 31, 2025Updated 5 months ago
- Cpp Socket Programming Tutorial☆10Jun 9, 2021Updated 4 years ago
- Breaking Semantic Artifacts for Generalized AI-generated Image Detection☆22Mar 3, 2026Updated last month
- [ICML‘25] Official code for paper "Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training an…☆13Apr 17, 2025Updated last year
- 👍 👍 👍 集成了目前最为成熟的 SpringAI,并支持 Ollama、OpenAI、DeepSeek、阿里百炼 多种 AI 模型。并通过 MCP、Function Calling 与大麦项目进行联动,可以让 AI 智能的帮助用户执行操作。实现 只需聊天就能全部搞定…☆54Mar 10, 2026Updated last month
- Hopenet: deep head pose estimator on ncnn☆10Jun 18, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Project page: https://jespark.net/projects/2024/community_forensics/☆36Jan 6, 2026Updated 3 months ago
- SJTU 中文简约 LaTeX 报告模板☆10Jun 7, 2021Updated 4 years ago
- Agentic Keyframe Search for Video Question Answering☆18Apr 7, 2025Updated last year
- This repo implements an interface to GTAV for SCENIC language.☆11Dec 7, 2019Updated 6 years ago
- Source code of ACL 2023 Main Conference Paper "PAD-Net: An Efficient Framework for Dynamic Networks".☆12Feb 28, 2026Updated last month
- Using DeepBSDE solver to price/hedge options & optimize portfolios under Black-Scholes, Heston and multiscale models.☆18Mar 20, 2020Updated 6 years ago
- Composite grid generator☆11Jul 22, 2017Updated 8 years ago
- Official code for the paper "HEXA-MoE: Efficient and Heterogeneous-Aware MoE Acceleration with Zero Computation Redundancy"☆15Mar 6, 2025Updated last year
- ☆28Jul 18, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Discovering Differential Equations with Physics-Informed Neural Networks and Symbolic Regression☆11Jul 28, 2023Updated 2 years ago
- ☆16Apr 8, 2026Updated last week
- Conditional convolution (Dynamic convolution) in tensorflow2☆22Aug 25, 2021Updated 4 years ago
- Spatial Mixture-of-Experts☆21Nov 29, 2022Updated 3 years ago
- This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…☆31Jun 1, 2024Updated last year
- ✂️ EyeLipCropper is a Python tool to crop eyes and mouth ROIs of the given video.☆14Nov 28, 2021Updated 4 years ago
- [CVPRW2024, Official Code] for paper "Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribu…☆14Jun 14, 2024Updated last year
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":☆45Feb 28, 2026Updated last month
- Pytorch implementation for Decomposed Convolutional Filters Network☆23Feb 19, 2020Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Rationale-enhanced language models are better continual relation learners (EMNLP 2023 Main Conference)☆12Oct 11, 2023Updated 2 years ago
- 现在是AI时代 Typescript+Vue3 搭建自己的chatgpt聊天机器人! 本项目基于 SpringAI 框架开发的聊天机器人,旨在为用户提供智能对话功能。该机器人采用自然语言处理技术,能够理解并回应用户的提问,适用于在线客服、信息查询、娱乐互动等多种场景。☆23Jul 8, 2025Updated 9 months ago
- generative neural network trained with physics knowledge☆14Mar 8, 2021Updated 5 years ago
- ☆17Aug 13, 2024Updated last year
- Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…☆17Nov 11, 2024Updated last year
- ☆41Sep 13, 2025Updated 7 months ago
- THOUGHTSCULPT, a general reasoning and search method for complex tasks☆13Dec 13, 2024Updated last year