根据Qwen2(Qwen1.5)模型生成qwen2 MoE模型的工具
☆15Mar 29, 2024Updated 2 years ago
Alternatives and similar repositories for qwen2_moe_mergekit
Users that are interested in qwen2_moe_mergekit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ipad最新协议,微商工具,私域管理,群管理,淘客机器人,视频号☆13Mar 31, 2026Updated 2 months ago
- Accelerating GOT-OCRv2 with VLLM☆10Nov 15, 2024Updated last year
- qwen2 and llama3 cpp implementation☆50Jun 7, 2024Updated 2 years ago
- ☆29Aug 14, 2023Updated 2 years ago
- text classification compitioin top 10 strategy☆18Aug 14, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Oct 11, 2023Updated 2 years ago
- Official implementation of (ICML 2026) Training-Free Vector Quantization via Gaussian VAEs☆23Jan 3, 2026Updated 5 months ago
- Neural image compression models optimized for Mask R-CNN from paper "Boosting Neural Image Compression for Machines Using Latent Space Ma…☆10Aug 16, 2022Updated 3 years ago
- [KDD 2026 ADS Track] Pytorch implementation of the paper "Hi-Guard: Towards Trustworthy Multimodal Moderation via Policy-Aligned Reasonin…☆23Jan 13, 2026Updated 5 months ago
- Code for the paper "Unbiased Supervised Contrastive Learning" | ICLR 2023 https://openreview.net/forum?id=Ph5cJSfD2XN☆12Sep 22, 2023Updated 2 years ago
- Un-official implementation of the Transformer Index for GEnerative Recommenders (TIGER) framework.☆13Jun 6, 2023Updated 3 years ago
- meta-comprehensive-rag-benchmark-kdd-cup-2024 phase1 task1 rank3☆20Jun 21, 2024Updated last year
- ☆10Jul 23, 2021Updated 4 years ago
- ☆17Apr 17, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for paper "Self-training Method Based on GCN for Semi-supervised Short Text Classification"☆11Oct 30, 2021Updated 4 years ago
- Official implementation for DenseMixer: Improving MoE Post-Training with Precise Router Gradient☆67Aug 3, 2025Updated 10 months ago
- The extented code of layered conceptual image compression. Journal submitted.☆15Aug 29, 2022Updated 3 years ago
- ☆20Dec 8, 2024Updated last year
- Daily paper reading records☆15Mar 31, 2025Updated last year
- ☆20Jan 10, 2025Updated last year
- [CVPR'26] AdapTok: Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space☆27Mar 15, 2026Updated 3 months ago
- This repository contains the code for 4th place solution for approach to RecSys Challenge 2020.☆18Sep 26, 2020Updated 5 years ago
- 企业微信SDK, Python企业微信机器人SDK, Python企业微信WebApi接口☆33Apr 9, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [SIGKDD 2023] HardSATGEN: Understanding the Difficulty of Hard SAT Formula Generation and A Strong Structure-Hardness-Aware Baseline☆22Jun 16, 2023Updated 3 years ago
- Undoing the Damage of Label Shift for Cross-domain Semantic Segmentation (CVPR 2022)☆17Mar 19, 2022Updated 4 years ago
- ☆15Mar 5, 2024Updated 2 years ago
- rewrite python scipy.signal.lfilter in c code☆11Aug 13, 2019Updated 6 years ago
- ☆32Jul 8, 2024Updated last year
- 软工实训,人脸识别+活体检测,极限一带四☆16Jan 5, 2021Updated 5 years ago
- 知识图谱从入门到精通☆33Nov 27, 2020Updated 5 years ago
- fast_faceswap use dlib and change_style_network(基于dlib和风格迁移网络的快速换脸)☆11Jul 18, 2019Updated 6 years ago
- ☆56Nov 6, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Janus NDI Plugin☆14Nov 2, 2025Updated 7 months ago
- Official Pytorch implementation of the TGRS paper "MAGE: Multisource Attention Network with Discriminative Graph and Informative Entities…☆17Sep 28, 2022Updated 3 years ago
- [NAACL 2025 Main Conference] PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization☆27Mar 29, 2025Updated last year
- A High performance and tiny TVM graph executor library written in C which can compile to WebAssembly and use CUDA/WebGPU as the accelerat…☆12Aug 3, 2023Updated 2 years ago
- This repository shows a demo of real-time Digital Makeup for a face. It can transference the hair style, foundation make-up, eyelash, lip…☆13Jul 15, 2018Updated 7 years ago
- ☆41Nov 18, 2021Updated 4 years ago
- Extreme Image Compression using Fine-tuned VQGAN Models (DCC 2024)☆24Jan 14, 2025Updated last year