☆134Feb 17, 2025Updated last year
Alternatives and similar repositories for DeepSeek-MoE-ResourceMap
Users that are interested in DeepSeek-MoE-ResourceMap are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Jan 4, 2024Updated 2 years ago
- This is the official repo for the paper "Accelerating Parallel Sampling of Diffusion Models" Tang et al. ICML 2024 https://openreview.net…☆16Jul 19, 2024Updated last year
- ☆52Feb 5, 2025Updated last year
- Paper list of federated learning: About system design☆13Apr 13, 2022Updated 4 years ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆29Jan 23, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 该系列的目的是让读者可以在基础的pytorch上,不依赖任何其他现成的外部库,从零开始理解并实现一个大语言模型的所有组成部分,以及训练微调代码,因此读者仅需python,pytorch和最基础深度学习背景知识即可。☆387Aug 28, 2025Updated 8 months ago
- ☆46Jun 10, 2025Updated 10 months ago
- The source will be uploaded recently☆14Aug 3, 2020Updated 5 years ago
- Building DeepSeek R1 from Scratch☆753Mar 21, 2025Updated last year
- MLLM @ Game☆16May 12, 2025Updated 11 months ago
- ☆50Jun 7, 2025Updated 11 months ago
- ☆16Sep 12, 2023Updated 2 years ago
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆280Apr 28, 2026Updated last week
- SPGCL: Mining Spatio-Temporal Relations via Self-Paced Graph Contrastive Learning☆14Feb 16, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code & Data for our Paper "PATTERN-BASED CHINESE HYPERNYM-HYPONYM RELATION EXTRACTION METHOD"☆12Jan 29, 2020Updated 6 years ago
- This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without …☆45Nov 30, 2023Updated 2 years ago
- Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks☆12Aug 12, 2025Updated 8 months ago
- OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models☆30Feb 4, 2026Updated 3 months ago
- Chinese version code for the paper "EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks"☆11Jul 25, 2019Updated 6 years ago
- MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition☆10Mar 19, 2025Updated last year
- This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.☆14Feb 24, 2022Updated 4 years ago
- 一个爬取国内技术站点的技术文章☆33Dec 24, 2017Updated 8 years ago
- Visualization tool for designing mesh Network-on-Chips (NoC) and assisting with architecture research☆17Jan 21, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 顾名思义:手搓的RAG☆134Feb 27, 2024Updated 2 years ago
- [NeurIPS 2019] E2-Train: Training State-of-the-art CNNs with Over 80% Less Energy☆21Nov 18, 2019Updated 6 years ago
- ☆15Mar 21, 2025Updated last year
- Official Implementation of "GRIFFIN: Effective Token Alignment for Faster Speculative Decoding"[NeurIPS 2025]☆18May 12, 2025Updated 11 months ago
- 基于Python3.10异步非阻塞框架Tornado6.0和前端Vue.js3框架实现ChatGPT的流式返回协议Server-sent events☆23Mar 7, 2023Updated 3 years ago
- ☆27Apr 14, 2025Updated last year
- Simulator for LLM inference on an abstract 3D AIMC-based accelerator☆28Sep 18, 2025Updated 7 months ago
- Deep Learning experiments of UCAS☆18Jun 25, 2019Updated 6 years ago
- Solutions of Kaggle Competition☆15Jan 28, 2018Updated 8 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Stream live plots to a matplotlib figure☆81Apr 18, 2025Updated last year
- 基于pytorch的不平衡数据的文本分类☆12Dec 26, 2021Updated 4 years ago
- A toy implementation about Program Dependence Graph using LLVM☆13Sep 27, 2023Updated 2 years ago
- Multinomial Factorization Machines☆21Oct 17, 2016Updated 9 years ago
- Youtu-Parsing: Perception, Structuring and Recognition via High-Parallelism Decoding☆67Feb 10, 2026Updated 2 months ago
- ☆24Jun 30, 2025Updated 10 months ago
- Implement of IVMM map matching method.☆17Dec 7, 2020Updated 5 years ago