☆134Feb 17, 2025Updated last year
Alternatives and similar repositories for DeepSeek-MoE-ResourceMap
Users that are interested in DeepSeek-MoE-ResourceMap are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Jan 4, 2024Updated 2 years ago
- This is the official repo for the paper "Accelerating Parallel Sampling of Diffusion Models" Tang et al. ICML 2024 https://openreview.net…☆16Jul 19, 2024Updated last year
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆29Jan 23, 2024Updated 2 years ago
- 该系列的目的是让读者可以在基础的pytorch上,不依赖任何其他现成的外部库,从零开始理解并实现一个大语言模型的所有组成部分,以及训练微调代码,因此读者仅需python,pytorch和最基础深度学习背景知识即可。☆386Aug 28, 2025Updated 7 months ago
- [TRETS'23, FPT'20] CHIP-KNN: Configurable and HIgh-Performance K-Nearest Neighbors Accelerator on Cloud FPGAs☆18Apr 9, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The source will be uploaded recently☆14Aug 3, 2020Updated 5 years ago
- Building DeepSeek R1 from Scratch☆752Mar 21, 2025Updated last year
- MLLM @ Game☆16May 12, 2025Updated 11 months ago
- ☆50Jun 7, 2025Updated 10 months ago
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆279Jan 20, 2026Updated 2 months ago
- SPGCL: Mining Spatio-Temporal Relations via Self-Paced Graph Contrastive Learning☆14Feb 16, 2023Updated 3 years ago
- This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without …☆45Nov 30, 2023Updated 2 years ago
- Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks☆12Aug 12, 2025Updated 8 months ago
- ☆191Mar 13, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Our 2nd-gen LMM☆34May 22, 2024Updated last year
- Chisel3 AXI4-{Lite, Full, Stream} Definitions☆15Dec 31, 2018Updated 7 years ago
- MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition☆10Mar 19, 2025Updated last year
- Visualization tool for designing mesh Network-on-Chips (NoC) and assisting with architecture research☆17Jan 21, 2024Updated 2 years ago
- Dynamic graph embedding☆14Aug 8, 2018Updated 7 years ago
- 顾名思义:手搓的RAG☆134Feb 27, 2024Updated 2 years ago
- ☆15Mar 21, 2025Updated last year
- Official Implementation of "GRIFFIN: Effective Token Alignment for Faster Speculative Decoding"[NeurIPS 2025]☆18May 12, 2025Updated 11 months ago
- Benchmarking Attention Mechanism in Vision Transformers.☆20Oct 10, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 整理的国际电话号码区号(list of country calling codes),数据来源于淘宝网的注册页面☆14Apr 10, 2019Updated 7 years ago
- Simulator for LLM inference on an abstract 3D AIMC-based accelerator☆28Sep 18, 2025Updated 7 months ago
- Deeptoai 系列 RAG 教程☆100Oct 29, 2025Updated 5 months ago
- Deep Learning experiments of UCAS☆18Jun 25, 2019Updated 6 years ago
- Code for 'Contrastive Multi-Document Question Generation'☆11Oct 16, 2022Updated 3 years ago
- Microservice-based application development using Spring Boot, Kafka, Redis, MySQL, MongoDB, Elasticsearch, Docker and Kubernetes.☆11Mar 17, 2024Updated 2 years ago
- ☆28Dec 2, 2024Updated last year
- ☆14Jan 4, 2017Updated 9 years ago
- Stream live plots to a matplotlib figure☆81Apr 18, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A toy implementation about Program Dependence Graph using LLVM☆13Sep 27, 2023Updated 2 years ago
- Youtu-Parsing: Perception, Structuring and Recognition via High-Parallelism Decoding☆65Feb 10, 2026Updated 2 months ago
- Implement of IVMM map matching method.☆17Dec 7, 2020Updated 5 years ago
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆56Updated this week
- Agently Stage - Efficient Convenient Asynchronous & Multithreaded Programming☆13Apr 2, 2025Updated last year
- 中华药典RAG项目☆10Oct 26, 2024Updated last year
- Urban Region Representation Learning with Attentive Fusion (ICDE 2024)☆20Feb 9, 2025Updated last year