☆133Feb 17, 2025Updated last year
Alternatives and similar repositories for DeepSeek-MoE-ResourceMap
Users that are interested in DeepSeek-MoE-ResourceMap are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official repo for the paper "Accelerating Parallel Sampling of Diffusion Models" Tang et al. ICML 2024 https://openreview.net…☆16Jul 19, 2024Updated last year
- ☆52Feb 5, 2025Updated last year
- 🍏专门为 2024 书生·浦语大模型挑战赛 (春季赛) 准备的 Repo🍎收录了赫萝相关的微调源码☆12Sep 20, 2024Updated last year
- 该系列的目的是让读者可以在基础的pytorch上,不依赖任何其他现成的外部库,从零开始理解并实现一个大语言模型的所有组成部分,以及训练微调代码,因此读者仅需python,pytorch和最基础深度学习背景知识即可。☆386Aug 28, 2025Updated 10 months ago
- ☆45Jun 10, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Build complex LLM Applications with Python Dictionary☆40Oct 10, 2024Updated last year
- Urban2Vec: Incorporating Street View Imagery and POIs for Multi-Modal Urban Neighborhood Embedding☆21Aug 21, 2023Updated 2 years ago
- Building DeepSeek R1 from Scratch☆778Mar 21, 2025Updated last year
- MLLM @ Game☆17May 12, 2025Updated last year
- Official code for the paper "Examining Post-Training Quantization for Mixture-of-Experts: A Benchmark"☆30Jun 30, 2025Updated last year
- ☆50Jun 7, 2025Updated last year
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆283May 8, 2026Updated last month
- SPGCL: Mining Spatio-Temporal Relations via Self-Paced Graph Contrastive Learning☆14Feb 16, 2023Updated 3 years ago
- Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks☆12Aug 12, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆190Mar 13, 2026Updated 3 months ago
- OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models☆30Feb 4, 2026Updated 4 months ago
- Our 2nd-gen LMM☆34May 22, 2024Updated 2 years ago
- MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition☆10Mar 19, 2025Updated last year
- Chisel3 AXI4-{Lite, Full, Stream} Definitions☆15Dec 31, 2018Updated 7 years ago
- Emotion analysis on DREAMER dataset using various Deep Learning Techniques☆13Jan 1, 2021Updated 5 years ago
- 一个爬取国内技术站点的技术文章☆33Dec 24, 2017Updated 8 years ago
- Visualization tool for designing mesh Network-on-Chips (NoC) and assisting with architecture research☆17Jan 21, 2024Updated 2 years ago
- 顾名思义:手搓的RAG☆134Feb 27, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [NeurIPS 2019] E2-Train: Training State-of-the-art CNNs with Over 80% Less Energy☆21Nov 18, 2019Updated 6 years ago
- ☆15Mar 21, 2025Updated last year
- Benchmarking Attention Mechanism in Vision Transformers.☆20Oct 10, 2022Updated 3 years ago
- 基于Python3.10异步非阻塞框架Tornado6.0和前端Vue.js3框架实现ChatGPT的流式返回协议Server-sent events☆23Mar 7, 2023Updated 3 years ago
- ☆28Apr 14, 2025Updated last year
- 整理的国际电话号码区号(list of country calling codes),数据来源于淘宝网的注册页面☆14Apr 10, 2019Updated 7 years ago
- A simple AXI4 DMA unit written in SpinalHDL.☆18Apr 18, 2020Updated 6 years ago
- Deep Learning experiments of UCAS☆18Jun 25, 2019Updated 7 years ago
- Streaming Video Diffusion: Online Video Editing with Diffusion Models☆17Jun 3, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [Ongoing Project] Codebase for network quantization study.☆12May 20, 2020Updated 6 years ago
- An implementation of LLMzip using GPT-2☆14Aug 7, 2023Updated 2 years ago
- Code for 'Contrastive Multi-Document Question Generation'☆11Oct 16, 2022Updated 3 years ago
- ☆28Dec 2, 2024Updated last year
- Microservice-based application development using Spring Boot, Kafka, Redis, MySQL, MongoDB, Elasticsearch, Docker and Kubernetes.☆12Mar 17, 2024Updated 2 years ago
- Simulator for LLM inference on an abstract 3D AIMC-based accelerator☆33Sep 18, 2025Updated 9 months ago
- 基于pytorch的不平衡数据的文本分类☆12Dec 26, 2021Updated 4 years ago