Mryangkaitong/deepseek-r1-gsm8k

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Mryangkaitong/deepseek-r1-gsm8k)

Mryangkaitong / deepseek-r1-gsm8k

☆49

Alternatives and similar repositories for deepseek-r1-gsm8k

Users that are interested in deepseek-r1-gsm8k are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LirongWu / Uni-Mol3
View on GitHub
A PyTorch implementation of Uni-Mol3.
☆24Mar 24, 2026Updated 4 months ago
shibing624 / text2vec-service
View on GitHub
Service for Bert model to Vector. 高效的文本转向量(Text-To-Vector)服务，支持GPU多卡、多worker、多客户端调用，开箱即用。
☆12May 24, 2022Updated 4 years ago
kyungmnlee / RenyiCL
View on GitHub
Contrastive self-supervised learning using Rényi divergence
☆14Oct 21, 2022Updated 3 years ago
zhaoyingjun / Tiny-R2
View on GitHub
Tiny-R2: A hybrid architecture integrating SWA, CSA, HCA, mHC, and DSMoE under the DeepSeek V4 design paradigm, enabling single-GPU OPD p…
☆46May 30, 2026Updated last month
qzp2018 / UniECS
View on GitHub
Official implement of CIKM2025: 《UniECS: Unified Multimodal E-Commerce Search Framework with Gated Cross-modal Fusion》
☆21Sep 17, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Baixuzx7 / ZeroSharpen
View on GitHub
☆14Jun 19, 2024Updated 2 years ago
yuking / RecommendationSystem
View on GitHub
基于Pytorch 框架复现的推荐系统的经典模型
☆24Sep 24, 2019Updated 6 years ago
Qsingle / open-medical-r1
View on GitHub
This repository is aim to reproduce the R1-Zero on medical domain.
☆32Jun 11, 2025Updated last year
RenzeLou / Muffin
View on GitHub
MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following
☆16Oct 31, 2024Updated last year
dhcode-cpp / X-R1
View on GitHub
minimal-cost for training 0.5B R1-Zero
☆816May 14, 2025Updated last year
Jaiy / Ground-aware-Seg
View on GitHub
Ground-Aware Point Cloud Semantic Segmentation for Autonomous Driving. ACM Multimedia 2019.
☆12Sep 19, 2019Updated 6 years ago
KongLongGeFDU / TransferTOD
View on GitHub
The code repository of paper "TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities"
☆20May 12, 2026Updated 2 months ago
alecwangcq / f-divergence-dpo
View on GitHub
Direct preference optimization with f-divergences.
☆17Nov 3, 2024Updated last year
UITron-hub / UItron
View on GitHub
☆67Sep 6, 2025Updated 10 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Unakar / Logic-RL
View on GitHub
Reproduce R1 Zero on Logic Puzzle
☆2,452Mar 20, 2025Updated last year
DongHande / IR-DM-RS-Paper-Classification
View on GitHub
本项目展示了2022年部分信息检索/数据挖掘顶会论文分类。
☆17Jun 13, 2022Updated 4 years ago
xypan0 / G-DIG
View on GitHub
☆12Jun 30, 2024Updated 2 years ago
kunzhan / BS-Mamba
View on GitHub
BS-Mamba for Black-Soil Area Detection on the Qinghai-Tibetan Plateau
☆12Apr 12, 2025Updated last year
memodb-io / skill-memory
View on GitHub
agent skill as memory layer
☆20Jan 30, 2026Updated 5 months ago
KelleyYin / XLM-Plus
View on GitHub
☆10Oct 15, 2020Updated 5 years ago
injadlu / DAMA
View on GitHub
[ICML 2025] Official code of "DAMA: Data- and Model-aware Alignment of Multi-modal LLMs"
☆16May 24, 2025Updated last year
HypherX / Evolution-Analysis
View on GitHub
☆25Dec 13, 2024Updated last year
Euphoria16 / UI-Genie
View on GitHub
[NeurIPS 2025] UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents
☆60Nov 27, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
PrOF-kk / ShowDesktopOneMonitor
View on GitHub
Win + D for One Monitor (Show Desktop only for One Monitor)
☆10Dec 15, 2022Updated 3 years ago
THU-KEG / Crab
View on GitHub
[CIKM 2025] Constraint Back-translation Improves Complex Instruction Following of Large Language Models
☆18May 23, 2025Updated last year
bqw18744018044 / Concise_SimCSE
View on GitHub
A concise implementation of SimCSE
☆16Aug 2, 2021Updated 4 years ago
GasolSun36 / GRACE
View on GitHub
[ICLR 2025] Official repo for paper: "GRACE: Generative Representation Learning via Contrastive Policy Optimization"
☆39Feb 3, 2026Updated 5 months ago
aladinD / SafeMERGE
View on GitHub
Code for SafeMERGE (ICLR 2025).
☆15Apr 1, 2025Updated last year
Phoenix8215 / build_neural_network_from_scratch_CPP
View on GitHub
Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.
☆11Jul 27, 2024Updated 2 years ago
Bruce-Lee-LY / memory_pool
View on GitHub
Simple and efficient memory pool is implemented with C++11.
☆10Jun 2, 2022Updated 4 years ago
momo-journey / CDial-GPT-NEZHA
View on GitHub
pytorch版基于gpt+nezha的中文多轮Cdial
☆11Oct 22, 2022Updated 3 years ago
utayao / LocalSpecGCN
View on GitHub
Source code for paper "Local Spectral Graph Convolution for Point Set Feature Learning"
☆10Jul 11, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
matrixjoeq / c_container
View on GitHub
C container is a STL-like library which implements generic containers in C language. It also implements most of the algorithms in STL alg…
☆24Aug 27, 2018Updated 7 years ago
kunzhan / VCHN
View on GitHub
TCyb 2023: View-Consistent Heterogeneous Network on Graphs
☆18Aug 21, 2021Updated 4 years ago
yuesong-feng / dbnotes
View on GitHub
数据库内核笔记
☆14Aug 18, 2022Updated 3 years ago
jcyk / AMR-parser
View on GitHub
AMR-parser. Code for EMNLP2019 paper "Core Semantic First: A Top-down Approach for AMR Parsing."
☆11Feb 23, 2020Updated 6 years ago
SabbaghCodes / ImbalancedLearningForSingleCellFoundationModels
View on GitHub
Code for the benchmarking single-cell foundation models (scGPT, scBERT, and Geneformer) for cell-type annotation task using skewed single…
☆16Dec 8, 2024Updated last year
ChicForX / advdiff_impl
View on GitHub
unformal implementation of advdiffuser
☆17Feb 4, 2024Updated 2 years ago
TiledTensor / TiledBench
View on GitHub
Benchmark tests supporting the TiledCUDA library.
☆19Nov 19, 2024Updated last year