☆48Feb 10, 2025Updated last year
Alternatives and similar repositories for deepseek-r1-gsm8k
Users that are interested in deepseek-r1-gsm8k are comparing it to the libraries listed below
Sorting:
- Berkeley Function Calling Leaderboard (BFCL) with Chinese-Language Evaluation☆23Apr 6, 2025Updated 11 months ago
- Service for Bert model to Vector. 高效的文本转向量(Text-To-Vector)服务,支持GPU多卡、多worker、多客户端调用,开箱即用。☆12May 24, 2022Updated 3 years ago
- ☆45Nov 20, 2025Updated 4 months ago
- pytorch版unilm模型☆27Jun 19, 2021Updated 4 years ago
- [CIKM 2025] Constraint Back-translation Improves Complex Instruction Following of Large Language Models☆17May 23, 2025Updated 10 months ago
- This repository is aim to reproduce the R1-Zero on medical domain.☆32Jun 11, 2025Updated 9 months ago
- ☆38Feb 16, 2024Updated 2 years ago
- ☆27Sep 15, 2025Updated 6 months ago
- Contrastive self-supervised learning using Rényi divergence☆14Oct 21, 2022Updated 3 years ago
- ☆25Dec 13, 2024Updated last year
- [AAAI 2026 Oral] The official code of "UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning"☆68Dec 8, 2025Updated 3 months ago
- minimal-cost for training 0.5B R1-Zero☆813May 14, 2025Updated 10 months ago
- [ICML 2025] Official code of "DAMA: Data- and Model-aware Alignment of Multi-modal LLMs"☆16May 24, 2025Updated 9 months ago
- [EMNLP 2023] Semi-automatic Data Enhancement for Document-Level Relation Extraction with Distant Supervision from Large Language Models☆17Oct 30, 2023Updated 2 years ago
- ☆25Aug 2, 2025Updated 7 months ago
- ☆46Jul 1, 2025Updated 8 months ago
- Reproduce R1 Zero on Logic Puzzle☆2,441Mar 20, 2025Updated last year
- ☆10Oct 15, 2020Updated 5 years ago
- Code for the benchmarking single-cell foundation models (scGPT, scBERT, and Geneformer) for cell-type annotation task using skewed single…☆15Dec 8, 2024Updated last year
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 7 months ago
- [ACL 2024 Findings] Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning☆13Sep 2, 2024Updated last year
- 本项目展示了2022年部分信息检索/数据挖掘顶会论文分类。☆17Jun 13, 2022Updated 3 years ago
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- 通用简单工具项目☆22Oct 6, 2024Updated last year
- Official repository for "DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Opinion Text Generation"☆10May 20, 2022Updated 3 years ago
- Fetching confused chars, including same pronunciation, similar pronunciation and similar character pattern☆20Jan 20, 2023Updated 3 years ago
- RAG-Fusion implementation using Langchain, Weaviate and OpenAI☆13Oct 31, 2023Updated 2 years ago
- Collecting personality-indicative data for role-playing agents.☆24Feb 18, 2025Updated last year
- A SPMI Lab toolkit for language models.☆11Apr 12, 2017Updated 8 years ago
- ☆12May 18, 2024Updated last year
- A concise implementation of SimCSE☆16Aug 2, 2021Updated 4 years ago
- Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.☆10Jul 27, 2024Updated last year
- ☆13Jun 4, 2023Updated 2 years ago
- Win + D for One Monitor (Show Desktop only for One Monitor)☆10Dec 15, 2022Updated 3 years ago
- ☆29Nov 27, 2025Updated 3 months ago
- pytorch版基于gpt+nezha的中文多轮Cdial☆12Oct 22, 2022Updated 3 years ago
- CogCompTime☆11Apr 19, 2022Updated 3 years ago
- ☆12Jun 30, 2024Updated last year
- 数据库内核笔记☆13Aug 18, 2022Updated 3 years ago