☆48Feb 10, 2025Updated last year
Alternatives and similar repositories for deepseek-r1-gsm8k
Users that are interested in deepseek-r1-gsm8k are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RetroDFM-R: Reasoning-Driven Retrosynthesis Prediction with Large Language Models via Reinforcement Learning☆22Nov 22, 2025Updated 6 months ago
- ☆14Jun 19, 2024Updated last year
- ☆53Apr 17, 2026Updated last month
- [CIKM 2025] Constraint Back-translation Improves Complex Instruction Following of Large Language Models☆18May 23, 2025Updated 11 months ago
- This repository is aim to reproduce the R1-Zero on medical domain.☆32Jun 11, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆16Oct 31, 2024Updated last year
- Contrastive self-supervised learning using Rényi divergence☆14Oct 21, 2022Updated 3 years ago
- ☆25Dec 13, 2024Updated last year
- minimal-cost for training 0.5B R1-Zero☆816May 14, 2025Updated last year
- [ICML 2025] Official code of "DAMA: Data- and Model-aware Alignment of Multi-modal LLMs"☆16May 24, 2025Updated 11 months ago
- [EMNLP 2023] Semi-automatic Data Enhancement for Document-Level Relation Extraction with Distant Supervision from Large Language Models☆17Oct 30, 2023Updated 2 years ago
- ☆22Sep 23, 2025Updated 7 months ago
- ☆26Aug 2, 2025Updated 9 months ago
- Ground-Aware Point Cloud Semantic Segmentation for Autonomous Driving. ACM Multimedia 2019.☆12Sep 19, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆47Jul 1, 2025Updated 10 months ago
- Direct preference optimization with f-divergences.☆16Nov 3, 2024Updated last year
- The code repository of paper "TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities"☆20May 12, 2026Updated last week
- This is the data and code for the paper: Evaluating the Efficacy of Supervised Learning vs. Large Language Models for Identifying Cogniti…☆16Aug 3, 2025Updated 9 months ago
- Reproduce R1 Zero on Logic Puzzle☆2,450Mar 20, 2025Updated last year
- ☆11Feb 26, 2026Updated 2 months ago
- Code for the benchmarking single-cell foundation models (scGPT, scBERT, and Geneformer) for cell-type annotation task using skewed single…☆15Dec 8, 2024Updated last year
- [ACL 2024 Findings] Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning☆13Sep 2, 2024Updated last year
- 本项目展示了2022年部分信息检索/数据挖掘顶会论文分类。☆17Jun 13, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 通用简单工具项目☆22Oct 6, 2024Updated last year
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆13Jul 21, 2024Updated last year
- Benchmark tests supporting the TiledCUDA library.☆19Nov 19, 2024Updated last year
- Fetching confused chars, including same pronunciation, similar pronunciation and similar character pattern☆21Jan 20, 2023Updated 3 years ago
- Official repository for "DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Opinion Text Generation"☆10May 20, 2022Updated 4 years ago
- RAG-Fusion implementation using Langchain, Weaviate and OpenAI☆13Oct 31, 2023Updated 2 years ago
- A concise implementation of SimCSE☆16Aug 2, 2021Updated 4 years ago
- ☆13May 18, 2024Updated 2 years ago
- Temperature Schedules for self-supervised contrastive methods on long-tail data (ICLR'23)☆18Apr 25, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.☆11Jul 27, 2024Updated last year
- Simple and efficient memory pool is implemented with C++11.☆10Jun 2, 2022Updated 3 years ago
- Gated Pretrained Transformer model for robust denoised sequence-to-sequence modelling☆10May 29, 2021Updated 4 years ago
- ☆31Nov 27, 2025Updated 5 months ago
- Structure From Motion in 50 lines using OpenCV☆13May 31, 2021Updated 4 years ago
- pytorch版基于gpt+nezha的中文多轮Cdial☆11Oct 22, 2022Updated 3 years ago
- CogCompTime☆11Apr 19, 2022Updated 4 years ago