☆48Feb 10, 2025Updated last year
Alternatives and similar repositories for deepseek-r1-gsm8k
Users that are interested in deepseek-r1-gsm8k are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RetroDFM-R: Reasoning-Driven Retrosynthesis Prediction with Large Language Models via Reinforcement Learning☆23Nov 22, 2025Updated 7 months ago
- [AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models☆23Aug 1, 2025Updated 11 months ago
- 无线根因分析结合现网历史告警和故障定位工单数据, 通过机器学习手段建立故障根因分析模型,快速定位故障原因,大幅提升网络运维效率。☆16Aug 23, 2019Updated 6 years ago
- [CIKM 2025] Constraint Back-translation Improves Complex Instruction Following of Large Language Models☆19May 23, 2025Updated last year
- This repository is aim to reproduce the R1-Zero on medical domain.☆32Jun 11, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Source codes for Time2Graph model.☆20Jan 21, 2020Updated 6 years ago
- ☆28Sep 15, 2025Updated 9 months ago
- Contrastive self-supervised learning using Rényi divergence☆14Oct 21, 2022Updated 3 years ago
- ☆25Dec 13, 2024Updated last year
- minimal-cost for training 0.5B R1-Zero☆814May 14, 2025Updated last year
- [ICML 2025] Official code of "DAMA: Data- and Model-aware Alignment of Multi-modal LLMs"☆16May 24, 2025Updated last year
- [AAAI 2026 Oral] The official code of "UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning"☆74Dec 8, 2025Updated 6 months ago
- ☆13Sep 26, 2023Updated 2 years ago
- Ground-Aware Point Cloud Semantic Segmentation for Autonomous Driving. ACM Multimedia 2019.☆12Sep 19, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Direct preference optimization with f-divergences.☆17Nov 3, 2024Updated last year
- This is the data and code for the paper: Evaluating the Efficacy of Supervised Learning vs. Large Language Models for Identifying Cogniti…☆16Aug 3, 2025Updated 10 months ago
- Reproduce R1 Zero on Logic Puzzle☆2,453Mar 20, 2025Updated last year
- ☆10Oct 15, 2020Updated 5 years ago
- Code for the benchmarking single-cell foundation models (scGPT, scBERT, and Geneformer) for cell-type annotation task using skewed single…☆15Dec 8, 2024Updated last year
- [ACL 2024 Findings] Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning☆13Sep 2, 2024Updated last year
- 本项目展示了2022年部分信息检索/数据挖掘顶会论文分类。☆17Jun 13, 2022Updated 4 years ago
- 通用简单工具项目☆22Oct 6, 2024Updated last year
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆13Jul 21, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Benchmark tests supporting the TiledCUDA library.☆19Nov 19, 2024Updated last year
- LONGAGENT: Scaling Language Models to 128k Context through Multi-Agent Collaboration☆11Mar 11, 2024Updated 2 years ago
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆47Mar 12, 2024Updated 2 years ago
- Fetching confused chars, including same pronunciation, similar pronunciation and similar character pattern☆21Jan 20, 2023Updated 3 years ago
- Official repository for "DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Opinion Text Generation"☆10May 20, 2022Updated 4 years ago
- RAG-Fusion implementation using Langchain, Weaviate and OpenAI☆13Oct 31, 2023Updated 2 years ago
- ☆12May 18, 2024Updated 2 years ago
- Win + D for One Monitor (Show Desktop only for One Monitor)☆10Dec 15, 2022Updated 3 years ago
- Simple and efficient memory pool is implemented with C++11.☆10Jun 2, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Gated Pretrained Transformer model for robust denoised sequence-to-sequence modelling☆10May 29, 2021Updated 5 years ago
- ☆31Nov 27, 2025Updated 7 months ago
- Binary Multi-View Clustering☆37Feb 7, 2022Updated 4 years ago
- Structure From Motion in 50 lines using OpenCV☆13May 31, 2021Updated 5 years ago
- 清华大学生物,医学,药学等相关专业的毕业论文latex模板。也适用于其他专业。适合本硕博毕业论文和博后报告。本模板在tuna协会的thuthesis项目基础上,增补了和生医药相关同学的内容,也增添了对latex新手更加友好的注释。☆27Sep 14, 2023Updated 2 years ago
- pytorch版基于gpt+nezha的中文多轮Cdial☆11Oct 22, 2022Updated 3 years ago
- CogCompTime☆11Apr 19, 2022Updated 4 years ago