☆48Feb 10, 2025Updated last year
Alternatives and similar repositories for deepseek-r1-gsm8k
Users that are interested in deepseek-r1-gsm8k are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Berkeley Function Calling Leaderboard (BFCL) with Chinese-Language Evaluation☆24Apr 6, 2025Updated last year
- RetroDFM-R: Reasoning-Driven Retrosynthesis Prediction with Large Language Models via Reinforcement Learning☆21Nov 22, 2025Updated 4 months ago
- Service for Bert model to Vector. 高效的文本转向量(Text-To-Vector)服务,支持GPU多卡、多worker、多客 户端调用,开箱即用。☆13May 24, 2022Updated 3 years ago
- [AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models☆22Aug 1, 2025Updated 8 months ago
- ☆45Nov 20, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 无线根因分析结合现网历史告警和故障定位工单数据,通过机器学习手段建立故障根因分析模型,快速定位故障原因,大幅提升网络运维效率。☆16Aug 23, 2019Updated 6 years ago
- [CIKM 2025] Constraint Back-translation Improves Complex Instruction Following of Large Language Models☆17May 23, 2025Updated 10 months ago
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆16Oct 31, 2024Updated last year
- ☆25Dec 13, 2024Updated last year
- minimal-cost for training 0.5B R1-Zero☆815May 14, 2025Updated 10 months ago
- ☆50May 19, 2025Updated 10 months ago
- [AAAI 2026 Oral] The official code of "UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning"☆70Dec 8, 2025Updated 4 months ago
- [EMNLP 2023] Semi-automatic Data Enhancement for Document-Level Relation Extraction with Distant Supervision from Large Language Models☆17Oct 30, 2023Updated 2 years ago
- ☆26Aug 2, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆13Feb 5, 2024Updated 2 years ago
- EDSL code☆19Mar 19, 2022Updated 4 years ago
- 哔哩哔哩常用API调用。☆17Aug 5, 2023Updated 2 years ago
- ☆47Jul 1, 2025Updated 9 months ago
- The code repository of paper "TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities"☆20Dec 24, 2024Updated last year
- Reproduce R1 Zero on Logic Puzzle☆2,447Mar 20, 2025Updated last year
- [ACL 2024 Findings] Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning☆13Sep 2, 2024Updated last year
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 8 months ago
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆14Jul 21, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- This is the official code repository for the paper "Decoding Global Preferences: Temporal and Cooperative Dependency Modeling in Multi-Ag…☆12Updated this week
- LONGAGENT: Scaling Language Models to 128k Context through Multi-Agent Collaboration☆11Mar 11, 2024Updated 2 years ago
- Collecting personality-indicative data for role-playing agents.☆24Feb 18, 2025Updated last year
- Official repository for "DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Opinion Text Generation"☆10May 20, 2022Updated 3 years ago
- RAG-Fusion implementation using Langchain, Weaviate and OpenAI☆13Oct 31, 2023Updated 2 years ago
- Implementation of Baseline for Scene Text-to-Scene Text Translation☆19Mar 30, 2025Updated last year
- ☆13May 18, 2024Updated last year
- A concise implementation of SimCSE☆16Aug 2, 2021Updated 4 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆13Jun 4, 2023Updated 2 years ago
- Simple and efficient memory pool is implemented with C++11.☆10Jun 2, 2022Updated 3 years ago
- Win + D for One Monitor (Show Desktop only for One Monitor)☆10Dec 15, 2022Updated 3 years ago
- ☆30Nov 27, 2025Updated 4 months ago
- Structure From Motion in 50 lines using OpenCV☆13May 31, 2021Updated 4 years ago
- 清华大学生物,医学,药学等相关专业的毕业论文latex模板。也适用于其他专业。适合本硕博毕业论文和博后报告。本模板在tuna协会的thuthesis项目基础上,增补了和生医药相关同学的内容,也增添了对latex新手更加友好的注释。☆25Sep 14, 2023Updated 2 years ago
- pytorch版基于gpt+nezha的中文多轮Cdial☆11Oct 22, 2022Updated 3 years ago