☆36Dec 18, 2024Updated last year
Alternatives and similar repositories for mindrlhf
Users that are interested in mindrlhf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A High performance and tiny TVM graph executor library written in C which can compile to WebAssembly and use CUDA/WebGPU as the accelerat…☆12Aug 3, 2023Updated 2 years ago
- Fast instruction tuning with Llama2☆11Apr 8, 2024Updated 2 years ago
- τ²-Bench-Verified is a corrected and verified version of the original τ²-bench benchmark. This release addresses issues discovered in the…☆38Apr 2, 2026Updated last month
- 电子科技大学本科毕业论文xelatex模板☆11Jan 20, 2014Updated 12 years ago
- 🗺️ ASP planning tools for PDDL☆32Jul 9, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆187Jan 28, 2026Updated 3 months ago
- Simple converter of MuseScore 2 plugins to MuseScore 3☆14Mar 30, 2019Updated 7 years ago
- Datawhale自研数据标注工具☆73May 11, 2024Updated last year
- An open-ended, self-improving AI system that evolves its own source code using a local LLM. Built for autonomy, reflection, and code evol…☆23Jan 24, 2026Updated 3 months ago
- ☆25Jun 5, 2025Updated 11 months ago
- Lean 定理证明☆26Dec 28, 2025Updated 4 months ago
- ☆10Dec 3, 2020Updated 5 years ago
- macrogpt大模型全量预训练(1b3,32层), 多卡deepspeed/单卡adafactor☆15Nov 30, 2023Updated 2 years ago
- Official repository for CoTran: An LLM-based code translator for whole-program translation, fine-tuned using feedback from compiler and s…☆15Nov 6, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆46Apr 23, 2026Updated 2 weeks ago
- Synthesizes efficient Z3 strategies tailored to your problem set! Repo for the IJCAI'24 paper: Layered and Staged Monte Carlo Tree Search…☆24Updated this week
- MuJoCo benchmark for Deep Reinforcement Learning as provided by Tianshou framework.☆15Jan 12, 2025Updated last year
- ☆46Sep 15, 2025Updated 7 months ago
- [ICML 2025] Official implementation of the paper "SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling". …☆22Nov 17, 2025Updated 5 months ago
- ☆29May 22, 2025Updated 11 months ago
- A toolbox of vision models and algorithms based on MindSpore☆265Jul 24, 2025Updated 9 months ago
- tianchi competition,round1 24/2322,round2 45/2322☆16Dec 9, 2020Updated 5 years ago
- ☆17May 3, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- OCR post processing and spelling correction.☆11Nov 12, 2018Updated 7 years ago
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.☆11Apr 5, 2023Updated 3 years ago
- ☆24Mar 1, 2024Updated 2 years ago
- NLP/ML面试各类资料链接 汇总(主要Github收集)☆11Mar 3, 2020Updated 6 years ago
- 《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被60多个国家的400多所大学用于教学。☆24Apr 3, 2023Updated 3 years ago
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆23Jun 15, 2025Updated 10 months ago
- [npj Digital Medicine] An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing for Information Extraction☆12May 1, 2024Updated 2 years ago
- Playground project acting as an example for a complex LangChain workflow☆11Jun 20, 2023Updated 2 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Jul 6, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [EMNLP 2024 Main] Official implementation of the paper "To Preserve or To Compress: An In-Depth Study of Connector Selection in Multimoda…☆17Dec 13, 2024Updated last year
- A toolbox of ocr models and algorithms based on MindSpore☆300Jul 24, 2025Updated 9 months ago
- Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"☆65Dec 10, 2025Updated 4 months ago
- [EMNLP 2024 Main] Official implementation of the paper "Unveiling In-Context Learning: A Coordinate System to Understand Its Working Mech…☆16Oct 8, 2024Updated last year
- [ICLR 2026] RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning☆43Feb 22, 2026Updated 2 months ago
- This is a DRL platform built with Gazebo for the purpose of robot navigation☆20Jul 14, 2018Updated 7 years ago
- ChatYuan-7B☆13Jun 16, 2023Updated 2 years ago