☆36Dec 18, 2024Updated last year
Alternatives and similar repositories for mindrlhf
Users that are interested in mindrlhf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast instruction tuning with Llama2☆11Apr 8, 2024Updated 2 years ago
- 🗺️ ASP planning tools for PDDL☆32Jul 9, 2021Updated 4 years ago
- ☆189Jan 28, 2026Updated 4 months ago
- Datawhale自研数据标注工具☆73May 11, 2024Updated 2 years ago
- Learning problem-solving, logic/set, math, physics, economics through functional programming using Haskell☆19Oct 16, 2015Updated 10 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- An open-ended, self-improving AI system that evolves its own source code using a local LLM. Built for autonomy, reflection, and code evol…☆23Jan 24, 2026Updated 4 months ago
- ☆26Jun 5, 2025Updated last year
- Lean 定理证明☆25Dec 28, 2025Updated 5 months ago
- macrogpt大模型全量预训练(1b3,32层), 多卡deepspeed/单卡adafactor☆15Nov 30, 2023Updated 2 years ago
- Official repository for CoTran: An LLM-based code translator for whole-program translation, fine-tuned using feedback from compiler and s…☆15Nov 6, 2024Updated last year
- Dinomaly with DinoV3☆41Aug 28, 2025Updated 9 months ago
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆49Apr 23, 2026Updated last month
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 11 months ago
- MuJoCo benchmark for Deep Reinforcement Learning as provided by Tianshou framework.☆15Jan 12, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆47Sep 15, 2025Updated 9 months ago
- [EMNLP 2024 Main] Official implementation of the paper "The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Langua…☆13Nov 11, 2024Updated last year
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆26May 27, 2025Updated last year
- [ISSTA'24] A Large-Scale Dataset Capable of Enhancing the Prowess of Large Language Models for Program Testing☆12Jan 7, 2025Updated last year
- ☆27Mar 17, 2025Updated last year
- (ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"☆28Mar 2, 2026Updated 3 months ago
- ☆17May 3, 2019Updated 7 years ago
- OCR post processing and spelling correction.☆11Nov 12, 2018Updated 7 years ago
- Simple Newtonian Astronomy Simulator☆15Aug 9, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- NLP/ML面试各类资料链接 汇总(主要Github收集)☆11Mar 3, 2020Updated 6 years ago
- ☆12Jul 13, 2023Updated 2 years ago
- 自动每天给女友发邮件☆12Jun 8, 2021Updated 5 years ago
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆23Jun 15, 2025Updated last year
- [npj Digital Medicine] An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing for Information Extraction☆13May 1, 2024Updated 2 years ago
- Playground project acting as an example for a complex LangChain workflow☆11Jun 20, 2023Updated 2 years ago
- This repository including all of exsiting demos of AppGallery Connect Service in HarmonyOS.☆18Oct 13, 2025Updated 8 months ago
- A toolbox of ocr models and algorithms based on MindSpore☆302Jul 24, 2025Updated 10 months ago
- Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"☆65Dec 10, 2025Updated 6 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2026] RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning☆45Feb 22, 2026Updated 3 months ago
- ChatYuan-7B☆13Jun 16, 2023Updated 3 years ago
- Prototype for a game testing framework using AI methods☆10Feb 25, 2023Updated 3 years ago
- Mobile HTML5 proto of HSL Navigator☆20Jul 3, 2015Updated 10 years ago
- Code for the paper "Code Generation From Flowcharts with Texts: A Benchmark Dataset and An Approach"☆13Feb 11, 2023Updated 3 years ago
- ☆25Dec 13, 2024Updated last year
- The officalimplement of dLLM-Factory☆25Jul 12, 2025Updated 11 months ago