☆36Dec 18, 2024Updated last year
Alternatives and similar repositories for mindrlhf
Users that are interested in mindrlhf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A High performance and tiny TVM graph executor library written in C which can compile to WebAssembly and use CUDA/WebGPU as the accelerat…☆12Aug 3, 2023Updated 2 years ago
- Fast instruction tuning with Llama2☆11Apr 8, 2024Updated last year
- Principles and Methodologies for Serial Performance Optimization (OSDI' 25)☆27Jun 5, 2025Updated 9 months ago
- A Pytorch implementation for ELIC (CVPR 2022).☆11Mar 29, 2023Updated 2 years ago
- ☆185Jan 28, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Simple converter of MuseScore 2 plugins to MuseScore 3☆14Mar 30, 2019Updated 6 years ago
- Datawhale自研数据标注工具☆73May 11, 2024Updated last year
- Learning problem-solving, logic/set, math, physics, economics through functional programming using Haskell☆19Oct 16, 2015Updated 10 years ago
- An open-ended, self-improving AI system that evolves its own source code using a local LLM. Built for autonomy, reflection, and code evol…☆22Jan 24, 2026Updated 2 months ago
- Lean 定理证明☆24Dec 28, 2025Updated 2 months ago
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆40Updated this week
- macrogpt大模型全量预训练(1b3,32层), 多卡deepspeed/单卡adafactor☆15Nov 30, 2023Updated 2 years ago
- Official repository for CoTran: An LLM-based code translator for whole-program translation, fine-tuned using feedback from compiler and s…☆16Nov 6, 2024Updated last year
- Runs WASM on the GPU☆45Feb 7, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Synthesizes efficient Z3 strategies tailored to your problem set! Repo for the IJCAI'24 paper: Layered and Staged Monte Carlo Tree Search…☆23Feb 10, 2026Updated last month
- MuJoCo benchmark for Deep Reinforcement Learning as provided by Tianshou framework.☆15Jan 12, 2025Updated last year
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 8 months ago
- 基于MindSpore的TinyRAG实现☆18Dec 31, 2024Updated last year
- Yet another musicxml render written in JS☆19Feb 24, 2022Updated 4 years ago
- (ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"☆26Mar 2, 2026Updated 3 weeks ago
- A toolbox of vision models and algorithms based on MindSpore☆266Jul 24, 2025Updated 8 months ago
- OCR post processing and spelling correction.☆11Nov 12, 2018Updated 7 years ago
- ☆15Mar 15, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆17May 3, 2019Updated 6 years ago
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.☆11Apr 5, 2023Updated 2 years ago
- ☆23Mar 1, 2024Updated 2 years ago
- ☆13Jul 13, 2023Updated 2 years ago
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆21Jun 15, 2025Updated 9 months ago
- ☆21Dec 22, 2024Updated last year
- ☆16Jul 21, 2022Updated 3 years ago
- Playground project acting as an example for a complex LangChain workflow☆11Jun 20, 2023Updated 2 years ago
- Ongoing research project for code&math LLMs☆27Jul 4, 2025Updated 8 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Jul 6, 2023Updated 2 years ago
- [EMNLP 2024 Main] Official implementation of the paper "To Preserve or To Compress: An In-Depth Study of Connector Selection in Multimoda…☆17Dec 13, 2024Updated last year
- Iterated Local Search (ILS) metaheuristic applied to the Prize Collecting Traveling Salesman Problem (PCTSP).☆10Jul 15, 2019Updated 6 years ago
- A toolbox of ocr models and algorithms based on MindSpore☆298Jul 24, 2025Updated 8 months ago
- Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"☆64Dec 10, 2025Updated 3 months ago
- [ICLR 2026] RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning☆42Feb 22, 2026Updated last month
- My Solutions to Sutton and Barto exercises, 2nd edition☆14Apr 27, 2018Updated 7 years ago