☆28Aug 27, 2025Updated 7 months ago
Alternatives and similar repositories for TextbooksToRL
Users that are interested in TextbooksToRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Enemies for your LLM☆35Jan 20, 2026Updated 2 months ago
- ☆23Aug 27, 2025Updated 7 months ago
- This repository includes the code to download the curated HuggingFace papers into a single markdown formatted file☆16Jul 26, 2024Updated last year
- Prompts and evaluation data for LLMs on real world coding and writing tasks☆17Sep 13, 2025Updated 6 months ago
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"☆23Jun 28, 2024Updated last year
- all that favours real-time democracy☆15Aug 7, 2022Updated 3 years ago
- Lego for GRPO☆30May 27, 2025Updated 9 months ago
- Repository containing the code for training the CroissantLLM☆21Feb 4, 2024Updated 2 years ago
- My collection of dotfiles☆14Mar 16, 2026Updated last week
- ☆12Aug 6, 2024Updated last year
- Tiny evaluation of leading LLMs on competitive programming problems☆14Nov 28, 2024Updated last year
- SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning☆179Sep 18, 2025Updated 6 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆16Aug 15, 2025Updated 7 months ago
- Python repo for the XDK auto-generated code.☆25Feb 28, 2026Updated 3 weeks ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆64Aug 2, 2024Updated last year
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆359Mar 18, 2026Updated last week
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆28May 28, 2024Updated last year
- ☆16May 31, 2024Updated last year
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆69May 13, 2025Updated 10 months ago
- ☆15Apr 26, 2025Updated 11 months ago
- ☆43Sep 15, 2025Updated 6 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Interactive 3D Avatar Profile Viewer generated in Ready Player Me☆10Aug 27, 2022Updated 3 years ago
- Quantum Insert/FoxAcid case study☆16May 18, 2017Updated 8 years ago
- This repository contains a Python implementation that allows you to use gorilla-llm/gorilla-openfunctions-v2 LLM to perform function call…☆17Apr 7, 2024Updated last year
- Scripts for training Qwen 2.5 VL with ms-swift and GRPO☆12Feb 27, 2025Updated last year
- Deep Counterfactual Prediction with Categorical Backward Variables☆12Feb 8, 2023Updated 3 years ago
- Debiasing Through Data Attribution☆12May 23, 2024Updated last year
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Jan 17, 2024Updated 2 years ago
- Optimizable stack of images at different resolutions, a useful representation of images for deep learning tasks. Docs: https://johnowhita…☆11Sep 8, 2022Updated 3 years ago
- ☆10Feb 17, 2019Updated 7 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Interesting papers☆11Jun 22, 2024Updated last year
- Hello Deep Learning☆16Apr 20, 2024Updated last year
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆267Jul 8, 2025Updated 8 months ago
- Radamsa written in c ported to Rust☆11May 10, 2023Updated 2 years ago
- The first large scale formally verified reasoning dataset for Verilog☆21May 16, 2025Updated 10 months ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency …☆10Jul 18, 2023Updated 2 years ago