☆42Jan 16, 2026Updated 4 months ago
Alternatives and similar repositories for data-efficient-llm-rl
Users that are interested in data-efficient-llm-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SVIP: Towards Verifiable Inference of Open-Source Large Language Models☆15Jun 3, 2025Updated last year
- When Reasoning Meets Its Laws☆37Jan 2, 2026Updated 5 months ago
- ☆80Jun 8, 2026Updated last week
- [NeurIPS 2021] Fast Certified Robust Training with Short Warmup☆25Jun 7, 2025Updated last year
- Fork of Microsoft/LightGBM to include support for the CEGB (Cost Efficient Gradient Boosting) algorithm. Original repository at https://g…☆13Jun 30, 2017Updated 8 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [ICLR 2020] Code for paper "Robustness Verification for Transformers"☆26Nov 26, 2024Updated last year
- Official Repository of "Learning what reinforcement learning can't"☆84Dec 30, 2025Updated 5 months ago
- [ICML 2025] Reward-guided Speculative Decoding (RSD) for efficiency and effectiveness.☆56May 2, 2025Updated last year
- [ICML 2022] "Linearity Grafting: Relaxed Neuron Pruning Helps Certifiable Robustness" by Tianlong Chen*, Huan Zhang*, Zhenyu Zhang, Shiyu…☆16Jun 22, 2022Updated 3 years ago
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆58May 12, 2025Updated last year
- ☆15Dec 7, 2021Updated 4 years ago
- This is the code of a agentic rag method with dynamic workflow.☆14Jan 22, 2026Updated 4 months ago
- Fully open reproduction of DeepSeek-R1☆11Mar 24, 2025Updated last year
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆21Dec 14, 2025Updated 6 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- This is the code of our work CISS Certified Robustness Against Natural Language Attacks by Causal Intervention published on ICML 2022☆11Dec 6, 2022Updated 3 years ago
- Fourth edition of VNN COMP (2023)☆16Apr 12, 2023Updated 3 years ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆35Jun 7, 2026Updated last week
- Awesome latest models, datasets and benchmarks on streaming/online video understanding.☆30Oct 19, 2025Updated 7 months ago
- ☆11Sep 19, 2025Updated 8 months ago
- Linear and interval bound propagation in Pytorch with easy-to-use API and GPU support.☆11May 14, 2026Updated last month
- ☆31Oct 8, 2025Updated 8 months ago
- Official implementation for "How Should We Meta-Learn Reinforcement Learning Algorithms?"☆24Sep 7, 2025Updated 9 months ago
- 🐧 Unify-Agent: An end-to-end unified multimodal agent for faithful, knowledge-grounded image generation.☆81May 2, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 🎓Automatically Update LLM inference systems Papers Daily using Github Actions (Update Every 12th hours)☆12Jun 8, 2026Updated last week
- ☆22Dec 18, 2024Updated last year
- The official repository for the paper entitled "Time Travel in LLMs: Tracing Data Contamination in Large Language Models."☆14Jun 11, 2024Updated 2 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- ☆20Mar 18, 2026Updated 2 months ago
- [ACL 2026] RAG over Tables: Hierarchical Memory Index, Multi-Stage Retrieval, and Benchmarking.☆32Oct 7, 2025Updated 8 months ago
- [CVPR 2026] MergeVLA: Cross-Skill Model Merging Toward a Generalist Vision-Language-Action Agent☆32Apr 30, 2026Updated last month
- ☆35May 14, 2026Updated last month
- A Bimanual-mobile Robot Manipulation Dataset specifically designed for household applications☆17Aug 12, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11Nov 2, 2023Updated 2 years ago
- This repo is the official implementation of the ICLR'23 paper "Towards Robustness Certification Against Universal Perturbations." We calc…☆12Feb 14, 2023Updated 3 years ago
- ☆14Jul 17, 2025Updated 10 months ago
- [Ebook]从零到百万店铺:一个没有计算机学位的普通人的系统设计实战之旅☆27Nov 11, 2025Updated 7 months ago
- ☆11Nov 8, 2023Updated 2 years ago
- This repository contains a list of papers on spatio-temporal graph, especially about GNNs on S-T graph.☆18Sep 8, 2023Updated 2 years ago
- ☆13Mar 24, 2017Updated 9 years ago