☆37Jan 16, 2026Updated 2 months ago
Alternatives and similar repositories for data-efficient-llm-rl
Users that are interested in data-efficient-llm-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆77Jun 28, 2025Updated 8 months ago
- [NeurIPS 2021] Fast Certified Robust Training with Short Warmup☆25Jun 7, 2025Updated 9 months ago
- Fork of Microsoft/LightGBM to include support for the CEGB (Cost Efficient Gradient Boosting) algorithm. Original repository at https://g…☆13Jun 30, 2017Updated 8 years ago
- [ICLR 2020] Code for paper "Robustness Verification for Transformers"☆27Nov 26, 2024Updated last year
- Certified defense to adversarial examples using CROWN and IBP. Also includes GPU implementation of CROWN verification algorithm (in PyTor…☆98Jun 7, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official Repository of "Learning what reinforcement learning can't"☆80Dec 30, 2025Updated 2 months ago
- [CVPR 2025] Official implementation for "Steering Away from Harm: An Adaptive Approach to Defending Vision Language Model Against Jailbre…☆55Jul 5, 2025Updated 8 months ago
- [ICML 2025] Reward-guided Speculative Decoding (RSD) for efficiency and effectiveness.☆56May 2, 2025Updated 10 months ago
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆52May 12, 2025Updated 10 months ago
- [ICML 2022] "Linearity Grafting: Relaxed Neuron Pruning Helps Certifiable Robustness" by Tianlong Chen*, Huan Zhang*, Zhenyu Zhang, Shiyu…☆17Jun 22, 2022Updated 3 years ago
- ☆15Dec 7, 2021Updated 4 years ago
- This is the code of a agentic rag method with dynamic workflow.☆12Jan 22, 2026Updated 2 months ago
- Fully open reproduction of DeepSeek-R1☆11Mar 24, 2025Updated last year
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- VideoGPA is a self-supervised framework that enhances 3D consistency in Video Diffusion Models.☆42Mar 16, 2026Updated last week
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆25Feb 10, 2026Updated last month
- ☆11Sep 19, 2025Updated 6 months ago
- ☆29Oct 8, 2025Updated 5 months ago
- Awesome latest models, datasets and benchmarks on streaming/online video understanding.☆24Oct 19, 2025Updated 5 months ago
- Linear and interval bound propagation in Pytorch with easy-to-use API and GPU support.☆11Jul 4, 2025Updated 8 months ago
- Official implementation for "How Should We Meta-Learn Reinforcement Learning Algorithms?"☆23Sep 7, 2025Updated 6 months ago
- Code repository of the paper "Alleviating Adversarial Attacks on Variational Autoencoders with MCMC" published at NeurIPS 2022. https://a…☆10Dec 14, 2022Updated 3 years ago
- 🎓Automatically Update LLM inference systems Papers Daily using Github Actions (Update Every 12th hours)☆12Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆23Dec 18, 2024Updated last year
- Interactive World Simulator for Robot Policy Training and Evaluation☆182Updated this week
- ☆10Oct 11, 2022Updated 3 years ago
- ☆21Mar 18, 2026Updated last week
- ☆30Dec 11, 2025Updated 3 months ago
- ☆23Mar 10, 2026Updated 2 weeks ago
- ☆11Nov 2, 2023Updated 2 years ago
- Mixture of Lora Experts☆10Apr 7, 2024Updated last year
- This repo is the official implementation of the ICLR'23 paper "Towards Robustness Certification Against Universal Perturbations." We calc…☆12Feb 14, 2023Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆14Jul 17, 2025Updated 8 months ago
- [Ebook]从零到百万店铺:一个没有计算机学位的普通人的系统设计实战之旅☆26Nov 11, 2025Updated 4 months ago
- Benchmarks for the VNN Comp 2023☆16Jun 7, 2024Updated last year
- Python-code to analyse Raman spectroscopy☆14Apr 19, 2019Updated 6 years ago
- The official repo for GCP-CROWN paper☆13Sep 26, 2022Updated 3 years ago
- An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).☆60Aug 13, 2024Updated last year
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆51Jan 30, 2026Updated last month