☆40Jan 16, 2026Updated 3 months ago
Alternatives and similar repositories for data-efficient-llm-rl
Users that are interested in data-efficient-llm-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SVIP: Towards Verifiable Inference of Open-Source Large Language Models☆15Jun 3, 2025Updated 11 months ago
- When Reasoning Meets Its Laws☆37Jan 2, 2026Updated 4 months ago
- ☆78Jun 28, 2025Updated 10 months ago
- [NeurIPS 2021] Fast Certified Robust Training with Short Warmup☆25Jun 7, 2025Updated 10 months ago
- Fork of Microsoft/LightGBM to include support for the CEGB (Cost Efficient Gradient Boosting) algorithm. Original repository at https://g…☆13Jun 30, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICLR 2020] Code for paper "Robustness Verification for Transformers"☆26Nov 26, 2024Updated last year
- Official Repository of "Learning what reinforcement learning can't"☆84Dec 30, 2025Updated 4 months ago
- [ICML 2022] "Linearity Grafting: Relaxed Neuron Pruning Helps Certifiable Robustness" by Tianlong Chen*, Huan Zhang*, Zhenyu Zhang, Shiyu…☆16Jun 22, 2022Updated 3 years ago
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆55May 12, 2025Updated 11 months ago
- This is the code of a agentic rag method with dynamic workflow.☆12Jan 22, 2026Updated 3 months ago
- Fully open reproduction of DeepSeek-R1☆11Mar 24, 2025Updated last year
- Code of On L-p Robustness of Decision Stumps and Trees, ICML 2020☆10Aug 3, 2020Updated 5 years ago
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 4 months ago
- Fourth edition of VNN COMP (2023)☆16Apr 12, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆31Feb 10, 2026Updated 2 months ago
- Causal Representation Learning for Out-of-Distribution Recommendation (WWW'22)☆19Dec 26, 2023Updated 2 years ago
- ☆11Sep 19, 2025Updated 7 months ago
- Awesome latest models, datasets and benchmarks on streaming/online video understanding.☆27Oct 19, 2025Updated 6 months ago
- The dataset of our work where the application of portable Raman spectroscopy coupled with several supervised machine-learning techniques,…☆14Nov 21, 2019Updated 6 years ago
- Linear and interval bound propagation in Pytorch with easy-to-use API and GPU support.☆11Apr 28, 2026Updated last week
- ☆30Oct 8, 2025Updated 6 months ago
- Official implementation for "How Should We Meta-Learn Reinforcement Learning Algorithms?"☆23Sep 7, 2025Updated 7 months ago
- 🎓Automatically Update LLM inference systems Papers Daily using Github Actions (Update Every 12th hours)☆12Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆20Mar 18, 2026Updated last month
- ☆28Updated this week
- A Bimanual-mobile Robot Manipulation Dataset specifically designed for household applications☆17Aug 12, 2024Updated last year
- ☆11Nov 2, 2023Updated 2 years ago
- This repo is the official implementation of the ICLR'23 paper "Towards Robustness Certification Against Universal Perturbations." We calc…☆12Feb 14, 2023Updated 3 years ago
- ☆14Jul 17, 2025Updated 9 months ago
- ☆11Nov 8, 2023Updated 2 years ago
- This repository contains a list of papers on spatio-temporal graph, especially about GNNs on S-T graph.☆18Sep 8, 2023Updated 2 years ago
- ☆13Mar 24, 2017Updated 9 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Benchmarks for the VNN Comp 2023☆16Jun 7, 2024Updated last year
- VideoGPA is a self-supervised framework that enhances 3D consistency in Video Diffusion Models.☆51Apr 17, 2026Updated 2 weeks ago
- Chrome extension that logs all AJAX (XMLHttpRequest) activity to the Dev Tools Console, allowing inspection of AJAX calls, and open calls…☆26Aug 20, 2015Updated 10 years ago
- An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).☆62Aug 13, 2024Updated last year
- This is the official implementation of physics-informed neural networks for functional differential equations (Functional PINN) proposed …☆12Apr 9, 2025Updated last year
- ☆14Jul 13, 2022Updated 3 years ago
- [RSS 2026] Interactive World Simulator for Robot Policy Training and Evaluation☆235Mar 20, 2026Updated last month