An automated data pipeline scaling RL to pretraining levels
☆73Oct 11, 2025Updated 4 months ago
Alternatives and similar repositories for PretrainRL-pipeline
Users that are interested in PretrainRL-pipeline are comparing it to the libraries listed below
Sorting:
- ☆19Jul 31, 2025Updated 7 months ago
- ☆13Jul 19, 2022Updated 3 years ago
- [ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang☆16May 4, 2023Updated 2 years ago
- ☆17Dec 16, 2024Updated last year
- Official code for the paper: "Metadata Archaeology"☆19May 10, 2023Updated 2 years ago
- Transform your CapsLock into an AI key! This AutoHotkey app puts powerful AI capabilities right at your fingertips, supercharging your Wi…☆21Oct 31, 2025Updated 4 months ago
- ☆51Oct 1, 2025Updated 5 months ago
- Multi-agent AI discussion CLI for structured debates between LLMs☆72Jan 1, 2026Updated 2 months ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆78May 2, 2025Updated 10 months ago
- ChatGPT Participates in a Computer Science Exam (2023)☆31Mar 21, 2023Updated 2 years ago
- ☆35Jun 13, 2023Updated 2 years ago
- [ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆41Dec 13, 2024Updated last year
- [CVPR 2020] A generative model with latent factors that are independent and localized.☆12Mar 27, 2025Updated 11 months ago
- Text to audio with Tik-Tok Voices☆13Apr 6, 2023Updated 2 years ago
- MiniGPT-Pancreas: Multimodal Large language Model for Pancreas Cancer Classification and Detection☆11Sep 19, 2025Updated 5 months ago
- ☆116May 7, 2025Updated 10 months ago
- CiteME is a benchmark designed to test the abilities of language models in finding papers that are cited in scientific texts.☆48Nov 3, 2025Updated 4 months ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆94Nov 17, 2024Updated last year
- Simple and powerful extension for searching web and viewing website content.☆11Apr 11, 2025Updated 10 months ago
- ☆10Sep 29, 2024Updated last year
- [IROS 2025] EgoLoc: Zero-Shot Temporal Interaction Localization for Egocentric Videos☆33Jan 13, 2026Updated last month
- ☆33Feb 26, 2026Updated last week
- [ACL 2025] Knowledge Unlearning for Large Language Models☆48Sep 18, 2025Updated 5 months ago
- ☆16Jul 20, 2025Updated 7 months ago
- Home server set up☆13Oct 5, 2025Updated 5 months ago
- Code repository supporting the paper "Auto-Generating Weak Labels for Real & Synthetic Data to Improve Label-Scarce Medical Image Segment…☆11Apr 29, 2024Updated last year
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- A benchmark dataset designed to support the development and evaluation of large language models (LLMs) for conversational mental health a…☆17Feb 24, 2025Updated last year
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- LiteLLM model integration for Pydantic AI framework - access 100+ LLM providers through a unified interface☆20Nov 19, 2025Updated 3 months ago
- ☆16Feb 22, 2025Updated last year
- Integrating neurosymbolic representations into LLMs for interpretability, steering, and running symbolic algorithms☆14Feb 2, 2026Updated last month
- Copilot with deepseek and more...☆13Mar 7, 2025Updated last year
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆51May 19, 2025Updated 9 months ago
- A collection of papers tackling automatic fact-checking (particularly of AI-generated content)☆14Nov 3, 2023Updated 2 years ago
- ComfyUI Workflows☆10Sep 27, 2025Updated 5 months ago
- Custom Engineered Agents and Tools for Vibe Coders | Agents for TRAE.AI, Smart MCPs, GLM Models integration and more...☆22Dec 24, 2025Updated 2 months ago
- Python package to download and use the SSB datasets☆11Aug 3, 2023Updated 2 years ago
- Code of paper "HyperVLA: Efficient Inference in Vision-Language-Action Models via Hypernetworks"☆22Oct 8, 2025Updated 5 months ago