SalesforceAIResearch / PretrainRL-pipelineView external linksLinks
An automated data pipeline scaling RL to pretraining levels
☆72Oct 11, 2025Updated 4 months ago
Alternatives and similar repositories for PretrainRL-pipeline
Users that are interested in PretrainRL-pipeline are comparing it to the libraries listed below
Sorting:
- ☆19Jul 31, 2025Updated 6 months ago
- [ICML 2025] Predictive Data Selection: The Data That Predicts Is the Data That Teaches☆62Mar 4, 2025Updated 11 months ago
- [ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang☆16May 4, 2023Updated 2 years ago
- Official code for the paper: "Metadata Archaeology"☆19May 10, 2023Updated 2 years ago
- RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.☆41Oct 31, 2025Updated 3 months ago
- ☆24Jan 22, 2025Updated last year
- Multi-agent AI discussion CLI for structured debates between LLMs☆69Jan 1, 2026Updated last month
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆77May 2, 2025Updated 9 months ago
- ChatGPT Participates in a Computer Science Exam (2023)☆31Mar 21, 2023Updated 2 years ago
- ☆35Jun 13, 2023Updated 2 years ago
- ComfyUI custom node to extend Wan videos in loops with overlap consistency, per loop prompts, and optional LoRA control.☆25Nov 29, 2025Updated 2 months ago
- Text to audio with Tik-Tok Voices☆13Apr 6, 2023Updated 2 years ago
- ☆27Updated this week
- [CVPR 2020] A generative model with latent factors that are independent and localized.☆12Mar 27, 2025Updated 10 months ago
- ☆115May 7, 2025Updated 9 months ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆94Nov 17, 2024Updated last year
- CiteME is a benchmark designed to test the abilities of language models in finding papers that are cited in scientific texts.☆48Nov 3, 2025Updated 3 months ago
- [ACL 2025] Knowledge Unlearning for Large Language Models☆48Sep 18, 2025Updated 4 months ago
- SpectraGuru - A Spectra Analysis Application☆29Updated this week
- Copilot with deepseek and more...☆13Mar 7, 2025Updated 11 months ago
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- Technical docs to help you make you Halo Strix WORK!☆23Jan 10, 2026Updated last month
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- Home server set up☆13Oct 5, 2025Updated 4 months ago
- Implementation of a fast semantic chunker in C++, installable in python 3.7+ projects.☆22Sep 20, 2025Updated 4 months ago
- A semi print-in-place hand for human-like manipulation, designed to be built by anyone.☆17Jan 5, 2026Updated last month
- A benchmark dataset designed to support the development and evaluation of large language models (LLMs) for conversational mental health a…☆17Feb 24, 2025Updated 11 months ago
- ☆16Feb 22, 2025Updated 11 months ago
- Code repository supporting the paper "Auto-Generating Weak Labels for Real & Synthetic Data to Improve Label-Scarce Medical Image Segment…☆11Apr 29, 2024Updated last year
- ☆10Sep 29, 2024Updated last year
- LiteLLM model integration for Pydantic AI framework - access 100+ LLM providers through a unified interface☆19Nov 19, 2025Updated 2 months ago
- Simple and powerful extension for searching web and viewing website content.☆11Apr 11, 2025Updated 10 months ago
- Standalone desktop application for Text-to-Speech (TTS) utilizing the Kokoro-82M AI model for pdf files☆28Updated this week
- Generate a wiki for your research topic, sourcing from the web and your docs.☆58Mar 8, 2025Updated 11 months ago
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆50May 19, 2025Updated 8 months ago
- code for DOMI☆11Mar 24, 2023Updated 2 years ago
- [NeurIPS 25]SwitchLingua: The First Large-Scale Multilingual and Multi-Ethnic Code-Switching Dataset☆16Sep 19, 2025Updated 4 months ago
- Langchain + Docker + Neo4j☆10Oct 29, 2024Updated last year
- GitOps automation for plain old docker compose stack deploy☆10Dec 25, 2024Updated last year