j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.
☆102Jul 19, 2025Updated 8 months ago
Alternatives and similar repositories for j1-micro
Users that are interested in j1-micro are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A book about Ph.D. student and research career planning☆29Oct 21, 2025Updated 5 months ago
- ☆19Mar 3, 2025Updated last year
- Minimal agent runtime built with DSPy modules and a thin Python loop. Includes CLI, FastAPI server, and eval harness with OpenAI/Ollama s…☆71Dec 22, 2025Updated 3 months ago
- Advanced SQLMap command builder with an intuitive cheatsheet UI. Works locally in your browser as a single HTML file (no data sent anywhe…☆32Jul 6, 2025Updated 8 months ago
- Make inso available in your GitHub Actions workflows☆11Jul 16, 2025Updated 8 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- CompChomper is a framework for measuring how LLMs perform at code completion.☆21Apr 29, 2025Updated 11 months ago
- A cookiecutter template for creating a new LLM plugin that adds tools to LLM☆29May 27, 2025Updated 10 months ago
- ☆28Oct 22, 2024Updated last year
- Intercepts cargo/gcc builds from AI coding agents via hooks and transparently routes them to remote worker machines, returning artifacts …☆44Updated this week
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆32Jun 13, 2024Updated last year
- ☆17Apr 11, 2025Updated 11 months ago
- [ICLR 2026] Learning to Reason without External Rewards☆403Jan 26, 2026Updated 2 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆41Jan 29, 2026Updated 2 months ago
- Vibe. Prove. Verify.☆38Feb 27, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Example code using the DSPy framework.☆20May 30, 2024Updated last year
- ☆85Sep 5, 2025Updated 6 months ago
- A framework for optimizing DSPy programs with RL☆328Jan 12, 2026Updated 2 months ago
- Inference-time scaling for LLMs-as-a-judge.☆332Nov 5, 2025Updated 4 months ago
- ☆137Mar 20, 2025Updated last year
- a single interface around speech-to-speech foundation models☆28Jun 27, 2025Updated 9 months ago
- ☆15Apr 26, 2025Updated 11 months ago
- Red-Teaming Language Models with DSPy☆253Feb 13, 2025Updated last year
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Exploring Applications of GRPO☆252Aug 25, 2025Updated 7 months ago
- Weaving prompts and code into structured, resilient patterns that won't unravel under pressure.☆30Dec 6, 2025Updated 3 months ago
- A toy Inspect implementation of the Bliss Attractor eval from Claude 4 System Card Welfare Assessment☆38Jun 5, 2025Updated 9 months ago
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆32Jun 5, 2025Updated 9 months ago
- ☆34Jun 10, 2025Updated 9 months ago
- rl from zero pretrain, can it be done? yes.☆291Sep 28, 2025Updated 6 months ago
- A vanilla implementation of ReAct: Synergizing Reasoning and Acting in Language Models☆17Mar 26, 2025Updated last year
- Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"☆13Jul 5, 2017Updated 8 years ago
- Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+te…☆349Sep 12, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆42Apr 4, 2025Updated 11 months ago
- OWASP Foundation Web Respository☆12Jan 28, 2026Updated 2 months ago
- ☆67May 23, 2025Updated 10 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆190Mar 7, 2025Updated last year
- AI powered Virtual Desktop☆16Mar 22, 2026Updated last week
- Open source replication of Anthropic's Crosscoders for Model Diffing☆63Oct 27, 2024Updated last year
- ☆12Mar 14, 2022Updated 4 years ago