haizelabs / j1-microView external linksLinks
j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.
☆102Jul 19, 2025Updated 6 months ago
Alternatives and similar repositories for j1-micro
Users that are interested in j1-micro are comparing it to the libraries listed below
Sorting:
- ☆37Aug 4, 2025Updated 6 months ago
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆32Jun 13, 2024Updated last year
- ☆14Dec 12, 2024Updated last year
- ☆10May 25, 2023Updated 2 years ago
- Make inso available in your GitHub Actions workflows☆11Jul 16, 2025Updated 6 months ago
- Official Code For Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM☆14Dec 27, 2023Updated 2 years ago
- Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"☆13Jul 5, 2017Updated 8 years ago
- Consistent dialogue generation☆16Oct 26, 2022Updated 3 years ago
- a single interface around speech-to-speech foundation models☆26Jun 27, 2025Updated 7 months ago
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆31Jun 5, 2025Updated 8 months ago
- [ICLR 2026] Learning to Reason without External Rewards☆391Jan 26, 2026Updated 2 weeks ago
- Detect and redact PII locally with SOTA performance☆91Mar 25, 2025Updated 10 months ago
- Minimal agent runtime built with DSPy modules and a thin Python loop. Includes CLI, FastAPI server, and eval harness with OpenAI/Ollama s…☆68Dec 22, 2025Updated last month
- A cookiecutter template for creating a new LLM plugin that adds tools to LLM☆29May 27, 2025Updated 8 months ago
- ☆67May 23, 2025Updated 8 months ago
- Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings☆115Jul 27, 2025Updated 6 months ago
- ☆34Jun 10, 2025Updated 8 months ago
- Deep learning examples for the Instant Super Computer☆20Jan 28, 2026Updated 2 weeks ago
- CompChomper is a framework for measuring how LLMs perform at code completion.☆19Apr 29, 2025Updated 9 months ago
- ☆137Mar 20, 2025Updated 10 months ago
- Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF☆24Oct 8, 2024Updated last year
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated last year
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20May 31, 2023Updated 2 years ago
- rl from zero pretrain, can it be done? yes.☆286Sep 28, 2025Updated 4 months ago
- Example code using the DSPy framework.☆20May 30, 2024Updated last year
- ☆28Feb 11, 2025Updated last year
- Python3 library for sophisticated timing attacks using Gaussian Mixture Model.☆22Apr 10, 2022Updated 3 years ago
- Advanced examples for using Tensorflow☆18Jul 13, 2017Updated 8 years ago
- Memory-efficient Count-Min Sketch Counter (based on Madoka C++ library)☆27Nov 30, 2025Updated 2 months ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆26Updated this week
- Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+te…☆347Sep 12, 2025Updated 5 months ago
- Neural search engine for discovering semantically similar Python repositories on GitHub☆29Feb 11, 2024Updated 2 years ago
- Pivotal Token Search☆145Dec 20, 2025Updated last month
- ReconPro is a specialized Google dorking tool designed for cybersecurity professionals and bug bounty hunters.☆44Sep 19, 2025Updated 4 months ago
- Exploring Applications of GRPO☆251Aug 25, 2025Updated 5 months ago
- ☆67Mar 30, 2025Updated 10 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆189Mar 7, 2025Updated 11 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆41Apr 4, 2025Updated 10 months ago
- Software Engineering Back End Microservices Project☆15Nov 20, 2024Updated last year