huggingface / gpt-oss-recipesView external linksLinks
Collection of scripts and notebooks for OpenAI's latest GPT OSS models
☆496Aug 25, 2025Updated 5 months ago
Alternatives and similar repositories for gpt-oss-recipes
Users that are interested in gpt-oss-recipes are comparing it to the libraries listed below
Sorting:
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆24Aug 2, 2025Updated 6 months ago
- ☆23Jun 5, 2025Updated 8 months ago
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j☆18Aug 19, 2024Updated last year
- Verifiers for LLM Reinforcement Learning☆80Apr 15, 2025Updated 10 months ago
- Trainable embedding transformation for confidence estimation, feature extraction, explainability and conversion from dense to sparse.☆26Jun 9, 2025Updated 8 months ago
- ☆220Oct 27, 2025Updated 3 months ago
- Renderer for the harmony response format to be used with gpt-oss☆4,184Dec 15, 2025Updated 2 months ago
- Code and data for "KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark" (LREC-COLING…☆17Apr 15, 2025Updated 10 months ago
- [ICLR'25] Code for KaSA, an official implementation of "KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models"☆20Jan 16, 2025Updated last year
- Simple GRPO scripts and configurations.☆59Feb 6, 2025Updated last year
- Korean Translation Benchmark, LLM-as-a-judge☆23Oct 23, 2025Updated 3 months ago
- Load compute kernels from the Hub☆416Updated this week
- huggingface에 있는 한국어 데이터 세트☆35Oct 10, 2024Updated last year
- Practices for improving quality and manageability of LLM co-created code-bases.☆38Feb 6, 2026Updated last week
- Github repo for NeurIPS 2024 paper "Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language Models"☆26Dec 21, 2025Updated last month
- 🤗 Collection of examples on how to train, deploy and monitor HuggingFace models in Google Cloud Vertex AI☆21Feb 26, 2024Updated last year
- Repository for the code assignment of the Deep Learning 1 course, Fall 2021 edition☆10Oct 31, 2022Updated 3 years ago
- Train transformer language models with reinforcement learning.☆17,360Updated this week
- AllenAI's post-training codebase☆3,573Updated this week
- Everything about the SmolLM and SmolVLM family of models☆3,602Jan 13, 2026Updated last month
- Question Answering Generative AI application with Large Language Models (LLMs) and Amazon OpenSearch Service☆28Dec 3, 2024Updated last year
- Code to go with beginner FastHTML tutorial☆20Jul 5, 2025Updated 7 months ago
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆36Aug 27, 2025Updated 5 months ago
- Ko-Arena-Hard-Auto: An automatic LLM benchmark for Korean☆22Apr 23, 2025Updated 9 months ago
- Recipes to scale inference-time compute of open models☆1,124May 22, 2025Updated 8 months ago
- Robust recipes to align language models with human and AI preferences☆5,495Sep 8, 2025Updated 5 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Oct 19, 2024Updated last year
- Stream Overlay for Twitch Clip Highlights☆11May 21, 2021Updated 4 years ago
- AZ AI DevContainer: Prebuilt AI Developer DevContainer/Codespace Environment including Python, Jupyter, Infra as Code deployment, AI Foun…☆14Updated this week
- Scripts for training Qwen 2.5 VL with ms-swift and GRPO☆12Feb 27, 2025Updated 11 months ago
- BERT score for text generation☆12Jan 15, 2025Updated last year
- Empowering LLM Agents for Real-World Computer System Optimization☆16Sep 10, 2025Updated 5 months ago
- LLM-aided data filtering☆14Dec 3, 2024Updated last year
- Image Search Engine with HuggingFace Sentence Transformer☆12Aug 31, 2023Updated 2 years ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,084Jan 26, 2026Updated 3 weeks ago
- Async RL Training at Scale☆1,071Updated this week
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆41Apr 4, 2025Updated 10 months ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- Official code for DAM: Dynamic Adapter Merging for Continual Video QA Learning☆14Apr 25, 2024Updated last year