lennart-finke / gpt-ossLinks
What does gpt-oss tell us about OpenAI's training data?
☆35Updated 4 months ago
Alternatives and similar repositories for gpt-oss
Users that are interested in gpt-oss are comparing it to the libraries listed below
Sorting:
- Experimental tools to backdoor large language models by re-writing their system prompts at a raw parameter level. This allows you to pote…☆202Updated 3 months ago
- ☆28Updated 2 months ago
- explore token trajectory trees on instruct and base models☆150Updated 7 months ago
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆274Updated 2 months ago
- syftr is an agent optimizer that helps you find the best agentic workflows for your budget.☆326Updated 3 months ago
- ☆55Updated 9 months ago
- Pivotal Token Search☆142Updated last month
- A better way of testing, inspecting, and analyzing AI Agent traces.☆40Updated last week
- Applying the ideas of Deepseek R1 to computer use☆221Updated 11 months ago
- ☆68Updated 7 months ago
- Everything you need to know about LLM inference☆257Updated last week
- A framework for building large-scale, deterministic, interactive workflows with a fault-tolerant, conversational UX☆43Updated 3 weeks ago
- ☆20Updated 2 months ago
- Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs☆24Updated 6 months ago
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆100Updated 9 months ago
- A simple tool that let's you explore different possible paths that an LLM might sample.☆199Updated 8 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆435Updated last week
- Benchmark that evaluates LLMs using 759 NYT Connections puzzles extended with extra trick words☆190Updated last month
- Parallel Reasoning: llm-consortium orchestrates mulitple LLMs, iteratively refines & achieves consensus.☆374Updated 2 weeks ago
- Context Engineering Course with DSPy☆211Updated 5 months ago
- ☆37Updated 5 months ago
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆246Updated last week
- Codebase for FinePDFs☆161Updated last week
- ☆19Updated 11 months ago
- ☆302Updated 5 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆61Updated 8 months ago
- ~ streaming agents☆74Updated 3 weeks ago
- ☆253Updated 10 months ago
- RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks☆230Updated 7 months ago
- Guardrails for secure and robust agent development☆378Updated last week