A truly open version of gpt-oss which shows the entire pre-training from scratch
☆89Sep 4, 2025Updated 8 months ago
Alternatives and similar repositories for truly-open-gpt-oss
Users that are interested in truly-open-gpt-oss are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)☆20Apr 18, 2024Updated 2 years ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆18Apr 5, 2025Updated last year
- Advanced NLP, Fall 2025 https://cmu-l3.github.io/anlp-fall2025/☆61Jan 18, 2026Updated 3 months ago
- ☆13Aug 13, 2025Updated 8 months ago
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An LLM leaderboard for stateful agents☆21Oct 16, 2025Updated 6 months ago
- ☆23Jun 28, 2025Updated 10 months ago
- A library for training crosscoders☆17May 28, 2025Updated 11 months ago
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆17Jun 5, 2024Updated last year
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systems☆35Nov 21, 2025Updated 5 months ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Nov 11, 2024Updated last year
- Collected the world's best computer vision labs and lecture materials.☆14Feb 23, 2025Updated last year
- ☆122Mar 18, 2026Updated last month
- ☆10Mar 14, 2018Updated 8 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Large language model of Medical AI, General Medical AI (GMAI)☆17Jan 30, 2024Updated 2 years ago
- a chat program demo use netty framework☆11Jun 30, 2014Updated 11 years ago
- A quick way to get started with Transformer Lens☆14Dec 13, 2023Updated 2 years ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated 2 years ago
- A demonstration of ElasticSearch and the Perl API, ElasticSearch.pm☆19Aug 18, 2011Updated 14 years ago
- Official Repository of paper MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Pol…☆79Jan 26, 2026Updated 3 months ago
- ☆10Aug 17, 2017Updated 8 years ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆18Oct 13, 2025Updated 6 months ago
- EmotionCircuits-LLM: A complete, reproducible framework for discovering and controlling emotion circuits in large language models.☆55Apr 7, 2026Updated 3 weeks ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [WACV 2024] Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining, WACV 2024☆13Jan 3, 2024Updated 2 years ago
- A tiny server to run local inference on MLX model in the style of OpenAI☆13Jan 31, 2024Updated 2 years ago
- AI安全开放社区官方文档☆26Apr 11, 2026Updated 3 weeks ago
- ☆11Aug 22, 2023Updated 2 years ago
- [ICLR 2026] Adapting Self-Supervised Representations as a Latent Space for Efficient Generation☆50Apr 24, 2026Updated last week
- Investigation for PyDataLondon 2023 and ODSC 2023 conference comparing Pandas 2, Polars and Dask☆11Dec 7, 2023Updated 2 years ago
- Official code and dataset for our EMNLP 2024 Findings paper: Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Kn…☆19Dec 27, 2024Updated last year
- An open source community implementation of the model MELLE from the paper: "Autoregressive Speech Synthesis without Vector Quantization"☆14Apr 13, 2026Updated 3 weeks ago
- [ICLR'26] The official code implementation for "Cache-to-Cache: Direct Semantic Communication Between Large Language Models"☆388Mar 13, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆17Jul 11, 2023Updated 2 years ago
- A bookshelf plugin which handles relationships.☆22Updated this week
- B站视频的音频下载工具,主要用来从视频下歌;>☆13Oct 18, 2024Updated last year
- Coder Desktop application for Windows☆23Feb 24, 2026Updated 2 months ago
- android https capture☆15Aug 8, 2022Updated 3 years ago
- A stacked area chart with smooth interpolation. Often used to display values over time.☆19Updated this week
- An Android app for face recognition.☆12Oct 23, 2019Updated 6 years ago