hughbzhang / o1_inference_scaling_lawsView external linksLinks
Replicating O1 inference-time scaling laws
☆93Dec 1, 2024Updated last year
Alternatives and similar repositories for o1_inference_scaling_laws
Users that are interested in o1_inference_scaling_laws are comparing it to the libraries listed below
Sorting:
- ☆20Nov 4, 2025Updated 3 months ago
- ☆12Nov 5, 2024Updated last year
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆120May 6, 2025Updated 9 months ago
- Securade.ai Sentinel - A monitoring and surveillance application that enables visual Q&A and video captioning for existing CCTV cameras.☆26Apr 6, 2025Updated 10 months ago
- ☆56Nov 6, 2024Updated last year
- Evaluation utilities based on SymPy.☆21Dec 12, 2024Updated last year
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ACL-2022☆18May 19, 2022Updated 3 years ago
- ☆19Mar 3, 2025Updated 11 months ago
- ☆21Jul 25, 2025Updated 6 months ago
- Utilities for efficient fine-tuning, inference and evaluation of code generation models☆21Oct 3, 2023Updated 2 years ago
- Multi-Agent LLM System for Digital Scam Protection☆12Dec 19, 2024Updated last year
- ☆42Sep 19, 2024Updated last year
- ☆23Jul 5, 2024Updated last year
- Harmonic Datasets☆52Jul 12, 2024Updated last year
- The rule-based evaluation subset and code implementation of Omni-MATH☆26Dec 23, 2024Updated last year
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated last year
- IBM Quantum Challenge Fall 2023☆10May 23, 2023Updated 2 years ago
- Image Search Engine with HuggingFace Sentence Transformer☆12Aug 31, 2023Updated 2 years ago
- The code and data for the paper JiuZhang3.0☆49May 26, 2024Updated last year
- ☆14Dec 25, 2024Updated last year
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 3 months ago
- Estimate MFU for DeepSeekV3☆26Jan 5, 2025Updated last year
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆28Dec 10, 2024Updated last year
- Extremely simple MoE implementation, mostly based off Switch Transformer☆13Feb 26, 2024Updated last year
- ☆13Apr 3, 2025Updated 10 months ago
- AI for Mathematics Paper List☆17Jan 14, 2025Updated last year
- ☆12Nov 15, 2022Updated 3 years ago
- ☆13May 21, 2024Updated last year
- In this course navigates through the LLMOps pipeline, enabling you to preprocess training data for supervised fine-tuning and deploy cust…☆14Feb 13, 2024Updated 2 years ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated last year
- ☆12Jul 12, 2024Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆28Feb 9, 2026Updated last week
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Oct 18, 2025Updated 3 months ago
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection☆55Oct 29, 2024Updated last year
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"☆78Nov 25, 2024Updated last year
- Dynamic Shell Command MCP Server☆41Feb 27, 2025Updated 11 months ago
- The supplementary material for the paper "Fine-tuning Large Language Models to Improve Accuracy and Comprehensibility of Automated Code R…☆16Aug 12, 2024Updated last year
- The official repo for the paper "Teacher Forcing Recovers Reward Functions for Text Generation"☆31May 27, 2023Updated 2 years ago
- This repository contains resources, documentation and artifacts describing LLM agents☆14Jan 22, 2025Updated last year