Open-Source-O1 / Open-O1
☆819Updated last month
Related projects ⓘ
Alternatives and complementary repositories for Open-O1
- Large Reasoning Models☆580Updated this week
- O1 Replication Journey: A Strategic Progress Report – Part I☆1,318Updated 3 weeks ago
- ☆515Updated this week
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models☆1,045Updated this week
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆714Updated 2 weeks ago
- Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality s…☆491Updated 2 weeks ago
- ☆878Updated 5 months ago
- OLMoE: Open Mixture-of-Experts Language Models☆460Updated this week
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆535Updated 3 weeks ago
- DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models☆1,008Updated 10 months ago
- Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi e…☆352Updated 2 months ago
- [ACL 2024] Progressive LLaMA with Block Expansion.☆478Updated 6 months ago
- Code for Quiet-STaR☆651Updated 3 months ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆204Updated this week
- Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation☆675Updated 3 months ago
- Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"☆883Updated last month
- An open-source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through in…☆589Updated last month
- ☆935Updated 2 weeks ago
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.☆647Updated last month
- ☆287Updated 2 months ago
- Parsing-free RAG supported by VLMs☆388Updated this week
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA☆412Updated last month
- Janus-Series: Unified Multimodal Understanding and Generation Models☆1,084Updated last week
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆316Updated last month
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning☆339Updated 2 months ago
- [NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces in…☆791Updated this week
- The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Mem…☆305Updated 7 months ago
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation☆254Updated last month
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆191Updated last month
- A lightweight framework for building LLM-based agents☆1,868Updated this week