NovaSky-AI / SkyThought
Sky-T1: Train your own O1 preview model within $450
☆1,795Updated this week
Alternatives and similar repositories for SkyThought:
Users that are interested in SkyThought are comparing it to the libraries listed below
- Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your resea…☆3,061Updated this week
- An Open Large Reasoning Model for Real-World Solutions☆1,378Updated last month
- g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains☆4,147Updated last month
- Recipes to scale inference-time compute of open models☆932Updated this week
- The Open Cookbook for Top-Tier Code Large Language Model☆1,551Updated last month
- Large Concept Models: Language modeling in a sentence representation space☆1,713Updated this week
- Optimizing inference proxy for LLMs☆1,926Updated this week
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,217Updated last month
- ☆1,044Updated this week
- 🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.☆5,197Updated this week
- Everything about the SmolLM & SmolLM2 family of models☆1,554Updated last week
- Composable building blocks to build Llama Apps☆6,036Updated this week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆3,442Updated 5 months ago
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.☆1,908Updated 5 months ago
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,462Updated this week
- Entropy Based Sampling and Parallel CoT Decoding☆3,197Updated 2 months ago
- DataComp for Language Models☆1,206Updated last month
- LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning☆1,739Updated last week
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other ent…☆2,022Updated this week
- nanoGPT style version of Llama 3.1☆1,290Updated 5 months ago
- NanoGPT (124M) in 3.4 minutes☆2,068Updated last week
- ☆1,137Updated last month
- Agentless🐱: an agentless approach to automatically solve software development problems☆1,281Updated 3 weeks ago
- ☆2,802Updated 4 months ago
- ☆2,289Updated this week
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"☆831Updated last month
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.☆1,992Updated last month
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,399Updated this week
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments☆1,528Updated this week
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,642Updated last week