rednote-hilab / dots.llm1Links
☆228Updated this week
Alternatives and similar repositories for dots.llm1
Users that are interested in dots.llm1 are comparing it to the libraries listed below
Sorting:
- ☆223Updated last week
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆132Updated 11 months ago
- ☆77Updated 2 months ago
- GLM Series Edge Models☆142Updated 3 months ago
- An Open Math Pre-trainng Dataset with 370B Tokens.☆88Updated 2 months ago
- Deep Reasoning Translation via Reinforcement Learning (arXiv preprint 2025); DRT: Deep Reasoning Translation via Long Chain-of-Thought (a…☆224Updated last week
- ☆83Updated 3 weeks ago
- Computer Agent Arena: Test & compare AI agents in real desktop apps & web environments. Code/data coming soon!☆45Updated 2 months ago
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆51Updated this week
- Auto Thinking Mode switch for Qwen3 in Open webui☆62Updated 3 weeks ago
- Repo of ACL 2025 main Paper "Quantification of Large Language Model Distillation"☆86Updated 2 weeks ago
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆103Updated 2 weeks ago
- Mixture-of-Experts (MoE) Language Model☆189Updated 8 months ago
- ☆53Updated 6 months ago
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆367Updated 3 weeks ago
- ☆48Updated 3 weeks ago
- ☆53Updated 3 months ago
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆78Updated 2 months ago
- Pytorch implementation of https://arxiv.org/html/2404.07143v1☆20Updated last year
- A Comprehensive Survey on Long Context Language Modeling☆147Updated this week
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆186Updated this week
- ☆86Updated 7 months ago
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆35Updated 2 months ago
- Ling is a MoE LLM provided and open-sourced by InclusionAI.☆160Updated 3 weeks ago
- 我们是第一个完全可商用的角色大模型。☆40Updated 9 months ago
- A lightweight script for processing HTML page to markdown format with support for code blocks☆79Updated last year
- ☆40Updated last year
- Efficient Agent Training for Computer Use☆94Updated last week
- ☆358Updated this week
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆248Updated 3 weeks ago