Eric-is-good / pretrain-LLM-from-scratchLinks
从0训练类 o1 大语言模型。
☆26Updated last week
Alternatives and similar repositories for pretrain-LLM-from-scratch
Users that are interested in pretrain-LLM-from-scratch are comparing it to the libraries listed below
Sorting:
- ☆189Updated 2 weeks ago
- [TKDE2025] Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL | A curated list of resources (surveys, papers, benchma…☆831Updated this week
- Repo-level benchmark for real-world Code Agents: from repo understanding → env setup → incremental dev/bug-fixing → task delivery, with c…☆238Updated 2 months ago
- [NeurIPS 2025 Main] SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications☆775Updated 2 months ago
- [TMLR'25] The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning☆51Updated 7 months ago
- ☆193Updated last month
- Marco Search Agent for Realistic and Challenging Agentic Search☆237Updated last month
- One-stop data intelligence agent, providing insights from all mainstream data formats in a single dialogue box, including documents, data…☆538Updated last year
- ☆356Updated 5 months ago
- 解题助手,面试助手,在编码笔试或面试时,借助AI实时提供解题思路和答案。A interview assistant that leverages AI to provide real-time solutions during coding interviews.☆242Updated 2 weeks ago
- EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in chall…☆426Updated 2 months ago
- A database operations and data analysis AI agent☆430Updated 3 months ago
- JittorGeometric is a Jittor-based graph machine learning library.☆416Updated 3 months ago
- A tool for translating the content of LaTeX documents into various other natural languages (e.g., translating an arXiv paper from English…☆415Updated last month
- This project is designed to evaluate the effectiveness of DeepClaude and other combination models.☆41Updated 8 months ago
- ☆126Updated last month
- 医学中文RAG项目,使用langchain+milvus,支持快速一键式部署,支持无缝领域迁移☆181Updated 2 months ago
- ☆1,115Updated 4 months ago
- Source code of LogicRAG at AAAI'26.☆145Updated last week
- 智川x-agent☆1,081Updated 3 months ago
- Science-Star: A Platform for Building, Extending, and Experimenting with Scientific Agents.☆737Updated last month
- CivAgent is an LLM-based Human-like Agent acting as a Digital Player within the Strategy Game Unciv.☆134Updated 8 months ago
- ☆174Updated 2 months ago
- DeepClaude Rust的升级版本☆209Updated 7 months ago
- 4th Place Solution for the Kaggle Competition: LMSYS - Chatbot Arena Human Preference Predictions☆171Updated last year
- 一个用于分析创业公司数据的综合平台,包含爬虫系统、数据分析工具、创业评估AI模型、Web端和小程序端☆117Updated 6 months ago
- 数据标注是一款专门对文本数据进行处理和标注的工具,通过简化快捷的文本标注流程和动态的算法反馈,支持用户快速标注关键词并能通过算法持续减少人工标注的成本和时间。数据标注的过程先由人工标注构建基础,再由自动标注反哺人工标注,最后由人工标注进行纠偏,从而大幅度提高标注的精准度和高…☆694Updated 5 months ago
- 🧠 Prometheus: A Knowledge-Graph-Driven 🤖 AI Agent that maps 🗺, understands 🧩, and repairs 🛠 complex codebases — not by guessing, but…☆430Updated last week
- A powerful multi-format file parsing, data cleaning, and AI annotation toolkit.☆141Updated this week
- [MM 2025] EventVAD: Training-Free Event-Aware Video Anomaly Detection☆509Updated 4 months ago