Eric-is-good / pretrain-LLM-from-scratchLinks
从0训练类 o1 大语言模型。
☆132Updated 2 weeks ago
Alternatives and similar repositories for pretrain-LLM-from-scratch
Users that are interested in pretrain-LLM-from-scratch are comparing it to the libraries listed below
Sorting:
- Science-Star: A Platform for Building, Extending, and Experimenting with Scientific Agents.☆739Updated 2 months ago
- 框架核心是两阶段“粗筛-精滤”数据清洗流程。首先,利用CLIP的多门控决策逻辑进行宏观粗筛,精准剔除插画、图表等非摄影类噪声。随后,利用DINOv2的细粒度特征,创新采用“相对边际分数”识别处于类别边界的混淆样本,并结合GMM模型为各类别动态确定清洗标准。整个流程内置最小样…☆213Updated last month
- ☆333Updated 2 months ago
- 智川x-agent☆1,083Updated 4 months ago
- ☆497Updated 3 months ago
- GigaModels: A Comprehensive Repository and Platform for Multi-modal, Generative, and Perceptual Models☆387Updated 2 weeks ago
- The Python implementation of some deep text hashing (also called deep semantic hashing) Models☆80Updated 3 weeks ago
- Translate PDF, Word, PowerPoint, etc. | zotero翻译插件,微信扫码注册,新用户可免费翻译25万汉字或100万个英文字母。超能文献官网:suppr.wilddata.cn;☆658Updated last month
- A real-time interactive Omni Avatar built on LiveKit, which allows you to seamlessly integrate with any open source Avatar components (re…☆557Updated this week
- 职星学院企业培训系统是一套基于点播、直播、考试、培训、面授等功能完善的在线教育系统,开源版是基于商业版精简实现的一个企业员工培训系统,致力于打造一个各行业都适用的在线培训系统、企业培训平台、员工培训系统、企业内部培训系统。☆534Updated 6 months ago
- Advanced Quantitative Factor Research: ML-powered stock return prediction with 72% performance improvement. Features comprehensive alpha …☆373Updated 4 months ago
- Joint Semantic Detection and Dissemination Control of Phishing Attacks on Social Media via LLama- Based Modeling☆822Updated 2 months ago
- ☆515Updated 9 months ago
- Synthetic Data Generation Platform By DataArcTech☆385Updated this week
- vue3+pinia+vue-router+elementPlus+vite7☆160Updated last month
- efflux-desktop-ui☆299Updated 5 months ago
- GigaDatasets: A Unified and Lightweight Framework for Data Processing, Curation, and Visualization☆248Updated last month
- Welcome to BlockSeek's official documentation. BlockSeek combines state-of-the-art AI with blockchain technology to revolutionize cryptoc…☆310Updated 10 months ago
- next easy report☆470Updated 3 weeks ago
- ☆81Updated 3 weeks ago
- 双版本markitdown:Java命令行;Python Web☆233Updated last month
- Minimalist ML framework for Go.☆184Updated last month
- A curated list of Model Context Protocol (MCP) servers☆506Updated 2 weeks ago
- Efflux desktop service☆366Updated 5 months ago
- A solution that makes it easy to connect ESP32 devices to Home Assistant, provided by Seeed Studio.☆172Updated last week
- Group Expectation Policy Optimization for Heterogeneous Reinforcement Learning☆164Updated last month
- AI-powered legal compliance assistant for alcohol beverage pricing laws — extracts, analyzes, and explains New York state-level regulatio…☆306Updated last month
- [NeurIPS 25] Official Implementation of TPP-SD: Accelerating Transformer Point Process Sampling with Speculative Decoding☆48Updated last month
- A transparent, minimal, and hackable agent framework. ~300 lines of readable code. Full control, no magic.☆432Updated last month
- ☆453Updated 7 months ago