☆526Apr 29, 2024Updated 2 years ago
Alternatives and similar repositories for Transformer-from-scratch
Users that are interested in Transformer-from-scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Personal website for studying LLMs☆31Feb 22, 2024Updated 2 years ago
- 《Reinforcement Learning》读书学习与视频分享笔记☆79Apr 1, 2025Updated last year
- 训练自己的中文 Embedding 模型☆30Jan 6, 2025Updated last year
- ☆85Feb 3, 2025Updated last year
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 中文翻译的 Hands-On-Large-Language-Models (hands-on-llms),动手学习大模型☆2,502Oct 19, 2025Updated 6 months ago
- 复现大模型相关算法及一些学习记录☆3,314Mar 21, 2026Updated last month
- ☆20Apr 7, 2024Updated 2 years ago
- REAL-TIME EMOTION RECOGNITION FROM EEG SIGNALS USING ONE ELECTRODE DEVICE☆10Feb 23, 2021Updated 5 years ago
- 从无名小卒到大模型(LLM)大英雄~ 欢迎关注后续!!!☆2,161Nov 22, 2025Updated 5 months ago
- NLP/LLM Mlops Pipeline to dev/train/evaluation, scalable deploy and monitoring systems.☆22Mar 15, 2024Updated 2 years ago
- Retriever-0.1B☆96Jun 6, 2024Updated last year
- A simple implementation of Llama 1, 2. Llama Architecture built from scratch using PyTorch all the models are built from scratch that inc…☆14May 6, 2024Updated last year
- I created some notebooks about different concepts of financial engineering☆12Sep 28, 2025Updated 7 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 零实现 AlphaGo Zero☆17Nov 10, 2024Updated last year
- Zero-human, cold-start construction of long-chain agents in professional domains☆48Nov 10, 2025Updated 5 months ago
- 基于gradio的极简 ragflow API 聊天Web界面☆18Mar 31, 2025Updated last year
- LightRAG与GraphRAG在索引构建、检索测试中的耗时、模型请求次数、Token消耗金额、检索质量等方面进行对比☆169Dec 1, 2024Updated last year
- 《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程☆30,170Apr 24, 2026Updated last week
- 《大模型白盒子构建指南》:一个全手搓的Tiny-Universe☆4,781Feb 12, 2026Updated 2 months ago
- Llama3-Tutorial(XTuner、LMDeploy、OpenCompass)☆509May 10, 2024Updated last year
- Code repo for MathAgent☆20Dec 15, 2023Updated 2 years ago
- NeurIPS 2022 paper, SubHypergraph Inductive Neural nEtwork☆19Aug 4, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Voice calls from Twilio with Gemini's Live API with a nice streaming pipeline☆18Aug 27, 2025Updated 8 months ago
- 🚀🚀 「大模型」2小时完全从0训练64M的小参数GPT!🌏 Train a 64M-parameter GPT from scratch in just 2h!☆48,315Apr 24, 2026Updated last week
- ☆29Apr 16, 2026Updated 2 weeks ago
- EEG-based Emotion Recognition using Deep Reinforcement Learning☆17Dec 31, 2023Updated 2 years ago
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆91,680Apr 16, 2026Updated 2 weeks ago
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆501May 1, 2025Updated last year
- ☆10Apr 15, 2017Updated 9 years ago
- 基于Raft一致性协议的分布式存储系统,参考阿里巴巴SOFAJRaft并使用Java从零实现。Distributed storage system based on Raft consistency protocol, referencing Alibaba SOFAJRa…☆20Dec 14, 2022Updated 3 years ago
- Building LLaMA 4 MoE from Scratch☆73Apr 15, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 🤖🗺️ Headless browser scraper written in python to extract Places data from Google Maps.☆22Apr 15, 2026Updated 2 weeks ago
- A curated collection of prompts for Grok Imagine by xAI☆28Oct 19, 2025Updated 6 months ago
- [ACL 2023] MANNER: A Variational Memory-Augmented Model for Cross Domain Few-Shot Named Entity Recognition☆20Jul 21, 2023Updated 2 years ago
- 这是一个用于与 RAGflow API 交互的 Python 客户端,支持数据集管理、文件管理、分块管理、聊天助手管理以及代理管理的完整功能。☆21Feb 21, 2025Updated last year
- A simple yet effective Python email scraper☆19Jan 11, 2024Updated 2 years ago
- 🚀 「大模型」2小时从0训练65M参数的视觉多模态VLM!🌏 Train a 65M-parameter VLM from scratch in just 2 hours!☆7,685Updated this week
- AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术☆16,685Sep 3, 2025Updated 7 months ago