pretrain a wiki llm using transformers
☆66Sep 1, 2024Updated last year
Alternatives and similar repositories for transformers_from_scratch
Users that are interested in transformers_from_scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Jul 22, 2024Updated last year
- Smart LLM/Agent Management in One Line of Code☆21Mar 22, 2026Updated last month
- ☆18Aug 23, 2022Updated 3 years ago
- SwanLab Self-hosted Service | SwanLab 私有化部署服务☆43Apr 28, 2026Updated last week
- Built on the robust XTuner backend framework, XTuner Chat GUI offers a user-friendly platform for quick and efficient local model inferen…☆13Feb 5, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 🎓Automatically Update agent Papers Daily using Github Actions (Update Every 12th hours)每日更新agent相关论文(已附带中文摘要翻译)☆37Updated this week
- 零实现 AlphaGo Zero☆17Nov 10, 2024Updated last year
- Kernel sources for https://huggingface.co/kernels-community☆106Updated this week
- 模型压缩的小白入门教程☆22Jul 7, 2024Updated last year
- This is the official code for paper: [PiVe: Prompting with Iterative Verification Improving Graph-based Generative Capability of LLMs]☆36Aug 31, 2024Updated last year
- BUAA Computer Organization 北航 计组☆12Apr 10, 2022Updated 4 years ago
- 关键词抽取技术☆18Sep 11, 2019Updated 6 years ago
- ☆34Jul 8, 2025Updated 9 months ago
- A self-evolving personal AI assistant.☆36Mar 13, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 基于internlm-chat-7b的保险知识大模型微调☆20Apr 26, 2024Updated 2 years ago
- Implementation of "Pre-training Graph Transformer with Multimodal Side Information for Recommendation"☆17Mar 17, 2022Updated 4 years ago
- 本项目分别电商数据统计模块及业务采集及数仓搭建模块,利用hive统计每个区域热门商品进行统计;依据业务数据实现离线业务数仓搭建。☆22Mar 2, 2022Updated 4 years ago
- Dataset for the paper "GenWiki: A Dataset of 1.3 Million Content-Sharing Text and Graphs for Unsupervised Graph-to-Text Generation"☆26Jan 2, 2024Updated 2 years ago
- Ask Poddy: Run Open Source LLMs and Embeddings as OpenAI-Compatible Serverless Endpoints (Tutorial)☆11Jul 19, 2024Updated last year
- The inference implementation of the deeplabV3+ person segementation algorithm.☆24Jan 1, 2021Updated 5 years ago
- 斯坦福小镇中国版,使用本地模型部署,提示工程中文化,简化流程☆55Oct 16, 2025Updated 6 months ago
- C++ implementation of tokenizers, including tiktoken.☆25Dec 7, 2023Updated 2 years ago
- Android hair segmentation demo by ncnn☆29Jul 9, 2021Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 生僻字OCR识别优化训练☆16Feb 16, 2023Updated 3 years ago
- deprecated, use https://github.com/octohelm/piper instead.☆14Sep 3, 2024Updated last year
- ☆16Mar 5, 2023Updated 3 years ago
- ☆45May 9, 2025Updated 11 months ago
- My Gen AI research☆11Jun 3, 2024Updated last year
- Multi-modal Knowledge Graph Convolutional Networks for Music Recommendation System☆22Aug 22, 2023Updated 2 years ago
- A simple website to manage your Hyper-V VMs and IIS sites☆12Jan 19, 2023Updated 3 years ago
- [NeurIPS'25 Spotlight🔥]Official implementation of Uni-MuMER: Unified Multi-Task Fine-Tuning of Vision-Language Model for Handwritten Ma…☆32Apr 13, 2026Updated 3 weeks ago
- ☆22Jan 26, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- AI Challenger 2018 细粒度用户评论情感分析比赛 个人baseline项目☆15Oct 3, 2018Updated 7 years ago
- Finetuning a codegen model with python instruction set using QLORA technique for better efficacy☆11Aug 31, 2023Updated 2 years ago
- A vllm proxy server to add security and multi model management for vllm servers☆12May 30, 2024Updated last year
- ☆11May 17, 2024Updated last year
- Codes for Operating System course.☆21Feb 20, 2025Updated last year
- parser for a microsoft .ini format file & java .properties file in golang☆13Jul 8, 2018Updated 7 years ago
- ☆20Nov 6, 2025Updated 6 months ago