pretrain a wiki llm using transformers
☆63Sep 1, 2024Updated last year
Alternatives and similar repositories for transformers_from_scratch
Users that are interested in transformers_from_scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Creating Your Divine Agent 😇☆10Jan 26, 2026Updated 2 months ago
- ☆13Jul 22, 2024Updated last year
- ChatGLM4微调简介☆22Apr 8, 2025Updated 11 months ago
- This project provides a set of translators to convert OpenAI Gym environments into text-based environments. It is designed to investigate…☆20May 29, 2024Updated last year
- ☆18Aug 23, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Kernel sources for https://huggingface.co/kernels-community☆83Updated this week
- Train deepseek r1-like reasoning LLM with ease | 轻松训练1个deepseek r1类的推理LLM☆19Feb 15, 2025Updated last year
- SwanLab Self-hosted Service | SwanLab 私有化部署服务☆40Mar 12, 2026Updated 2 weeks ago
- A roadmap of artificial intelligence☆17Sep 17, 2022Updated 3 years ago
- Official Repository for paper "Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Mod…☆15Nov 25, 2024Updated last year
- Built on the robust XTuner backend framework, XTuner Chat GUI offers a user-friendly platform for quick and efficient local model inferen…☆13Feb 5, 2024Updated 2 years ago
- A simple implementation of LoRA+: Efficient Low Rank Adaptation of Large Models☆10Mar 20, 2024Updated 2 years ago
- 🎓Automatically Update agent Papers Daily using Github Actions (Update Every 12th hours)每日更新agent相关论文(已附带中文摘要翻译)☆36Updated this week
- This is the official code for paper: [PiVe: Prompting with Iterative Verification Improving Graph-based Generative Capability of LLMs]☆36Aug 31, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 电信大数据项目实战☆13Dec 27, 2018Updated 7 years ago
- 关键词抽取技术☆18Sep 11, 2019Updated 6 years ago
- 使用jupyter进行langchain的代码练习☆19Feb 18, 2024Updated 2 years ago
- ☆16Mar 30, 2023Updated 2 years ago
- A self-evolving personal AI assistant.☆36Mar 13, 2026Updated 2 weeks ago
- ☆57Jul 8, 2025Updated 8 months ago
- Dataset for the paper "GenWiki: A Dataset of 1.3 Million Content-Sharing Text and Graphs for Unsupervised Graph-to-Text Generation"☆26Jan 2, 2024Updated 2 years ago
- 基于Llamaindex微调qwen2.5-7b☆38Dec 23, 2024Updated last year
- C++ implementation of tokenizers, including tiktoken.☆25Dec 7, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆16Mar 5, 2023Updated 3 years ago
- ☆45May 9, 2025Updated 10 months ago
- My Gen AI research☆11Jun 3, 2024Updated last year
- Multi-modal Knowledge Graph Convolutional Networks for Music Recommendation System☆22Aug 22, 2023Updated 2 years ago
- A simple website to manage your Hyper-V VMs and IIS sites☆12Jan 19, 2023Updated 3 years ago
- LLM as a Chatbot Service☆17Aug 28, 2023Updated 2 years ago
- 支持多种 Linux 发 行版的交互式/自动化 NVIDIA 驱动安装脚本☆47Mar 16, 2026Updated last week
- ☆11May 17, 2024Updated last year
- An Offline and Secure Retrieval-Augmented Generation (RAG) system designed for efficient processing of diverse content types with minimal…☆20Dec 29, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆18Nov 6, 2025Updated 4 months ago
- something for paper agent☆11Dec 18, 2024Updated last year
- Modify from https://github.com/ankush-me/SynthText.git to generate game style character☆17Feb 9, 2021Updated 5 years ago
- ☆11Aug 15, 2025Updated 7 months ago
- Example of Langchain-Elasticsearch integrations & RAG.☆12Sep 20, 2024Updated last year
- Building a VLM model starts from the basic module.☆18Apr 7, 2024Updated last year
- benchmark of KgCLUE, with different models and methods☆28Dec 13, 2021Updated 4 years ago