pretrain a wiki llm using transformers
☆65Sep 1, 2024Updated last year
Alternatives and similar repositories for transformers_from_scratch
Users that are interested in transformers_from_scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ChatGLM4微调简介☆23Apr 8, 2025Updated last year
- This project provides a set of translators to convert OpenAI Gym environments into text-based environments. It is designed to investigate…☆21May 29, 2024Updated last year
- Train deepseek r1-like reasoning LLM with ease | 轻松训练1个deepseek r1类的推理LLM☆19Feb 15, 2025Updated last year
- SwanLab Self-hosted Service | SwanLab 私有化部署服务☆42Apr 8, 2026Updated last week
- A roadmap of artificial intelligence☆17Sep 17, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Built on the robust XTuner backend framework, XTuner Chat GUI offers a user-friendly platform for quick and efficient local model inferen…☆13Feb 5, 2024Updated 2 years ago
- A simple implementation of LoRA+: Efficient Low Rank Adaptation of Large Models☆10Mar 20, 2024Updated 2 years ago
- 🎓Automatically Update agent Papers Daily using Github Actions (Update Every 12th hours)每日更新agent相关论文(已附带中文摘要翻译)☆37Updated this week
- 零实现 AlphaGo Zero☆17Nov 10, 2024Updated last year
- 模型压缩的小白入门教程☆22Jul 7, 2024Updated last year
- 关键词抽取技术☆18Sep 11, 2019Updated 6 years ago
- 使用jupyter进行langchain的代码练习☆19Feb 18, 2024Updated 2 years ago
- ☆34Jul 8, 2025Updated 9 months ago
- 收集整理大模型面试题☆12Aug 29, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Composition of Multimodal Language Models From Scratch☆15Aug 16, 2024Updated last year
- SimCSE的tensorflow版本实现,以及基础实验对比☆13Jul 22, 2021Updated 4 years ago
- 基于internlm-chat-7b的保险知识大模型微调☆20Apr 26, 2024Updated last year
- 本项目分别电商数据统计模块及业务采集及数仓搭建模块,利用hive统计每个区域热门商品进行统计;依据业务数据实现离线业务数仓搭建。☆21Mar 2, 2022Updated 4 years ago
- A web interface for SleekDB written in PHP☆11Jan 22, 2022Updated 4 years ago
- Storage Performance Development Kit☆11Apr 6, 2026Updated last week
- deprecated, use https://github.com/octohelm/piper instead.☆14Sep 3, 2024Updated last year
- ☆16Mar 5, 2023Updated 3 years ago
- ☆45May 9, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- My Gen AI research☆11Jun 3, 2024Updated last year
- LLM as a Chatbot Service☆17Aug 28, 2023Updated 2 years ago
- Force google.com (aka Google No Country Redirect) when searching in Google Chrome Omnibox☆10Nov 30, 2016Updated 9 years ago
- AI Challenger 2018 细粒度用户评论情感分析比赛 个人baseline项目☆15Oct 3, 2018Updated 7 years ago
- Finetuning a codegen model with python instruction set using QLORA technique for better efficacy☆11Aug 31, 2023Updated 2 years ago
- A vllm proxy server to add security and multi model management for vllm servers☆12May 30, 2024Updated last year
- ☆11May 17, 2024Updated last year
- 数据仓库KETTLE ETL资源库☆14Jun 11, 2015Updated 10 years ago
- ☆19Nov 6, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- parser for a microsoft .ini format file & java .properties file in golang☆13Jul 8, 2018Updated 7 years ago
- Modify from https://github.com/ankush-me/SynthText.git to generate game style character☆17Feb 9, 2021Updated 5 years ago
- something for paper agent☆11Dec 18, 2024Updated last year
- ☆11Aug 15, 2025Updated 8 months ago
- Example of Langchain-Elasticsearch integrations & RAG.☆12Sep 20, 2024Updated last year
- A LLaMA2-7b chatbot with memory running on CPU, and optimized using smooth quantization, 4-bit quantization or Intel® Extension For PyTor…☆15Feb 27, 2024Updated 2 years ago
- A dual-chatbot system for learning languages based on LangChain☆13Jun 25, 2023Updated 2 years ago