beichao1314/Open-Llama

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/beichao1314/Open-Llama)

beichao1314 / Open-Llama

The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.

☆68

Alternatives and similar repositories for Open-Llama

Users that are interested in Open-Llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RapidAI / Open-Llama
View on GitHub
The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
☆69May 9, 2023Updated 3 years ago
ifromeast / LLMTrainer
View on GitHub
A comparison of pretraining framework for LLM
☆22Feb 6, 2025Updated last year
huggingface / Megatron-LM
View on GitHub
Ongoing research training transformer models at scale
☆18Jul 27, 2023Updated 2 years ago
MoFHeka / LLaMA-Megatron
View on GitHub
A LLaMA1/LLaMA12 Megatron implement.
☆28Dec 13, 2023Updated 2 years ago
StefanHeng / ProgGen
View on GitHub
Code for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models"
☆17Mar 29, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
manueldeprada / Pretraining-T5-PyTorch-Lightning
View on GitHub
Collection of scripts to pretrain T5 in unsupervised text, using PyTorch Lightning. CORD-19 pretraining provided as example.
☆32Apr 26, 2021Updated 5 years ago
alibaba / Megatron-LLaMA
View on GitHub
Best practice for training LLaMA models in Megatron-LM
☆665Jan 2, 2024Updated 2 years ago
ashishkssingh / Anomaly-Detection-SH-ESD
View on GitHub
Anomaly Detection using SH-ESD
☆10Feb 6, 2019Updated 7 years ago
liyunlongaaa / AD-TUNING
View on GitHub
AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…
☆11Feb 23, 2024Updated 2 years ago
RQLuo / MixTeX-DataHub
View on GitHub
LaTeXDataHub is an open-source platform dedicated to the sharing and contribution of real-world LaTeX image datasets and their annotation…
☆12Aug 13, 2024Updated last year
plm-team / PLM
View on GitHub
PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing
☆21Mar 18, 2025Updated last year
HuangLK / transpeeder
View on GitHub
train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism
☆224Nov 21, 2023Updated 2 years ago
tjunlp-lab / M3KE
View on GitHub
A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark
☆106Jul 20, 2023Updated 2 years ago
USTC-StarTeam / ZIP
View on GitHub
☆28Jul 11, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
MikaStars39 / StableMask
View on GitHub
PyTorch implementation of StableMask (ICML'24)
☆15Jun 27, 2024Updated last year
xv44586 / Chinese-instruction-datasets
View on GitHub
中文 Instruction tuning datasets
☆143Apr 10, 2024Updated 2 years ago
NovelAI / k-diffusion-multigen
View on GitHub
Karras et al. (2022) diffusion models for PyTorch
☆17Oct 5, 2023Updated 2 years ago
sunzeyeah / RLHF
View on GitHub
Implementation of Chinese ChatGPT
☆287Nov 20, 2023Updated 2 years ago
Dicer-Zz / EPI
View on GitHub
Code for the paper: Rehearsal-free Continual Language Learning via Efficient Parameter Isolation
☆13May 16, 2023Updated 3 years ago
HarderThenHarder / transformers_tasks
View on GitHub
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SF…
☆2,423Sep 29, 2023Updated 2 years ago
hint-lab / doctrack
View on GitHub
Dataset for EMNLP'23 Paper "DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine Reading"
☆11Oct 25, 2023Updated 2 years ago
ControlNet / HYDRA
View on GitHub
[ECCV] HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning
☆26Sep 6, 2025Updated 8 months ago
GX-XinGao / GRA
View on GitHub
The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"
☆34Jun 13, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Bai-YT / AdaptiveSmoothing
View on GitHub
Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".
☆10Feb 6, 2024Updated 2 years ago
aplmikex / deduplication_mnbvc
View on GitHub
文本去重
☆77May 23, 2024Updated 2 years ago
earth2observe / downscaling-tools
View on GitHub
python programs and procedures that facilitate local application of the earth2observe global water resources reanalysis
☆10Nov 21, 2017Updated 8 years ago
supersymmetry-technologies / BBT-FinCUGE-Applications
View on GitHub
☆284Jul 10, 2023Updated 2 years ago
swtheing / PF-PPO-RLHF
View on GitHub
☆34Sep 14, 2024Updated last year
chatgpt-plus / chatgpt-plus.github.io
View on GitHub
☆11May 25, 2026Updated last week
fabiomatricardi / Deepseek-R1-qwen1.5B
View on GitHub
how to run DeepSeek-R1-Distill-Qwen-1.5B GGUF locally on your PC
☆28Jan 24, 2025Updated last year
yangjianxin1 / Firefly-LLaMA2-Chinese
View on GitHub
Firefly中文LLaMA-2大模型，支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型
☆414Oct 21, 2023Updated 2 years ago
CVI-SZU / Linly
View on GitHub
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型；ChatFlow中文对话模型；中文OpenLLaMA模型；NLP预训练/指令微调数据集
☆3,051Apr 14, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
dongxiaohuang / TextClassifier_Transformer
View on GitHub
个人基于谷歌开源的BERT编写的文本分类器（基于微调方式），可自由加载NLP领域知名的预训练语言模型BERT、Bert-wwm、Roberta、ALBert以及ERNIE1.0
☆42May 6, 2020Updated 6 years ago
llyx97 / Rosita
View on GitHub
[AAAI 2021] "ROSITA: Refined BERT cOmpreSsion with InTegrAted techniques", Yuanxin Liu, Zheng Lin, Fengcheng Yuan
☆14Oct 18, 2022Updated 3 years ago
keep-smile-001 / opentqa
View on GitHub
opentqa is a open framework of the textbook question answering, which includes xtqa, mcan, cmr, mfb, mutan.
☆11Mar 27, 2021Updated 5 years ago
RL10x / RetNet
View on GitHub
an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf
☆11Jul 25, 2023Updated 2 years ago
ShiZhengyan / InstructionModelling
View on GitHub
[NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With Loss Over Instructions"
☆38May 24, 2024Updated 2 years ago
percent4 / Keras_R_BERT
View on GitHub
本项目使用Keras实现R-BERT，在人物关系数据集上进行测试验证。
☆10Apr 17, 2021Updated 5 years ago
zhongwanjun / CARP
View on GitHub
code for the table-based open domain question answering project, with paper title: "Reasoning over Hybrid Chain for Table-and-Text Open D…
☆12Sep 16, 2022Updated 3 years ago