The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
☆67Mar 27, 2023Updated 2 years ago
Alternatives and similar repositories for Open-Llama
Users that are interested in Open-Llama are comparing it to the libraries listed below
Sorting:
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆68May 9, 2023Updated 2 years ago
- Karras et al. (2022) diffusion models for PyTorch☆17Oct 5, 2023Updated 2 years ago
- Multi AI Agents for Investment Risk Analysis☆14Jun 23, 2024Updated last year
- PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing☆20Mar 18, 2025Updated 11 months ago
- This project explores the different techniques (both scalable and non scalable) for Graph based semi supervised learning. Recent techniqu…☆14May 28, 2016Updated 9 years ago
- Ongoing research training transformer models at scale☆18Jul 27, 2023Updated 2 years ago
- Best practice for training LLaMA models in Megatron-LM☆663Jan 2, 2024Updated 2 years ago
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆43Apr 7, 2024Updated last year
- ☆55Jan 3, 2025Updated last year
- Unofficial implementation of https://arxiv.org/pdf/2407.14679☆53Sep 7, 2024Updated last year
- ☆27Jul 11, 2024Updated last year
- example for rendering charts with flask & echarts☆19May 17, 2018Updated 7 years ago
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆104Jul 20, 2023Updated 2 years ago
- ☆30May 20, 2022Updated 3 years ago
- Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型☆416Oct 21, 2023Updated 2 years ago
- The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"☆34Jun 13, 2025Updated 8 months ago
- 大语言模型指令调优工具(支持 FlashAttention)☆177Jan 4, 2024Updated 2 years ago
- [AAAI 2026] The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary Giants☆46Dec 11, 2025Updated 2 months ago
- FaceShield: Explainable Face Anti-Spoofing with Multimodal Large Language Models☆10Dec 21, 2025Updated 2 months ago
- [ACL 2025] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLM…☆68Oct 27, 2024Updated last year
- Financial Analysis and Algorithmic Trading Strategies in Python☆11Feb 16, 2023Updated 3 years ago
- A MoE impl for PyTorch, [ATC'23] SmartMoE☆71Jul 11, 2023Updated 2 years ago
- [ACL 2024] Progressive LLaMA with Block Expansion.☆514May 20, 2024Updated last year
- Chinese Financial Assistant with Large Language Model☆78Sep 4, 2024Updated last year
- Implementation of Chinese ChatGPT☆289Nov 20, 2023Updated 2 years ago
- ☆281Jul 10, 2023Updated 2 years ago
- 文本去重☆78May 23, 2024Updated last year
- ⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SF…☆2,410Sep 29, 2023Updated 2 years ago
- [CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents☆59May 26, 2025Updated 9 months ago
- 中文 Instruction tuning datasets☆143Apr 10, 2024Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆1,437Mar 20, 2024Updated last year
- ☆34Sep 14, 2024Updated last year
- End-to-end integration of HuggingFace's models for sequence labeling.☆11Oct 4, 2020Updated 5 years ago
- Y-Agent Studio 是一个面向 企业级应用 的Agent开发套,Y-Agent是其中的核心模块。 包含了:支持智能体编排、RAG、流程日志、单元测试、流程测试、语料生产等垂直领域非常需要的功能。 智能体编排可以在同一个流程中,同时支持多智能体协作和流程混合编排…☆25Oct 4, 2025Updated 5 months ago
- The dataset and source code for our paper: "Did You Ask a Good Question? A Cross-Domain Question IntentionClassification Benchmark for Te…☆32Jul 5, 2021Updated 4 years ago
- Implementation of the model from "Faster sorting algorithms discovered using deep reinforcement learning" that discovered an all-new ult…☆11Aug 29, 2023Updated 2 years ago
- Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集☆3,055Apr 14, 2024Updated last year
- Instruction Tuning with GPT-4☆4,341Jun 11, 2023Updated 2 years ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Jan 29, 2024Updated 2 years ago