The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
☆69May 9, 2023Updated 2 years ago
Alternatives and similar repositories for Open-Llama
Users that are interested in Open-Llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism☆224Nov 21, 2023Updated 2 years ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆68Mar 27, 2023Updated 3 years ago
- 适合于开发人员的运维管理平台(基于ASP.NET Core Blazor 5语言编写)☆10Feb 18, 2024Updated 2 years ago
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆48Dec 14, 2024Updated last year
- This is an example of creating an AI agent with flowchart☆12Jul 22, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The code for the paper "Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval" (WWW'22, Oral).☆17Mar 8, 2022Updated 4 years ago
- Incredibly descriptive audiovisual summaries for videos☆41Aug 2, 2024Updated last year
- Explore 160+ notebook visual analytics tools in your browser!☆67Mar 29, 2024Updated 2 years ago
- Record everyday Top GPTs in ChatGPT GPTs Store☆64Feb 20, 2025Updated last year
- Rust implementation of Surya☆66Mar 1, 2025Updated last year
- 🎧 Pod-Helper: Real-time audio transcription and repair on consumer hardware☆76Feb 23, 2024Updated 2 years ago
- codes for GAIIC-Track1☆15Jun 14, 2022Updated 3 years ago
- ☆23May 25, 2022Updated 3 years ago
- TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLO…☆979Sep 14, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆13Jul 15, 2021Updated 4 years ago
- User-Centric Conversational Recommendation with Multi-Aspect User Modeling (UCCR)☆39Jul 7, 2022Updated 3 years ago
- Document Artifical Intelligence☆202Sep 28, 2025Updated 7 months ago
- Image captioning with weight pruning in PyTorch☆22Jan 14, 2022Updated 4 years ago
- A Python toolkit for analyzing machine learning models and datasets.☆79Sep 8, 2023Updated 2 years ago
- ☆85Jan 15, 2024Updated 2 years ago
- a collection of resources around LLMs, aggregated for the workshop "Mastering LLMs: End-to-End Fine-Tuning and Deployment" by Dan Becker …☆110May 31, 2024Updated last year
- ssc-FinLLM-金融大模型☆27Apr 22, 2024Updated 2 years ago
- Supercharge huggingface transformers with model parallelism.☆78Jul 23, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆164Apr 17, 2023Updated 3 years ago
- 3rd Place solution for Feedback Prize - Predicting Effective Arguments Kaggle competition☆16Sep 6, 2022Updated 3 years ago
- 一句话描述你想要的工作流,自动生成 Coze / Dify / ComfyUI 平台可直接导入的完整工作流定义文件——包括节点配置、连线、布局和所有平台特有的格式要求。☆63Apr 10, 2026Updated 3 weeks ago
- ☆20Jan 6, 2023Updated 3 years ago
- NeurIPS 2024 tutorial on LLM Inference☆49Dec 10, 2024Updated last year
- Useful tool to build multi-agent in an easy way☆66Feb 19, 2025Updated last year
- 西班牙短文本匹配比赛,初赛8/1027,复赛5/1027☆19Aug 1, 2018Updated 7 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆19Jul 20, 2023Updated 2 years ago
- 更纯粹、更高压缩率的Tokenizer☆488Nov 27, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- Reasoning by Communicating with Agents☆29Apr 29, 2025Updated last year
- GPT-Analyst: A GPT for GPT analysis and reverse engineering☆204Mar 14, 2024Updated 2 years ago
- ☆29Jul 4, 2025Updated 10 months ago
- ☆26Aug 16, 2021Updated 4 years ago
- WiNGPT是一个基于GPT的医疗垂直领域大模型,旨在将专业的医学知识、医疗信息、数据融会贯通,为医疗行业提供智能化的医疗问答、诊断支持和医学知识等信息服务,提高诊疗效率和医疗服务质量。☆425Nov 28, 2024Updated last year
- ☆10May 20, 2019Updated 6 years ago