The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
☆69May 9, 2023Updated 2 years ago
Alternatives and similar repositories for Open-Llama
Users that are interested in Open-Llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism☆224Nov 21, 2023Updated 2 years ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆68Mar 27, 2023Updated 3 years ago
- 适合于开发人员的运维管理平台(基于ASP.NET Core Blazor 5语言编写)☆11Feb 18, 2024Updated 2 years ago
- This is an example of creating an AI agent with flowchart☆12Jul 22, 2024Updated last year
- Incredibly descriptive audiovisual summaries for videos☆41Aug 2, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Rust implementation of Surya☆66Mar 1, 2025Updated last year
- codes for GAIIC-Track1☆15Jun 14, 2022Updated 3 years ago
- ☆23May 25, 2022Updated 3 years ago
- TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLO…☆980Sep 14, 2024Updated last year
- User-Centric Conversational Recommendation with Multi-Aspect User Modeling (UCCR)☆39Jul 7, 2022Updated 3 years ago
- 医疗数据的匿名化研究☆12Jul 20, 2015Updated 10 years ago
- Document Artifical Intelligence☆202Sep 28, 2025Updated 6 months ago
- Image captioning with weight pruning in PyTorch☆22Jan 14, 2022Updated 4 years ago
- A Python toolkit for analyzing machine learning models and datasets.☆79Sep 8, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆85Jan 15, 2024Updated 2 years ago
- ssc-FinLLM-金融大模型☆27Apr 22, 2024Updated last year
- EMNLP-2021 paper: Thinking Clearly, Talking Fast: Concept-Guided Non-Autoregressive Generation for Open-Domain Dialogue Systems.☆16Nov 11, 2021Updated 4 years ago
- ☆11May 24, 2023Updated 2 years ago
- ☆164Apr 17, 2023Updated 2 years ago
- 3rd Place solution for Feedback Prize - Predicting Effective Arguments Kaggle competition☆16Sep 6, 2022Updated 3 years ago
- 31st place silver medal solution to USPPPM Kaggle competition☆20Jun 23, 2022Updated 3 years ago
- ☆20Jan 6, 2023Updated 3 years ago
- ☆206Updated this week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Useful tool to build multi-agent in an easy way☆66Feb 19, 2025Updated last year
- An algorithm that intelligently executes a crypto order over time via Coinbase☆13Oct 26, 2021Updated 4 years ago
- 西班牙短文本匹配比赛,初赛8/1027,复赛5/1027☆19Aug 1, 2018Updated 7 years ago
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆110Jul 29, 2025Updated 8 months ago
- KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense Generation☆30Aug 31, 2021Updated 4 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆19Jul 20, 2023Updated 2 years ago
- 更纯粹、更高压缩率的Tokenizer☆488Nov 27, 2024Updated last year
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- Reasoning by Communicating with Agents☆28Apr 29, 2025Updated 11 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆24Apr 4, 2022Updated 4 years ago
- WiNGPT是一个基于GPT的医疗垂直领域大模型,旨在将专业的医学知识、医疗信息、数据融会贯通,为医疗行业提供智能化的医疗问答、诊断支持和医学知识等信息服务,提高诊疗效率和医疗服务质量。☆425Nov 28, 2024Updated last year
- 计算机相关知识笔记☆10Updated this week
- I use various Data Science and machine learning techniques to analyze customer data using STP framework. I preprocessed the data, perform…☆12Apr 26, 2020Updated 5 years ago
- An Innovative Gaming Platform that Integrates Digital Games with Physical Fitness.☆15May 22, 2021Updated 4 years ago
- ☆22Mar 3, 2022Updated 4 years ago
- Implementation of "Modeling Past and Future for Neural Machine Translation"☆15Mar 16, 2018Updated 8 years ago