The aim of this repository is to utilize LLaMA to reproduce and enhance the Stanford Alpaca
☆98Apr 5, 2023Updated 3 years ago
Alternatives and similar repositories for efficient_alpaca
Users that are interested in efficient_alpaca are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing model pruning technique and continuing pretraining from OpenBA-1…☆25May 10, 2024Updated 2 years ago
- The framework to prune LLMs to any size and any config.☆94Mar 1, 2024Updated 2 years ago
- This is the codebase for pre-training, compressing, extending, and distilling LLMs with Megatron-LM.☆12Mar 11, 2024Updated 2 years ago
- ☆36Oct 14, 2022Updated 3 years ago
- ☆95Oct 8, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code and data for "Improving Temporal Generalization of Pre-trained Language Models with Lexical Semantic Change" (EMNLP2022)☆18Dec 8, 2022Updated 3 years ago
- CMD: a framework for Context-aware Model self-Detoxification (EMNLP2024 Long Paper)☆17Feb 10, 2025Updated last year
- Diffusion Model Improvement Method☆35Sep 4, 2023Updated 2 years ago
- [COLING 2025] NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models☆18Jan 18, 2025Updated last year
- Code and data for "Timo: Towards Better Temporal Reasoning for Language Models" (COLM 2024)☆26Oct 23, 2024Updated last year
- ☆188Jul 22, 2024Updated last year
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆26Oct 10, 2025Updated 8 months ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- ChatGLM-6B-Slim:裁减掉20K图片Token的ChatGLM-6B,完全一样的性能,占用更小的显存。☆124Apr 5, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆12Jun 13, 2025Updated last year
- ✒️ ChatGPT as a writing partner.☆14Mar 6, 2023Updated 3 years ago
- The source code of paper "An Effective System for Multi-format Information Extraction".☆18Aug 14, 2021Updated 4 years ago
- Using conversational games to evaluate powerful LLMs☆18Sep 3, 2023Updated 2 years ago
- The code for "MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking"☆19Jan 25, 2025Updated last year
- Long Context Research☆32Jan 26, 2026Updated 4 months ago
- ☆879May 24, 2024Updated 2 years ago
- 一种面向中文复杂问句的查询图生成方法,以及一份含有多种复杂句的中文知识图谱问答数据集☆18Mar 16, 2023Updated 3 years ago
- [ACL 2025] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLM…☆68Oct 27, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This is the code for neural-Jacana aligner, and the data for MultiMWA dataset.☆20Feb 12, 2023Updated 3 years ago
- ☆20May 14, 2025Updated last year
- ☆19Oct 13, 2025Updated 8 months ago
- ☆18Sep 26, 2020Updated 5 years ago
- Code for "Mixed Cross Entropy Loss for Neural Machine Translation"☆20Jul 23, 2021Updated 4 years ago
- [ICLR 2025] Official Code Release for Explaining Modern Gated-Linear RNNs via a Unified Implicit Attention Formulation☆50Mar 1, 2025Updated last year
- Open Academic Research on Improving LLaMA to SOTA LLM☆1,605Aug 30, 2023Updated 2 years ago
- ☆45Mar 6, 2026Updated 3 months ago
- Code for Unsupervised Domain Adaptation of a Pretrained Cross-Lingual Language Model, IJCAI 2020☆12Nov 26, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Statistical discontinuous constituent parsing☆11Feb 15, 2018Updated 8 years ago
- ☆97Dec 6, 2024Updated last year
- Named Entity Oriented Sentiment Analysis Task for mass-media texts☆11May 22, 2024Updated 2 years ago
- ☆12Nov 5, 2024Updated last year
- Code for NeurIPS 2022 Spotlight paper " Non-Monotonic Latent Alignments for CTC-Based Non-Autoregressive Machine Translation"☆20Nov 16, 2022Updated 3 years ago
- [ICCV 2021] Click to Move: Controlling Video Generation with Sparse Motion☆11Apr 14, 2023Updated 3 years ago
- ☆152Jan 31, 2024Updated 2 years ago