The aim of this repository is to utilize LLaMA to reproduce and enhance the Stanford Alpaca
☆98Apr 5, 2023Updated 2 years ago
Alternatives and similar repositories for efficient_alpaca
Users that are interested in efficient_alpaca are comparing it to the libraries listed below
Sorting:
- OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing model pruning technique and continuing pretraining from OpenBA-1…☆25May 10, 2024Updated last year
- The framework to prune LLMs to any size and any config.☆95Mar 1, 2024Updated 2 years ago
- This is the codebase for pre-training, compressing, extending, and distilling LLMs with Megatron-LM.☆12Mar 11, 2024Updated last year
- ☆96Oct 8, 2023Updated 2 years ago
- [ACL 2024 Findings] Code implementation of Paper "Rethinking Negative Instances for Generative Named Entity Recognition"☆60Mar 20, 2024Updated last year
- ☆36Oct 14, 2022Updated 3 years ago
- ChatGLM-6B-Slim:裁减掉20K图片Token的ChatGLM-6B,完全一样的性能,占用更小的显存。☆125Apr 5, 2023Updated 2 years ago
- CMD: a framework for Context-aware Model self-Detoxification (EMNLP2024 Long Paper)☆17Feb 10, 2025Updated last year
- ☆18Mar 10, 2023Updated 2 years ago
- Code and data for "Improving Temporal Generalization of Pre-trained Language Models with Lexical Semantic Change" (EMNLP2022)☆18Dec 8, 2022Updated 3 years ago
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆24Oct 10, 2025Updated 4 months ago
- Diffusion Model Improvement Method☆35Sep 4, 2023Updated 2 years ago
- Code and data for "Timo: Towards Better Temporal Reasoning for Language Models" (COLM 2024)☆25Oct 23, 2024Updated last year
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- Statistical discontinuous constituent parsing☆11Feb 15, 2018Updated 8 years ago
- ☆187Jul 22, 2024Updated last year
- Encoder-decoders for translating different chemical formats.☆18Sep 17, 2025Updated 5 months ago
- Source code for "N-ary Constituent Tree Parsing with Recursive Semi-Markov Model" published at ACL 2021☆10May 27, 2021Updated 4 years ago
- 苏州大学研究生学位论文模板 - Soochow University Thesis TeX Template☆17Updated this week
- ☆13May 23, 2025Updated 9 months ago
- ☆14Apr 16, 2024Updated last year
- Implementation of Cascaded Head-colliding Attention (ACL'2021)☆11Sep 16, 2021Updated 4 years ago
- A DP beam-search extension of Mitchell Stern's span-based neural constituency parser☆11Aug 24, 2022Updated 3 years ago
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated last year
- The implementation for "Open Relation Modeling: Learning to Define Relations between Entities" (Findings of ACL '22)☆12Feb 28, 2022Updated 4 years ago
- ☆16Nov 5, 2018Updated 7 years ago
- This is the code for neural-Jacana aligner, and the data for MultiMWA dataset.☆20Feb 12, 2023Updated 3 years ago
- ✒️ ChatGPT as a writing partner.☆14Mar 6, 2023Updated 2 years ago
- Code for NeurIPS 2022 Spotlight paper " Non-Monotonic Latent Alignments for CTC-Based Non-Autoregressive Machine Translation"☆20Nov 16, 2022Updated 3 years ago
- Second Order Implementation of Hidden Markov Model for Tagging.☆15Mar 17, 2022Updated 3 years ago
- The code for "MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking"☆19Jan 25, 2025Updated last year
- SMiLER - Samsung MultiLingual Entity and Relation Extraction dataset☆18Feb 11, 2021Updated 5 years ago
- Biaffine Dependency Parser, implemented in PyTorch.☆12Feb 19, 2018Updated 8 years ago
- ☆41Mar 8, 2021Updated 4 years ago
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆41Sep 29, 2024Updated last year
- ☆21Sep 6, 2021Updated 4 years ago
- ☆18Sep 26, 2020Updated 5 years ago
- 🪞A powerful toolkit for almost all the Information Extraction tasks.☆124Apr 21, 2025Updated 10 months ago
- [NAACL 2022] "Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training", Yuanxin Liu, Fandong Meng, Zheng Lin, Pe…☆15Oct 18, 2022Updated 3 years ago