fe1ixxu / ALMA
State-of-the-art LLM-based translation models.
☆423Updated last month
Related projects ⓘ
Alternatives and complementary repositories for ALMA
- BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages☆217Updated 11 months ago
- ☆218Updated 5 months ago
- The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1…☆168Updated last year
- ☆445Updated last week
- [ACL 2024] Progressive LLaMA with Block Expansion.☆479Updated 5 months ago
- Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning☆381Updated 5 months ago
- A bagel, with everything.☆312Updated 6 months ago
- An Open Source Toolkit For LLM Distillation☆350Updated last month
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆86Updated last year
- The FLORES+ Machine Translation Benchmark☆99Updated 2 months ago
- [ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning☆553Updated 8 months ago
- Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates☆433Updated 6 months ago
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆194Updated 6 months ago
- Implementation of DoRA☆282Updated 5 months ago
- Codebase for Merging Language Models (ICML 2024)☆765Updated 6 months ago
- Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality s…☆476Updated this week
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆435Updated 7 months ago
- Low-Rank adapter extraction for fine-tuned transformers model☆162Updated 6 months ago
- A Multilingual Replicable Instruction-Following Model☆93Updated last year
- [EMNLP 2023] Adapting Language Models to Compress Long Contexts☆275Updated 2 months ago
- Scripts for fine-tuning Llama2 via SFT and DPO.☆179Updated last year
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆171Updated 3 months ago
- Chat Templates for 🤗 HuggingFace Large Language Models☆528Updated last week
- Generative Representational Instruction Tuning☆562Updated this week
- FuseAI Project☆448Updated 2 months ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆96Updated 6 months ago
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆701Updated this week
- This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.☆528Updated 7 months ago
- ☆294Updated 5 months ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆250Updated last month