fe1ixxu / ALMA
State-of-the-art LLM-based translation models.
☆437Updated last month
Related projects ⓘ
Alternatives and complementary repositories for ALMA
- ☆221Updated 5 months ago
- BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages☆219Updated last year
- The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1…☆168Updated last year
- [ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning☆558Updated 8 months ago
- Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates☆435Updated 7 months ago
- The FLORES+ Machine Translation Benchmark☆99Updated last week
- ☆451Updated 3 weeks ago
- Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning☆384Updated 6 months ago
- Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality s…☆491Updated 2 weeks ago
- ☆200Updated 4 months ago
- Scripts for fine-tuning Llama2 via SFT and DPO.☆182Updated last year
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆86Updated last year
- DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI☆478Updated 6 months ago
- A bagel, with everything.☆312Updated 7 months ago
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.☆647Updated last month
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆714Updated 2 weeks ago
- [ACL 2024] Progressive LLaMA with Block Expansion.☆478Updated 6 months ago
- ☆295Updated 5 months ago
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆438Updated 8 months ago
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆142Updated 9 months ago
- [EMNLP 2023] Adapting Language Models to Compress Long Contexts☆277Updated 2 months ago
- [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning☆613Updated 5 months ago
- All-in-one text de-duplication☆622Updated 6 months ago
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆196Updated 6 months ago
- Official repository for LongChat and LongEval☆512Updated 5 months ago
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆124Updated 4 months ago
- Generative Representational Instruction Tuning☆567Updated this week
- This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.☆528Updated 8 months ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆91Updated last year
- Code used for sourcing and cleaning the BigScience ROOTS corpus☆306Updated last year