Instruct-tuning LLaMA on consumer hardware with machine-translated data
☆19Apr 17, 2023Updated 3 years ago
Alternatives and similar repositories for alpaca-lora-mt
Users that are interested in alpaca-lora-mt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A library of translation-based text similarity measures☆25Dec 11, 2023Updated 2 years ago
- Data from the publication "Multi-Domain Goal-Oriented Dialogues (MultiDoGO): Strategies toward Curating and Annotating Large Scale Dialog…☆25Dec 3, 2020Updated 5 years ago
- A collection of instruction data and scripts for machine translation.☆20Sep 23, 2023Updated 2 years ago
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆13Nov 21, 2023Updated 2 years ago
- Modified version of fairseq, including new implementations for criterions using reinforcement learning methods.☆11Aug 14, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A framework for evaluating Machine Translation models.☆12Apr 21, 2026Updated 2 weeks ago
- ☆26May 30, 2023Updated 2 years ago
- [Under Progress] Code & Data for the AAAI 2020 Paper "Likelihood Ratios and Generative Classifiers For Unsupervised OOD Detection In Task…☆10Jul 25, 2024Updated last year
- Extend bert-nmt to context-aware translation.☆11May 24, 2021Updated 4 years ago
- [Reproduce] Code for the EMNLP2018 paper "A Visual Attention Grounding Neural Model for Multimodal Machine Translation".☆11Jan 19, 2020Updated 6 years ago
- Improving cross-lingual word embeddings by meeting in the middle☆23Aug 25, 2020Updated 5 years ago
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- Code for paper "Out-of-domain detection for natural language understanding in dialog systems"☆10May 27, 2022Updated 3 years ago
- Anh - LAION's multilingual assistant datasets and models☆28Apr 5, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- pytorch实现bert做seq2seq任务,使用unilm方案。☆10Apr 1, 2020Updated 6 years ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Aug 25, 2023Updated 2 years ago
- ☆35Jun 15, 2023Updated 2 years ago
- Pytorch based BERT, mBART and NMT training☆15Jul 30, 2025Updated 9 months ago
- This repo contains the code and data used in the paper "Wizard of Search Engine: Access to Information Through Conversations with Search …☆21Apr 30, 2021Updated 5 years ago
- Repository for Giuseppe Russo's master thesis code.☆13Oct 2, 2020Updated 5 years ago
- Cross-lingual Visual Pre-training for Multimodal Machine Translation☆18Dec 28, 2021Updated 4 years ago
- SiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a Large Language model as the translation model and employs a t…☆18Feb 22, 2024Updated 2 years ago
- 历年CSP考试 题解☆14Apr 14, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- play with algoritms☆17Oct 4, 2019Updated 6 years ago
- Accepted to ICLR 2025. MetaMetrics is a calibrated meta-metric designed to evaluate generation tasks across different modalities aligned …☆14Dec 30, 2024Updated last year
- Official repository for the paper "GN-Transformer: Fusing AST and Source Code information in Graph Networks".☆17May 25, 2025Updated 11 months ago
- ☆14May 7, 2019Updated 6 years ago
- Source Code for <Target-Side Data Augmentation for Sequence Generation>☆12Oct 6, 2021Updated 4 years ago
- Transformer-based approaches for an efficient docstrings generation on a piece of Python's code.☆17Feb 16, 2026Updated 2 months ago
- ☆20Mar 22, 2024Updated 2 years ago
- Code for paper "Modeling Discriminative Representations for Out-of-Domain Detection with Supervised Contrastive Learning"☆20Sep 6, 2021Updated 4 years ago
- ☆16Jul 23, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A comprehensive overview of Data Distillation and Condensation (DDC). DDC is a data-centric task where a representative (i.e., small but …☆13Dec 1, 2022Updated 3 years ago
- Larger-Context NMT☆13Aug 20, 2017Updated 8 years ago
- Sum up the pages of all pdf files in a directory☆15Apr 7, 2020Updated 6 years ago
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 3 years ago
- A Bilingual Multi-Domain Dataset For Task-Oriented Dialogue Modeling☆23Jul 31, 2021Updated 4 years ago
- ☆12Feb 11, 2026Updated 2 months ago
- Implementation of VQ-VAE with a GPT-style sampler in the JAX and Haiku ecosystem.☆11Nov 23, 2023Updated 2 years ago