ymoslem / MT-LM
Domain-Specific Text Generation for Machine Translation (with LLMs) - scripts and config files for the paper
☆15Updated last year
Alternatives and similar repositories for MT-LM:
Users that are interested in MT-LM are comparing it to the libraries listed below
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q …☆86Updated 11 months ago
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆23Updated 2 years ago
- Official implementations for (1) BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation and (2) Discourse Centric …☆75Updated last year
- A Multilingual Replicable Instruction-Following Model☆94Updated last year
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆70Updated 11 months ago
- A unified versatile interface for dialogue datasets☆17Updated last year
- ☆42Updated 8 months ago
- Source codes for the paper "Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints"☆27Updated 2 years ago
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.☆25Updated last year
- PropSegmEnt is an annotated dataset for segmenting English text into propositions, and recognizing proposition-level entailment relations…☆19Updated 2 years ago
- StAtutory Reasoning Assessment☆13Updated 2 years ago
- LogiTorch is a PyTorch-based library for logical reasoning on natural language☆70Updated 5 months ago
- Tools for managing datasets for governance and training.☆82Updated 2 weeks ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆99Updated 10 months ago
- Apps built using Inspired Cognition's Critique.☆58Updated last year
- Reasoning by Communicating with Agents☆24Updated 4 months ago
- ☆23Updated 3 years ago
- ☆26Updated 6 months ago
- Open information and community for machine translation☆74Updated last week
- Adaptive Machine Translation with Large Language Models☆30Updated last month
- GEMBA — GPT Estimation Metric Based Assessment☆108Updated 6 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆46Updated last year
- ☆49Updated last year
- ☆26Updated 2 years ago
- Code for Stage-wise Fine-tuning for Graph-to-Text Generation☆26Updated 2 years ago
- MAFAND-MT☆55Updated 7 months ago
- Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models☆20Updated 2 months ago
- The paper list of multilingual pre-trained models (Continual Updated).☆20Updated 8 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 4 months ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year