foundation-model-stack / fms-dgt
Synthetic Data Generation for Foundation Models
β18Updated 2 months ago
Alternatives and similar repositories for fms-dgt:
Users that are interested in fms-dgt are comparing it to the libraries listed below
- π¦ Unitxt: a python library for getting data fired up and set for training and evaluationβ187Updated this week
- TDD-Bench-Verified is a new benchmark for generating test cases for test-driven development (TDD)β15Updated last month
- β106Updated 11 months ago
- Repository for "Detoxification with MaRCo: Controllable Revision with Experts and Anti-Experts"β9Updated last year
- A framework for few-shot evaluation of autoregressive language models.β104Updated last year
- β14Updated last week
- Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.β144Updated 6 months ago
- Utility to incrementally learn regular expressions from examplesβ27Updated 7 months ago
- Collection of evals for Inspect AIβ115Updated this week
- Knowledge-Aware RL agents with Commonsense Reasoningβ77Updated 3 years ago
- β34Updated 4 months ago
- SoTA Abstract Meaning Representation (AMR) parsing with word-node alignments in Pytorch. Includes checkpoints and other tools such as staβ¦β251Updated 3 months ago
- β82Updated 2 years ago
- Grammar Prompting for Domain-Specific Language Generation with Large Language Modelsβ65Updated last year
- Independent implementation of DBCA method from http://arxiv.org/abs/1912.09713β11Updated 4 years ago
- Modalities, a PyTorch-native framework for distributed and reproducible foundation model training.β75Updated this week
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasetsβ218Updated 5 months ago
- This is the code for our KILT leaderboard submissions (KGI + Re2G models).β153Updated last year
- A domain-specific probabilistic programming language for modeling and inference with language modelsβ126Updated last year
- Efficient LLM inference on Slurm clusters using vLLM.β57Updated this week
- Inspecting and Editing Knowledge Representations in Language Modelsβ115Updated last year
- β14Updated 3 weeks ago
- β49Updated 4 months ago
- β71Updated 2 months ago
- β257Updated this week
- The CodeInsight dataset is designed for code generation tasks, providing developers with expert-curated examples that bridge the gap betwβ¦β13Updated 6 months ago
- The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.β177Updated 2 years ago
- The prime repository for state-of-the-art Multilingual Question Answering research and development.β732Updated 3 months ago
- β132Updated 5 months ago
- A system for prompted weak supervision. Alfred is a powerful tool that leverages large language models to accelerate data annotation.β52Updated 3 weeks ago