foundation-model-stack / fms-dgt
Synthetic Data Generation for Foundation Models
☆19Updated 3 months ago
Alternatives and similar repositories for fms-dgt
Users that are interested in fms-dgt are comparing it to the libraries listed below
Sorting:
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆194Updated this week
- Discovering Data-driven Hypotheses in the Wild☆80Updated 5 months ago
- Independent implementation of DBCA method from http://arxiv.org/abs/1912.09713☆11Updated 4 years ago
- Dolomite Engine is a library for pretraining/finetuning LLMs☆53Updated this week
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆159Updated last year
- codebase release for EMNLP2023 paper publication☆19Updated last week
- A framework for few-shot evaluation of autoregressive language models.☆103Updated 2 years ago
- Conformal Language Modeling☆29Updated last year
- A dataset generator for family tree data.☆13Updated 5 years ago
- Official implementation of Inductive Logical Query Answering in Knowledge Graphs (NeurIPS 2022)☆47Updated 2 years ago
- ☆174Updated 2 years ago
- Efficient LLM inference on Slurm clusters using vLLM.☆62Updated this week
- ☆34Updated 4 months ago
- A benchmark that challenges language models to code solutions for scientific problems☆120Updated this week
- ☆177Updated last year
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆78Updated last year
- ☆43Updated 2 years ago
- Grammar Prompting for Domain-Specific Language Generation with Large Language Models☆72Updated last year
- Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.☆147Updated 7 months ago
- 🤗 A specialized library for integrating context-free grammars (CFG) in EBNF with the Hugging Face Transformers☆117Updated last month
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆199Updated last week
- ☆16Updated last month
- ☆40Updated 3 months ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆40Updated 6 months ago
- ☆11Updated 2 years ago
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…☆61Updated 2 years ago
- Repository for MuSiQue: Multi-hop Questions via Single-hop Question Composition, TACL 2022☆134Updated 11 months ago
- ☆288Updated 10 months ago
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆127Updated last year
- Continuous Query Decomposition for Complex Query Answering in Incomplete Knowledge Graphs☆97Updated 2 years ago