foundation-model-stack / fms-dgt
Synthetic Data Generation for Foundation Models
☆18Updated 2 months ago
Alternatives and similar repositories for fms-dgt:
Users that are interested in fms-dgt are comparing it to the libraries listed below
- 🦄 Unitxt: a python library for getting data fired up and set for training and evaluation☆182Updated this week
- TDD-Bench-Verified is a new benchmark for generating test cases for test-driven development (TDD)☆14Updated last month
- [arXiv] EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees☆13Updated 3 weeks ago
- Discovering Data-driven Hypotheses in the Wild☆65Updated 4 months ago
- 🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.☆38Updated last week
- ☆284Updated 9 months ago
- Efficient LLM inference on Slurm clusters using vLLM.☆53Updated this week
- Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.☆141Updated 5 months ago
- ☆68Updated last year
- The prime repository for state-of-the-art Multilingual Question Answering research and development.☆733Updated 2 months ago
- ☆33Updated 3 months ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆39Updated 5 months ago
- Grammar Prompting for Domain-Specific Language Generation with Large Language Models☆65Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆103Updated last year
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆157Updated 10 months ago
- STREET: a multi-task and multi-step reasoning dataset☆22Updated last year
- ☆104Updated 11 months ago
- ☆39Updated 2 years ago
- Knowledge-Aware RL agents with Commonsense Reasoning☆77Updated 3 years ago
- A virtual environment for developing and evaluating automated scientific discovery agents.☆140Updated 3 weeks ago
- Independent implementation of DBCA method from http://arxiv.org/abs/1912.09713☆11Updated 4 years ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆47Updated last year
- ☆19Updated last month
- ☆47Updated last year
- ☆33Updated 3 weeks ago
- Official implementation of Inductive Logical Query Answering in Knowledge Graphs (NeurIPS 2022)☆48Updated 2 years ago
- A benchmark that challenges language models to code solutions for scientific problems☆111Updated last week
- ☆34Updated last year
- This is the code for our KILT leaderboard submissions (KGI + Re2G models).☆153Updated last year
- ☆129Updated 2 months ago