foundation-model-stack / fms-dgtLinks
Synthetic Data Generation for Foundation Models
☆21Updated 2 weeks ago
Alternatives and similar repositories for fms-dgt
Users that are interested in fms-dgt are comparing it to the libraries listed below
Sorting:
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆211Updated last week
- An open source benchmarking framework for IT automation☆249Updated last week
- ☆316Updated last year
- Grammar Prompting for Domain-Specific Language Generation with Large Language Models☆75Updated 2 years ago
- SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models☆578Updated last year
- ☆50Updated 2 weeks ago
- ☆390Updated last week
- Collection of evals for Inspect AI☆289Updated last week
- ☆165Updated last year
- ☆116Updated last year
- A virtual environment for developing and evaluating automated scientific discovery agents.☆191Updated 8 months ago
- Aligning AI With Shared Human Values (ICLR 2021)☆304Updated 2 years ago
- The project page for "LOGIC-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning"☆363Updated last year
- Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.☆154Updated 2 months ago
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898☆230Updated last year
- Discovering Data-driven Hypotheses in the Wild☆118Updated 5 months ago
- A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.☆294Updated last month
- [LREC-COLING'24] HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization☆39Updated 8 months ago
- 🔗 LINC: Logical Inference via Neurosymbolic Computation [EMNLP2023]☆78Updated last year
- Large language model and dataset for natural language to first-order logic translation☆73Updated 2 years ago
- TDD-Bench-Verified is a new benchmark for generating test cases for test-driven development (TDD)☆25Updated 2 months ago
- ☆52Updated 8 months ago
- A benchmark that challenges language models to code solutions for scientific problems☆154Updated last week
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆122Updated last year
- ACPBench: Reasoning about Action, Change, and Planning☆30Updated 2 months ago
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆223Updated last year
- A Unified Approach to Evaluate and Compare Explainable AI methods☆14Updated last year
- ☆241Updated last year
- ☆25Updated 5 months ago
- A `Neural = Symbolic` framework for sound and complete weighted real-value logic☆293Updated last week