foundation-model-stack / fms-dgtLinks
Synthetic Data Generation for Foundation Models
β21Updated 2 months ago
Alternatives and similar repositories for fms-dgt
Users that are interested in fms-dgt are comparing it to the libraries listed below
Sorting:
- π¦ Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data β¦β212Updated 2 weeks ago
- ACPBench: Reasoning about Action, Change, and Planningβ32Updated 2 months ago
- The AI Steerability 360 toolkit is an extensible library for general purpose steering of LLMs.β76Updated 2 weeks ago
- Collection of evals for Inspect AIβ357Updated this week
- [ICML '24] R2E: Turn any GitHub Repository into a Programming Agent Environmentβ140Updated 9 months ago
- β328Updated last year
- Large language model and dataset for natural language to first-order logic translationβ74Updated 2 years ago
- A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.β318Updated 2 weeks ago
- A benchmark that challenges language models to code solutions for scientific problemsβ169Updated last week
- TDD-Bench-Verified is a new benchmark for generating test cases for test-driven development (TDD)β27Updated 4 months ago
- β117Updated last year
- β69Updated 3 weeks ago
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898β240Updated last year
- Grammar Prompting for Domain-Specific Language Generation with Large Language Modelsβ75Updated 2 years ago
- β¨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024β186Updated last year
- EvalAssist is an open-source project that simplifies using large language models as evaluators (LLM-as-a-Judge) of the output of other laβ¦β94Updated 2 months ago
- β44Updated 2 years ago
- β52Updated 10 months ago
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)β170Updated 5 months ago
- π Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.β56Updated this week
- β432Updated this week
- The Granite Guardian models are designed to detect risks in prompts and responses.β130Updated 4 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".β224Updated last month
- [NeurIPS '25] GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agentsβ63Updated this week
- β26Updated last week
- The prime repository for state-of-the-art Multilingual Question Answering research and development.β740Updated 4 months ago
- π€ A specialized library for integrating context-free grammars (CFG) in EBNF with the Hugging Face Transformersβ132Updated 3 weeks ago
- β165Updated last year
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasetsβ227Updated last year
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"β124Updated last year