Synthetic Text Dataset Generation for LLM projects
☆58May 27, 2026Updated last week
Alternatives and similar repositories for datafast
Users that are interested in datafast are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 7 months ago
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way☆18Nov 4, 2025Updated 7 months ago
- ☆28Feb 11, 2026Updated 3 months ago
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs …☆65Feb 6, 2025Updated last year
- synthetic data for ml☆25Jan 30, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆24Jun 5, 2025Updated last year
- Centralize and streamline ML/AI lifecycle observability and compliance processes.☆12Apr 21, 2026Updated last month
- A curated list of materials on AI guardrails☆55Jun 3, 2025Updated last year
- SynthTextEval: A Toolkit for Generating and Evaluating Synthetic Data For High-Stakes Domains (EMNLP 2025 System Demonstration)☆27Nov 3, 2025Updated 7 months ago
- Extract Molecular SMILES embeddings from language models pre-trained with various objectives architectures.☆19Nov 9, 2023Updated 2 years ago
- ☆161Dec 2, 2024Updated last year
- Demo of knowledge graph creation and Graph RAG with BAML and Kuzu☆73Sep 17, 2025Updated 8 months ago
- Curriculum training of instruction-following LLMs with Unsloth☆14Dec 15, 2025Updated 5 months ago
- Evals that meet you where you are. For AI that's grounded.☆68Mar 21, 2026Updated 2 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- The OS AI engineering and monitoring agent. 🦸♀️ Oversight and compliance copilot for trustworthy AI.☆46Jul 6, 2025Updated 11 months ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆31Nov 18, 2025Updated 6 months ago
- ☆12Mar 4, 2025Updated last year
- ☆11Sep 27, 2024Updated last year
- ☆10Nov 12, 2024Updated last year
- A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal…☆12Sep 16, 2024Updated last year
- An Easy Annotation Tool for Natural Language Processing☆11May 17, 2024Updated 2 years ago
- EvalAssist is an open-source project that simplifies using large language models as evaluators (LLM-as-a-Judge) of the output of other la…☆101Apr 9, 2026Updated 2 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆34Aug 24, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Build datasets using natural language☆574Sep 19, 2025Updated 8 months ago
- A python implementation of discrete optimal transport with a Tsallis entropy regularization.☆14Oct 23, 2023Updated 2 years ago
- ☆44Jan 30, 2026Updated 4 months ago
- ☆10Dec 3, 2024Updated last year
- ☆15May 12, 2025Updated last year
- A collection of Tiptap extensions, versioned and released independently.☆27May 3, 2026Updated last month
- ☆14May 26, 2026Updated 2 weeks ago
- an experimental implementation of Burrow's delta in Python 3☆12Jun 6, 2017Updated 9 years ago
- Python code for implementing embeddings in the Wasserstein space of elliptical distributions☆11Jul 22, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Demonstrate using MCP with Pydantic AI framework☆14Mar 14, 2025Updated last year
- ☆22Jan 13, 2025Updated last year
- Generate HTML forms from Pydantic models for your FastHTML application☆45Apr 2, 2026Updated 2 months ago
- ☆14Dec 1, 2025Updated 6 months ago
- 🐴🐘 Data on Members of the 116th U.S. Congress☆10Dec 11, 2019Updated 6 years ago
- eTaPR☆17May 16, 2023Updated 3 years ago
- Poetry Corpora Annotated on Aesthetic Emotions☆13Aug 2, 2022Updated 3 years ago