Synthetic Text Dataset Generation for LLM projects
☆58Mar 26, 2026Updated 2 weeks ago
Alternatives and similar repositories for datafast
Users that are interested in datafast are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Mar 9, 2023Updated 3 years ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 5 months ago
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way☆18Nov 4, 2025Updated 5 months ago
- SynthGenAI - Package for Generating Synthetic Datasets using LLMs.☆56Nov 24, 2025Updated 4 months ago
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs …☆64Feb 6, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- synthetic data for ml☆25Jan 30, 2025Updated last year
- ☆23Jun 5, 2025Updated 10 months ago
- Centralize and streamline ML/AI lifecycle observability and compliance processes.☆12Feb 12, 2025Updated last year
- A curated list of materials on AI guardrails☆49Jun 3, 2025Updated 10 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆84Feb 10, 2026Updated last month
- SynthTextEval: A Toolkit for Generating and Evaluating Synthetic Data For High-Stakes Domains (EMNLP 2025 System Demonstration)☆26Nov 3, 2025Updated 5 months ago
- Official website for the TRON (Token Reduced Object Notation) format☆38Nov 29, 2025Updated 4 months ago
- ☆162Dec 2, 2024Updated last year
- Evals that meet you where you are. For AI that's grounded.☆55Mar 21, 2026Updated 2 weeks ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Curriculum training of instruction-following LLMs with Unsloth☆14Dec 15, 2025Updated 3 months ago
- The OS AI engineering and monitoring agent. 🦸♀️ Oversight and compliance copilot for trustworthy AI.☆46Jul 6, 2025Updated 9 months ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆30Nov 18, 2025Updated 4 months ago
- ☆12Mar 4, 2025Updated last year
- ☆11Sep 27, 2024Updated last year
- Plug-and-play document AI with zero-shot models.☆125Feb 16, 2026Updated last month
- ☆10Nov 12, 2024Updated last year
- Feature Selection using Simulated Annealing☆11Aug 10, 2022Updated 3 years ago
- A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal…☆11Sep 16, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆11Apr 9, 2024Updated 2 years ago
- PreRanker: reranking tools before tool-use☆21Apr 9, 2025Updated last year
- Bypass browser bot detection in langchain tools☆18Feb 10, 2026Updated 2 months ago
- EvalAssist is an open-source project that simplifies using large language models as evaluators (LLM-as-a-Judge) of the output of other la…☆97Updated this week
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆34Aug 24, 2024Updated last year
- Build datasets using natural language☆573Sep 19, 2025Updated 6 months ago
- A python implementation of discrete optimal transport with a Tsallis entropy regularization.☆14Oct 23, 2023Updated 2 years ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Sep 5, 2024Updated last year
- Large language models for document ranking.☆72Jan 13, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 🚀 [ICLR '25] RocketEval: Efficient Automated LLM Evaluation via Grading Checklist☆16Aug 21, 2025Updated 7 months ago
- AdFit Web SDK for Publisher☆15Jul 6, 2023Updated 2 years ago
- ☆17Feb 18, 2026Updated last month
- 360M model running in the browser on WebGPU☆23Aug 20, 2024Updated last year
- ☆12Jul 8, 2021Updated 4 years ago
- A Tiptap extension for adding embedded content with Iframely.☆16Nov 18, 2025Updated 4 months ago
- an experimental implementation of Burrow's delta in Python 3☆12Jun 6, 2017Updated 8 years ago