[TACL, EMNLP 2025 Oral] Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation"
☆35Dec 5, 2025Updated 5 months ago
Alternatives and similar repositories for CRAFT
Users that are interested in CRAFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Oct 31, 2025Updated 6 months ago
- ACE (Adaptive Code Evolution) is an AI-powered system for code analysis and optimization.☆12Mar 25, 2026Updated last month
- [WWW 2026] 🕸 GlotWeb: Web Indexing for Minority Languages☆17Apr 14, 2026Updated 3 weeks ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- This GUI aims to simplify the process of converting GGUF files to llamafile format by providing an intuitive and convenient way for users…☆14Jan 2, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- Distributed multi-agent framework for event-driven, graph-based computation. Elixir/Python, NATS event streaming, modular operator/XCS ar…☆14Mar 25, 2026Updated last month
- Upstash Vector Python SDK☆18Oct 21, 2025Updated 6 months ago
- Python client for TeraChem Cloud☆13Jun 19, 2025Updated 10 months ago
- Python re-implementation of szabo.f☆10Jul 30, 2015Updated 10 years ago
- Create Vector Store from Scratch in pure Python.☆14Dec 15, 2023Updated 2 years ago
- A simple github actions script to build a llamafile and uploads to huggingface☆17Jan 11, 2024Updated 2 years ago
- ☆26Feb 19, 2023Updated 3 years ago
- ☆23Apr 10, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Stanford CS224W: Machine Learning with Graphs (GNN)☆12Sep 6, 2022Updated 3 years ago
- Electronegativity equilibration model for atomic partial charges☆24Mar 9, 2026Updated last month
- Serializing molecule 3D structures☆14Nov 27, 2024Updated last year
- Toolkit containing implementations of GPU-accelerated approximate kernel models and efficient atomic representations. Yields accurate mod…☆14May 16, 2024Updated last year
- ☆25Jun 4, 2024Updated last year
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆24May 6, 2025Updated 11 months ago
- Data Science for Materials - Collection of Open Educational Resources☆17Jun 18, 2025Updated 10 months ago
- ☆15Jan 10, 2022Updated 4 years ago
- When Reasoning Meets Its Laws☆37Jan 2, 2026Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆10Aug 3, 2023Updated 2 years ago
- ☆13Oct 31, 2024Updated last year
- ☆17Jun 18, 2016Updated 9 years ago
- A fully featured ASE calculator for xTB☆24Oct 21, 2024Updated last year
- Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.☆15Jun 3, 2023Updated 2 years ago
- ☆13Sep 10, 2025Updated 7 months ago
- 毕业设计:互联网新闻热点抽取系统☆10May 21, 2022Updated 3 years ago
- This is a detailed code demo on how to conduct Full-Param Supervised Fine-tuning (SFT) and DPO (Direct Preference Optimization)☆19Jan 9, 2025Updated last year
- The best terminal chat client for your live streams☆19Jun 10, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆10Oct 22, 2024Updated last year
- [NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling☆14Sep 27, 2025Updated 7 months ago
- Workflow for CONNectivity preserving Geometry Optimization☆11Sep 2, 2021Updated 4 years ago
- Code and pruned models for our paper: K. Gkrispanis, N. Gkalelis, V. Mezaris, "Filter-Pruning of Lightweight Face Detectors Using a Geome…☆14May 8, 2024Updated last year
- ☆14Oct 31, 2016Updated 9 years ago
- w3act is an annotation and curation tool for building web archive collections☆21Jan 30, 2024Updated 2 years ago
- Exploration of automated dataset selection approaches at large scales.☆54Mar 4, 2025Updated last year