[TACL, EMNLP 2025 Oral] Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation"
☆35Dec 5, 2025Updated 7 months ago
Alternatives and similar repositories for CRAFT
Users that are interested in CRAFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- ACE (Adaptive Code Evolution) is an AI-powered system for code analysis and optimization.☆12Mar 25, 2026Updated 3 months ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆48Jul 25, 2023Updated 2 years ago
- Code for training a language model reaction predictor. (To accompany our paper on the OOD evaluation of reaction predictors).☆12Jan 13, 2025Updated last year
- ☆13Aug 13, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Distributed multi-agent framework for event-driven, graph-based computation. Elixir/Python, NATS event streaming, modular operator/XCS ar…☆14Mar 25, 2026Updated 3 months ago
- Upstash Vector Python SDK☆18Oct 21, 2025Updated 8 months ago
- Python client for TeraChem Cloud☆13Jun 19, 2025Updated last year
- 集中管理所有的prompt。☆14Nov 27, 2024Updated last year
- Create Vector Store from Scratch in pure Python.☆13Dec 15, 2023Updated 2 years ago
- A simple github actions script to build a llamafile and uploads to huggingface☆17Jan 11, 2024Updated 2 years ago
- Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)☆15Oct 29, 2024Updated last year
- Stanford CS224W: Machine Learning with Graphs (GNN)☆12Sep 6, 2022Updated 3 years ago
- GEMV implementation with CUTLASS☆21Aug 21, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Serializing molecule 3D structures☆14Nov 27, 2024Updated last year
- ☆25Jun 4, 2024Updated 2 years ago
- ☆16Mar 2, 2024Updated 2 years ago
- Research repository to the publication: Navigating the Design Space of Equivariant Diffusion-Based Generative Models for De Novo 3D Molec…☆14Apr 2, 2024Updated 2 years ago
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆26May 6, 2025Updated last year
- ☆17Aug 28, 2023Updated 2 years ago
- When Reasoning Meets Its Laws☆37Jan 2, 2026Updated 6 months ago
- Experimental tl;dr summaries for datasets on the Hugging Face Hub!☆10Apr 4, 2024Updated 2 years ago
- ☆13Oct 31, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The RELX Dataset and Matching the Multilingual Blanks for Cross-Lingual Relation Classification, EMNLP-Findings 2020.☆18Aug 27, 2021Updated 4 years ago
- An implementation of Seq2seq chatbot.☆17Jun 20, 2018Updated 8 years ago
- A simple implementation of ReasonGenRM.☆19Apr 21, 2025Updated last year
- A fully featured ASE calculator for xTB☆25Oct 21, 2024Updated last year
- Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.☆16Jun 3, 2023Updated 3 years ago
- ☆13Sep 10, 2025Updated 9 months ago
- ☆11Mar 22, 2024Updated 2 years ago
- Encoding chemistry to interpret crystallographic data☆28Jun 10, 2026Updated 3 weeks ago
- Weighted Ensemble Data Analysis and Plotting☆27Dec 11, 2025Updated 6 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This is a detailed code demo on how to conduct Full-Param Supervised Fine-tuning (SFT) and DPO (Direct Preference Optimization)☆19Jan 9, 2025Updated last year
- Algoritma ve Programlama Haftalık uygulama Saati Föyleri☆19Oct 9, 2022Updated 3 years ago
- ☆10Oct 22, 2024Updated last year
- Hückel model + JAX☆14Oct 13, 2022Updated 3 years ago
- [NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling☆14Sep 27, 2025Updated 9 months ago
- ☆14Oct 31, 2016Updated 9 years ago
- ALAS: Autonomous Learning Agent System☆18Aug 14, 2025Updated 10 months ago