[TACL, EMNLP 2025 Oral] Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation"
☆34Dec 5, 2025Updated 3 months ago
Alternatives and similar repositories for CRAFT
Users that are interested in CRAFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AI-powered third-arm prosthesis. 🦾 LeRobot Worldwide Hackathon 2025 (🏆 13ᵗʰ place)☆25Sep 25, 2025Updated 6 months ago
- Code for training a language model reaction predictor. (To accompany our paper on the OOD evaluation of reaction predictors).☆12Jan 13, 2025Updated last year
- This GUI aims to simplify the process of converting GGUF files to llamafile format by providing an intuitive and convenient way for users…☆14Jan 2, 2026Updated 2 months ago
- Distributed multi-agent framework for event-driven, graph-based computation. Elixir/Python, NATS event streaming, modular operator/XCS ar…☆14Nov 4, 2025Updated 4 months ago
- Create Vector Store from Scratch in pure Python.☆14Dec 15, 2023Updated 2 years ago
- 集中管理所有的prompt。☆14Nov 27, 2024Updated last year
- A simple github actions script to build a llamafile and uploads to huggingface☆17Jan 11, 2024Updated 2 years ago
- Stanford CS224W: Machine Learning with Graphs (GNN)☆11Sep 6, 2022Updated 3 years ago
- GEMV implementation with CUTLASS☆19Aug 21, 2025Updated 7 months ago
- Serializing molecule 3D structures☆14Nov 27, 2024Updated last year
- ☆25Jun 4, 2024Updated last year
- A new Turkish Dependency Treebank in UD style☆15Aug 17, 2020Updated 5 years ago
- 🖖 图谱式笔记系统,旨在提高个人笔记的使用率!☆12Jan 17, 2021Updated 5 years ago
- Data Science for Materials - Collection of Open Educational Resources☆16Jun 18, 2025Updated 9 months ago
- Distributed system for scaling quantum chemistry computations☆19Oct 15, 2025Updated 5 months ago
- Fine Tuning Model for different NLP task☆15Jan 22, 2023Updated 3 years ago
- Robust and Memory Efficient Event Detection and Tracking in Large News Feeds☆13Oct 15, 2021Updated 4 years ago
- Experimental tl;dr summaries for datasets on the Hugging Face Hub!☆10Apr 4, 2024Updated last year
- ☆13Oct 31, 2024Updated last year
- A fully featured ASE calculator for xTB☆24Oct 21, 2024Updated last year
- ☆13Sep 10, 2025Updated 6 months ago
- Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.☆15Jun 3, 2023Updated 2 years ago
- EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets☆10Dec 12, 2023Updated 2 years ago
- This is a detailed code demo on how to conduct Full-Param Supervised Fine-tuning (SFT) and DPO (Direct Preference Optimization)☆18Jan 9, 2025Updated last year
- Weighted Ensemble Data Analysis and Plotting☆26Dec 11, 2025Updated 3 months ago
- Algoritma ve Programlama Haftalık uygulama Saati Föyleri☆19Oct 9, 2022Updated 3 years ago
- The best terminal chat client for your live streams☆19Jun 10, 2023Updated 2 years ago
- ☆10Oct 22, 2024Updated last year
- ☆10Jun 10, 2016Updated 9 years ago
- Workflow for CONNectivity preserving Geometry Optimization☆11Sep 2, 2021Updated 4 years ago
- ALAS: Autonomous Learning Agent System☆15Aug 14, 2025Updated 7 months ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- Robust NN MD simulator☆21Aug 3, 2023Updated 2 years ago
- A repository to organize materials from the AI4LAM Teach and Learning Working Group☆14May 5, 2023Updated 2 years ago
- Explore cutting-edge Redis capabilities for Vector Similarity Search, Hybrid Search (Vector Similarity + Meta Search), Semantic Caching, …☆16Jan 21, 2024Updated 2 years ago
- This project implements a web-based chatbot using open source technology☆20May 1, 2024Updated last year
- FastAPI Microservices Architecture SDK - As Basis for multiple services in a platform/system☆12Oct 4, 2022Updated 3 years ago
- JADE-NAMD: An package for the on-the-fly nonadiabatic molecular dynamics simulation☆21Jun 1, 2021Updated 4 years ago
- Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training☆16Jul 1, 2025Updated 8 months ago