[TACL, EMNLP 2025 Oral] Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation"
☆34Dec 5, 2025Updated 5 months ago
Alternatives and similar repositories for CRAFT
Users that are interested in CRAFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- ACE (Adaptive Code Evolution) is an AI-powered system for code analysis and optimization.☆12Mar 25, 2026Updated 2 months ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- ☆12Aug 13, 2024Updated last year
- This GUI aims to simplify the process of converting GGUF files to llamafile format by providing an intuitive and convenient way for users…☆14Jan 2, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- This is a deep learning method for identification of viral contigs with short length from metagenomic data.☆10Dec 20, 2021Updated 4 years ago
- BertSNR: an interpretable deep learning framework for single nucleotide resolution identification of transcription factor binding sites b…☆13May 27, 2025Updated 11 months ago
- Source code for the DeepViral paper☆12Mar 17, 2021Updated 5 years ago
- Fully open reproduction of DeepSeek-R1☆11Mar 24, 2025Updated last year
- ☆12May 31, 2024Updated last year
- 集中管理所有的prompt。☆14Nov 27, 2024Updated last year
- Create Vector Store from Scratch in pure Python.☆14Dec 15, 2023Updated 2 years ago
- A simple github actions script to build a llamafile and uploads to huggingface☆17Jan 11, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)☆15Oct 29, 2024Updated last year
- Stanford CS224W: Machine Learning with Graphs (GNN)☆12Sep 6, 2022Updated 3 years ago
- Serializing molecule 3D structures☆14Nov 27, 2024Updated last year
- ☆21Apr 29, 2026Updated 3 weeks ago
- Phage virion protein classifier☆14Nov 6, 2024Updated last year
- LLM KV Cache compression - K+V dual compression, 73-99% VRAM savings, zero accuracy loss☆55Mar 30, 2026Updated last month
- ☆25Jun 4, 2024Updated last year
- A new Turkish Dependency Treebank in UD style☆16Aug 17, 2020Updated 5 years ago
- ☆16Mar 2, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Research repository to the publication: Navigating the Design Space of Equivariant Diffusion-Based Generative Models for De Novo 3D Molec…☆14Apr 2, 2024Updated 2 years ago
- 🖖 图谱式笔记系统,旨在提高个人笔记的使用率!☆12Jan 17, 2021Updated 5 years ago
- PhaTYP: Predicting lifestyle for bacteriophages using BERT☆18Nov 6, 2024Updated last year
- G-K BertDTA is a research tool facilitating drug-target interaction prediction. Leveraging multi-modal data integration and advanced mach…☆13Nov 22, 2023Updated 2 years ago
- Companion repository which facilitates the creation of Gradio endpoints which are accessible from within Digital Audio Workstations (DAWs…☆28May 15, 2026Updated last week
- Gradio Demo for ComfyDeploy☆56Aug 10, 2024Updated last year
- ☆16Aug 28, 2023Updated 2 years ago
- Fine Tuning Model for different NLP task☆15Jan 22, 2023Updated 3 years ago
- ☆14Apr 10, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆17Mar 18, 2023Updated 3 years ago
- Boğaziçi University Annotation Tool for Dependency Parsing☆14Oct 31, 2024Updated last year
- Experimental tl;dr summaries for datasets on the Hugging Face Hub!☆10Apr 4, 2024Updated 2 years ago
- 嵌入数据仓库,向量存储,向量相似度搜索引擎,向量知识库☆12Apr 24, 2024Updated 2 years ago
- An implementation of Seq2seq chatbot.☆17Jun 20, 2018Updated 7 years ago
- Pairwise machine learning models for phage-host interaction prediction☆21Jun 23, 2024Updated last year
- ☆13Sep 10, 2025Updated 8 months ago