evol augment any dataset online
☆61Aug 3, 2023Updated 2 years ago
Alternatives and similar repositories for evol-dataset
Users that are interested in evol-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Open Source WizardCoder Dataset☆166Jul 12, 2023Updated 2 years ago
- Utilities for efficient fine-tuning, inference and evaluation of code generation models☆21Oct 3, 2023Updated 2 years ago
- distill chatGPT coding ability into small model (1b)☆31Sep 7, 2023Updated 2 years ago
- A distributed, extensible, secure solution for evaluating machine generated code with unit tests in multiple programming languages.☆64Oct 21, 2024Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆28Apr 21, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A bagel, with everything.☆326Apr 11, 2024Updated 2 years ago
- ☆34Mar 21, 2026Updated last month
- ☆86Jun 13, 2023Updated 2 years ago
- Generate textbook-quality synthetic LLM pretraining data☆508Oct 19, 2023Updated 2 years ago
- An original implementation of the paper "CREPE: Open-Domain Question Answering with False Presuppositions"☆16Nov 5, 2024Updated last year
- paraphase sentence☆11Aug 22, 2025Updated 8 months ago
- Code related to the ELM neuron.☆15Feb 27, 2024Updated 2 years ago
- ☆285Apr 25, 2023Updated 3 years ago
- LLM training in simple, raw C/CUDA☆18May 6, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆28Aug 30, 2023Updated 2 years ago
- ☆20May 12, 2022Updated 3 years ago
- ☆10Apr 11, 2022Updated 4 years ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago
- Replication package for ISSTA2023 paper - Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and Beyond☆23Apr 9, 2023Updated 3 years ago
- ☆12Aug 15, 2023Updated 2 years ago
- Provides a minimal implementation to extract FLAN datasets for further processing☆11Feb 1, 2023Updated 3 years ago
- The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》☆30Oct 23, 2025Updated 6 months ago
- Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning☆411May 17, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Run evaluation on LLMs using human-eval benchmark☆430Sep 12, 2023Updated 2 years ago
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆17Sep 2, 2024Updated last year
- Source Code Data Augmentation for Deep Learning: A Survey.☆66Jun 15, 2024Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆234Oct 31, 2024Updated last year
- Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning☆307Oct 24, 2024Updated last year
- LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development☆22Jul 24, 2023Updated 2 years ago
- Seq2seq Type Inference using Static Analysis and CodeT5☆32Jul 9, 2023Updated 2 years ago
- The Official Repo for Paper: Aligning Clinical Needs and AI Capabilities: A Survey on LLMs for Medical Reasoning☆22Apr 7, 2026Updated last month
- Fully open reproduction of DeepSeek-R1☆11Mar 24, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- QLoRA with Enhanced Multi GPU Support☆38Aug 8, 2023Updated 2 years ago
- Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"☆48Nov 10, 2025Updated 5 months ago
- TheDeepChecker: Dynamic Debugger for Neural Networks Training Programs☆10Nov 2, 2022Updated 3 years ago
- Hugging Face Jobs☆20Jul 11, 2025Updated 9 months ago
- [Bioinformatics 2022] Cross-Modality and Self-Supervised Protein Embedding for Compound-Protein Affinity and Contact Prediction☆16Jun 6, 2024Updated last year
- Codebase for EnterpriseOps-Gym from ServiceNow☆83Apr 30, 2026Updated last week
- ☆74Apr 2, 2024Updated 2 years ago