evol augment any dataset online
☆61Aug 3, 2023Updated 2 years ago
Alternatives and similar repositories for evol-dataset
Users that are interested in evol-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Open Source WizardCoder Dataset☆167Jul 12, 2023Updated 2 years ago
- A repository to perform self-instruct with a model on HF Hub☆32Sep 29, 2023Updated 2 years ago
- distill chatGPT coding ability into small model (1b)☆31Sep 7, 2023Updated 2 years ago
- Accepted by Transactions on Machine Learning Research (TMLR)☆135Oct 5, 2024Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆28Apr 21, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆34Mar 21, 2026Updated 2 months ago
- ☆86May 15, 2026Updated last month
- paraphase sentence☆11Aug 22, 2025Updated 9 months ago
- ☆286Apr 25, 2023Updated 3 years ago
- LLM training in simple, raw C/CUDA☆18May 6, 2024Updated 2 years ago
- ☆28Aug 30, 2023Updated 2 years ago
- [EMNLP 2024] Multi-modal reasoning problems via code generation.☆28Apr 14, 2026Updated 2 months ago
- ☆10Apr 11, 2022Updated 4 years ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Replication package for ISSTA2023 paper - Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and Beyond☆23Apr 9, 2023Updated 3 years ago
- chatgpt written in c++☆14Jan 5, 2023Updated 3 years ago
- Provides a minimal implementation to extract FLAN datasets for further processing☆11Feb 1, 2023Updated 3 years ago
- The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》☆31Oct 23, 2025Updated 7 months ago
- Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning☆412May 17, 2024Updated 2 years ago
- Run evaluation on LLMs using human-eval benchmark☆429Sep 12, 2023Updated 2 years ago
- Source Code Data Augmentation for Deep Learning: A Survey.☆66Jun 15, 2024Updated 2 years ago
- The Official Repo for Paper: Aligning Clinical Needs and AI Capabilities: A Survey on LLMs for Medical Reasoning☆23Apr 7, 2026Updated 2 months ago
- Fully open reproduction of DeepSeek-R1☆11Mar 24, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"☆49Jun 2, 2026Updated 2 weeks ago
- [Bioinformatics 2022] Cross-Modality and Self-Supervised Protein Embedding for Compound-Protein Affinity and Contact Prediction☆16Jun 6, 2024Updated 2 years ago
- UPDATE: All future changes will be pushed to https://github.com/HICAI-ZJU/PromptProtein☆15Apr 23, 2023Updated 3 years ago
- Codebase for EnterpriseOps-Gym from ServiceNow☆96Jun 3, 2026Updated 2 weeks ago
- ☆74Apr 2, 2024Updated 2 years ago
- ☆20Mar 4, 2022Updated 4 years ago
- MCP Atlas☆97Jun 10, 2026Updated last week
- ☆17Jan 30, 2023Updated 3 years ago
- ☆22Dec 18, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Playground for Kotlin Flows and Channels☆30Oct 6, 2020Updated 5 years ago
- Chrome Extension for visualizing browsing history☆11Sep 6, 2023Updated 2 years ago
- A multi-programming language benchmark for LLMs☆307Apr 12, 2026Updated 2 months ago
- CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)☆74Jun 25, 2024Updated last year
- ☆16Aug 16, 2023Updated 2 years ago
- Template for assignment 2 of SUSTech CS209, 23 spring semester.☆10Apr 18, 2023Updated 3 years ago
- ☆20Dec 14, 2024Updated last year