The official GitHub page for the survey paper "A Survey on Data Augmentation in Large Model Era"
☆134Jul 10, 2024Updated last year
Alternatives and similar repositories for LLM-data-aug-survey
Users that are interested in LLM-data-aug-survey are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Rust implementation of Surya☆66Mar 1, 2025Updated last year
- Cookbook for Crafting Good Code☆57Mar 19, 2024Updated 2 years ago
- Supercharge huggingface transformers with model parallelism.☆78Jul 23, 2025Updated 8 months ago
- ☆487Sep 25, 2024Updated last year
- https://www.shoufachen.com/Awesome-Diffusion-Transformers/☆148Mar 6, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Track and Collaborate on ML & AI Experiments.☆44Mar 10, 2025Updated last year
- Incredibly descriptive audiovisual summaries for videos☆41Aug 2, 2024Updated last year
- 完全依靠ChatGPT生成数据微调的西式翻译腔聊天风格中文大模型☆21Apr 1, 2024Updated last year
- This is an example of creating an AI agent with flowchart☆12Jul 22, 2024Updated last year
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Jan 5, 2026Updated 2 months ago
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆110Jul 29, 2025Updated 7 months ago
- Benchmarking Mobile Device Control Agents across Diverse Configurations (ICLR 2024 workshop GenAI4DM spotlight presentation; CoLLAs 2025)☆35Jul 21, 2025Updated 8 months ago
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023☆12Dec 13, 2023Updated 2 years ago
- ☆10Sep 13, 2022Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Code for the paper "Learning a Diffusion Model Policy from Rewards via Q-Score Matching"☆32Apr 15, 2025Updated 11 months ago
- ☆20Apr 8, 2025Updated 11 months ago
- LLM全栈优质资源汇总☆694Jul 15, 2025Updated 8 months ago
- blade-chest model for matchup and comparison prediction☆14Jul 10, 2016Updated 9 years ago
- Efficient vector database for hundred millions of embeddings.☆213May 17, 2024Updated last year
- ☆119Dec 18, 2024Updated last year
- 受到self-instruct启发,除了通用LLM还能做垂直领域的小LLM实现定制效果,通过GPT获得question和answer来作为训练数据☆18May 12, 2023Updated 2 years ago
- Code for the EMNLP2020 long paper "Lifelong Language Knowledge Distillation" https://arxiv.org/abs/2010.02123☆12Jul 13, 2021Updated 4 years ago
- Colored Kimia Path24 Dataset: Configurations and Benchmarks with Deep Embeddings☆10Jun 6, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [Paper][WWW2025] OntoTune: Ontology-Driven Self-training for Aligning Large Language Models☆25Jul 21, 2025Updated 8 months ago
- ☆12Mar 9, 2024Updated 2 years ago
- 珠算代码大模型(Abacus Code LLM)☆58Sep 26, 2024Updated last year
- ☆171May 27, 2024Updated last year
- Interactive coding assistant for data scientists and machine learning developers, empowered by large language models.☆99Oct 8, 2024Updated last year
- Code for the paper Neural Pipeline for Zero-Shot Data-to-Text Generation☆16Aug 26, 2024Updated last year
- The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"☆190Oct 28, 2024Updated last year
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Jun 5, 2025Updated 9 months ago
- ☆21May 24, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".☆1,591Jun 3, 2025Updated 9 months ago
- [NeurIPS 2025] Official codebase for T2MIR: Mixture-of-Experts Meets In-Context Reinforcement Learning.☆31Oct 26, 2025Updated 5 months ago
- [Sci. Rep. 2025] Revisiting model scaling with a U-net benchmark for 3D medical image segmentation☆18Aug 21, 2025Updated 7 months ago
- ☆38Jun 16, 2024Updated last year
- 基于ReAct手搓一个Agent Demo☆171Jun 28, 2025Updated 9 months ago
- Summarize existing representative LLMs text datasets.☆1,447Mar 11, 2026Updated 2 weeks ago
- minitools☆104Jul 25, 2013Updated 12 years ago