数据合成工具,简单高效的合成不同业务场景的大模型训练数据
☆42Jan 2, 2025Updated last year
Alternatives and similar repositories for JoyDataForge
Users that are interested in JoyDataForge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- JoyType: A Robust Design for Multilingual Visual Text Creation☆39Sep 21, 2025Updated 6 months ago
- [ACL 2024] Benchmarking Knowledge Boundary for Large Language Models: A Different Perspective on Model Evaluation☆10May 26, 2024Updated last year
- Citrus-V: Advancing Medical Foundation Models with Unified Medical Image Grounding for Clinical Reasoning☆18Sep 26, 2025Updated 6 months ago
- ☆20Feb 2, 2025Updated last year
- SysBench: Can Large Language Models Follow System Messages?☆40Sep 4, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [ACL25] FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation☆48Jan 28, 2026Updated last month
- Xcbwin - a simple C++ class for graphical outputs using XCB☆12May 12, 2015Updated 10 years ago
- ☆54Sep 11, 2024Updated last year
- Repo for the pape Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions☆48Jul 10, 2025Updated 8 months ago
- SlotRefine: A Fast Non-Autoregressive Model forJoint Intent Detection and Slot Filling☆48Apr 27, 2021Updated 4 years ago
- Explorations into the proposed SDFT, Self-Distillation Enables Continual Learning, from Shenfeld et al. of MIT☆30Feb 6, 2026Updated last month
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆49Aug 27, 2023Updated 2 years ago
- 增加了indextts2的简单的界面与api调用方式☆26Oct 27, 2025Updated 5 months ago
- LLM graph-RAG SQL generator for large databases with poor documentation☆19Sep 12, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 基于cn-clip模型封装的本地图片搜索工具☆11Jul 6, 2023Updated 2 years ago
- Ilya Sutskever 推荐的30篇Deep learning 必读论文 (中英文对照翻译版)☆13Dec 18, 2024Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- ☆13Aug 11, 2024Updated last year
- Vstream - Video Analytics pipeline with Hardware based accelerations (dev - stage)☆10Feb 2, 2024Updated 2 years ago
- A Keras implementation of the AAAI21 paper "a lightweight neural model for biomedical entity linking"☆53Jul 24, 2022Updated 3 years ago
- Mini-Projects using Cutting-Edge AI Frameworks☆15Oct 26, 2025Updated 5 months ago
- 一个开源的多模态 AI 搜索项目,结合 大语言模型(LLM)+ 多源搜索引擎 + 多 Agent 架构,打造新一代的智能问答式搜索体验☆15Mar 26, 2025Updated last year
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆13Jan 9, 2024Updated 2 years ago
- lightgbm☆14Jun 21, 2022Updated 3 years ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆53Jun 24, 2024Updated last year
- An implementation of MSSRM method☆11Mar 23, 2023Updated 3 years ago
- [NAACL 2022] TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding☆10Jul 15, 2023Updated 2 years ago
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆56Jun 16, 2024Updated last year
- ☆63May 10, 2025Updated 10 months ago
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆102Sep 14, 2024Updated last year
- 利用大模型LLM对中文文本、图片以及pdf中的非结构化文本内容进行分析,并提取主-谓-宾(SPO)三元组的知识形式,以及将这些关系可视化为知识图谱。The large LLM model is used to analyze the unstructured text co…☆26Apr 16, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Real-time multi-language unit test generation tool via LSP☆35Mar 20, 2026Updated last week
- KPI time-series analysis using deep neural networks☆13Feb 28, 2019Updated 7 years ago
- agentic data generation(under refactor!!!)☆196Mar 14, 2026Updated last week
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆11Jan 9, 2024Updated 2 years ago
- Exploring advanced prompting tools to query SQL database with multiple tables in natural language using LLMs☆16Aug 23, 2024Updated last year
- Pytorch implementation of Tree Preference Optimization (TPO) (Accepted by ICLR'25)☆26Apr 24, 2025Updated 11 months ago
- 简单问答系统优化版本☆12Dec 10, 2019Updated 6 years ago