受到self-instruct启发,除了通用LLM还能做垂直领域的小LLM实现定制效果,通过GPT获得question和answer来作为训练数据
☆18May 12, 2023Updated 2 years ago
Alternatives and similar repositories for domain-self-instruct
Users that are interested in domain-self-instruct are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023☆12Dec 13, 2023Updated 2 years ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- Train your own GPT2!☆14Apr 11, 2023Updated 3 years ago
- Code for KDD 2025 paper "FreRA: A Frequency-Refined Augmentation for Contrastive Learning on Time Series Classification"☆32Jun 20, 2025Updated 10 months ago
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Accelerating GOT-OCRv2 with VLLM☆10Nov 15, 2024Updated last year
- ☆11May 2, 2023Updated 3 years ago
- 实现向 ChatGPT 自动提问的脚本。A script that implements automatic questions to ChatGPT.☆63May 20, 2023Updated 2 years ago
- natural language processing☆16Jan 14, 2019Updated 7 years ago
- The official dataset of paper "Goal-Oriented Prompt Attack and Safety Evaluation for LLMs".☆21Feb 5, 2024Updated 2 years ago
- 三大boosting算法的工程实现 XGBoost、LightGBM、Catboost原理实现及常见面试问题总结,以及其他理解深刻的机器学习、深度学习文章备份☆12Jul 7, 2021Updated 4 years ago
- Distilling Task-Specific Knowledge from BERT into Simple Neural Networks.☆15Aug 28, 2020Updated 5 years ago
- [EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"☆20Oct 2, 2024Updated last year
- ☆22May 22, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 中文对话数据清洗☆32Nov 8, 2022Updated 3 years ago
- Generate dialog data from documents using LLM like ChatGLM2 or ChatGPT;利用ChatGLM2,ChatGPT等大模型根据文档生成对话数据集☆163Oct 25, 2023Updated 2 years ago
- 本文提出了一个基于“文心一言”的中国LLMs的安全评估基准,其中包括8种典型的安全场景和6种指令攻击类型。此外,本文还提出了安全评估的框架和过程,利用手动编写和收集开源数据的测试Prompts,以及人工干预结合利用LLM强大的评估能力作为“共同评估者”。☆34Sep 1, 2023Updated 2 years ago
- [ACL 2024 Findings] Learning Fine-Grained Grounded Citations for Attributed Large Language Models☆19Oct 24, 2024Updated last year
- huggingface ChineseBert Tokenizer☆17Apr 16, 2022Updated 4 years ago
- 知识库、大语言模型、医疗知识库构建、基于大语言模型的知识库☆29Jun 13, 2023Updated 2 years ago
- 采用bert进行事件抽取,[cls]进行事件分类,最后一层向量进行序列标注,两个任务同时训练。☆13Jun 7, 2021Updated 4 years ago
- ☆31Aug 28, 2024Updated last year
- ☆32Feb 23, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for RECENT☆13Dec 18, 2022Updated 3 years ago
- An easy-to-use library and command-line tool for TTS☆15May 3, 2025Updated last year
- 复现论文《Distilling Task-Specific Knowledge from BERT into Simple Neural Networks》☆16Jun 13, 2021Updated 4 years ago
- 2025.01:从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct,Reyes也通过一个两…☆33Feb 10, 2026Updated 2 months ago
- 👽 基于大模型的知识库问答 | Large model-based knowledge base Q&A.☆30May 21, 2023Updated 2 years ago
- 百度UIE抽取模型torch版训练预测框架☆12Nov 20, 2024Updated last year
- chinese NLP dataset☆18Nov 6, 2020Updated 5 years ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆27Jul 23, 2024Updated last year
- accelerate generating vector by using onnx model☆18Jan 23, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code to break Llama Guard☆32Dec 7, 2023Updated 2 years ago
- Source code for AAAI 2021 paper "A Supervised Multi-Head Self-Attention Network for Nested Named Entity Recognition""☆16Jun 16, 2021Updated 4 years ago
- 基于simcse的中文句向量生成☆16Jun 8, 2022Updated 3 years ago
- Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks☆32Jul 9, 2024Updated last year
- [ACL 2024] Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization☆29Jul 9, 2024Updated last year
- [EMNLP 2023] Explainable Claim Verification via Knowledge-Grounded Reasoning with Large Language Models☆28Dec 13, 2023Updated 2 years ago
- ☆30Feb 16, 2024Updated 2 years ago