Instruction Tuning data generation uses LLM in a specific scenario.
☆23May 2, 2024Updated last year
Alternatives and similar repositories for SFT_data_generation
Users that are interested in SFT_data_generation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- Short Text Similarity as described in https://dl.acm.org/citation.cfm?id=2806475☆17Feb 7, 2019Updated 7 years ago
- ☆24Oct 14, 2024Updated last year
- CNRec Data Associated with Content based News Recommendation via Shortest Entity Distance over Knowledge Graph☆10Feb 26, 2019Updated 7 years ago
- [ACL 2024] On the Multi-turn Instruction Following for Conversational Web Agents☆17Oct 12, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official code and dataset for our EMNLP 2024 Findings paper: Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Kn…☆19Dec 27, 2024Updated last year
- [ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities☆29Apr 2, 2025Updated 11 months ago
- Extracting terms from text using XLM-R for token and sequence classification☆16Apr 18, 2022Updated 3 years ago
- chinese NLP dataset☆18Nov 6, 2020Updated 5 years ago
- Code repo for EMNLP 2023 paper "Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models"☆23Nov 13, 2023Updated 2 years ago
- ☆13Feb 16, 2023Updated 3 years ago
- 📸 Code and Dataset for our ACL 2023 paper: "MPCHAT: Towards Multimodal Persona-Grounded Conversation"☆22Sep 5, 2023Updated 2 years ago
- Code for paper: PoisonPrompt: Backdoor Attack on Prompt-based Large Language Models, IEEE ICASSP 2024. Demo//124.220.228.133:11107☆20Aug 10, 2024Updated last year
- [EMNLP 2022] Official Pytorch implementation for "Tiny-NewsRec: Efficient and Effective PLM-based News Recommendation"☆18Sep 18, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"☆20Oct 2, 2024Updated last year
- Drug repositioning with adaptive graph convolutional networks☆18Sep 22, 2024Updated last year
- The official implementation of the ACL 2023 paper, "Paraphrasing-Guided Data Augmentation for Contrastive Prompt-based Few-shot Fine-tuni…☆11Nov 28, 2023Updated 2 years ago
- ☆12May 19, 2024Updated last year
- Official Code and data for ACL 2024 finding, "An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models"☆25Nov 10, 2024Updated last year
- ☆33Dec 17, 2025Updated 3 months ago
- Simple python interface to be used with crisp_controllers.☆33Feb 24, 2026Updated last month
- AI Emoji Argue Agent 🚀 基于LangChain的开源表情包斗图Agent☆28May 30, 2024Updated last year
- ☆31Feb 23, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Structured Binary Neural Networks for Image Recognition☆18Nov 18, 2021Updated 4 years ago
- Official code for ICML 2024 paper on Persona In-Context Learning (PICLe)☆26Jun 27, 2024Updated last year
- 自己阅读的多模态对话系统论文(及部分笔记)汇总☆22Jan 5, 2023Updated 3 years ago
- The ACL RD-TEC 2.0: A corpus of annotated terms in context from domain of computational linguistics☆23Oct 7, 2016Updated 9 years ago
- Pytorch implementations of the BNN, XNOR-Net and BiReal-Net☆15Aug 20, 2020Updated 5 years ago
- 基于BERT和知识图谱的中文电子病例医学命名实体识别☆18Jun 4, 2021Updated 4 years ago
- ☆17Dec 1, 2023Updated 2 years ago
- ACTER is a manually annotated dataset for term extraction, covering 3 languages (English, French, and Dutch), and 4 domains (corruption, …☆24Apr 8, 2022Updated 3 years ago
- ☆21Oct 1, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code to break Llama Guard☆32Dec 7, 2023Updated 2 years ago
- Official implementation of NeurIPS 2023 paper, "NuTrea: Neural Tree Search for Context-guided Multi-hop KGQA".☆21Dec 6, 2023Updated 2 years ago
- RT from How far is Language Model from 100 medical NER☆11Dec 17, 2024Updated last year
- 📖 Use Bi-normal Separation to find document vectors which is used to compute similarity for shorter sentences.☆27Aug 21, 2018Updated 7 years ago
- Shadow Alignment: The Ease of Subverting Safely-Aligned Language Models☆35Oct 19, 2023Updated 2 years ago
- 这是一个一键让小参数大模型进行角色扮演的项目,从数据构成和训练都包含在这项目中☆25Mar 31, 2024Updated last year
- ☆17May 16, 2022Updated 3 years ago