muyu42 / DataS
本项目旨在结合以往研究人员的代表性工作,从多个维度评估sft数据,并自动化过滤sft数据。
☆55Updated 6 months ago
Related projects: ⓘ
- 使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力☆145Updated 2 months ago
- Code for ACL 2024 long paper: Are AI-Generated Text Detectors Robust to Adversarial Perturbations?☆15Updated 2 months ago
- 📕 DDmkTCCorpus: Diachronic Danmaku Text Comments Corpus (历时弹幕语料库)☆21Updated 8 months ago
- Corpus and Enhanced Pre-trained Models for EMNLP 2023 Findings Long Paper: "Multi-Stage Pre-training Enhanced by ChatGPT for Multi-Scenar…☆36Updated 10 months ago
- Code of Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Ne…☆31Updated 6 months ago
- 即迅语音识别服务,支持语音识别(ASR)、语音合成(TTS)、声纹识别(VPR)等功能,适配国产化arm操作系统,支持CPU快速语音识别☆102Updated 2 months ago
- An open-source highly heterogeneous entity alignment (HHEA) toolkit.☆47Updated 3 months ago
- ☆32Updated 2 months ago
- ☆87Updated 7 months ago
- 通过RPN with FPN以及CRNN进行车牌检测和识别☆32Updated 5 months ago
- Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation☆28Updated this week
- Emotion text classification using Llama3-8b with LoRA and FlashAttention. Based on LLaMA-Factory.☆46Updated last month
- Chat-Style-Bot是一个聊天风格模仿大语言模型,通过分析和学习微信聊天记录,可模仿你的说话风格(口头禅等),并可接入微信和你的朋友们自动聊天。Chat-Style-Bot is a chat style imitating llm. By analyzing an…☆77Updated last month
- ☆32Updated 3 months ago
- [ICME 2024] Official Datasets and example of LLM-SAP: Large Language Model Situational Awareness Based Planning☆42Updated 3 weeks ago
- NLP自学仓库☆31Updated 2 months ago
- ☆127Updated this week
- WordGPT是一款可以结合个人知识库或联网查询资料快速生成高质量论文、简历、博客、新闻稿、产品描述、故事、邮件、剧本、诗歌、工作汇报,及思维导图、文章配图等内容,同时可以进行各种语言的翻译,还能根据文本生成PPT的的工具。☆58Updated last month
- An official Project related to Paper "Perceiving Ambiguity and Semantics without Recognition: An Efficient and Effective Ambiguous Scene …☆27Updated 9 months ago
- A股历史复盘☆32Updated last year
- 接地气的大模型工程,争取成为一本大模型实战百科全书☆17Updated 11 months ago
- linkedin, seek job information crawler☆104Updated 3 weeks ago
- ☆32Updated 2 months ago
- ☆42Updated 7 months ago
- AutoAnys is an innovative, open-source Robotic Process Automation (RPA) platform designed to revolutionize the automation landscape. Buil…☆94Updated 2 months ago
- The fastest QA system-简单高效的基于TF-IDF的中文问答系统☆33Updated 2 years ago
- EffiBench: Benchmarking the Efficiency of Automatically Generated Code☆50Updated last month
- This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without …☆47Updated 9 months ago
- 📚 Chinese Historical Documents Assistant(CHDA) 中国历史文献推荐小助手☆26Updated 6 months ago
- The Buddhist Scripture Explanation API is an AI-powered service designed to provide insightful explanations for passages from key Buddhis…☆89Updated 3 weeks ago