muyu42 / DataS
本项目旨在结合以往研究人员的代表性工作,从多个维度评估sft数据,并自动化过滤sft数据。
☆44Updated last year
Alternatives and similar repositories for DataS:
Users that are interested in DataS are comparing it to the libraries listed below
- ☆65Updated 5 months ago
- 检索增强生成的核心逻辑,因关键技术保密,无法提供图向量的提取生成和检索逻辑☆33Updated 5 months ago
- [EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Ge…☆39Updated 2 months ago
- Lightweight C++ LLM agent framework☆28Updated this week
- Bi-level Contrastive Learning for Knowledge-Enhanced Molecule Representations☆28Updated last week
- StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving☆21Updated 4 months ago
- 通过RPN with FPN以及CRNN进行车牌检测和识别☆26Updated 3 months ago
- Corpus and Enhanced Pre-trained Models for EMNLP 2023 Findings Long Paper: "Multi-Stage Pre-training Enhanced by ChatGPT for Multi-Scenar…☆30Updated last year
- 📚 Chinese Historical Documents Assistant(CHDA) 中国历史文献推荐小助手☆24Updated last year
- 📕 DDmkTCCorpus: Diachronic Danmaku Text Comments Corpus (历时弹幕语料库)☆15Updated last year
- This search engine leverages the Boost library for efficient document search, featuring data preprocessing, index creation, and advanced …☆58Updated 7 months ago
- ☆5Updated last month
- [COLING Demos 2025] an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs☆37Updated last month
- ☆45Updated 9 months ago
- Code of Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Ne…☆24Updated last year
- Training and evaluation code of EGTLM model.☆23Updated 10 months ago
- Face Verification and Liveness Control☆33Updated 6 years ago
- An open-source highly heterogeneous entity alignment (HHEA) toolkit.☆31Updated 10 months ago
- [ICME 2024] Official Datasets and example of LLM-SAP: Large Language Model Situational Awareness Based Planning☆33Updated 3 weeks ago
- 在线体验工具已经开放!☆28Updated last year
- Ein multimodaler, multi-intelligenter Entwicklungsrahmen☆44Updated 3 months ago
- 即迅语音识别服务,支持语音识别(ASR)、语音合成(TTS)、声纹识别(VPR)等功能,适配国产化arm操作系统,支持CPU快速语音识别☆76Updated 9 months ago
- Awesome-MCP Servers & Clients & Funny things☆24Updated last month
- NLP自学仓库☆24Updated 9 months ago
- ☆51Updated last year
- ☆19Updated 9 months ago
- WordGPT是一款可以结合个人知识库或联网查询资料快速生成高质量论文、简历、博客、新闻稿、产品描述、故事、邮件、剧本、诗歌、工作汇报,及思维导图、文章配图等内容,同时可以进行各种语言的翻译,还能根据文本生成PPT的的工具。☆48Updated 8 months ago
- ☆36Updated last year
- ☆107Updated 2 months ago
- Dynamic Topic Segmentation in Dialogues: Enhancing Boundaries with Topic-Aware Propagation☆41Updated 4 months ago