muyu42 / DataS
本项目旨在结合以往研究人员的代表性工作,从多个维度评估sft数据,并自动化过滤sft数据。
☆41Updated 10 months ago
Alternatives and similar repositories for DataS:
Users that are interested in DataS are comparing it to the libraries listed below
- ☆62Updated 2 months ago
- A collection of papers related to knowledge fusion☆53Updated 3 months ago
- Corpus and Enhanced Pre-trained Models for EMNLP 2023 Findings Long Paper: "Multi-Stage Pre-training Enhanced by ChatGPT for Multi-Scenar…☆28Updated last year
- StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving☆20Updated last month
- ☆108Updated 8 months ago
- 即迅语音识别服务,支持语音识别(ASR)、语音合成(TTS)、声纹识别(VPR)等功能,适配国产化arm操作系统,支持CPU快速语音识别☆70Updated 6 months ago
- A curated list of resources on graph-based retrieval-augmented generation (GraphRAG) for customized large language models.☆55Updated this week
- 通过RPN with FPN以及CRNN进行车牌检测和识别☆26Updated this week
- 📕 DDmkTCCorpus: Diachronic Danmaku Text Comments Corpus (历时弹幕语料库)☆15Updated last year
- [EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Ge…☆39Updated last month
- Code of Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Ne…☆24Updated 10 months ago
- 本项目展示了如何利用 GPT 自动化检索仓库内的文件(如 PDF、XLS、Word 等)并完成多模态任务。可将家庭摄像头的视频帧送入仓库,可以自动化判断家庭是否危险的事情(利用大模型对世界的理解力)。☆54Updated 5 months ago
- An open-source highly heterogeneous entity alignment (HHEA) toolkit.☆29Updated 7 months ago
- A Contextual RAG Bot Framework☆79Updated 2 months ago
- ☆74Updated 2 months ago
- [COLING Demos 2025] an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs☆39Updated last month
- ☆84Updated 3 weeks ago
- The fastest QA system-简单高效的基于TF-IDF的中文问答系统☆28Updated 2 years ago
- Ein multimodaler, multi-intelligenter Entwicklungsrahmen☆45Updated 2 weeks ago
- ☆44Updated last year
- Training and evaluation code of EGTLM model.☆22Updated 7 months ago
- [ICME 2024] Official Datasets and example of LLM-SAP: Large Language Model Situational Awareness Based Planning☆29Updated 4 months ago
- The Buddhist Scripture Explanation API is an AI-powered service designed to provide insightful explanations for passages from key Buddhis…☆60Updated 4 months ago
- ☆67Updated 11 months ago
- Empower Your Model with Longer and Better Context Comprehention☆40Updated last year
- High performance rank executor for advertisement and recommendation system, implemented in C/C++ and support ensembled into Java/Scala ho…☆72Updated 10 months ago
- LoRA fine-tuning Mistral-7b-v2 on PR Task☆18Updated 6 months ago
- ☆43Updated 6 months ago
- ☆53Updated 3 months ago
- ☆24Updated 7 months ago