muyu42 / DataSLinks
本项目旨在结合以往研究人员的代表性工作,从多个维度评估sft数据,并自动化过滤sft数据。
☆43Updated last year
Alternatives and similar repositories for DataS
Users that are interested in DataS are comparing it to the libraries listed below
Sorting:
- ☆65Updated 7 months ago
- Corpus and Enhanced Pre-trained Models for EMNLP 2023 Findings Long Paper: "Multi-Stage Pre-training Enhanced by ChatGPT for Multi-Scenar…☆30Updated last year
- StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving☆22Updated 5 months ago
- [COLING Demos 2025] an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs☆36Updated 3 months ago
- 通过RPN with FPN以及CRNN进行车牌检测和识别☆26Updated 4 months ago
- DanmakuTPPBench: A Multi-modal Benchmark for Temporal Point Process Modeling and Understanding☆54Updated last week
- [EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Ge…☆38Updated 3 months ago
- NLP自学仓库☆24Updated 11 months ago
- An open-source highly heterogeneous entity alignment (HHEA) toolkit.☆31Updated last year
- ☆26Updated last year
- ☆28Updated 2 weeks ago
- The fastest QA system-简单高效的基于TF-IDF的中文问答系统☆28Updated 3 years ago
- Code of Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Ne…☆25Updated last year
- High performance rank executor for advertisement and recommendation system, implemented in C/C++ and support ensembled into Java/Scala ho…☆78Updated last year
- [ICME 2024] Official Datasets and example of LLM-SAP: Large Language Model Situational Awareness Based Planning☆33Updated 2 months ago
- The implementation for ACL 2023 paper "Towards Robust Personalized Dialogue Generation via Order-Insensitive Representation Regularizatio…☆16Updated last year
- mobile predict☆25Updated 6 months ago
- Please visit our demonstration website for interactive demonstrations☆29Updated 8 months ago
- Training and evaluation code of EGTLM model.☆23Updated last year
- ☆43Updated last year
- The code and data for our work accepted by EMNLP2024: "VLEU: a Method for Automatic Evaluation for Generalizability of Text-to-Image Mode…☆16Updated 8 months ago
- 低代码核心组件:数据模型的实现☆58Updated last year
- 使用donut多模态模型,身份证识别,对身份证做端对端识别,无需中间处理,识别率达到商用☆18Updated 10 months ago
- ☆46Updated 11 months ago
- React + NextJS + AppRouter路由模式开发的音乐WebApp项目☆18Updated 7 months ago
- Awesome-MCP Servers & Clients & Funny things☆24Updated 2 months ago
- An official Project related to Paper "Perceiving Ambiguity and Semantics without Recognition: An Efficient and Effective Ambiguous Scene …☆21Updated last year
- 即迅语音识别服务,支持语音识别(ASR)、语音合成(TTS)、声纹识别(VPR)等功能,适配国产化arm操作系统,支持CPU快速语音识别☆75Updated 10 months ago
- 检索增强生成的核心逻辑,因关键技术保密,无法提供图向量的提取生成和检索逻辑☆33Updated 6 months ago
- This is a traditional Chinese-based demographic dictionary search system that is free and open-source. 這是一個基於繁體中文的人口學詞典檢索系統,該系統是免費且開放的。☆39Updated 2 months ago