muyu42 / DataSLinks

本项目旨在结合以往研究人员的代表性工作，从多个维度评估sft数据，并自动化过滤sft数据。

☆43

Alternatives and similar repositories for DataS

Users that are interested in DataS are comparing it to the libraries listed below

Sorting:

BonnieZbw / CT2CQA
☆65Updated 9 months ago
niuchenglei / rankextor
High performance rank executor for advertisement and recommendation system, implemented in C/C++ and support ensembled into Java/Scala ho…
☆78Updated last year
YecanLee / Adaptive-Contrastive-Search
[EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Ge…
☆40Updated 5 months ago
bird-bench / BIRD-Interact
[BIRD-INTERACT] Re-imagines Text-to-SQL evaluation via lens of dynamic interactions.
☆111Updated 3 weeks ago
Robot2050 / AttenHScore
Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question Answering
☆43Updated 2 months ago
gao-xiao-bai / StrategyLLM
StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving
☆22Updated 7 months ago
ffengc / boost-search-engine
This search engine leverages the Boost library for efficient document search, featuring data preprocessing, index creation, and advanced …
☆58Updated 11 months ago
anonymous11811 / OmniCam
☆80Updated last month
Haijian06 / EartAgent
Ein multimodaler, multi-intelligenter Entwicklungsrahmen
☆45Updated 2 months ago
ddm3114 / CRNN
通过RPN with FPN以及CRNN进行车牌检测和识别
☆26Updated 6 months ago
FRENKIE-CHIANG / DanmakuTPPBench
DanmakuTPPBench: A Multi-modal Benchmark for Temporal Point Process Modeling and Understanding
☆70Updated 2 months ago
1Hun0ter1 / MGCF-Net
MGCF-Net for Phishing URLs Detection
☆51Updated 2 months ago
garlic-byte / RL-LLM
强化学习-大语言模型
☆65Updated last month
jxh4945777 / OpenHHEA
An open-source highly heterogeneous entity alignment (HHEA) toolkit.
☆31Updated last year
dvlab-research / Logits-Based-Finetuning
Official Code of Logits-Based-Finetuning
☆87Updated last month
huangjch526 / IFAST_official
☆43Updated 2 years ago
HanyangZhong / Situational_Planning_datasets
[ICME 2024] Official Datasets and example of LLM-SAP: Large Language Model Situational Awareness Based Planning
☆33Updated 4 months ago
nmhjklnm / HTIA-mobile-predict
mobile predict
☆25Updated 8 months ago
slanorgcn / CHDA
📚 Chinese Historical Documents Assistant(CHDA) 中国历史文献推荐小助手
☆23Updated last year
bird-bench / livesqlbench
☆100Updated 3 weeks ago
keating666 / yzcbbs
A Knowledge Base on Pre-made Dishes
☆106Updated last month
OfficeAIWork / WordGPT
WordGPT是一款可以结合个人知识库或联网查询资料快速生成高质量论文、简历、博客、新闻稿、产品描述、故事、邮件、剧本、诗歌、工作汇报，及思维导图、文章配图等内容，同时可以进行各种语言的翻译，还能根据文本生成PPT的的工具。
☆51Updated 11 months ago
PunyGoood / DCS
☆5Updated 4 months ago
EEE1even / NLP-Learning
NLP自学仓库
☆24Updated last year
konmor / konmorReportServer
☆80Updated last month
insmess / insmess-speech
即迅语音识别服务，支持语音识别（ASR）、语音合成（TTS）、声纹识别（VPR）等功能，适配国产化arm操作系统，支持CPU快速语音识别
☆75Updated last year
4real3000 / EasyJudge
[COLING Demos 2025] an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs
☆36Updated 5 months ago
MetaASI / CS-Eval
Concise Evaluation Benchmark for Large Language Models
☆26Updated last week
selmiss / EGTLM
Training and evaluation code of EGTLM model.
☆24Updated last year
AaronZ345 / ISDrama
Dataset and evaluation code of ISDrama(ACM-MM 2025): Immersive Spatial Drama Generation through Multimodal Prompting
☆118Updated last week