muyu42 / DataSLinks
本项目旨在结合以往研究人员的代表性工作,从多个维度评估sft数据,并自动化过滤sft数据。
☆44Updated last year
Alternatives and similar repositories for DataS
Users that are interested in DataS are comparing it to the libraries listed below
Sorting:
- ☆62Updated last year
- High performance rank executor for advertisement and recommendation system, implemented in C/C++ and support ensembled into Java/Scala ho…☆75Updated last year
- Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question Answering☆42Updated last month
- [EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Ge…☆41Updated 10 months ago
- 通过RPN with FPN以及CRNN进行车牌检测和识别☆26Updated 11 months ago
- An open-source highly heterogeneous entity alignment (HHEA) toolkit.☆32Updated last year
- [NeurIPS 2025] DanmakuTPPBench: A Multi-modal Benchmark for Temporal Point Process Modeling and Understanding☆73Updated 3 months ago
- StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving☆21Updated last year
- Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward☆42Updated last month
- This search engine leverages the Boost library for efficient document search, featuring data preprocessing, index creation, and advanced …☆59Updated last year
- Concise Evaluation Benchmark for Large Language Models☆25Updated 5 months ago
- MGCF-Net for Phishing URLs Detection☆50Updated 7 months ago
- [NeurIPS 25 @ ER] Long-Context Modeling with Dynamic Hierarchical Sparse Attention for On-Device LLMs☆73Updated last month
- 接地气的大模型工程,争取成为一本大模型实战百科全书☆18Updated 2 years ago
- Please visit our demonstration website for interactive demonstrations☆33Updated last year
- This script monitors the remaining traffic of VMs on Vultr, DigitalOcean, and Linode. If the remaining traffic is zero, it shuts down the…☆33Updated last year
- ☆73Updated last year
- Training and evaluation code of EGTLM model.☆22Updated last year
- 中/英文 拼音/字符 模糊匹配库☆37Updated 5 months ago
- Multi-Attentional Deepfake Detection☆22Updated last year
- 🤖 AI-powered academic proposal & experiment design generator | 基于人工智能的学术开题报告与实验设计生成工具 - Support multi-AI models, research gap discovery,…☆52Updated 4 months ago
- [ICME 2024] Official Datasets and example of LLM-SAP: Large Language Model Situational Awareness Based Planning☆33Updated 9 months ago
- 强化学习-大语言模型☆68Updated 6 months ago
- NLP自学仓库☆24Updated last year
- mobile predict☆25Updated last year
- A Contextual RAG Bot Framework☆82Updated last year
- WordGPT是一款可以结合个人知识库或联网查询资料快速生成高质量论文、简历、博客、新闻稿、产品描述、故事、邮件、剧本、诗歌、工作汇报,及思维导图、文章配图等内容,同时可以进行各种语言的翻译,还能根据文本生成PPT的的工具。☆52Updated last year
- Official Code of Logits-Based-Finetuning☆91Updated 6 months ago
- ☆57Updated last year
- api开放平台☆24Updated last year