listentm/CROWDSELECT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/listentm/CROWDSELECT)

listentm / CROWDSELECT

We systematically studied the influencing factors when LLM generates benchmarks,By using our code, you can generate high-quality QA datasets

☆20

Alternatives and similar repositories for CROWDSELECT

Users that are interested in CROWDSELECT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SAP-archive / cross-language-detection-artifacts
View on GitHub
This repository complements our paper by offering the training dataset, the best-performing models utilized in our real-world experiment,…
☆22Mar 7, 2025Updated last year
hust-open-atom-club / pwn.hust.college
View on GitHub
Deploy and customize our own pwn.college - pwn.hust.college
☆59Updated this week
vuminhduc796 / GPTVoiceTasker
View on GitHub
☆12Apr 2, 2024Updated 2 years ago
wata-orz / fvs
View on GitHub
feedback vertex set solver
☆11Nov 1, 2018Updated 7 years ago
pku-liang / ISAMORE
View on GitHub
☆18Feb 25, 2026Updated 5 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
bearbro / kg
View on GitHub
爬取同花顺的股票（A股）信息
☆10Nov 5, 2021Updated 4 years ago
pku-liang / aps-mlir
View on GitHub
APS: An open-source toolchain towards agile processor specialization based on MLIR
☆19Jan 17, 2026Updated 6 months ago
yllhwa / HUST-Lab-OS
View on GitHub
华中科技大学网络空间安全学院 2020 级计算机操作系统课程设计
☆10Mar 27, 2023Updated 3 years ago
sangHa0411 / Llama-Instruction-Tuning
View on GitHub
☆10Dec 28, 2023Updated 2 years ago
iamalbert / torch-word-emb
View on GitHub
load word embeddings to Torch.Tensor
☆14May 12, 2016Updated 10 years ago
2003pro / TAGCOS
View on GitHub
This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data
☆13Jul 21, 2024Updated 2 years ago
wubinyi / Convolutional-Neural-Network-Accelerator
View on GitHub
Deep learning accelerator for convolutional layer (convolution operation) and fully-connected layer(matrix-multiplication).
☆20Nov 18, 2018Updated 7 years ago
eujhwang / personalized-llms
View on GitHub
personalized-llms with allen institute
☆13Jun 22, 2023Updated 3 years ago
zhanhl316 / SocialDial
View on GitHub
SocialDial: A Benchmark for Socially-Aware Dialogue Systems (SIGIR'23)
☆16Aug 4, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
XiangLi1999 / AutoBencher
View on GitHub
☆33Jul 11, 2024Updated 2 years ago
shinhyeokoh / rwen
View on GitHub
☆14Jun 16, 2023Updated 3 years ago
zouharvi / subset2evaluate
View on GitHub
Find informative examples to efficiently (human)-evaluate NLG models.
☆17Apr 22, 2026Updated 3 months ago
cisnlp / semi-markov-crf
View on GitHub
Code for paper "Neural Semi-Markov Conditional Random Fields for Robust Character-Based Part-of-Speech Tagging"
☆16May 31, 2019Updated 7 years ago
jonathanherzig / semantic-parsing-annotation
View on GitHub
Author implementation of the paper "Don’t paraphrase, detect! Rapid and Effective Data Collection for Semantic Parsing"
☆20Oct 5, 2020Updated 5 years ago
dair-iitd / FloNet
View on GitHub
Code for "End-to-End Learning of Flowchart Grounded Task-Oriented Dialogs"
☆14Oct 10, 2022Updated 3 years ago
OFA-Sys / DiverseEvol
View on GitHub
Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning
☆88Dec 14, 2023Updated 2 years ago
wac81 / Chinese-Sentiment
View on GitHub
A Chinese sentiment analyze lib with Python
☆15Dec 17, 2021Updated 4 years ago
kaist-ina / Trinity-AE
View on GitHub
Source code for Trinity(ASPLOS 2026)
☆25Apr 24, 2026Updated 3 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
yrf1 / LLM-MassiveMulticultureNormsKnowledge-NCLB
View on GitHub
☆20Mar 12, 2025Updated last year
fannix / Chinese-Sentiment-Lexicon
View on GitHub
A large Chinese sentiment lexicon consist of 8000 words
☆24Oct 31, 2012Updated 13 years ago
ShaoqLin / DiscoSG
View on GitHub
[EMNLP 2025 Outstanding Paper Award] Official repo for DiscoSG: Towards Discourse-Level Text Scene Graph Parsing through Iterative Graph …
☆22Nov 16, 2025Updated 8 months ago
tonnetonne814 / PL-Bert-VITS2
View on GitHub
VITS2 using Phoneme-Level Japanese BERT
☆14Dec 17, 2023Updated 2 years ago
leokhoa / Open-DocLLM
View on GitHub
☆16Apr 3, 2024Updated 2 years ago
yueyu1030 / Patron
View on GitHub
[ACL 2023] The code for our ACL'23 paper Cold-Start Data Selection for Few-shot Language Model Fine-tuning: A Prompt-Based Uncertainty Pr…
☆24Jun 1, 2024Updated 2 years ago
zyascend / End-to-End-Speech-Recognition-Learning
View on GitHub
ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别
☆12Oct 25, 2020Updated 5 years ago
tianyi-lab / Superfiltering
View on GitHub
[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
☆189Jun 25, 2025Updated last year
WangHelin1997 / LibriLightMix-WHAMR
View on GitHub
Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM
☆17Nov 7, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
WisdomShell / ADG
View on GitHub
[ACL'26 Main Conference] Instruction Data Selection via Answer Divergence
☆22Apr 14, 2026Updated 3 months ago
litagin02 / laughter-collector
View on GitHub
大量の音声データから笑い声部分を集めるやつ
☆14May 23, 2024Updated 2 years ago
ZifanL / TSDS
View on GitHub
Implementation of TSDS: Data Selection for Task-Specific Model Finetuning. An optimal-transport framework for selecting domain-specific a…
☆19Dec 25, 2024Updated last year
isEmmanuelOlowe / llm-cost-estimator
View on GitHub
Estimating hardware and cloud costs of LLMs and transformer projects
☆22Apr 1, 2026Updated 3 months ago
2003pro / ScaleBiO
View on GitHub
This is the official implementation of ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting
☆25Jul 30, 2024Updated last year
Cohere-Labs-Community / iterative-data-selection
View on GitHub
☆30Nov 5, 2024Updated last year
SALT-NLP / CoAnnotating
View on GitHub
This is the official repository for "CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data An…
☆24Oct 26, 2023Updated 2 years ago