alycialee / beyond-scale-language-data-diversityView external linksLinks
☆13Aug 11, 2024Updated last year
Alternatives and similar repositories for beyond-scale-language-data-diversity
Users that are interested in beyond-scale-language-data-diversity are comparing it to the libraries listed below
Sorting:
- SysBench: Can Large Language Models Follow System Messages?☆38Sep 4, 2024Updated last year
- CFBench: A Comprehensive Constraints-Following Benchmark for LLMs☆47Aug 26, 2024Updated last year
- Real-time multi-language unit test generation tool via LSP☆31Updated this week
- 使用OpenCV解析APP界面,对界面的布局和控件做树形结构化的描述☆11Jan 6, 2020Updated 6 years ago
- https://demo-web.reflex.run☆12Apr 25, 2024Updated last year
- Source code for NeurIPS 2020 paper "Node Classification on Graphs with Few-Shot Novel Labels via Meta Transformed Network Embedding"☆10Nov 17, 2020Updated 5 years ago
- Long Context Research☆26Jan 26, 2026Updated 2 weeks ago
- Machine Learning written in TypeScript (to replace learn4js)☆11Apr 11, 2018Updated 7 years ago
- A collection of packages for chess analysis grouped under a @chess-tools scope.☆10Nov 4, 2018Updated 7 years ago
- JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning☆10Nov 3, 2024Updated last year
- Implementation for paper "Link Prediction on Heterophilic Graphs via Disentangled Representation Learning"☆13Aug 26, 2022Updated 3 years ago
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆13Jan 9, 2024Updated 2 years ago
- ☆11Oct 2, 2024Updated last year
- This repository contains the data and code for the paper "SideControl: Controlled Open-domain Dialogue Generation via Additive Side Netwo…☆12Dec 1, 2021Updated 4 years ago
- Python Bindings to the Lean Theorem Prover http://leanprover.github.io/☆13Sep 12, 2017Updated 8 years ago
- 📊 A simple command-line utility for querying and monitoring GPU status☆14Aug 3, 2023Updated 2 years ago
- Ἀνατομή is a PyTorch library to analyze representation of neural networks☆13Jan 31, 2024Updated 2 years ago
- The TacTok automated Coq proof script synthesis tool☆17Jan 9, 2024Updated 2 years ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆24Oct 7, 2025Updated 4 months ago
- Mizar Mathematical Library☆16Mar 17, 2012Updated 13 years ago
- 自动发布Twitter及转推的python☆13Dec 18, 2024Updated last year
- Accepted to ICLR 2025. MetaMetrics is a calibrated meta-metric designed to evaluate generation tasks across different modalities aligned …☆14Dec 30, 2024Updated last year
- Membership Inference Attack against Graph Neural Networks☆12Nov 9, 2022Updated 3 years ago
- An implementation of loopy belief propagation on a Bayesian Network (BN)☆11Feb 25, 2015Updated 10 years ago
- a detail tutorials of allennlp , which is based on my own view.☆10Mar 7, 2020Updated 5 years ago
- Zero-Shot and Few-Shot methods for NER in biomedical domain☆18Jul 3, 2023Updated 2 years ago
- ☆15Jul 2, 2020Updated 5 years ago
- Text classifier, based on the BERT and a Bayesian neural network, which can train on small labeled texts and doubt its decision.☆14Mar 24, 2023Updated 2 years ago
- 中文医学语料库☆14Jul 2, 2021Updated 4 years ago
- ☆17Mar 3, 2025Updated 11 months ago
- Maps: Python's missing mappings☆13Nov 29, 2017Updated 8 years ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated last year
- ☆14Feb 26, 2024Updated last year
- Official Code Repository for [AutoScale📈: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*…☆13Aug 8, 2025Updated 6 months ago
- Synthesizing realistic and diverse text-datasets from augmented LLMs☆16Jan 26, 2026Updated 2 weeks ago
- 北京语言大学-中文语义依存分析标注规范☆16Dec 16, 2020Updated 5 years ago
- ☆16Apr 6, 2025Updated 10 months ago
- semantic role labeling based on deep learning, implemented by tensorflow☆16Aug 20, 2018Updated 7 years ago
- ☆17Nov 1, 2025Updated 3 months ago