Implementation of TSDS: Data Selection for Task-Specific Model Finetuning. An optimal-transport framework for selecting domain-specific and task-specific training data to improve LLM finetuning and instruction tuning.
☆17Dec 25, 2024Updated last year
Alternatives and similar repositories for TSDS
Users that are interested in TSDS are comparing it to the libraries listed below
Sorting:
- Official pytorch implementation of ICML2025 "TAROT: Targeted Data Selection via Optimal Transport"☆28Dec 12, 2024Updated last year
- Code for ACL 2025 Main paper "Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning…☆47Aug 4, 2025Updated 7 months ago
- Official implementation of the paper "ALTER: Augmentation for Large-Table-Based Reasoning"☆15Aug 26, 2024Updated last year
- ☆41Sep 21, 2023Updated 2 years ago
- ☆87Dec 29, 2023Updated 2 years ago
- Py implementation of incremental Density-based spatial clustering of applications with noise☆12Jul 1, 2018Updated 7 years ago
- 3D object detection based on pointpillar and fcos☆10Aug 28, 2019Updated 6 years ago
- opentqa is a open framework of the textbook question answering, which includes xtqa, mcan, cmr, mfb, mutan.☆11Mar 27, 2021Updated 4 years ago
- A novel incremental hierarchical clustering algorithm (KDD 22)☆10Aug 31, 2023Updated 2 years ago
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- Exploiting Inter-sample and Inter-feature Relations in Dataset Distillation (CVPR24)☆11Jun 16, 2024Updated last year
- ☆11Aug 15, 2020Updated 5 years ago
- ☆10Oct 20, 2023Updated 2 years ago
- Code for paper "Reasoning Like an Economist: Post-Training on Economic Problems Induces Strategic Generalization in LLMs"☆12Jun 11, 2025Updated 8 months ago
- The code for paper "ProQA: Structural Prompt-based Pre-training for Unified Question Answering"☆11Feb 7, 2023Updated 3 years ago
- Height map to normal map converter for Unity☆12Mar 8, 2018Updated 7 years ago
- Implementation for NeurIPS 2024 paper "SAFE: Slow and Fast Parameter-Efficient Tuning for Continual Learning with Pre-Trained Models" (ht…☆14Dec 23, 2024Updated last year
- code for the NAACL 2021 paper Compositional Generalization for Neural Semantic Parsing via Span-level Supervised Attention by Microsoft S…☆12Apr 21, 2023Updated 2 years ago
- nodeppt-template-default☆12Jan 31, 2019Updated 7 years ago
- 日志增量聚类算法,用于日志异常检测☆12Aug 20, 2022Updated 3 years ago
- We systematically studied the influencing factors when LLM generates benchmarks,By using our code, you can generate high-quality QA datas…☆20May 20, 2025Updated 9 months ago
- [ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better☆16Feb 15, 2025Updated last year
- The official data and code for EMNLP 2023 main conference paper: CRT-QA: A Dataset of Complex Reasoning Question Answering over Tabular D…☆13May 19, 2025Updated 9 months ago
- Autonomous Theorem Prover for First Order Predicate Logic☆12Jun 29, 2020Updated 5 years ago
- [EMNLP'2023 Findings] MoqaGPT, for zero-shot multimodal question answering with LLMs☆13Dec 28, 2024Updated last year
- Contextual Vision Transformers for Robust Representation Learning☆15Oct 19, 2023Updated 2 years ago
- A list of Numerical Multimodal reasoning papers and their implementation☆11May 13, 2024Updated last year
- ☆13Jul 6, 2023Updated 2 years ago
- The official implemention of "Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration"☆24Feb 4, 2026Updated last month
- Corpus exploration platform using advanced tools such as interactive summarization and multi document coreference resolution☆12Jun 15, 2023Updated 2 years ago
- A package containing utils for the PyTorch version of the Tapas algorithm.☆11Apr 29, 2021Updated 4 years ago
- Artistic Neural Style Transfer Software for DIY Stylized images and videos creations.☆12Jul 29, 2021Updated 4 years ago
- Data and code for ACL 2023 paper "RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations"☆15Feb 8, 2024Updated 2 years ago
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆183Jul 23, 2025Updated 7 months ago
- 完整的原版transformer程序,complete origin transformer program☆16Mar 5, 2025Updated 11 months ago
- The evaluation code for the paper "MoreHopQA: More Than Multi-hop Reasoning"☆14Jun 21, 2024Updated last year
- This is the github to open source benchmark AdvancedIF, see LAMA L1387358RCRO☆29Nov 26, 2025Updated 3 months ago
- code for the table-based open domain question answering project, with paper title: "Reasoning over Hybrid Chain for Table-and-Text Open D…☆12Sep 16, 2022Updated 3 years ago
- A curated list of awesome Deep Learning theories that shed light on the mysteries of DL☆10Jul 20, 2018Updated 7 years ago