Improving large language models with concept-aware fine-tuning (CAFT)
☆29Jan 31, 2026Updated last month
Alternatives and similar repositories for caft-llm
Users that are interested in caft-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Sp…☆57Feb 7, 2026Updated last month
- Official Code: TheWebConf 2022 Compact Graph Structure Learning via Mutual Information Compression☆24Mar 17, 2024Updated 2 years ago
- Implementation of Implicit Graphon Neural Representation☆13Sep 1, 2023Updated 2 years ago
- ☆11Feb 22, 2023Updated 3 years ago
- Implementation of TSDS: Data Selection for Task-Specific Model Finetuning. An optimal-transport framework for selecting domain-specific a…☆18Dec 25, 2024Updated last year
- [ICLR 2023] Learnable Randomness Injection (LRI) for interpretable Geometric Deep Learning.☆25Jul 18, 2023Updated 2 years ago
- Official Code for "Rethinking Diffusion Model in High Dimension"☆24May 20, 2025Updated 10 months ago
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆31Apr 8, 2024Updated last year
- Official pytorch implementation of ICML2025 "TAROT: Targeted Data Selection via Optimal Transport"☆28Dec 12, 2024Updated last year
- SmartCLIP: A training method to improve CLIP with both short and long texts☆40Jun 18, 2025Updated 9 months ago
- Transformer, Evolved Transformer Model☆10Jul 6, 2019Updated 6 years ago
- Source Code for KDD'19 paper "SurfCon: Synonym Discovery on Privacy-Aware Clinical Data"☆10Apr 10, 2020Updated 5 years ago
- Code for ACL 2025 Main paper "Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning…☆48Aug 4, 2025Updated 7 months ago
- ☆13Jan 22, 2025Updated last year
- Code for the competition "CCKS 2020: 面向中文短文本的实体链指任务" , see https://www.biendata.xyz/competition/ccks_2020_el/☆14Dec 1, 2020Updated 5 years ago
- ☆12Oct 29, 2022Updated 3 years ago
- [NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models☆113Sep 27, 2025Updated 5 months ago
- A structured, open-source taxonomy for classifying open source software projects.☆31Updated this week
- Official implementation of "Physics-Informed Long-Sequence Forecasting From Multi-Resolution Spatiotemporal Data".☆10Dec 12, 2022Updated 3 years ago
- CIKM 23 Oral - HoLe: Homophily-enhanced Structure Learning for Graph Clustering☆10Feb 29, 2024Updated 2 years ago
- Jdit is a research processing oriented framework based on pytorch. The docs is here!☆30Apr 27, 2021Updated 4 years ago
- code for paper "DRoC: Elevating Large Language Models for Complex Vehicle Routing via Decomposed Retrieval of Constraints"☆26Feb 4, 2025Updated last year
- Official PyTorch implementation for Frequency Domain-based Dataset Distillation [NeurIPS 2023]☆31May 7, 2024Updated last year
- [AAAI 2025] Official Implementation of "HDT: Hierarchical Discrete Transformer for Multivariate Time Series Forecasting"☆16Feb 17, 2025Updated last year
- A list of research resources that I've appreciated.☆12Dec 10, 2019Updated 6 years ago
- [SIGIR 2023] This is the official PyTorch implementation for the paper: "EulerNet: Adaptive Feature Interaction Learning via Euler’s Form…☆18Jul 31, 2024Updated last year
- The repo for ACL2021 findings paper - Don't Miss the Labels: Label-semantic Argumented Meta-Learner for Few-Shot Text Classification☆15Mar 24, 2022Updated 4 years ago
- [NeurIPS 2025] L-MTP: Leap Multi-Token Prediction Beyond Adjacent Context for Large Language Models☆24Oct 29, 2025Updated 4 months ago
- MISO: Learning Multiple Initial Solutions to Optimization Problems☆16Nov 8, 2024Updated last year
- Paper List for Dialogue and Interactive Systems☆15Jun 5, 2020Updated 5 years ago
- Time Series Forecasting with Dynamic Graph Modeling☆15Aug 31, 2025Updated 6 months ago
- ☆59Sep 23, 2024Updated last year
- CVPR2026☆25Sep 18, 2025Updated 6 months ago
- Repository for ''Contextualizing MLP-Mixers Spatiotemporally for Urban Data Forecast at Scale''☆14Apr 30, 2024Updated last year
- 🏆🏅 Repository for the GEB team's winning solutions in the IEEE Hybrid Energy Forecasting and Trading Competition (HEFTCom).☆28Oct 4, 2025Updated 5 months ago
- Official implementation of "COExpander: Adaptive Solution Expansion for Combinatorial Optimization".☆21Jun 28, 2025Updated 8 months ago
- Fine-grained attention in hierarchical transformers for tabular time-series.☆12Dec 24, 2024Updated last year
- Bayesian model reduction for probabilistic machine learning☆11Jul 3, 2025Updated 8 months ago
- softmax, crf, biaffine, globalpointer, efficient globalpointer, ricon☆17Oct 11, 2022Updated 3 years ago