Improving large language models with concept-aware fine-tuning (CAFT)
☆29Jan 31, 2026Updated 4 months ago
Alternatives and similar repositories for caft-llm
Users that are interested in caft-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official reponsitory for "S^2IP-LLM: Semantic Space Informed Prompt Learning with LLM for Time Series Forecasting"☆57Jul 17, 2024Updated last year
- A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Sp…☆140May 31, 2026Updated 2 weeks ago
- Official Code: TheWebConf 2022 Compact Graph Structure Learning via Mutual Information Compression☆24Mar 17, 2024Updated 2 years ago
- ☆10Feb 22, 2023Updated 3 years ago
- Implementation of TSDS: Data Selection for Task-Specific Model Finetuning. An optimal-transport framework for selecting domain-specific a…☆19Dec 25, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation of the paper "A New Perspective on the Effects of Spectrum in Graph Neural Networks"☆17Jun 17, 2022Updated 3 years ago
- This is the official implementation for "Do Transformers Really Perform Bad for Graph Representation?".☆14Aug 20, 2021Updated 4 years ago
- [ICLR 2023] Learnable Randomness Injection (LRI) for interpretable Geometric Deep Learning.☆25Jul 18, 2023Updated 2 years ago
- Official Code for "Rethinking Diffusion Model in High Dimension"☆25May 20, 2025Updated last year
- Explore, Establish, Exploit: Red Teaming Language Models from Scratch☆15Jun 21, 2023Updated 2 years ago
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆32Apr 8, 2024Updated 2 years ago
- Learning Graphons via Structured Gromov-Wasserstein Barycenters☆23Dec 13, 2020Updated 5 years ago
- Official pytorch implementation of ICML2025 "TAROT: Targeted Data Selection via Optimal Transport"☆30Dec 12, 2024Updated last year
- SmartCLIP: A training method to improve CLIP with both short and long texts☆42Jun 18, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official implemention for Diffusion Models Are Innate One-Step Generators☆26Jun 25, 2025Updated 11 months ago
- Transformer, Evolved Transformer Model☆10Jul 6, 2019Updated 6 years ago
- Source Code for KDD'19 paper "SurfCon: Synonym Discovery on Privacy-Aware Clinical Data"☆10Apr 10, 2020Updated 6 years ago
- Official Implementation of SEA: Sparse Linear Attention with Estimated Attention Mask (ICLR 2024)☆12Jun 20, 2025Updated 11 months ago
- ☆14Jan 22, 2025Updated last year
- Code for ACL 2025 Main paper "Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning…☆51Aug 4, 2025Updated 10 months ago
- ☆14Nov 26, 2022Updated 3 years ago
- Codes for GReTo accepted by ICLR2023☆12Mar 12, 2023Updated 3 years ago
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆35Jun 7, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models☆118Sep 27, 2025Updated 8 months ago
- [NeurIPS 2024] Official Implementation of "SDformer: Similarity-driven Discrete Transformer For Time Series Generation"☆16May 23, 2025Updated last year
- Official implement of 'Advancing Graph Convolutional Networks via General Spectral Wavelets'☆34Jun 22, 2025Updated 11 months ago
- Jdit is a research processing oriented framework based on pytorch. The docs is here!☆30Apr 27, 2021Updated 5 years ago
- code for paper "DRoC: Elevating Large Language Models for Complex Vehicle Routing via Decomposed Retrieval of Constraints"☆26Feb 4, 2025Updated last year
- [AAAI 2025] Official Implementation of "HDT: Hierarchical Discrete Transformer for Multivariate Time Series Forecasting"☆18Feb 17, 2025Updated last year
- Official PyTorch implementation for Frequency Domain-based Dataset Distillation [NeurIPS 2023]☆31May 7, 2024Updated 2 years ago
- [SIGIR 2023] This is the official PyTorch implementation for the paper: "EulerNet: Adaptive Feature Interaction Learning via Euler’s Form…☆18Jul 31, 2024Updated last year
- [NeurIPS 2025] L-MTP: Leap Multi-Token Prediction Beyond Adjacent Context for Large Language Models☆28May 8, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- MISO: Learning Multiple Initial Solutions to Optimization Problems☆17Nov 8, 2024Updated last year
- Paper List for Dialogue and Interactive Systems☆15Jun 5, 2020Updated 6 years ago
- CIKM 23 Oral - HoLe: Homophily-enhanced Structure Learning for Graph Clustering☆11Feb 29, 2024Updated 2 years ago
- CVPR2026☆32Sep 18, 2025Updated 8 months ago
- 🏆🏅 Repository for the GEB team's winning solutions in the IEEE Hybrid Energy Forecasting and Trading Competition (HEFTCom).☆28Oct 4, 2025Updated 8 months ago
- Official implementation of "COExpander: Adaptive Solution Expansion for Combinatorial Optimization".☆24Jun 28, 2025Updated 11 months ago
- Fourier Spatial-Temporal Network for Multivariate Time Series Forecasting☆11Jan 1, 2023Updated 3 years ago