Improving large language models with concept-aware fine-tuning (CAFT)
☆29Jan 31, 2026Updated 2 months ago
Alternatives and similar repositories for caft-llm
Users that are interested in caft-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR24] code for LSN☆10Oct 28, 2024Updated last year
- A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Sp…☆74Feb 7, 2026Updated 2 months ago
- Official reponsitory for "S^2IP-LLM: Semantic Space Informed Prompt Learning with LLM for Time Series Forecasting"☆55Jul 17, 2024Updated last year
- Official Code: TheWebConf 2022 Compact Graph Structure Learning via Mutual Information Compression☆24Mar 17, 2024Updated 2 years ago
- Pytorch implementation of RED-SDS (NeurIPS 2021).☆20Oct 27, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation of Implicit Graphon Neural Representation☆14Sep 1, 2023Updated 2 years ago
- ☆10Feb 22, 2023Updated 3 years ago
- Multi-GPU supported kmeans clustering for cluser-clip☆15Jun 3, 2024Updated last year
- Code for the paper "Symbiotic Adversarial Learning for Attribute-based Person Search", ECCV2020☆21Aug 1, 2021Updated 4 years ago
- Implementation of TSDS: Data Selection for Task-Specific Model Finetuning. An optimal-transport framework for selecting domain-specific a…☆18Dec 25, 2024Updated last year
- Implementation of the paper "A New Perspective on the Effects of Spectrum in Graph Neural Networks"☆17Jun 17, 2022Updated 3 years ago
- An autoencoder using a convolutional neural network with Tensorflow☆11Dec 6, 2017Updated 8 years ago
- ☆12Jan 4, 2024Updated 2 years ago
- This is the official implementation for "Do Transformers Really Perform Bad for Graph Representation?".☆14Aug 20, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2023] Learnable Randomness Injection (LRI) for interpretable Geometric Deep Learning.☆25Jul 18, 2023Updated 2 years ago
- Official Code for "Rethinking Diffusion Model in High Dimension"☆24May 20, 2025Updated 10 months ago
- Explore, Establish, Exploit: Red Teaming Language Models from Scratch☆14Jun 21, 2023Updated 2 years ago
- Overcooked-AI Experiment Psiturk Demo (for MTurk experiments)☆12May 10, 2021Updated 4 years ago
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆31Apr 8, 2024Updated 2 years ago
- macOS Dynamic Island for AI coding agents. Monitor, approve, and jump to Claude Code sessions from the notch.☆259Updated this week
- Learning Graphons via Structured Gromov-Wasserstein Barycenters☆23Dec 13, 2020Updated 5 years ago
- This repository is the official implementation of NeurIPS 2025 Paper "Dual Data Alignment Makes AI-Generated Image Detector Easier Genera…☆110Mar 16, 2026Updated 3 weeks ago
- Official pytorch implementation of ICML2025 "TAROT: Targeted Data Selection via Optimal Transport"☆28Dec 12, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- SmartCLIP: A training method to improve CLIP with both short and long texts☆43Jun 18, 2025Updated 9 months ago
- Official implemention for Diffusion Models Are Innate One-Step Generators☆26Jun 25, 2025Updated 9 months ago
- Tensorflow ResNet implementation on cifar10☆13Aug 10, 2017Updated 8 years ago
- Transformer, Evolved Transformer Model☆10Jul 6, 2019Updated 6 years ago
- Source Code for KDD'19 paper "SurfCon: Synonym Discovery on Privacy-Aware Clinical Data"☆10Apr 10, 2020Updated 6 years ago
- Official Implementation of SEA: Sparse Linear Attention with Estimated Attention Mask (ICLR 2024)☆12Jun 20, 2025Updated 9 months ago
- ☆14Jan 22, 2025Updated last year
- Code for ACL 2025 Main paper "Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning…☆49Aug 4, 2025Updated 8 months ago
- AIML based chatbot☆20Aug 22, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for the competition "CCKS 2020: 面向中文短文本的实体链指任务" , see https://www.biendata.xyz/competition/ccks_2020_el/☆14Dec 1, 2020Updated 5 years ago
- ☆14Nov 26, 2022Updated 3 years ago
- ☆12Oct 29, 2022Updated 3 years ago
- Codes for GReTo accepted by ICLR2023☆12Mar 12, 2023Updated 3 years ago
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆35Jun 7, 2024Updated last year
- A structured, open-source taxonomy for classifying open source software projects.☆32Updated this week
- Personal site☆64Apr 19, 2025Updated 11 months ago