Token-level adaptation of LoRA matrices for downstream task generalization.
☆15Apr 14, 2024Updated 2 years ago
Alternatives and similar repositories for LoRA-TLE
Users that are interested in LoRA-TLE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Sep 10, 2023Updated 2 years ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Jul 12, 2023Updated 2 years ago
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- [SIGIR'24] The official implementation code of MOELoRA.☆192Jul 22, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆13Jun 18, 2024Updated last year
- A data preprocessor for the Quranic Treebank using neural networks. Divides longer verses into smaller chunks.☆12Jul 4, 2023Updated 2 years ago
- Image Diffusion block merging technique applied to transformers based Language Models.☆56May 8, 2023Updated 2 years ago
- SadTalker gradio_demo.py file with code section that allows you to set the eye blink and pose reference videos for the software to use wh…☆11Jun 20, 2023Updated 2 years ago
- 🌳 MCTS-inspired parallel beam search for conversation optimization. Explore multiple dialogue strategies simultaneously, stress-test a…☆36Jan 18, 2026Updated 2 months ago
- direct preference optimization with only 1 model copy :)☆14Oct 2, 2023Updated 2 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jun 1, 2023Updated 2 years ago
- Sample app for a Python API using FastAPI and neomodel☆12Jul 1, 2024Updated last year
- Python code which creates a semantic search bot over any available corpus☆17May 22, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆17Jan 30, 2024Updated 2 years ago
- ☆21Oct 6, 2023Updated 2 years ago
- The official github repo for the open online courses: "Dive into LLMs".☆10Mar 15, 2024Updated 2 years ago
- Code repository for the c-BTM paper☆109Sep 26, 2023Updated 2 years ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Dec 22, 2023Updated 2 years ago
- Repository containing code for the paper "Learning to Learn to Disambiguate: Meta-Learning for Few-Shot Word Sense Disambiguation", publi…☆12Nov 12, 2020Updated 5 years ago
- [ACM Computing Survey 2025] Recent Advances of Foundation Language Models-based Continual Learning: A Survey☆26Oct 6, 2025Updated 6 months ago
- ☆11Oct 3, 2021Updated 4 years ago
- Train transformer language models with reinforcement learning.☆20Dec 26, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).☆11May 24, 2024Updated last year
- ☆23Dec 22, 2023Updated 2 years ago
- T2NER: Transformers based Transfer Learning Framework for Named Entity Recognition (EACL 2021)☆11Sep 24, 2022Updated 3 years ago
- The official implementation of the EMNLP 2023 paper "Paraphrase Types for Generation and Detection"☆12Oct 20, 2024Updated last year
- ☆415Nov 2, 2023Updated 2 years ago
- Automatically exported from code.google.com/p/jacana☆37Aug 19, 2015Updated 10 years ago
- Code for LAMOL: LAnguage MOdeling for Lifelong Language Learning☆95Aug 28, 2020Updated 5 years ago
- ☆35Aug 23, 2023Updated 2 years ago
- [WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer Previews☆17Dec 14, 2025Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆10Nov 15, 2020Updated 5 years ago
- The implementation of RAGSynth: Synthetic Data for Robust and Faithful RAG Component Optimization☆21May 26, 2025Updated 10 months ago
- Personal collection of inspirational notes☆26Updated this week
- Must-read Papers on Large Language Model (LLM) Continual Learning☆149Nov 14, 2023Updated 2 years ago
- Small repository for my video on LoRA☆16May 14, 2023Updated 2 years ago
- Attack AlphaZero Go agents (NeurIPS 2022)☆22Dec 3, 2022Updated 3 years ago
- Learning Latent Semantic Annotations for Grounding Natural Language to Structured Data☆13Jan 28, 2019Updated 7 years ago