Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input
☆1,280Jun 8, 2025Updated last year
Alternatives and similar repositories for text-to-lora
Users that are interested in text-to-lora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Hypernetworks that update LLMs to remember factual information☆745May 25, 2026Updated 3 weeks ago
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,215Jan 30, 2025Updated last year
- ZeroSearch: Incentivize the Search Capability of LLMs without Searching☆1,287Aug 16, 2025Updated 10 months ago
- Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents☆2,102Aug 13, 2025Updated 10 months ago
- Implementation of SOAR☆52Sep 17, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆362Jun 23, 2025Updated 11 months ago
- Everything about the SmolLM and SmolVLM family of models☆3,811May 26, 2026Updated 3 weeks ago
- Synthetic data curation for post-training and structured data extraction☆1,686Jun 11, 2026Updated last week
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,950Dec 29, 2025Updated 5 months ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆56Aug 6, 2025Updated 10 months ago
- Go ahead and axolotl questions☆12,061Updated this week
- Fast State-of-the-Art Static Embeddings☆2,127Jun 6, 2026Updated last week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,795May 28, 2026Updated 3 weeks ago
- ☆85Sep 5, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Create Custom LLMs☆1,851Apr 24, 2026Updated last month
- Self-Adapting Language Models☆1,779Aug 1, 2025Updated 10 months ago
- Merliot Device Hub☆165Jun 11, 2025Updated last year
- Tools for merging pretrained large language models.☆7,154Updated this week
- DFloat11 [NeurIPS '25]: Lossless Compression of LLMs and DiTs for Efficient GPU Inference☆638Nov 24, 2025Updated 6 months ago
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,425Nov 29, 2024Updated last year
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆463Aug 26, 2025Updated 9 months ago
- Structured Outputs☆13,964May 18, 2026Updated last month
- Agentic RL Training at Scale☆1,483Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- DSPy: The framework for programming—not prompting—language models☆35,064Updated this week
- A Python, Windows-friendly version of Claude Code for AI coding☆32Feb 28, 2025Updated last year
- [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning☆1,083Apr 15, 2026Updated 2 months ago
- The Context Layer for unstructured data: typed, versioned datasets over S3, GCS, Azure☆2,782Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,251Jun 8, 2026Updated last week
- The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU cluste…☆5,101Updated this week
- ☆160Apr 17, 2025Updated last year
- Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…☆9,971Jun 12, 2026Updated last week
- ☆278Mar 6, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆127Jan 10, 2026Updated 5 months ago
- Large Concept Models: Language modeling in a sentence representation space☆2,363Jan 29, 2025Updated last year
- Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.☆66,620Updated this week
- Fully open reproduction of DeepSeek-R1☆26,295Apr 2, 2026Updated 2 months ago
- Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your resea…☆5,686Aug 20, 2025Updated 9 months ago
- A system for agentic LLM-powered data processing and ETL☆3,835Updated this week
- ☆478Nov 25, 2025Updated 6 months ago