Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input
☆1,253Jun 8, 2025Updated 10 months ago
Alternatives and similar repositories for text-to-lora
Users that are interested in text-to-lora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Hypernetworks that update LLMs to remember factual information☆689Mar 2, 2026Updated last month
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,208Jan 30, 2025Updated last year
- ZeroSearch: Incentivize the Search Capability of LLMs without Searching☆1,267Aug 16, 2025Updated 8 months ago
- Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents☆1,998Aug 13, 2025Updated 8 months ago
- Implementation of SOAR☆51Sep 17, 2025Updated 7 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆361Jun 23, 2025Updated 9 months ago
- Everything about the SmolLM and SmolVLM family of models☆3,705Apr 2, 2026Updated 2 weeks ago
- Synthetic data curation for post-training and structured data extraction☆1,663Mar 28, 2026Updated 3 weeks ago
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,825Dec 29, 2025Updated 3 months ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆56Aug 6, 2025Updated 8 months ago
- Go ahead and axolotl questions☆11,688Updated this week
- Fast State-of-the-Art Static Embeddings☆2,024Apr 10, 2026Updated last week
- ☆85Sep 5, 2025Updated 7 months ago
- Create Custom LLMs☆1,828Nov 8, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Self-Adapting Language Models☆1,740Aug 1, 2025Updated 8 months ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,753May 21, 2025Updated 10 months ago
- Merliot Device Hub☆166Jun 11, 2025Updated 10 months ago
- Tools for merging pretrained large language models.☆6,973Mar 15, 2026Updated last month
- AI-powered Pokemon gameplay agent with headless emulation, REST API, and live dashboard. Works with any LLM.☆71Mar 10, 2026Updated last month
- DFloat11 [NeurIPS '25]: Lossless Compression of LLMs and DiTs for Efficient GPU Inference☆618Nov 24, 2025Updated 4 months ago
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,414Nov 29, 2024Updated last year
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆461Aug 26, 2025Updated 7 months ago
- Agentic RL Training at Scale☆1,292Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Structured Outputs☆13,657Mar 26, 2026Updated 3 weeks ago
- DSPy: The framework for programming—not prompting—language models☆33,649Updated this week
- A Python, Windows-friendly version of Claude Code for AI coding☆32Feb 28, 2025Updated last year
- [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning☆1,048Updated this week
- Analytics, Versioning and ETL for multimodal data: video, audio, PDFs, images☆2,732Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,178Updated this week
- The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU cluste…☆4,887Updated this week
- ☆160Apr 17, 2025Updated last year
- ☆272Mar 6, 2025Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆124Jan 10, 2026Updated 3 months ago
- Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.5, DeepSeek, gpt-oss locally.☆61,312Updated this week
- Large Concept Models: Language modeling in a sentence representation space☆2,348Jan 29, 2025Updated last year
- Fully open reproduction of DeepSeek-R1☆25,991Apr 2, 2026Updated 2 weeks ago
- Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your resea…☆5,491Aug 20, 2025Updated 7 months ago
- Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…☆9,157Apr 12, 2026Updated last week
- ☆471Nov 25, 2025Updated 4 months ago