Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input
☆1,228Jun 8, 2025Updated 9 months ago
Alternatives and similar repositories for text-to-lora
Users that are interested in text-to-lora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,196Jan 30, 2025Updated last year
- ZeroSearch: Incentivize the Search Capability of LLMs without Searching☆1,254Aug 16, 2025Updated 7 months ago
- Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents☆1,965Aug 13, 2025Updated 7 months ago
- Implementation of SOAR☆51Sep 17, 2025Updated 6 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆361Jun 23, 2025Updated 9 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Everything about the SmolLM and SmolVLM family of models☆3,675Jan 13, 2026Updated 2 months ago
- Synthetic data curation for post-training and structured data extraction☆1,650Mar 18, 2026Updated last week
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,804Dec 29, 2025Updated 2 months ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆56Aug 6, 2025Updated 7 months ago
- Go ahead and axolotl questions☆11,508Updated this week
- Fast State-of-the-Art Static Embeddings☆2,017Mar 12, 2026Updated 2 weeks ago
- ☆85Sep 5, 2025Updated 6 months ago
- Self-Adapting Language Models☆1,728Aug 1, 2025Updated 7 months ago
- Create Custom LLMs☆1,820Nov 8, 2025Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,739May 21, 2025Updated 10 months ago
- Merliot Device Hub☆166Jun 11, 2025Updated 9 months ago
- Tools for merging pretrained large language models.☆6,895Mar 15, 2026Updated last week
- DFloat11 [NeurIPS '25]: Lossless Compression of LLMs and DiTs for Efficient GPU Inference☆616Nov 24, 2025Updated 4 months ago
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,405Nov 29, 2024Updated last year
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆462Aug 26, 2025Updated 7 months ago
- Structured Outputs☆13,588Mar 21, 2026Updated last week
- DSPy: The framework for programming—not prompting—language models☆33,038Updated this week
- The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search☆2,305Dec 19, 2025Updated 3 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Async RL Training at Scale☆1,176Updated this week
- [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning☆1,034Mar 11, 2026Updated 2 weeks ago
- A Python, Windows-friendly version of Claude Code for AI coding☆26Feb 28, 2025Updated last year
- Analytics, Versioning and ETL for multimodal data: video, audio, PDFs, images☆2,730Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,131Mar 16, 2026Updated last week
- The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU cluste…☆4,829Mar 20, 2026Updated last week
- ☆160Apr 17, 2025Updated 11 months ago
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆120Jan 10, 2026Updated 2 months ago
- Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.☆57,673Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆271Mar 6, 2025Updated last year
- Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your resea…☆5,405Aug 20, 2025Updated 7 months ago
- Large Concept Models: Language modeling in a sentence representation space☆2,342Jan 29, 2025Updated last year
- Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…☆9,050Mar 21, 2026Updated last week
- Fully open reproduction of DeepSeek-R1☆25,968Nov 24, 2025Updated 4 months ago
- MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.☆3,123Jul 7, 2025Updated 8 months ago
- ☆470Nov 25, 2025Updated 4 months ago