Agora-Lab-AI / The-DistillerLinks
Generate High Quality textual or multi-modal datasets with Agents
☆17Updated last year
Alternatives and similar repositories for The-Distiller
Users that are interested in The-Distiller are comparing it to the libraries listed below
Sorting:
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- Plug in and Play implementation of "Certified Reasoning with Language Models" that elevates model reasoning by 40%☆16Updated last year
- The Next Generation Multi-Modality Superintelligence☆70Updated 8 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- 🚀 Automatically convert unstructured data into a high-quality 'textbook' format, optimized for fine-tuning Large Language Models (LLMs)☆26Updated last year
- ☆22Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 4 months ago
- ☆82Updated last year
- ☆37Updated 2 years ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Finetune any model on HF in less than 30 seconds☆57Updated last month
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆12Updated 6 months ago
- ☆11Updated 10 months ago
- entropix style sampling + GUI☆26Updated 7 months ago
- ☆19Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- A new way to generate large quantities of high quality synthetic data (on par with GPT-4), with better controllability, at a fraction of …☆22Updated 8 months ago
- ☆64Updated 2 months ago
- ☆63Updated 8 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆41Updated last year
- ☆49Updated 6 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 5 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated last year
- Enhancement in Multimodal Representation Learning.☆40Updated last year
- ☆41Updated 5 months ago
- Advanced Reasoning Benchmark Dataset for LLMs☆45Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆13Updated last week
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆37Updated last year
- Memoria is a human-inspired memory architecture for neural networks.☆73Updated 7 months ago