SqueezeAILab / LLMCompiler
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
☆1,378Updated 2 months ago
Related projects: ⓘ
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆2,817Updated 2 weeks ago
- Training LLMs with QLoRA + FSDP☆1,382Updated last week
- Reaching LLaMA2 Performance with 0.1M Dollars☆955Updated last month
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆1,396Updated this week
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,309Updated 5 months ago
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆679Updated 3 weeks ago
- The official implementation of Self-Play Fine-Tuning (SPIN)☆958Updated 4 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆1,451Updated last month
- Chat language model that can use tools and interpret the results☆1,358Updated this week
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,698Updated 7 months ago
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆657Updated 5 months ago
- Agentless🐱: an agentless approach to automatically solve software development problems☆663Updated 3 weeks ago
- Open-source tool to visualise your RAG 🔮☆1,059Updated 6 months ago
- Automated Design of Agentic Systems☆846Updated 3 weeks ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,080Updated this week
- ☆870Updated this week
- YaRN: Efficient Context Window Extension of Large Language Models☆1,306Updated 5 months ago
- Efficient Retrieval Augmentation and Generation Framework☆1,255Updated last week
- Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"☆2,059Updated 3 months ago
- ☆640Updated this week
- [ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"☆619Updated last month
- ☆1,517Updated last week
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆871Updated this week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆1,935Updated last week
- Tools for merging pretrained large language models.☆4,501Updated this week
- The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval☆847Updated 2 weeks ago
- ☆449Updated 5 months ago
- This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…☆1,747Updated 3 months ago
- Evaluate your LLM's response with Prometheus and GPT4 💯☆745Updated last week
- Scale LLM Engine public repository☆770Updated this week