SqueezeAILab / LLMCompiler
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
☆1,662Updated 9 months ago
Alternatives and similar repositories for LLMCompiler:
Users that are interested in LLMCompiler are comparing it to the libraries listed below
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,396Updated 2 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆1,824Updated 8 months ago
- This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…☆2,044Updated 10 months ago
- Efficient Retrieval Augmentation and Generation Framework☆1,515Updated 3 months ago
- RayLLM - LLMs on Ray (Archived). Read README for more info.☆1,262Updated last month
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,378Updated last year
- Reaching LLaMA2 Performance with 0.1M Dollars☆982Updated 8 months ago
- Training LLMs with QLoRA + FSDP☆1,470Updated 5 months ago
- ☆953Updated 2 months ago
- ☆855Updated 7 months ago
- [EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!☆386Updated 7 months ago
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,817Updated last year
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆683Updated 8 months ago
- Open-source tool to visualise your RAG 🔮☆1,121Updated 3 months ago
- Automatically evaluate your LLMs in Google Colab☆615Updated 11 months ago
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,146Updated 11 months ago
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,034Updated last month
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,774Updated last month
- AgentTuning: Enabling Generalized Agent Abilities for LLMs☆1,425Updated last year
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,031Updated this week
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆693Updated last year
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,642Updated last week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,360Updated this week
- Code and Data for Tau-Bench☆437Updated 3 months ago
- A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.☆766Updated last month
- An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.☆1,644Updated 7 months ago
- MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.☆2,002Updated 3 weeks ago
- ☆526Updated 7 months ago
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆1,002Updated 2 months ago
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆2,503Updated 2 months ago