SqueezeAILab / LLMCompiler
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
☆1,680Updated 10 months ago
Alternatives and similar repositories for LLMCompiler
Users that are interested in LLMCompiler are comparing it to the libraries listed below
Sorting:
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,381Updated last year
- Training LLMs with QLoRA + FSDP☆1,477Updated 6 months ago
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆2,557Updated 3 months ago
- Chat language model that can use tools and interpret the results☆1,556Updated last week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,442Updated 3 months ago
- AgentTuning: Enabling Generalized Agent Abilities for LLMs☆1,433Updated last year
- A library for advanced large language model reasoning☆2,122Updated last month
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,800Updated 2 months ago
- Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"☆2,362Updated 5 months ago
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆945Updated 6 months ago
- YaRN: Efficient Context Window Extension of Large Language Models☆1,484Updated last year
- This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…☆2,076Updated 11 months ago
- ☆873Updated 8 months ago
- List of language agents based on paper "Cognitive Architectures for Language Agents"☆946Updated 4 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆1,854Updated 8 months ago
- Guide for fine-tuning Llama/Mistral/CodeLlama models and more☆591Updated last week
- ☆1,019Updated 4 months ago
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,152Updated last year
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,411Updated last week
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆683Updated 8 months ago
- AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:☆2,155Updated this week
- ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting wit…☆1,067Updated last year
- [NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments☆1,847Updated this week
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,823Updated last year
- Open-source tool to visualise your RAG 🔮☆1,128Updated 4 months ago
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…☆1,254Updated last week
- Reaching LLaMA2 Performance with 0.1M Dollars☆980Updated 9 months ago
- [EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!☆397Updated 8 months ago
- [ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"☆754Updated 9 months ago
- A unified evaluation framework for large language models☆2,609Updated 2 weeks ago