apple / ml-ferret
☆8,561Updated 3 months ago
Alternatives and similar repositories for ml-ferret:
Users that are interested in ml-ferret are comparing it to the libraries listed below
- MLX: An array framework for Apple silicon☆18,837Updated this week
- Examples in the MLX framework☆6,833Updated this week
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆13,255Updated 4 months ago
- Run any open-source LLMs, such as Llama, Mistral, as OpenAI compatible API endpoint in the cloud.☆10,522Updated this week
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,333Updated 6 months ago
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆17,093Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆36,895Updated this week
- Go ahead and axolotl questions☆8,484Updated this week
- Official inference library for Mistral models☆9,921Updated 2 months ago
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆21,326Updated 5 months ago
- the AI-native open-source embedding database☆17,476Updated this week
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆11,759Updated this week
- ☆3,871Updated 10 months ago
- CoreNet: A library for training deep neural networks☆6,999Updated 3 months ago
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,104Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆16,133Updated this week
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆6,782Updated 6 months ago
- Modeling, training, eval, and inference code for OLMo☆5,132Updated this week
- Large Language Model Text Generation Inference☆9,710Updated this week
- Python bindings for llama.cpp☆8,567Updated last week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,171Updated 9 months ago
- High-speed Large Language Model Serving for Local Deployment☆8,074Updated last week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆11,473Updated this week
- A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/aut…☆38,922Updated this week
- Inference code for CodeLlama models☆16,184Updated 5 months ago
- Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory☆24,152Updated this week
- Letta (formerly MemGPT) is a framework for creating LLM services with memory.☆14,359Updated this week
- ☆4,056Updated 8 months ago
- An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents☆5,438Updated 4 months ago
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,403Updated 5 months ago