avilum / yallaLinks
A tiny LLM Agent with minimal dependencies, focused on local inference.
☆56Updated last year
Alternatives and similar repositories for yalla
Users that are interested in yalla are comparing it to the libraries listed below
Sorting:
- function calling-based LLM agents☆289Updated last year
- Official repository for "DynaSaur: Large Language Agents Beyond Predefined Actions"☆348Updated 10 months ago
- A fast batching API to serve LLM models☆188Updated last year
- A Lightweight Library for AI Observability☆251Updated 8 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆221Updated last year
- Function Calling Benchmark & Testing☆92Updated last year
- The easiest, and fastest way to run AI-generated Python code safely☆338Updated 11 months ago
- ☆124Updated last year
- This is our own implementation of 'Layer Selective Rank Reduction'☆239Updated last year
- ☆471Updated last year
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆151Updated last year
- Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.☆92Updated last year
- Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniqu…☆76Updated last year
- OpenTelemetry Instrumentation for AI Observability☆700Updated this week
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆190Updated last year
- One click templates for inferencing Language Models☆218Updated 3 months ago
- Tutorial for building LLM router☆233Updated last year
- ☆133Updated 6 months ago
- ☆65Updated 7 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆91Updated 2 months ago
- Fast parallel LLM inference for MLX☆225Updated last year
- Tuning and Evaluation of RAG pipeline. (Automated optimization to be added soon)☆262Updated last year
- A Lossless Compression Library for AI pipelines☆285Updated 4 months ago
- Client-side toolkit for using large language models, including where self-hosted☆112Updated this week
- Domain Adapted Language Modeling Toolkit - E2E RAG☆329Updated last year
- ReLM is a Regular Expression engine for Language Models☆106Updated 2 years ago
- Synthetic Data for LLM Fine-Tuning☆119Updated last year
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated last year
- A simple Python sandbox for helpful LLM data agents☆288Updated last year
- SynthLang is a hyper-efficient prompt language designed to optimize interactions with Large Language Models (LLMs) like GPT-4o by leverag…☆221Updated 7 months ago