ray-project / llm-numbers
Numbers every LLM developer should know
☆4,191Updated last year
Alternatives and similar repositories for llm-numbers:
Users that are interested in llm-numbers are comparing it to the libraries listed below
- Large Language Model Text Generation Inference☆9,905Updated this week
- A language for constraint-guided and efficient LLM programming.☆3,868Updated 9 months ago
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,653Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,322Updated 9 months ago
- Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning …☆4,466Updated 4 months ago
- A guidance language for controlling large language models.☆19,918Updated this week
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆11,909Updated this week
- Semantic cache for LLMs. Fully integrated with LangChain and llama_index.☆7,468Updated 6 months ago
- General technology for enabling AI capabilities w/ LLMs and MLLMs☆3,896Updated this week
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,358Updated 7 months ago
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆10,993Updated this week
- The RedPajama-Data repository contains code for preparing large datasets for training large language models.☆4,684Updated 3 months ago
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆6,831Updated 8 months ago
- OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset☆7,462Updated last year
- ☆3,306Updated last year
- Structured Text Generation☆11,109Updated this week
- An awesome & curated list of best LLMOps tools for developers☆4,613Updated last month
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,508Updated 6 months ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,043Updated 6 months ago
- Go ahead and axolotl questions☆8,928Updated this week
- Adding guardrails to large language models.☆4,671Updated last week
- A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain☆3,471Updated last year
- Aligning pretrained language models with instruction data generated by themselves.☆4,314Updated last year
- Simple UI for LLM Model Finetuning☆2,059Updated last year
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆4,957Updated last week
- Robust recipes to align language models with human and AI preferences☆5,072Updated 4 months ago
- SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 15+ clouds). Get unified execution, cost savings, and high GPU availability v…☆7,556Updated this week
- Resource list for generating JSON using LLMs via function calling, tools, CFG. Libraries, Models, Notebooks, etc.☆2,071Updated last month
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chro…☆2,816Updated 7 months ago
- Accessible large language models via k-bit quantization for PyTorch.☆6,818Updated this week