Snowflake-Labs / snowflake-arctic
☆539Updated 7 months ago
Alternatives and similar repositories for snowflake-arctic:
Users that are interested in snowflake-arctic are comparing it to the libraries listed below
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆338Updated 9 months ago
- ☆842Updated 6 months ago
- ☆704Updated 2 weeks ago
- ☆451Updated last year
- Serving multiple LoRA finetuned LLM as one☆1,042Updated 10 months ago
- AI for all: Build the large graph of the language models☆263Updated 9 months ago
- ☆376Updated 2 months ago
- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM☆1,139Updated this week
- Banishing LLM Hallucinations Requires Rethinking Generalization☆273Updated 8 months ago
- Evaluate the accuracy of LLM generated outputs☆644Updated last month
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆313Updated last year
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,329Updated this week
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆656Updated 2 months ago
- LLMPerf is a library for validating and benchmarking LLMs☆845Updated 3 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,358Updated this week
- A Lightweight Library for AI Observability☆238Updated last month
- ☆586Updated 2 months ago
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆299Updated last year
- Self-host LLMs with vLLM and BentoML☆97Updated this week
- [NeurIPS'24 Spotlight, ICLR'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which r…☆951Updated this week
- Evaluate your LLM's response with Prometheus and GPT4 💯☆893Updated 2 weeks ago
- ☆438Updated 5 months ago
- TAG-Bench: A benchmark for table-augmented generation (TAG)☆712Updated 2 weeks ago
- Minimalistic large language model 3D-parallelism training☆1,737Updated this week
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆236Updated this week
- AIDE: AI-Driven Exploration in the Space of Code. State of the Art machine Learning engineering agents that automates AI R&D.☆813Updated last month
- An Open Source Toolkit For LLM Distillation☆554Updated 2 months ago
- This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!☆726Updated this week
- Beating the GAIA benchmark with Transformers Agents. 🚀☆103Updated last month
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆477Updated 2 weeks ago