argilla-io / argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
β4,490Updated this week
Alternatives and similar repositories for argilla
Users that are interested in argilla are comparing it to the libraries listed below
Sorting:
- Efficient few-shot learning with Sentence Transformersβ2,479Updated last month
- π¦ Integrating LLMs into structured NLP pipelinesβ1,245Updated 4 months ago
- AI Observability & Evaluationβ5,697Updated this week
- Adding guardrails to large language models.β4,950Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-β¦β3,446Updated this week
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)β3,386Updated last month
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.β11,190Updated this week
- General technology for enabling AI capabilities w/ LLMs and MLLMsβ3,989Updated last month
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMsβ2,976Updated last week
- A Bulletproof Way to Generate Structured JSON from Language Modelsβ4,724Updated last year
- π€ Evaluate: A library for easily evaluating machine learning models and datasets.β2,212Updated 4 months ago
- A blazing fast inference solution for text embeddings modelsβ3,543Updated 2 weeks ago
- A language for constraint-guided and efficient LLM programming.β3,935Updated 11 months ago
- Toolkit for creating, sharing and using natural language prompts.β2,848Updated last year
- MTEB: Massive Text Embedding Benchmarkβ2,513Updated this week
- Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Enginβ¦β3,702Updated 3 months ago
- Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining theβ¦β2,032Updated 9 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β2,700Updated last week
- [ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddingsβ1,949Updated 4 months ago
- π€ AutoTrain Advancedβ4,393Updated 3 months ago
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML vaβ¦β3,795Updated 2 months ago
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroβ¦β2,849Updated 9 months ago
- Robust recipes to align language models with human and AI preferencesβ5,180Updated 2 weeks ago
- An awesome & curated list of best LLMOps tools for developersβ4,866Updated last week
- Evaluation and Tracking for LLM Experimentsβ2,501Updated this week
- Superfast AI decision making and intelligent processing of multi-modal data.β2,589Updated last week
- Top2Vec learns jointly embedded topic, document and word vectors.β3,043Updated 6 months ago
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024β2,008Updated this week
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.β1,827Updated this week
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpaliβ2,164Updated this week