neuml / annotateaiLinks
π Automatically annotate papers using LLMs
β321Updated last month
Alternatives and similar repositories for annotateai
Users that are interested in annotateai are comparing it to the libraries listed below
Sorting:
- π€ Benchmark Large Language Models Reliably On Your Dataβ318Updated this week
- Generate large synthetic data using an LLMβ418Updated last week
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³β292Updated this week
- Build datasets using natural languageβ483Updated 3 weeks ago
- A flexible, adaptive classification system for dynamic text classificationβ195Updated 3 weeks ago
- Structured information extraction from documentsβ315Updated 8 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.β422Updated last year
- Fast Semantic Text Deduplication & Filteringβ671Updated last week
- An agentic AI application that allows you to chat with your papers and gather also information from papers on ArXiv and on PubMedβ134Updated 2 weeks ago
- Automatically evaluate your LLMs in Google Colabβ631Updated last year
- An open-source tool for seamless migration from other LLMs to Llama, and for general prompt optimization.β360Updated this week
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engineβ461Updated 4 months ago
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.β789Updated 4 months ago
- A Lightweight Library for AI Observabilityβ243Updated 3 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on taskβ¦β172Updated 8 months ago
- Automate computer tasks in Pythonβ312Updated this week
- This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM.β305Updated 2 months ago
- Semantic Chunker is a lightweight Python package for semantically-aware chunking and clustering of text.β251Updated last month
- CodeScientist: An automated scientific discovery system for code-based experimentsβ263Updated 2 months ago
- awesome synthetic (text) datasetsβ281Updated 7 months ago
- β98Updated 6 months ago
- β188Updated 2 months ago
- Tool for generating high quality Synthetic datasetsβ896Updated this week
- β210Updated 11 months ago
- An Awesome list of curated DSPy resources.β326Updated 3 months ago
- UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detectionβ598Updated last week
- FastAPI wrapper around DSPyβ242Updated last year
- This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation.β¦β323Updated 2 months ago
- Late Interaction Models Training & Retrievalβ395Updated this week
- Toolkit for attaching, training, saving and loading of new heads for transformer modelsβ279Updated 2 months ago