institutional / institutional-books-1-pipelineLinks
The Institutional Data Initiative's pipeline for analyzing, refining, and publishing the Institutional Books 1.0 collection.
☆46Updated last week
Alternatives and similar repositories for institutional-books-1-pipeline
Users that are interested in institutional-books-1-pipeline are comparing it to the libraries listed below
Sorting:
- Transformer GPU VRAM estimator☆67Updated last year
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆53Updated 3 months ago
- Train, tune, and infer Bamba model☆136Updated 5 months ago
- Framework-Agnostic RL Environments for LLM Fine-Tuning☆39Updated last week
- Python library to use Pleias-RAG models☆67Updated 6 months ago
- Code for collecting, processing, and preparing datasets for the Common Pile☆243Updated 2 months ago
- Pivotal Token Search☆131Updated 4 months ago
- Granite 3.1 Language Models☆131Updated 5 months ago
- LLM plugin for clustering embeddings☆82Updated last year
- lossily compress representation vectors using product quantization☆59Updated last month
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆217Updated last week
- ☆29Updated 2 years ago
- First token cutoff sampling inference example☆32Updated last year
- An introduction to DSPy☆32Updated 3 months ago
- Chrome Extension for exploring Hugging Face datasets 🔎☆49Updated last year
- Japanese / English Bilingual LLM☆27Updated last week
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year
- ☆23Updated 10 months ago
- The AILuminate v1.1 benchmark suite is an AI risk assessment benchmark developed with broad involvement from leading AI companies, academ…☆56Updated 5 months ago
- Flask app for article abstract and listing pages☆175Updated last week
- ☆115Updated 10 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆30Updated 11 months ago
- Efficiently computing & storing token n-grams from large corpora☆26Updated last year
- ☆25Updated last week
- Your buddy in the (L)LM space.☆64Updated last year
- IBM development fork of https://github.com/huggingface/text-generation-inference☆62Updated 2 months ago
- Flow Chart Image-to-Code Generation☆34Updated 2 years ago
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval☆58Updated last year
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆67Updated 4 months ago
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆57Updated 8 months ago