Hugging Face and Pyserini interoperability
☆19May 18, 2023Updated 3 years ago
Alternatives and similar repositories for gaia
Users that are interested in gaia are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Aug 5, 2023Updated 2 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆20Oct 23, 2023Updated 2 years ago
- Script for downloading GitHub.☆13Sep 24, 2020Updated 5 years ago
- Pytorch Datasets for Easy-To-Hard☆30Jan 9, 2025Updated last year
- [NeurIPS 2025] MergeBench: A Benchmark for Merging Domain-Specialized LLMs☆46Feb 11, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Developing tools to automatically analyze datasets☆74Apr 3, 2026Updated last month
- ☆11Oct 11, 2023Updated 2 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Nov 29, 2023Updated 2 years ago
- Pile Deduplication Code☆18May 15, 2023Updated 3 years ago
- Using Artificial Intelligence to Augment Human Intelligence☆19May 22, 2018Updated 8 years ago
- ☆12Dec 30, 2020Updated 5 years ago
- Minimum Description Length probing for neural network representations☆20Jan 28, 2025Updated last year
- Notebooks and other course materials for Emory QTM 340 (Fall 2022)☆12Dec 13, 2022Updated 3 years ago
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆30Nov 25, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Transformer LIbrary Docker Stacks☆14Feb 4, 2022Updated 4 years ago
- Yet Another SEquence Tagger☆10Dec 8, 2022Updated 3 years ago
- ☆23Jan 27, 2025Updated last year
- Fast structured perceptron sequential labeler☆15Dec 8, 2015Updated 10 years ago
- ☆15Oct 24, 2023Updated 2 years ago
- ☆26Nov 23, 2023Updated 2 years ago
- Benchmark of common hash functions☆10Sep 15, 2019Updated 6 years ago
- ☆10Oct 28, 2020Updated 5 years ago
- ☆10Feb 2, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Cellular Automata - Pokemon Type Battle Simulation☆11Oct 26, 2024Updated last year
- A command-line clone of the excellent macOS program LaTeXiT.☆15Nov 13, 2017Updated 8 years ago
- ☆12Apr 13, 2023Updated 3 years ago
- ☆33Jul 17, 2023Updated 2 years ago
- User-friendly viewer for Parquet files☆11May 8, 2026Updated 2 weeks ago
- Repo for "Smart Word Suggestions" (SWS) task and benchmark☆19Dec 4, 2023Updated 2 years ago
- Differentiable Tree Machine☆14Nov 3, 2023Updated 2 years ago
- This is the official PyTorch repo for "UNIREX: A Unified Learning Framework for Language Model Rationale Extraction" (ICML 2022).☆27Feb 14, 2023Updated 3 years ago
- Dataset for Conversation Semantic Role Labeling☆11Aug 26, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This project collects methods that enhance the comparison between AMR graphs.☆11Jun 15, 2023Updated 2 years ago
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆43Nov 10, 2025Updated 6 months ago
- ☆10Jan 12, 2024Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆97Feb 9, 2023Updated 3 years ago
- Hausa-NMT: Empirical Study of Neural Machine translation for English-Hausa-English☆17Oct 20, 2020Updated 5 years ago
- Directed masked autoencoders☆15Mar 25, 2026Updated last month
- FlexiTokens☆23Dec 27, 2025Updated 4 months ago