allenai / bffView external linksLinks
☆38Apr 17, 2024Updated last year
Alternatives and similar repositories for bff
Users that are interested in bff are comparing it to the libraries listed below
Sorting:
- Library for fast text representation and classification.☆31Jan 9, 2024Updated 2 years ago
- Code for COMET: Cardinality Constrained Mixture of Experts with Trees and Local Search☆11Jun 21, 2023Updated 2 years ago
- ☆15Apr 26, 2024Updated last year
- Map (deep learning) model weights between different model implementations.☆19Jan 30, 2025Updated last year
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆15Jan 16, 2024Updated 2 years ago
- Natural language detection, Java bindings for CLD2☆14Sep 26, 2025Updated 4 months ago
- Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.☆14May 5, 2022Updated 3 years ago
- Multimodal extreme classification☆20May 1, 2024Updated last year
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆19Oct 12, 2024Updated last year
- Package and scripts used to build a dataset of Wikipedia articles in Markdown.☆20Sep 11, 2023Updated 2 years ago
- Hugging Face and Pyserini interoperability☆19May 18, 2023Updated 2 years ago
- ☆44Nov 17, 2024Updated last year
- Multi-Turn-Single-Intent Bert model for dialogue session classification☆25Dec 8, 2022Updated 3 years ago
- Scaling Data-Constrained Language Models☆341Jun 28, 2025Updated 7 months ago
- NusaWrites is an in-depth analysis of corpora collection strategy and a comprehensive language modeling benchmark for underrepresented an…☆27Sep 27, 2024Updated last year
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆48Oct 31, 2023Updated 2 years ago
- ☆29Jul 17, 2023Updated 2 years ago
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆56Feb 28, 2023Updated 2 years ago
- Codebase for Inference-Time Policy Adapters☆25Nov 3, 2023Updated 2 years ago
- ☆26Feb 27, 2022Updated 3 years ago
- ☆26Nov 23, 2023Updated 2 years ago
- ☆12Jan 17, 2026Updated 3 weeks ago
- https://pypi.org/project/intent-suggestions/☆10Sep 6, 2022Updated 3 years ago
- Pytorch Seq2Seq framework☆27Jan 22, 2026Updated 3 weeks ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,885Updated this week
- The data and implementation for the experiments in the paper "Flows: Building Blocks of Reasoning and Collaborating AI".☆31Feb 12, 2024Updated 2 years ago
- Documentation effort for the BookCorpus dataset☆34Jun 2, 2021Updated 4 years ago
- [NeurIPS 2023] Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning☆33Jun 2, 2023Updated 2 years ago
- ☆32Jun 17, 2024Updated last year
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆74Apr 1, 2025Updated 10 months ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆35Feb 5, 2026Updated last week
- The Multitask Long Document Benchmark☆42Nov 2, 2022Updated 3 years ago
- SILO Language Models code repository☆83Feb 23, 2024Updated last year
- Continual Resilient (CoRe) Optimizer for PyTorch☆11Jun 10, 2024Updated last year
- Pequenos projetos e testes simples em linguagem Python.☆11Jan 28, 2018Updated 8 years ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 6 months ago
- Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"☆562Dec 28, 2024Updated last year
- an unofficial code for augment-XY-CUT in XYLayoutLM☆30Jul 12, 2022Updated 3 years ago
- Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)☆93Jul 25, 2023Updated 2 years ago