A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs
☆116Mar 17, 2023Updated 3 years ago
Alternatives and similar repositories for presto
Users that are interested in presto are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A curated list of resources dedicated to NLP (paper, blogs, note and etc)☆13Nov 30, 2019Updated 6 years ago
- ☆40Mar 25, 2023Updated 3 years ago
- ☆19May 6, 2023Updated 3 years ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆40Dec 27, 2022Updated 3 years ago
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.☆13Apr 21, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- HANNA, a large annotated dataset of Human-ANnotated NArratives for ASG evaluation.☆37Oct 15, 2024Updated last year
- Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP☆11Jan 29, 2024Updated 2 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- NTREX -- News Test References for MT Evaluation☆87Jun 5, 2024Updated 2 years ago
- ☆81Mar 24, 2025Updated last year
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Apr 29, 2023Updated 3 years ago
- ☆15Oct 31, 2023Updated 2 years ago
- ☆54Jan 18, 2023Updated 3 years ago
- Entailment self-training☆27May 30, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 8 months ago
- Materials for "Natural Language Processing for Multilingual Task-Oriented Dialogue" Tutorial at ACL 2022☆14May 21, 2022Updated 4 years ago
- [WIP] Transformer to embed Danbooru labelsets☆13Mar 31, 2024Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆97Feb 9, 2023Updated 3 years ago
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆30Feb 8, 2023Updated 3 years ago
- Official Repository for Efficient Linear-Time Attention Transformers.☆18Jun 2, 2024Updated 2 years ago
- ☆10Oct 6, 2015Updated 10 years ago
- Reimplementation of the task generation part from the Alpaca paper☆119Apr 4, 2023Updated 3 years ago
- 👩💻 Code for the ACL paper "Detecting Edit Failures in LLMs: An Improved Specificity Benchmark"☆20Jan 19, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- code for the paper "Zero-Shot Text Classification with Self-Training" for EMNLP 2022☆50Sep 17, 2025Updated 9 months ago
- ☆58Nov 17, 2021Updated 4 years ago
- Statistics and Accepted paper list of ACL 2020 with arXiv link☆23May 30, 2020Updated 6 years ago
- TBC☆28Nov 2, 2022Updated 3 years ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- The original Backpack Language Model implementation, a fork of FlashAttention☆71May 29, 2023Updated 3 years ago
- ☆19Apr 21, 2026Updated last month
- Apps that run on modal.com☆13Sep 14, 2025Updated 9 months ago
- Inference code for LLaMA 2 models☆30Jul 7, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Language Quantized AutoEncoders☆114Feb 7, 2023Updated 3 years ago
- Implementation for https://arxiv.org/abs/2005.00652☆27Dec 8, 2022Updated 3 years ago
- ☆152Jun 2, 2023Updated 3 years ago
- For the rlhf learning environment of Koreans☆25Sep 25, 2023Updated 2 years ago
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆126Apr 10, 2026Updated 2 months ago
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆28Apr 21, 2023Updated 3 years ago