ayaka14732 / TrAVis
TrAVis: Visualise BERT attention in your browser
☆56Updated last year
Alternatives and similar repositories for TrAVis:
Users that are interested in TrAVis are comparing it to the libraries listed below
- JAX implementation of the bart-base model☆30Updated last year
- ROUGE for multilingual Summarization☆23Updated 3 years ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Large-scale query-focused multi-document Summarization dataset☆10Updated 3 years ago
- A GPT-based generative LM for combined text and math formulas, leveraging tree-based formula encoding.☆33Updated last year
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q…☆86Updated 11 months ago
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆58Updated last year
- ☆27Updated last year
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective☆62Updated 2 years ago
- Code for text generation papers searches on ArXiv, with very manual jekyll site creation.☆39Updated 2 weeks ago
- ☆36Updated last year
- The official implementation of "Distilling Relation Embeddings from Pre-trained Language Models, EMNLP 2021 main conference", a high-qual…☆46Updated last month
- zero-vocab or low-vocab embeddings☆18Updated 2 years ago
- ☆23Updated 2 years ago
- A library for computing diverse text characteristics and using them to analyze data sets and models with ease.☆40Updated 2 years ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 3 years ago
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following☆79Updated 4 months ago
- a large scientific paraphrase dataset for longer paraphrase generation☆38Updated 2 years ago
- Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Lo…☆39Updated last year
- A Python implementation of Toolformer using Huggingface Transformers☆15Updated last year
- Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in datase…☆51Updated last year
- A simple semantic search engine for scientific papers.☆27Updated last year
- 💪 A toolkit to help search for papers from aclanthology, arXiv and dblp.☆45Updated last year
- Reasoning by Communicating with Agents☆24Updated 3 months ago
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆44Updated 8 months ago
- ☆34Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆45Updated last year
- This repository is the official implementation of our paper MVP: Multi-task Supervised Pre-training for Natural Language Generation.☆70Updated 2 years ago
- ☆21Updated 3 years ago