esborisova / Awesome-Table-Understanding-DatasetsView external linksLinks
Curated list of awesome datasets for various table understanding tasks
☆18Sep 5, 2025Updated 5 months ago
Alternatives and similar repositories for Awesome-Table-Understanding-Datasets
Users that are interested in Awesome-Table-Understanding-Datasets are comparing it to the libraries listed below
Sorting:
- Search the biomedical literature for protein interactions and protein associations☆11Nov 24, 2023Updated 2 years ago
- Code for Handling Divergent Reference Texts when Evaluating Table-to-Text Generation (Dhingra et al. 2019)☆30Apr 6, 2021Updated 4 years ago
- ☆27Jul 29, 2023Updated 2 years ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆13Jan 1, 2025Updated last year
- Analyse des Pegida facebook Korpus☆10Jan 31, 2015Updated 11 years ago
- Crawler based on a modified browser to detect online tracking.☆11Jul 19, 2023Updated 2 years ago
- Utilities to gather software metrics from tools (SONAR, etc) and store them into ElasticSearch for later display using Kibana.☆11Dec 31, 2017Updated 8 years ago
- A repository for resources relating to NLP in the Balochi language☆19Jun 3, 2023Updated 2 years ago
- Code for the paper "Modeling Information Change in Science Communication with Semantically Matched Paraphrases" from EMNLP 2022☆13Oct 20, 2022Updated 3 years ago
- (Accepted By EMNLP2022 main long)Knowledge Prompting in Pre-trained Language Model for Natural Language Understanding☆14Oct 29, 2022Updated 3 years ago
- Go through the list of accepted papers for ICLR in terminal and add them to your reading list.☆13Jan 30, 2021Updated 5 years ago
- Persian Datasets including: Wikipedia, Twitter, Hamshahri, Hellokish, NSURL'19, Peyma, Text_mining.ir☆11Oct 6, 2023Updated 2 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- Hengam: An Adversarially Trained Transformer for Persian Temporal Tagging (AACL'22)☆11Aug 25, 2023Updated 2 years ago
- ☆11Jul 11, 2023Updated 2 years ago
- Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)☆12Oct 25, 2021Updated 4 years ago
- a fast implementation of BM25☆10Sep 15, 2022Updated 3 years ago
- Data and code: "Answering legal questions from laymen in German civil law system", Büttner & Habernal, EACL'24☆13Mar 2, 2024Updated last year
- This repo consists of my implementation of DocFormerV2☆11Mar 31, 2024Updated last year
- ☆13Feb 2, 2026Updated 2 weeks ago
- ☆12Jun 25, 2018Updated 7 years ago
- Code associated with the project http://predimportance.mit.edu/☆12Aug 7, 2020Updated 5 years ago
- CascadER: Cross-Modal Cascading for Knowledge Graph Link Prediction (arXiv 22)☆13Jun 17, 2022Updated 3 years ago
- SQL and Bash scripts to import the offical Stack Overflow data dump and the SOTorrent data set, to retrieve Stack Overflow references fro…☆15Sep 14, 2025Updated 5 months ago
- A repository dedicated to learning about ChatGPT training techniques and related knowledge. Contains study notes, code snippets, and reso…☆12Dec 14, 2024Updated last year
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languages☆11Feb 6, 2024Updated 2 years ago
- Scripts for building a geo-located web corpus using Common Crawl data☆11Jan 18, 2026Updated last month
- 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Apr 6, 2025Updated 10 months ago
- ☆12Mar 5, 2021Updated 4 years ago
- collecting agile metrics from jira, bitbucket, sonarqube and send them to elastic stack to visualize in kibana☆11Nov 15, 2022Updated 3 years ago
- ☆14Aug 6, 2021Updated 4 years ago
- this repository contains the dataset and the source code for the EMNLP 2019 paper "A Neural Citation Count Prediction Model based on Peer…☆10Oct 8, 2021Updated 4 years ago
- A template primarily for PhD theses but also suitable for Bachelor's or Master's theses☆11Nov 10, 2021Updated 4 years ago
- a simple neural network☆11Dec 20, 2018Updated 7 years ago
- Gated Pretrained Transformer model for robust denoised sequence-to-sequence modelling☆10May 29, 2021Updated 4 years ago
- Code repository accompanying the CHI 2021 Paper titled "Adapting User Interfaces with Model-based Reinforcement Learning"☆16Oct 18, 2021Updated 4 years ago
- Converts brat standoff format to JSONL format☆13Jan 29, 2022Updated 4 years ago
- MCP Server to make searching openrouter easy☆19Feb 7, 2026Updated last week
- 🕸 GlotWeb: Web Indexing for Low-Resource Languages -- under construction.☆17Aug 13, 2025Updated 6 months ago