sebastian-hofstaetter / teaching
Open-Source Information Retrieval Courses @ TU Wien
☆586Updated last year
Related projects: ⓘ
- ⚡️A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion 🐍☆439Updated 2 months ago
- Provides a common interface to many IR ranking datasets.☆314Updated last month
- A Python framework for performing information retrieval experiments, building on http://terrier.org/☆408Updated this week
- Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch☆259Updated last year
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆321Updated last year
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆1,552Updated last month
- docTTTTTquery document expansion model☆350Updated last year
- a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini☆338Updated 8 months ago
- A curated list of awesome papers related to pre-trained models for information retrieval (a.k.a., pretraining for IR).☆638Updated 8 months ago
- A curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.☆334Updated 9 months ago
- SPLADE: sparse neural search (SIGIR21, SIGIR22)☆741Updated 4 months ago
- pytrec_eval is an Information Retrieval evaluation tool for Python, based on the popular trec_eval.☆283Updated 11 months ago
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.☆1,633Updated this week
- A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Re…☆315Updated last year
- Neural information retrieval / Semantic search / Bi-encoders☆166Updated last year
- MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, …☆292Updated last year
- ☆158Updated 3 years ago
- Tevatron - A flexible toolkit for neural retrieval research and development.☆479Updated 3 weeks ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆918Updated 2 weeks ago
- Active Learning for Text Classification in Python☆548Updated this week
- Build Text Rerankers with Deep Language Models☆245Updated 6 months ago
- SGPT: GPT Sentence Embeddings for Semantic Search☆838Updated 7 months ago
- A simple toolkit to process TREC files in Python.☆166Updated 3 weeks ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,192Updated 8 months ago
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆770Updated 4 months ago
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆373Updated last year
- Clustering sentence embeddings to extract message intent☆166Updated 2 years ago
- SpanMarker for Named Entity Recognition☆384Updated last month
- SPECTER: Document-level Representation Learning using Citation-informed Transformers☆508Updated last year
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆251Updated 4 months ago