ilyalasy / DOM-LM
Unofficial Pytorch implementation of Dom-LM paper.
☆33Updated 2 years ago
Alternatives and similar repositories for DOM-LM:
Users that are interested in DOM-LM are comparing it to the libraries listed below
- SIGIR-2022 Webformer: Pre-training with Web Pages for Information Retrieval☆47Updated 2 years ago
- Simplified DOM Trees for Transferable Attribute Extraction from the Web☆38Updated 6 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 6 months ago
- ☆67Updated 4 months ago
- Python API for https://vespa.ai, the open big data serving engine☆121Updated this week
- A Python library to chunk/group your texts based on semantic similarity.☆96Updated 9 months ago
- Retrieval Augmented Generation Generalized Evaluation Dataset☆53Updated 5 months ago
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆49Updated 6 months ago
- ☆62Updated 9 months ago
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆67Updated last year
- Vespa application making an index of the CORD-19 dataset.☆39Updated 3 months ago
- CLIR version of ColBERT☆68Updated last month
- Common crawl extractor☆75Updated 11 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆100Updated last year
- Completion After Prompt Probability. Make your LLM make a choice☆76Updated 5 months ago
- ☆27Updated 3 months ago
- ☆23Updated 3 weeks ago
- ☆41Updated 4 months ago
- Large-language Model Evaluation framework with Elo Leaderboard and A-B testing☆52Updated 6 months ago
- ☆12Updated 5 months ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆108Updated last week
- Pre-train Static Word Embeddings☆56Updated last week
- Repository for deepdoctection tutorial notebooks☆44Updated 4 months ago
- [NAACL 2022] TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages☆19Updated 2 years ago
- ReLM is a Regular Expression engine for Language Models☆103Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆128Updated last year
- ☆38Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆108Updated 11 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆155Updated last year
- XTR/WARP is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆123Updated 6 months ago