ilyalasy / DOM-LMLinks
Unofficial Pytorch implementation of Dom-LM paper.
☆33Updated 2 years ago
Alternatives and similar repositories for DOM-LM
Users that are interested in DOM-LM are comparing it to the libraries listed below
Sorting:
- ReLM is a Regular Expression engine for Language Models☆107Updated 2 years ago
- Simplified DOM Trees for Transferable Attribute Extraction from the Web☆40Updated last year
- SIGIR-2022 Webformer: Pre-training with Web Pages for Information Retrieval☆50Updated 3 years ago
- Completion After Prompt Probability. Make your LLM make a choice☆82Updated last year
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆68Updated 2 years ago
- SAIL: Search Augmented Instruction Learning☆158Updated 6 months ago
- [FORGE 2025] Graph-based method for end-to-end code completion with context awareness on repository☆71Updated last year
- AuditNLG: Auditing Generative AI Language Modeling for Trustworthiness☆103Updated last year
- Common crawl extractor☆84Updated last year
- Evaluating tool-augmented LLMs in conversation settings☆88Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆111Updated last year
- A Python library to chunk/group your texts based on semantic similarity.☆103Updated last year
- Reward Model framework for LLM RLHF☆62Updated 2 years ago
- ☆185Updated 2 years ago
- ☆83Updated 3 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆192Updated 7 months ago
- Based on the tree of thoughts paper☆48Updated 2 years ago
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆58Updated 10 months ago
- Large-language Model Evaluation framework with Elo Leaderboard and A-B testing☆52Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆46Updated 2 years ago
- Retrieval Augmented Generation Generalized Evaluation Dataset☆61Updated 6 months ago
- Efficient few-shot learning with cross-encoders.☆62Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆126Updated 2 years ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆91Updated 5 months ago
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q…☆89Updated last year
- StAtutory Reasoning Assessment☆15Updated 3 years ago
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal …☆32Updated 4 years ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆99Updated 4 months ago
- We believe the ability of an LLM to attribute the text that it generates is likely to be crucial for both system developers and users in …☆54Updated 2 years ago
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023☆251Updated 2 years ago