Spico197 / paper-heroLinks
πͺ A toolkit to help search for papers from aclanthology, arXiv and dblp.
β45Updated 2 years ago
Alternatives and similar repositories for paper-hero
Users that are interested in paper-hero are comparing it to the libraries listed below
Sorting:
- β35Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found heβ¦β31Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorchβ30Updated 2 weeks ago
- [NeurIPS 2023 Main Track] This is the repository for the paper titled "Donβt Stop Pretraining? Make Prompt-based Fine-tuning Powerful Leaβ¦β74Updated last year
- A extension of Transformers library to include T5ForSequenceClassification class.β38Updated 2 years ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuningβ35Updated last year
- [EACL 2023] CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verificationβ41Updated 2 years ago
- β51Updated last year
- Plug-and-play Search Interfaces with Pyserini and Hugging Faceβ32Updated last year
- On Transferability of Prompt Tuning for Natural Language Processingβ99Updated last year
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorchβ39Updated 3 years ago
- Few-shot Learning with Auxiliary Dataβ28Updated last year
- [ACL 2023 Findings] What In-Context Learning βLearnsβ In-Context: Disentangling Task Recognition and Task Learningβ21Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 laβ¦β48Updated last year
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkIβ94Updated 2 years ago
- [NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modelingβ37Updated last year
- official repo of AAAI2024 paper Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence Regularizationβ13Updated last year
- Transformers at any scaleβ41Updated last year
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Modelβ43Updated last year
- β12Updated last year
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.β20Updated last year
- Large-scale query-focused multi-document Summarization datasetβ10Updated 3 years ago
- π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.β82Updated 3 years ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimizationβ37Updated 3 months ago
- Embedding Recycling for Language modelsβ38Updated last year
- Starbucks: Improved Training for 2D Matryoshka Embeddingsβ21Updated 4 months ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning Pβ¦β34Updated last year
- Utilities for Training Very Large Modelsβ58Updated 8 months ago
- Python tools for processing the stackexchange data dumps into a text dataset for Language Modelsβ81Updated last year
- [NAACL 2024 Findings] Evaluation suite for the systematic evaluation of instruction selection methods.β22Updated last year