DefSent: Sentence Embeddings using Definition Sentences
☆23Aug 5, 2021Updated 4 years ago
Alternatives and similar repositories for defsent
Users that are interested in defsent are comparing it to the libraries listed below
Sorting:
- LEIA: Facilitating Cross-Lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation☆23Apr 24, 2024Updated last year
- Learning to Describe Unknown Phrases with Local and Global Contexts☆21Jun 21, 2022Updated 3 years ago
- ☆10Aug 21, 2021Updated 4 years ago
- You can create datasets from Wikia/Wikipedia that can be used for entity recognition and Entity Linking. Dumps for ja-wiki and VTuber-wik…☆17May 2, 2021Updated 4 years ago
- Rethinking Perturbations in Encoder-Decoders for Fast Training☆18Nov 25, 2021Updated 4 years ago
- ☆17May 31, 2023Updated 2 years ago
- Discovering Universal Geometry in Embeddings with ICA (Published in EMNLP 2023)☆20Jun 17, 2025Updated 9 months ago
- This is web browser for studying.☆14Nov 3, 2021Updated 4 years ago
- Flexible evaluation tool for language models☆58Updated this week
- microCMS × Next.js × Jamstack☆14Nov 3, 2025Updated 4 months ago
- ☆19May 23, 2024Updated last year
- 🍹 blog 🍹☆30Mar 13, 2026Updated last week
- ☆10Sep 14, 2022Updated 3 years ago
- ☆57Jun 3, 2023Updated 2 years ago
- ☆14Dec 11, 2022Updated 3 years ago
- YAST - Yet Another SPLADE or Sparse Trainer☆21Jun 16, 2025Updated 9 months ago
- ☆12Nov 9, 2018Updated 7 years ago
- AJIMEE-Bench (Advanced Japanese IME Evaluation Benchmark)☆18Jan 13, 2025Updated last year
- To be readable without enhancing english power.☆10Jul 22, 2020Updated 5 years ago
- Kyoto University Web Document Leads Corpus☆83Dec 18, 2023Updated 2 years ago
- A collection of research papers related to Natural Language Reasoning☆11May 27, 2022Updated 3 years ago
- Implementation of "Neural Word Embedding as Implicit Matrix Factorization"☆14Mar 17, 2022Updated 4 years ago
- A collection of various NLP datasets, mainly Indonesia-related languages.☆15Apr 23, 2022Updated 3 years ago
- Utility scripts for preprocessing Wikipedia texts for NLP☆78Apr 9, 2024Updated last year
- PythonとCythonで出来てる日本語形態素解析エンジン🚧☆13Dec 4, 2019Updated 6 years ago
- ☆15Dec 2, 2022Updated 3 years ago
- Code for the paper "Critical Thinking for Language Models"☆12Jun 1, 2021Updated 4 years ago
- [WIP] Twitter Client Library written in Rust☆51May 6, 2021Updated 4 years ago
- script to evaluate pre-trained Japanese word2vec model on Japanese similarity dataset☆12Nov 4, 2024Updated last year
- A process-based communication control system for containers.☆17Feb 10, 2022Updated 4 years ago
- ☯️ AllenNLP training configurations for promising models on Named Entity Recognition. (BiLSTM-CRF, BiLSTM-CNN-CRF, BERT, BERT-CRF)☆15Nov 26, 2020Updated 5 years ago
- The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (NeurIPS 2022)☆16Feb 11, 2023Updated 3 years ago
- 日本語テキストに対する wikification のためのソフトウェア☆17Mar 14, 2017Updated 9 years ago
- Multimodal retrieval in art with context embeddings.☆11Jan 5, 2022Updated 4 years ago
- Code Repository for "A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language Models".☆15Oct 14, 2022Updated 3 years ago
- The evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)☆87Mar 16, 2026Updated last week
- ☆15Mar 15, 2022Updated 4 years ago
- Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…☆13Jan 30, 2020Updated 6 years ago
- VisualMRC: Machine Reading Comprehension on Document Images (AAAI2021)☆57Mar 31, 2025Updated 11 months ago