Discovering Universal Geometry in Embeddings with ICA (Published in EMNLP 2023)
☆21Jun 17, 2025Updated 9 months ago
Alternatives and similar repositories for Universal-Geometry-with-ICA
Users that are interested in Universal-Geometry-with-ICA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DIRECT: Direct and Indirect REsponses in Conversational Text Corpus☆17Jul 1, 2021Updated 4 years ago
- script to evaluate pre-trained Japanese word2vec model on Japanese similarity dataset☆12Nov 4, 2024Updated last year
- DefSent: Sentence Embeddings using Definition Sentences☆23Aug 5, 2021Updated 4 years ago
- LaTeX document class for the proceedings of ANLP☆21Oct 28, 2025Updated 5 months ago
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Embedding language models in probability space via log-likelihood vectors☆16Oct 25, 2025Updated 5 months ago
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆126Updated this week
- ☆19Dec 6, 2024Updated last year
- ☆17May 31, 2023Updated 2 years ago
- A soft and fast pattern matcher for billion-scale corpora.☆75Feb 26, 2025Updated last year
- Flexible evaluation tool for language models☆59Apr 6, 2026Updated last week
- Tokyo Metropolitan University Paraphrase Corpus (TMUP)☆11Jun 12, 2017Updated 8 years ago
- The evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)☆89Mar 16, 2026Updated 3 weeks ago
- 日本語CLIPモデル☆13Sep 15, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆24Dec 15, 2023Updated 2 years ago
- Utility scripts for preprocessing Wikipedia texts for NLP☆78Apr 9, 2024Updated 2 years ago
- Fast and Multi-aspect Mining of Complex Time-stamped Event Streams (WWW'23)☆13Jan 27, 2025Updated last year
- Exploring Japanese SimCSE☆69Oct 31, 2023Updated 2 years ago
- NAISTの入試で提出した小論文☆34Jan 27, 2023Updated 3 years ago
- YAST - Yet Another SPLADE or Sparse Trainer☆21Jun 16, 2025Updated 9 months ago
- Source code for "SimCKP: Simple Contrastive Learning of Keyphrase Representations", Findings of EMNLP 2023☆12Jun 20, 2025Updated 9 months ago
- AJIMEE-Bench (Advanced Japanese IME Evaluation Benchmark)☆20Jan 13, 2025Updated last year
- SQL linter tool for BigQuery GoogleSQL (formerly known as StandardSQL).☆17Mar 29, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- To be readable without enhancing english power.☆10Jul 22, 2020Updated 5 years ago
- Code for "Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem" (NAACL 2022)☆112May 14, 2025Updated 11 months ago
- Pytorch implementation and pre-trained Japanese model for CANINE, the efficient character-level transformer.☆89Nov 3, 2023Updated 2 years ago
- a discord/slack/misskey bot that instantly creates custom emojis with commands (text emojis, AI illustration emojis) / emotion in that m…☆13Apr 25, 2024Updated last year
- Rotated Word Vector Representations and their Interpretability (EMNLP 2017)☆18Jul 13, 2019Updated 6 years ago
- ☆19Mar 12, 2026Updated last month
- Arguments parser with class for Python, inspired by StructOpt☆62Sep 17, 2023Updated 2 years ago
- ☆16Jan 3, 2025Updated last year
- ☆23Sep 18, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A simple implementation of SimCSE☆78Oct 31, 2022Updated 3 years ago
- JQaRA: Japanese Question Answering with Retrieval Augmentation - 検索拡張(RAG)評価のための日本語Q&Aデータセット☆43Sep 9, 2025Updated 7 months ago
- [NeurIPS 2023] "Learning to Augment Distributions for Out-of-distribution Detection"☆11Nov 14, 2023Updated 2 years ago
- 🦀 A Rust implementation of a RoBERTa classification model for the SNLI dataset☆13Sep 13, 2021Updated 4 years ago
- ☆15Mar 15, 2022Updated 4 years ago
- Kyoto University Web Document Leads Corpus☆83Dec 18, 2023Updated 2 years ago
- Tutorials for PyTorch Geometric(PyG)☆20Jan 6, 2020Updated 6 years ago