SCT: An Efficient Self-Supervised Cross-View Training For Sentence Embedding (TACL)
☆16Jul 27, 2024Updated last year
Alternatives and similar repositories for SCT
Users that are interested in SCT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of ConGen: Unsupervised Control and Generalization Distillation For Sentence Representation (Finding of EMNLP 2022).☆22Sep 13, 2023Updated 2 years ago
- The implementation of CL-ReLKT (NAACL-2022)☆14Aug 31, 2022Updated 3 years ago
- ☆14Dec 13, 2023Updated 2 years ago
- A comprehensive evaluation framework for the SEA region☆27Apr 20, 2026Updated last month
- Official Code for "Seed-Free Synthetic Data Generation Framework for Instruction-Tuning LLMs: A Case Study in Thai"☆22May 9, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Benchmark for Thai sentence representation☆136May 27, 2025Updated 11 months ago
- [ACL 2025] 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Apr 6, 2025Updated last year
- The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…☆16May 4, 2022Updated 4 years ago
- triple-encoders is a library for contextualizing distributed Sentence Transformers representations.☆15Sep 3, 2024Updated last year
- Code base for the EMNLP 2021 Findings paper: Cartography Active Learning☆14Jun 3, 2025Updated 11 months ago
- ☆21Dec 30, 2022Updated 3 years ago
- Bootstrapped Unsupervised Sentence Representation Learning (ACL 2021)☆30Apr 27, 2022Updated 4 years ago
- Source code to accompany research paper on training multi token prediction language models using self-distillation.☆37Feb 21, 2026Updated 3 months ago
- Embedding Representation for Indonesian Sentences!☆25Aug 14, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆13Nov 19, 2023Updated 2 years ago
- data collator for UL2 and U-PaLM☆29Aug 20, 2023Updated 2 years ago
- Code for paper Document-Level Paraphrase Generation with Sentence Rewriting and Reordering by Zhe Lin, Yitao Cai and Xiaojun Wan. This pa…☆26Nov 10, 2021Updated 4 years ago
- Few-shot Learning with Auxiliary Data☆31Dec 8, 2023Updated 2 years ago
- python library for visualization string edit distance☆10Oct 15, 2021Updated 4 years ago
- Handling Cross- and Out-of-Domain Samples in Thai Word Segmentation (ACL 2021 Findings).☆30Feb 2, 2024Updated 2 years ago
- WangChanGLM 🐘 - The Multilingual Instruction-Following Model☆95Nov 22, 2023Updated 2 years ago
- LTG-Bert☆34Jan 8, 2024Updated 2 years ago
- ☆12Dec 7, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- State-of-the-art paired encoder and decoder models (17M-1B params)☆69Aug 6, 2025Updated 9 months ago
- desktop application for viewing and analyzing Claude Code CLI session logs.☆29Aug 16, 2025Updated 9 months ago
- ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost☆42Nov 15, 2023Updated 2 years ago
- [NeurIPS 2024] Official implementation of the paper "Enhancing LLM’s Cognition via Structurization"☆25Aug 5, 2025Updated 9 months ago
- KhanomTan TTS (ขนมตาล) is an open-source Thai text-to-speech model that supports multilingual speakers such as Thai, English, and others.☆48Aug 25, 2022Updated 3 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- ☆12Dec 14, 2020Updated 5 years ago
- ☆10Dec 17, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Language Models as Hierarchy Encoders☆42Jan 6, 2026Updated 4 months ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆99Apr 26, 2023Updated 3 years ago
- Dataset Catalogue Homepage for Indonesian Languages☆12Feb 19, 2024Updated 2 years ago
- Example chats from running BioMCP for demonstrations☆17Jun 20, 2025Updated 11 months ago
- ☆10Oct 2, 2024Updated last year
- ☆24May 6, 2026Updated 2 weeks ago
- ☆40Feb 1, 2023Updated 3 years ago