MEXMA: Token-level objectives improve sentence representations
☆43Jan 6, 2025Updated last year
Alternatives and similar repositories for mexma
Users that are interested in mexma are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆34Dec 2, 2025Updated 6 months ago
- ☆14Jul 7, 2024Updated last year
- Code for the paper "Watermarking Makes Language Models Radioactive"☆23Oct 25, 2024Updated last year
- This repository provides a comprehensive benchmark for evaluating the performance of neural watermarking techniques. The benchmark includ…☆26Jan 9, 2026Updated 5 months ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Training code for Sparse Autoencoders on Embedding models☆39May 9, 2026Updated last month
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 10 months ago
- AutoRAG example about benchmarking Korean embeddings.☆45Oct 2, 2024Updated last year
- Korean Sentence Embedding Model Performance Benchmark for RAG☆50Jan 27, 2025Updated last year
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 7 months ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- ☆63Jan 26, 2025Updated last year
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆372Apr 13, 2026Updated 2 months ago
- Coord: A Unified Interface for All Models☆18Feb 2, 2026Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆930Oct 28, 2024Updated last year
- ☆41Jan 29, 2026Updated 4 months ago
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆18Dec 22, 2023Updated 2 years ago
- Korean-MTEB☆86May 12, 2026Updated last month
- Official code for "Evaluations of Machine Learning Privacy Defenses are Misleading" (https://arxiv.org/abs/2404.17399)☆13Apr 29, 2024Updated 2 years ago
- Load any clip model with a standardized interface☆22Oct 20, 2025Updated 7 months ago
- Mixture of Lora Experts☆11Apr 7, 2024Updated 2 years ago
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆221Apr 14, 2026Updated 2 months ago
- Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"☆40Sep 20, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Performs benchmarking on two Korean datasets with minimal time and effort.☆45Jan 22, 2026Updated 4 months ago
- The inverted index exchange format as defined as part of the Open-Source IR Replicability Challenge (OSIRRC) initiative☆11Aug 6, 2025Updated 10 months ago
- Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval☆38Aug 4, 2025Updated 10 months ago
- ☆16Mar 3, 2024Updated 2 years ago
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆38Aug 27, 2025Updated 9 months ago
- huggingface에 있는 한국어 데이터 세트☆36Oct 10, 2024Updated last year
- TrustMark - Universal Watermarking for Arbitrary Resolution Images☆113Apr 30, 2026Updated last month
- ☆34Feb 27, 2024Updated 2 years ago
- Training hybrid models for dummies.☆31Nov 1, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆19Apr 18, 2025Updated last year
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆377Dec 12, 2024Updated last year
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,761Jul 18, 2025Updated 10 months ago
- [ACL Findings 2026] Official Implementation of "FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acc…☆32Apr 14, 2026Updated 2 months ago
- The unofficial CLI of Amazon S3 Vectors (Preview) in Rust☆17Jul 19, 2025Updated 10 months ago
- Computationally friendly hyper-parameter search with DP-SGD☆26Jan 7, 2025Updated last year
- 대학생을 위한 IT 스펙 저장소 PRE:FOLIO 클라이언트☆10Jul 19, 2023Updated 2 years ago