MEXMA: Token-level objectives improve sentence representations
☆43Jan 6, 2025Updated last year
Alternatives and similar repositories for mexma
Users that are interested in mexma are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆31Dec 2, 2025Updated 4 months ago
- ☆14Jul 7, 2024Updated last year
- Code for the paper "Watermarking Makes Language Models Radioactive"☆22Oct 25, 2024Updated last year
- This repository provides a comprehensive benchmark for evaluating the performance of neural watermarking techniques. The benchmark includ…☆26Jan 9, 2026Updated 3 months ago
- Training code for Sparse Autoencoders on Embedding models☆39Apr 5, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- AutoRAG example about benchmarking Korean embeddings.☆44Oct 2, 2024Updated last year
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 8 months ago
- Korean Sentence Embedding Model Performance Benchmark for RAG☆50Jan 27, 2025Updated last year
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 5 months ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- ☆60Jan 26, 2025Updated last year
- Code for the paper "Multi-Field Adaptive Retrieval," a research project on a semi-structured document retrieval☆18Feb 13, 2026Updated 2 months ago
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆364Apr 8, 2026Updated last week
- Coord: A Unified Interface for All Models☆18Feb 2, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆42Jan 29, 2026Updated 2 months ago
- Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"☆35Sep 20, 2025Updated 6 months ago
- Official code for "Evaluations of Machine Learning Privacy Defenses are Misleading" (https://arxiv.org/abs/2404.17399)☆12Apr 29, 2024Updated last year
- Load any clip model with a standardized interface☆22Oct 20, 2025Updated 5 months ago
- ☆43Apr 22, 2025Updated 11 months ago
- Mixture of Lora Experts☆10Apr 7, 2024Updated 2 years ago
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆210Apr 4, 2026Updated last week
- Performs benchmarking on two Korean datasets with minimal time and effort.☆46Jan 22, 2026Updated 2 months ago
- The inverted index exchange format as defined as part of the Open-Source IR Replicability Challenge (OSIRRC) initiative☆11Aug 6, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval☆38Aug 4, 2025Updated 8 months ago
- ☆16Mar 3, 2024Updated 2 years ago
- ☆19May 16, 2024Updated last year
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆37Aug 27, 2025Updated 7 months ago
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆29Jun 3, 2025Updated 10 months ago
- huggingface에 있는 한국어 데이터 세트☆36Oct 10, 2024Updated last year
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆38Apr 8, 2026Updated last week
- TrustMark - Universal Watermarking for Arbitrary Resolution Images☆99Updated this week
- ☆34Feb 27, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A simple implementation of DP-RAG☆17Mar 17, 2025Updated last year
- Training hybrid models for dummies.☆29Nov 1, 2025Updated 5 months ago
- ☆18Apr 18, 2025Updated 11 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆375Dec 12, 2024Updated last year
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,755Jul 18, 2025Updated 8 months ago
- Official Implementation of FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acceleration☆30Apr 7, 2026Updated last week
- The unofficial CLI of Amazon S3 Vectors (Preview) in Rust☆17Jul 19, 2025Updated 8 months ago