MEXMA: Token-level objectives improve sentence representations
☆43Jan 6, 2025Updated last year
Alternatives and similar repositories for mexma
Users that are interested in mexma are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆32Dec 2, 2025Updated 5 months ago
- ☆14Jul 7, 2024Updated last year
- Code for the paper "Watermarking Makes Language Models Radioactive"☆22Oct 25, 2024Updated last year
- This repository provides a comprehensive benchmark for evaluating the performance of neural watermarking techniques. The benchmark includ…☆26Jan 9, 2026Updated 3 months ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 9 months ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 6 months ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆368Apr 13, 2026Updated 3 weeks ago
- Coord: A Unified Interface for All Models☆18Feb 2, 2026Updated 3 months ago
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆930Oct 28, 2024Updated last year
- ☆42Jan 29, 2026Updated 3 months ago
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆18Dec 22, 2023Updated 2 years ago
- Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"☆35Sep 20, 2025Updated 7 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Korean-MTEB☆83Apr 16, 2026Updated 2 weeks ago
- Official code for "Evaluations of Machine Learning Privacy Defenses are Misleading" (https://arxiv.org/abs/2404.17399)☆12Apr 29, 2024Updated 2 years ago
- Load any clip model with a standardized interface☆22Oct 20, 2025Updated 6 months ago
- Mixture of Lora Experts☆10Apr 7, 2024Updated 2 years ago
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆213Apr 14, 2026Updated 3 weeks ago
- Notebook which provides an overview to several text summarization techniques☆11Mar 22, 2019Updated 7 years ago
- Performs benchmarking on two Korean datasets with minimal time and effort.☆46Jan 22, 2026Updated 3 months ago
- The inverted index exchange format as defined as part of the Open-Source IR Replicability Challenge (OSIRRC) initiative☆11Aug 6, 2025Updated 8 months ago
- Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval☆38Aug 4, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆16Mar 3, 2024Updated 2 years ago
- ☆19May 16, 2024Updated last year
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆37Aug 27, 2025Updated 8 months ago
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆29Jun 3, 2025Updated 11 months ago
- huggingface에 있는 한국어 데이터 세트☆36Oct 10, 2024Updated last year
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆39Apr 22, 2026Updated last week
- TrustMark - Universal Watermarking for Arbitrary Resolution Images☆105Apr 9, 2026Updated 3 weeks ago
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆26Dec 20, 2024Updated last year
- ☆34Feb 27, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Training hybrid models for dummies.☆29Nov 1, 2025Updated 6 months ago
- ☆19Apr 18, 2025Updated last year
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆375Dec 12, 2024Updated last year
- [ACL Findings 2026] Official Implementation of "FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acc…☆31Apr 14, 2026Updated 3 weeks ago
- The unofficial CLI of Amazon S3 Vectors (Preview) in Rust☆17Jul 19, 2025Updated 9 months ago
- Benchmarking library for RAG☆268Mar 11, 2026Updated last month
- Efficient Finetuning for OpenAI GPT-OSS☆24Oct 2, 2025Updated 7 months ago