MEXMA: Token-level objectives improve sentence representations
☆43Jan 6, 2025Updated last year
Alternatives and similar repositories for mexma
Users that are interested in mexma are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.☆33Dec 2, 2025Updated 5 months ago
- ☆14Jul 7, 2024Updated last year
- Training code for Sparse Autoencoders on Embedding models☆39May 9, 2026Updated 2 weeks ago
- AutoRAG example about benchmarking Korean embeddings.☆45Oct 2, 2024Updated last year
- Korean Sentence Embedding Model Performance Benchmark for RAG☆50Jan 27, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 7 months ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- ☆63Jan 26, 2025Updated last year
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆372Apr 13, 2026Updated last month
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆930Oct 28, 2024Updated last year
- ☆42Jan 29, 2026Updated 3 months ago
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆18Dec 22, 2023Updated 2 years ago
- Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"☆35Sep 20, 2025Updated 8 months ago
- Official code for "Evaluations of Machine Learning Privacy Defenses are Misleading" (https://arxiv.org/abs/2404.17399)☆13Apr 29, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Load any clip model with a standardized interface☆22Oct 20, 2025Updated 7 months ago
- ☆45Apr 22, 2025Updated last year
- Mixture of Lora Experts☆11Apr 7, 2024Updated 2 years ago
- KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델☆217Apr 14, 2026Updated last month
- The inverted index exchange format as defined as part of the Open-Source IR Replicability Challenge (OSIRRC) initiative☆11Aug 6, 2025Updated 9 months ago
- ☆19May 16, 2024Updated 2 years ago
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆37Aug 27, 2025Updated 8 months ago
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆29Jun 3, 2025Updated 11 months ago
- huggingface에 있는 한국어 데이터 세트☆36Oct 10, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆34Feb 27, 2024Updated 2 years ago
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆25Dec 20, 2024Updated last year
- A simple implementation of DP-RAG☆17Mar 17, 2025Updated last year
- ☆19Apr 18, 2025Updated last year
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆376Dec 12, 2024Updated last year
- [ACL Findings 2026] Official Implementation of "FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acc…☆32Apr 14, 2026Updated last month
- Benchmarking library for RAG☆272Mar 11, 2026Updated 2 months ago
- Computationally friendly hyper-parameter search with DP-SGD☆26Jan 7, 2025Updated last year
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆24Oct 10, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 대학생을 위한 IT 스펙 저장소 PRE:FOLIO 클라이언트☆10Jul 19, 2023Updated 2 years ago
- Efficient Finetuning for OpenAI GPT-OSS☆24Oct 2, 2025Updated 7 months ago
- Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents☆38Apr 13, 2026Updated last month
- ☆26Feb 11, 2025Updated last year
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆41Oct 20, 2025Updated 7 months ago
- Solar vs GLM vs Phi☆102Jan 2, 2026Updated 4 months ago
- Codebase for generation-time and post-hoc text watermarking, as well as watermark radioactivity detection.☆61Updated this week