This code sets up a simple yet robust server using FastAPI for handling asynchronous requests for embedding generation and reranking tasks using the BAAI M3 multilingual model.
☆72May 8, 2024Updated last year
Alternatives and similar repositories for baai_m3_simple_server
Users that are interested in baai_m3_simple_server are comparing it to the libraries listed below
Sorting:
- *high-load* benchmarking tool☆16Mar 2, 2026Updated last week
- CPython 파헤치기 스터디☆16Jul 13, 2024Updated last year
- Hugging Face RoBERTa with Flash Attention 2☆24Sep 14, 2025Updated 5 months ago
- TextEmbed is a REST API crafted for high-throughput and low-latency embedding inference. It accommodates a wide variety of embedding mode…☆28Sep 5, 2024Updated last year
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,703Feb 5, 2026Updated last month
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆159Jul 14, 2025Updated 7 months ago
- Online Shopping site with basic PHP concept using Bootstrap ,PHP ,MYSQL☆12Jan 6, 2015Updated 11 years ago
- fine-tuning tutorial☆18Feb 20, 2026Updated 2 weeks ago
- ☆10Apr 15, 2021Updated 4 years ago
- User-friendly viewer for Parquet files☆10Updated this week
- ☆11Dec 6, 2023Updated 2 years ago
- Redis distributed lock implementation for Python based on Pub/Sub messaging☆11Feb 14, 2026Updated 3 weeks ago
- LightGBM for handling label-imbalanced data with focal and weighted loss functions in binary and multiclass classification☆21Jan 29, 2026Updated last month
- ☆18Oct 9, 2018Updated 7 years ago
- ☆10Jan 9, 2024Updated 2 years ago
- A blazing fast inference solution for text embeddings models☆4,553Feb 25, 2026Updated last week
- A file-backed dictionary for Python☆12Aug 15, 2022Updated 3 years ago
- Less-Resilient MapReduce for Go☆10Feb 15, 2023Updated 3 years ago
- A limechat theme based on the solarized color scheme by Ethan Schoonover☆28Jun 28, 2011Updated 14 years ago
- Pure Go MPEG-1 Audio library☆11Nov 20, 2020Updated 5 years ago
- Naver oauth2 passport login☆12Mar 11, 2021Updated 4 years ago
- 🚀 LLM inference optimization simulator, modeling compute-bound prefill and memory-bound decode phases.☆13Jul 12, 2025Updated 7 months ago
- 비즈엠 개발 서버에서 전화번호 인증을 쉽게 할 수 있는 웹사이트입니다.☆10Feb 27, 2023Updated 3 years ago
- Collection of shortest path algorithms (Dijkstra, A*, Bellman-Ford, All pair SP, DFS, BFS, and own) that converge to the most cost-effecti…☆10May 5, 2019Updated 6 years ago
- Today I learned / 오늘의 학습 기록소 (근본있는 개발자가 되자)☆12Jun 6, 2020Updated 5 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- ☆10Oct 21, 2022Updated 3 years ago
- 📒 마크다운 문서속에서 터미널을 사용할 수 있다!☆10Dec 22, 2019Updated 6 years ago
- LLMPerf is a library for validating and benchmarking LLMs☆11Aug 13, 2024Updated last year
- Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!☆10Aug 29, 2018Updated 7 years ago
- Language Server Indexing Format (LSIF) generator for .net☆10Apr 22, 2022Updated 3 years ago
- 🛰️ Assets for Station☆13Aug 18, 2024Updated last year
- Few-shot text classification with meta learning and BERT☆11Jun 14, 2021Updated 4 years ago
- Fast text chunking algorithms for Python☆12Oct 7, 2020Updated 5 years ago
- ☆12Mar 25, 2024Updated last year
- MIDict (Multi-Index Dict) can be indexed by any "keys" or "values", suitable as a bidirectional/inverse dict or a multi-key/multi-value d…☆14May 19, 2016Updated 9 years ago
- Label shift estimation for transfer difficulty with Familiarity.☆10Feb 4, 2025Updated last year
- On November 21, 1972, Kim Doo-han collapsed due to high blood pressure, an orange disease.☆11Jun 27, 2022Updated 3 years ago
- uvx is now uvenv☆15Dec 4, 2024Updated last year