jina-ai / inference-client
β12Updated last year
Alternatives and similar repositories for inference-client:
Users that are interested in inference-client are comparing it to the libraries listed below
- Python API for authentication, resource management with Hubbleβ19Updated last year
- LLM finetuningβ42Updated last year
- π Unstructured Data Connectors for Haystack 2.0β16Updated last year
- Create and host retrieval plugins for ChatGPT in one clickβ67Updated last year
- Input text or image, get back matching image fashion results, using Jina, DocArray, and CLIPβ50Updated 2 years ago
- Neural search engine for discovering semantically similar Python repositories on GitHubβ28Updated last year
- A production-ready, scalable Indexer for the Jina neural search framework, based on HNSW and PSQLβ29Updated 2 years ago
- H&M Fashion Image similarity search with Weaviate and DocArrayβ43Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β34Updated 4 months ago
- π¦ XβLLM: Simple & Cutting Edge LLM Finetuningβ11Updated last year
- Using short models to classify long textsβ21Updated 2 years ago
- The collection of bulding blocks building fine-tunable metric learning modelsβ32Updated last month
- Example of Alpaca-LoRA with llama index.β31Updated 2 years ago
- Simple Indexerβ13Updated 2 years ago
- Explore the use of DSPy for extracting features from PDFs πβ39Updated last year
- Integrate an LLM copilot within your Keras model development workflowβ28Updated last year
- Adversarial Training and SFT for Bot Safety Modelsβ39Updated 2 years ago
- ChatBot App built using LangChain and Lightning AIβ18Updated 2 years ago
- ππ§ A minimalistic tool to fine-tune your LLMsβ18Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the creβ¦β19Updated 6 months ago
- Sentence Embedding as a Serviceβ15Updated last year
- Reasoning by Communicating with Agentsβ28Updated last week
- Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Modelsβ21Updated 5 months ago
- Developing tools to automatically analyze datasetsβ73Updated 6 months ago
- π€ Trade any tensors over the networkβ30Updated last year
- **ARCHIVED** Filesystem interface to π€ Hubβ58Updated 2 years ago
- Rust bindings for CTranslate2β14Updated last year
- BIG: Back In the Game of Creative AIβ27Updated 2 years ago
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.β16Updated 2 weeks ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeedβ35Updated last year