infinigence / InfiniWebSearchLinks
A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.
☆39Updated 10 months ago
Alternatives and similar repositories for InfiniWebSearch
Users that are interested in InfiniWebSearch are comparing it to the libraries listed below
Sorting:
- A high-throughput and memory-efficient inference and serving engine for LLMs☆138Updated 10 months ago
- Its an open source LLM based on MOE Structure.☆58Updated last year
- A dataset template for guiding chat-models to self-cognition, including information about the model’s identity, capabilities, usage, limi…☆29Updated 2 years ago
- ☆234Updated last year
- Imitate OpenAI with Local Models☆88Updated last year
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆140Updated last year
- ☆164Updated last year
- Alpaca Chinese Dataset -- 中文指令微调数据集☆216Updated last year
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆264Updated last year
- 首个llama2 13b 中文版模型 (Base + 中文对话SFT,实现流畅多轮人机自然语言交互)☆91Updated 2 years ago
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆114Updated last week
- gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。☆216Updated this week
- 探索 LLM 在法律行业的应用潜力☆91Updated 10 months ago
- code for piccolo embedding model from SenseTime☆140Updated last year
- 中文基于满血DeepSeek-R1蒸馏数据集☆62Updated 8 months ago
- 中文原生检索增强生成测评基准☆123Updated last year
- 部署你自己的OpenAI api🤩, 基于flask, transformers (使用 Baichuan2-13B-Chat-4bits 模型, 可以运行在单张Tesla T4显卡) ,实现了OpenAI中Chat, Models和Completions接口,包含流式响…☆96Updated last year
- 360zhinao☆290Updated 5 months ago
- Baichuan2代码的逐行解析版本,适合小白☆214Updated 2 years ago
- ☆337Updated last week
- qwen models finetuning☆105Updated 7 months ago
- 旨在对当前主流LLM进行一个直观、具体、标准的评测☆94Updated 2 years ago
- llama inference for tencentpretrain☆99Updated 2 years ago
- (撰写ing..)本仓库偏教程性质,以「模型中文化」为一个典型的模型训练问题切入场景,指导读者上手学习LLM二次微调训练。☆35Updated last year
- ☆241Updated 8 months ago
- GLM Series Edge Models☆149Updated 4 months ago
- 中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微…☆211Updated last year
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆68Updated last year
- 文本去重☆76Updated last year
- [ACL2025 demo track] ROGRAG: A Robustly Optimized GraphRAG Framework☆175Updated last month