The Roadmap for LLMs
☆85Aug 1, 2023Updated 2 years ago
Alternatives and similar repositories for llm
Users that are interested in llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Chinese Word Segmentation task based on BERT and implemented in Pytorch☆14Aug 14, 2020Updated 5 years ago
- Large-scale exact string matching tool☆17Mar 7, 2025Updated last year
- Official code repository for PHYBench.☆24May 16, 2025Updated last year
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆30Aug 19, 2025Updated 9 months ago
- DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI☆524Jun 2, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code release for "MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning"☆11Oct 11, 2024Updated last year
- Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding (RSS 2021)☆12Oct 22, 2021Updated 4 years ago
- Solving Competition Geometry Problems in Lean☆38Aug 26, 2025Updated 9 months ago
- A C++/CUDA toolkit for neural machine translation (RNN-Based NMT) across multiple GPUs☆10Oct 17, 2022Updated 3 years ago
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆22Jan 11, 2026Updated 5 months ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 3 years ago
- ☆12Jul 4, 2024Updated last year
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆90Mar 24, 2024Updated 2 years ago
- Provides a minimal implementation to extract FLAN datasets for further processing☆11Feb 1, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs [EMNLP 2023 Findings]☆24Nov 18, 2023Updated 2 years ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- ☆16Oct 1, 2020Updated 5 years ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Jul 21, 2023Updated 2 years ago
- ☆308Apr 15, 2026Updated 2 months ago
- [NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation☆22Dec 17, 2024Updated last year
- ☆14Jun 20, 2022Updated 3 years ago
- Python script to download conference paper automatically☆16Sep 10, 2024Updated last year
- Data and codes for EMNLP 2022 paper "CDConv: A Benchmark for Contradiction Detection in Chinese Conversations"☆13May 8, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆44Apr 7, 2024Updated 2 years ago
- The codebase of Cola DLM☆217Jun 6, 2026Updated last week
- EasyTTS是一个便捷的工具,旨在方便地使用第三方API服务来调用OpenAI的文本转语音(TTS)功能。 EasyTTS允许用户输入文本,并选 择不同的模型、音色、格式来生成音频文件。☆10Nov 26, 2023Updated 2 years ago
- Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code☆17Aug 23, 2024Updated last year
- ☆11Sep 4, 2019Updated 6 years ago
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆68Jan 26, 2026Updated 4 months ago
- ☆13Jun 16, 2021Updated 5 years ago
- Use to store public paper and organize them.☆18Feb 26, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official implementation of our paper at ACL 2023: Pre-training Multi-party Dialogue Models with Latent Discourse Inference☆10Jul 10, 2023Updated 2 years ago
- 非结构化商业文本信息中隐私信息识别比赛代码仓库☆22Jan 11, 2024Updated 2 years ago
- 旨在对当前主流LLM进行一个直观、具体、标准的评测☆94Jun 20, 2023Updated 2 years ago
- CCL 2023 汉语学习者文本纠错评测☆32Jul 12, 2023Updated 2 years ago
- The official GitHub page for the survey paper "A Survey of Large Language Models".☆12,169Mar 11, 2025Updated last year
- ☆126Feb 10, 2024Updated 2 years ago
- ☆196Feb 6, 2025Updated last year