TrelisResearch / code-llama-32kLinks
Run code-llama with 50k tokens using flash attention and better transformer
☆12Updated 2 years ago
Alternatives and similar repositories for code-llama-32k
Users that are interested in code-llama-32k are comparing it to the libraries listed below
Sorting:
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆84Updated last year
- Data extraction with LLM on CPU☆85Updated 2 years ago
- Continuously learning web-browsing AI agent that extends the Voyager architecture.☆40Updated 6 months ago
- Using LlamaIndex with Ray for productionizing LLM applications☆71Updated 2 years ago
- Run GPU inference and training jobs on serverless infrastructure that scales with you.☆102Updated last year
- A voice-enabled chatbot application built using of 🦜️🔗 LangChain, text-to-speech, and speech-to-text models from 🤗 Hugging Face, and …☆195Updated 2 years ago
- Python client library for improving your LLM app accuracy☆97Updated 10 months ago
- Beginner-friendly repository for launching your first LLM API with Python, LangChain and FastAPI, using local models or the OpenAI API.☆103Updated 2 years ago
- ☆48Updated 2 years ago
- Large Language Model (LLM) Inference API and Chatbot☆127Updated last year
- Weaviate Podcast MCP☆59Updated last week
- 🔓 The open-source autonomous agent LLM initiative 🔓☆91Updated last year
- Open Source Embeddings Optimisation and Eval Framework for RAG/LLM Applications. Documentations at https://docs.vectorboard.ai/introducti…☆50Updated 2 years ago
- ☆138Updated 2 years ago
- Natural Language Interfaces Powered by LLMs☆96Updated last year
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆48Updated last year
- ☆80Updated 2 years ago
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.☆76Updated 2 years ago
- Falcon40B and 7B (Instruct) with streaming, top-k, and beam search☆40Updated 2 years ago
- Develop, evaluate and monitor LLM applications at scale☆98Updated last year
- This is a template retrieval repo to create a Flask api server using LangChain with Cohere embeddings and Qdrant Vector Database☆78Updated 2 years ago
- ☆223Updated 2 years ago
- Function Calling Mistral 7B. Learn how to make functions call for open source LLMs.☆48Updated last year
- An example application built with LangChain CLI and LangServe☆79Updated last year
- ☆60Updated 2 years ago
- Data extraction with LLM on CPU☆112Updated last year
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆166Updated last year
- Experimental agent using OpenAI Functions☆79Updated 2 years ago
- ☆103Updated 2 years ago
- A starter app to build AI powered chat bots with Astra DB and LlamaIndex☆74Updated last year