kyegomez / Falcon
A simple package for leveraging Falcon 180B and the HF ecosystem's tools, including training/inference scripts, safetensors, integrations with bitsandbytes, PEFT, GPTQ, assisted generation, RoPE scaling support, and rich generation parameters.
☆13Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for Falcon
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆11Updated 9 months ago
- The Swarm Ecosystem☆13Updated 3 months ago
- ☆14Updated 7 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated 9 months ago
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆27Updated this week
- A forest of autonomous agents.☆18Updated this week
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- Finetune any model on HF in less than 30 seconds☆56Updated this week
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆22Updated 9 months ago
- A Data Source for Reasoning Embodied Agents☆19Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆19Updated 9 months ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆15Updated this week
- ☆11Updated 3 weeks ago
- ☆38Updated this week
- Seamless Voice Interactions with LLMs☆11Updated last year
- Simple Implementation of TinyGPTV in super simple Zeta lego blocks☆15Updated this week
- PegasusX: The Future of Multimodal Embeddings 🦄 🦄☆15Updated 3 weeks ago
- Modified Beam Search with periodical restart☆12Updated last month
- Generate High Quality textual or multi-modal datasets with Agents☆17Updated last year
- The Next Generation Multi-Modality Superintelligence☆70Updated 2 months ago
- A collection of pre-build wrappers over common RAG systems like ChromaDB, Weaviate, Pinecone, and othersz!☆19Updated this week
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated this week
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated 10 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆41Updated last month
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆11Updated 2 months ago
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆14Updated last month
- ☆28Updated 2 weeks ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆14Updated 7 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆25Updated last year
- ☆16Updated 9 months ago