kyegomez / OmniByteFormerLinks

OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing traditional tokenization or specific data-type encodings.

☆14

Alternatives and similar repositories for OmniByteFormer

Users that are interested in OmniByteFormer are comparing it to the libraries listed below

Sorting:

The-Swarm-Corporation / AgentParse
AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…
☆17Updated 2 months ago
The-Swarm-Corporation / agentverse
Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!
☆16Updated last week
kyegomez / OpenStrawberry
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆29Updated 3 weeks ago
The-Swarm-Corporation / Mamba-R1
Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…
☆25Updated 2 months ago
The-Swarm-Corporation / OmniParse
Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …
☆20Updated 2 months ago
kyegomez / MobileVLM
Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …
☆15Updated last year
ZackBradshaw / ikigAI
☆14Updated last year
kyegomez / TinyGPTV
Simple Implementation of TinyGPTV in super simple Zeta lego blocks
☆16Updated last year
kyegomez / HSSS
Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…
☆14Updated last year
kyegomez / forest-of-thoughts
A forest of autonomous agents.
☆19Updated 11 months ago
kyegomez / CogNetX
CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…
☆19Updated last week
Agora-Lab-AI / OmegaViT
OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…
☆14Updated last week
kyegomez / SelfExtend
Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta
☆13Updated last year
lucidrains / mind-evolution
Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind
☆57Updated 7 months ago
superagi / Veagle
Enhancement in Multimodal Representation Learning.
☆40Updated last year
kyegomez / SimpleMamba
Implementation of a modular, high-performance, and simplistic mamba for high-speed applications
☆39Updated last year
SamsungSAILMontreal / nino
Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [ICLR 2025]
☆25Updated 2 months ago
kyegomez / Tiktokx
Tiktok is an advanced multimedia recommender system that fuses the generative modality-aware collaborative self-augmentation and contrast…
☆13Updated 2 years ago
facebookresearch / DIG-In
This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.
☆20Updated last year
shangshang-wang / Resa
Resa: Transparent Reasoning Models via SAEs
☆46Updated 3 months ago
13331112522 / v-rag
Visual RAG using less than 300 lines of code.
☆29Updated last year
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆35Updated last year
facebookresearch / NeuralMemory
A Data Source for Reasoning Embodied Agents
☆19Updated 2 years ago
OpenMOSS / Lorsa
☆29Updated last month
shulin16 / MMInA
[ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents
☆47Updated 10 months ago
zaydzuhri / flame
Fork of Flame repo for training of some new stuff in development
☆19Updated this week
The-Swarm-Corporation / Brainwave
Brainwave is a state-of-the-art neural decoder that transforms electroencephalogram (EEG) and brain signals into multimodal outputs inclu…
☆14Updated 2 months ago
kyegomez / Qwen-VL
My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…
☆12Updated last year
kyegomez / Falcon
A simple package for leveraging Falcon 180B and the HF ecosystem's tools, including training/inference scripts, safetensors, integrations…
☆12Updated last year
kyegomez / TTL
Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"
☆25Updated last week