Inference Server for Llama, Encoding, everything. Packaged Inference Server endpoint, with Huggingface Encoders, NLPs, etc! Update 2025.1.30 This repository does not function, and is largely outdated. However, it is currently serving as a demo of error tracking. Why bundle everything in one? For fast multi-cluster deployments. License MIT