Skip to content

Bundled inference server (drop in replacement) for huggingface, optimized for monitoring and now serving as an example for error tracking.

License

Notifications You must be signed in to change notification settings

West-Computing-Club/Inference-Server-ET

Repository files navigation

Inference Server for Llama, Encoding, everything.

Packaged Inference Server endpoint, with Huggingface Encoders, NLPs, etc!

Update 2025.1.30

This repository does not function, and is largely outdated. However, it is currently serving as a demo of error tracking.

Why bundle everything in one?

For fast multi-cluster deployments.

License

MIT

About

Bundled inference server (drop in replacement) for huggingface, optimized for monitoring and now serving as an example for error tracking.

Topics

Resources

License

Stars

Watchers

Forks