Add CentML as an inference provider #1394

V2arK · 2025-04-28T17:09:23Z

✨ Add CentML server-less support to huggingface.js/inference

What’s in this PR

Area	Change
README	Adds CentML to the supported-provider tables and links to the partner-model list.
Provider wiring	* `src/lib/getProviderHelper.ts` – registers CentML for `conversational` . * `src/providers/centml.ts` – new task classes
Docs	Links to `https://huggingface.co/api/partners/centml/models`.
Misc	Uses the current CentML base URL `https://api.centml.com`.

Why

* CentML just opened public server-less endpoints; supporting it keeps huggingface.js a one-stop client for all HF partners.
* CentML routes are OpenAI-shaped, so we extend the existing base classes.

How I tested

enable tests in packages/inference/test/InferenceClient.spec.ts, and did

pnpm test -- \
  --reporter verbose \
  test/InferenceClient.spec.ts -t "CentML"

> @huggingface/[email protected] test /Users/honglin/centML/huggingface.js/packages/inference
> vitest run --config vitest.config.mts "--reporter" "verbose" "test/InferenceClient.spec.ts" "-t" "CentML"


 RUN  v0.34.6 /Users/honglin/centML/huggingface.js/packages/inference

stderr | unknown test
Set HF_TOKEN in the env to run the tests for better rate limits

 ✓ test/InferenceClient.spec.ts (100) 4226ms
   ✓ InferenceClient (100) 4225ms
     ↓ backward compatibility (1) [skipped]
       ↓ works with old HfInference name [skipped]
     ↓ HF Inference (48) [skipped]
       ↓ throws error if model does not exist [skipped]
       ↓ fillMask [skipped]
       ↓ works without model [skipped]
       ↓ summarization [skipped]
       ↓ questionAnswering [skipped]
       ↓ tableQuestionAnswering [skipped]
       ↓ documentQuestionAnswering [skipped]
       ↓ documentQuestionAnswering with non-array output [skipped]
       ↓ visualQuestionAnswering [skipped]
       ↓ textClassification [skipped]
       ↓ textGeneration - gpt2 [skipped]
       ↓ textGeneration - openai-community/gpt2 [skipped]
       ↓ textGenerationStream - meta-llama/Llama-3.2-3B [skipped]
       ↓ textGenerationStream - catch error [skipped]
       ↓ textGenerationStream - Abort [skipped]
       ↓ tokenClassification [skipped]
       ↓ translation [skipped]
       ↓ zeroShotClassification [skipped]
       ↓ sentenceSimilarity [skipped]
       ↓ FeatureExtraction [skipped]
       ↓ FeatureExtraction - auto-compatibility sentence similarity [skipped]
       ↓ FeatureExtraction - facebook/bart-base [skipped]
       ↓ FeatureExtraction - facebook/bart-base, list input [skipped]
       ↓ automaticSpeechRecognition [skipped]
       ↓ audioClassification [skipped]
       ↓ audioToAudio [skipped]
       ↓ textToSpeech [skipped]
       ↓ imageClassification [skipped]
       ↓ zeroShotImageClassification [skipped]
       ↓ objectDetection [skipped]
       ↓ imageSegmentation [skipped]
       ↓ imageToImage [skipped]
       ↓ imageToImage blob data [skipped]
       ↓ textToImage [skipped]
       ↓ textToImage with parameters [skipped]
       ↓ imageToText [skipped]
       ↓ request - openai-community/gpt2 [skipped]
       ↓ tabularRegression [skipped]
       ↓ tabularClassification [skipped]
       ↓ endpoint - makes request to specified endpoint [skipped]
       ↓ chatCompletion modelId - OpenAI Specs [skipped]
       ↓ chatCompletionStream modelId - OpenAI Specs [skipped]
       ↓ chatCompletionStream modelId Fail - OpenAI Specs [skipped]
       ↓ chatCompletion - OpenAI Specs [skipped]
       ↓ chatCompletionStream - OpenAI Specs [skipped]
       ↓ custom mistral - OpenAI Specs [skipped]
       ↓ custom openai - OpenAI Specs [skipped]
       ↓ OpenAI client side routing - model should have provider as prefix [skipped]
     ↓ Fal AI (4) [skipped]
       ↓ textToImage - black-forest-labs/FLUX.1-schnell [skipped]
       ↓ textToImage - SD LoRAs [skipped]
       ↓ textToImage - Flux LoRAs [skipped]
       ↓ automaticSpeechRecognition - openai/whisper-large-v3 [skipped]
     ↓ Featherless (3) [skipped]
       ↓ chatCompletion [skipped]
       ↓ chatCompletion stream [skipped]
       ↓ textGeneration [skipped]
     ↓ Replicate (10) [skipped]
       ↓ textToImage canonical - black-forest-labs/FLUX.1-schnell [skipped]
       ↓ textToImage canonical - black-forest-labs/FLUX.1-dev [skipped]
       ↓ textToImage canonical - stabilityai/stable-diffusion-3.5-large-turbo [skipped]
       ↓ textToImage versioned - ByteDance/SDXL-Lightning [skipped]
       ↓ textToImage versioned - ByteDance/Hyper-SD [skipped]
       ↓ textToImage versioned - playgroundai/playground-v2.5-1024px-aesthetic [skipped]
       ↓ textToImage versioned - stabilityai/stable-diffusion-xl-base-1.0 [skipped]
       ↓ textToSpeech versioned [skipped]
       ↓ textToSpeech OuteTTS -  usually Cold [skipped]
       ↓ textToSpeech Kokoro [skipped]
     ↓ SambaNova (3) [skipped]
       ↓ chatCompletion [skipped]
       ↓ chatCompletion stream [skipped]
       ↓ featureExtraction [skipped]
     ↓ Together (4) [skipped]
       ↓ chatCompletion [skipped]
       ↓ chatCompletion stream [skipped]
       ↓ textToImage [skipped]
       ↓ textGeneration [skipped]
     ↓ Nebius (3) [skipped]
       ↓ chatCompletion [skipped]
       ↓ chatCompletion stream [skipped]
       ↓ textToImage [skipped]
     ↓ 3rd party providers (1) [skipped]
       ↓ chatCompletion - fails with unsupported model [skipped]
     ↓ Fireworks (2) [skipped]
       ↓ chatCompletion [skipped]
       ↓ chatCompletion stream [skipped]
     ↓ Hyperbolic (4) [skipped]
       ↓ chatCompletion - hyperbolic [skipped]
       ↓ chatCompletion stream [skipped]
       ↓ textToImage [skipped]
       ↓ textGeneration [skipped]
     ↓ Novita (2) [skipped]
       ↓ chatCompletion [skipped]
       ↓ chatCompletion stream [skipped]
     ↓ Black Forest Labs (2) [skipped]
       ↓ textToImage [skipped]
       ↓ textToImage URL [skipped]
     ↓ Cohere (2) [skipped]
       ↓ chatCompletion [skipped]
       ↓ chatCompletion stream [skipped]
     ↓ Cerebras (2) [skipped]
       ↓ chatCompletion [skipped]
       ↓ chatCompletion stream [skipped]
     ↓ Nscale (3) [skipped]
       ↓ chatCompletion [skipped]
       ↓ chatCompletion stream [skipped]
       ↓ textToImage [skipped]
     ↓ Groq (2) [skipped]
       ↓ chatCompletion [skipped]
       ↓ chatCompletion stream [skipped]
     ✓ CentML (4) 4225ms
       ✓ chat completions (4) 4225ms
         ✓ basic chat completion 1047ms
         ✓ chat completion with multiple messages 970ms
         ✓ chat completion with parameters 1555ms
         ✓ chat completion stream 653ms

 Test Files  1 passed (1)
      Tests  4 passed | 96 skipped (100)
   Start at  01:20:35
   Duration  4.72s (transform 254ms, setup 9ms, collect 297ms, tests 4.23s, environment 0ms, prepare 54ms)

Thanks for reviewing! 🙏

packages/inference/src/providers/consts.ts

HuggingFaceDocBuilderDev · 2025-04-28T17:28:07Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

hanouticelina

Looks good, thank you @V2arK for the contribution!
I tested the PR (using the free credits you offer for new users) and it works as expected ✅

V2arK · 2025-05-04T04:57:45Z

Is there anything else I need to do to get it merged?

V2arK added 4 commits April 28, 2025 11:05

add centml

3452916

add README and populate getProviderHelper.ts

f4ded01

fix typo / formatting

7b5521f

fix centml ls

0a68414

V2arK requested review from julien-c, hanouticelina and SBrandeis as code owners April 28, 2025 17:09

V2arK marked this pull request as draft April 28, 2025 17:09

julien-c reviewed Apr 28, 2025

View reviewed changes

packages/inference/src/providers/consts.ts Outdated Show resolved Hide resolved

julien-c changed the title ~~Add CentML as an inference provide~~ Add CentML as an inference provider Apr 28, 2025

V2arK and others added 6 commits April 29, 2025 00:08

remove from const.ts

7f121fe

add centml tests, enable tests

e99c0c9

remove some override

4942d49

remove textGen api, currently not officially supported via platform

ac5b195

skip tests

06f62e4

Merge branch 'main' into v2ark/add_centml

8a1b3a3

V2arK marked this pull request as ready for review April 29, 2025 07:24

Wauplin and others added 4 commits April 29, 2025 12:01

Merge branch 'main' into v2ark/add_centml

854975d

Merge branch 'main' into v2ark/add_centml

082644b

Merge branch 'main' into v2ark/add_centml

d73d604

Merge branch 'main' into v2ark/add_centml

66d7b0d

hanouticelina approved these changes Apr 30, 2025

View reviewed changes

hanouticelina and others added 2 commits April 30, 2025 12:21

format

6270e19

Merge branch 'main' into v2ark/add_centml

2ecd64b

V2arK requested a review from julien-c May 5, 2025 20:03

V2arK added 2 commits May 13, 2025 11:14

Merge branch 'main' into v2ark/add_centml

2a9aabb

Merge branch 'main' into v2ark/add_centml

8855cbf

SBrandeis added the inference-providers integration of a new or existing Inference Provider label May 23, 2025

SBrandeis self-assigned this May 23, 2025

V2arK added 3 commits May 27, 2025 10:47

Merge branch 'main' into v2ark/add_centml

1121aaa

Merge branch 'main' into v2ark/add_centml

d013eb6

Merge branch 'main' into v2ark/add_centml

fb0efcd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add CentML as an inference provider #1394

Add CentML as an inference provider #1394

Uh oh!

V2arK commented Apr 28, 2025 •

edited

Loading

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Apr 28, 2025

Uh oh!

hanouticelina left a comment

Uh oh!

V2arK commented May 4, 2025

Uh oh!

Uh oh!

Add CentML as an inference provider #1394

Are you sure you want to change the base?

Add CentML as an inference provider #1394

Uh oh!

Conversation

V2arK commented Apr 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✨ Add CentML server-less support to huggingface.js/inference

What’s in this PR

Why

How I tested

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Apr 28, 2025

Uh oh!

hanouticelina left a comment

Choose a reason for hiding this comment

Uh oh!

V2arK commented May 4, 2025

Uh oh!

Uh oh!

V2arK commented Apr 28, 2025 •

edited

Loading