Skip to content

Commit 75db74a

Browse files
committed
fix: updates all old inference URLS to api.aws.us-east-1.cerebrium.ai
1 parent 95e8d39 commit 75db74a

28 files changed

+33
-33
lines changed

cerebrium/container-images/custom-web-servers.mdx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -53,7 +53,7 @@ The configuration requires three key parameters:
5353
<Info>
5454
For ASGI applications like FastAPI, include the appropriate server package
5555
(like `uvicorn`) in your dependencies. After deployment, your endpoints become
56-
available at `https://api.cortex.cerebrium.ai/v4/{project - id}/{app - name}
56+
available at `https://api.aws.us-east-1.cerebrium.ai/v4/{project - id}/{app - name}
5757
/your/endpoint`.
5858
</Info>
5959

cerebrium/container-images/defining-container-images.mdx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -282,7 +282,7 @@ vllm = "latest"
282282
- Code is mounted in `/cortex`—adjust paths accordingly.
283283
- The port in your entrypoint must match the `port` parameter.
284284
- Install any required server packages (uvicorn, gunicorn, etc.) via pip dependencies.
285-
- All endpoints will be available at `https://api.cortex.cerebrium.ai/v4/{project-id}/{app-name}/your/endpoint`.
285+
- All endpoints will be available at `https://api.aws.us-east-1.cerebrium.ai/v4/{project-id}/{app-name}/your/endpoint`.
286286

287287
Deploy as normal with `cerebrium deploy -y`—the system automatically detects and handles custom runtime configuration.
288288

cerebrium/endpoints/async.mdx

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ responsibility, while you as the developer are responsible for ensuring that dat
1010
You can enable your function to execute asynchronously by adding the `async` query parameter to your request and setting it to `true`. This would look something like this:
1111

1212
```bash
13-
curl -X POST https://api.cortex.cerebrium.ai/v4/<YOUR-PROJECT-ID>/<YOUR-APP>/run?async=true'\
13+
curl -X POST https://api.aws.us-east-1.cerebrium.ai/v4/<YOUR-PROJECT-ID>/<YOUR-APP>/run?async=true'\
1414
-H 'Content-Type: application/json'\
1515
-H 'Authorization: Bearer <YOUR-JWT-TOKEN>\
1616
--data '{"param": "hello world"}'
@@ -47,7 +47,7 @@ async execution with a specified `webhookEndpoint`, to have Cerebrium automatica
4747
the function response once it has returned:
4848

4949
```bash
50-
curl -X POST <https://api.cortex.cerebrium.ai/v4/><YOUR-PROJECT-ID>/<YOUR-APP>/run?async=true&webhookEndpoint=https%3A%2F%2Fwebhook.site%2F'\
50+
curl -X POST <https://api.aws.us-east-1.cerebrium.ai/v4/><YOUR-PROJECT-ID>/<YOUR-APP>/run?async=true&webhookEndpoint=https%3A%2F%2Fwebhook.site%2F'\
5151
-H 'Content-Type: application/json'\
5252
-H 'Authorization: Bearer <YOUR-JWT-TOKEN>\
5353
--data '{"param": "hello world"}'

cerebrium/endpoints/inference-api.mdx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ By default, all functions on Cerebrium are accessible via authenticated POST req
88
The POST request follows the structure below, where `{function}` is the name of the function you want to invoke. For example, in this case, the function `predict()` from `main.py` is being called.
99

1010
```bash
11-
curl --location --request POST 'https://api.cortex.cerebrium.ai/v4/p-xxxxx/{app-name}/{function}' \
11+
curl --location --request POST 'https://api.aws.us-east-1.cerebrium.ai/v4/p-xxxxx/{app-name}/{function}' \
1212
--header 'Authorization: Bearer <JWT_TOKEN>' \
1313
--header 'Content-Type: application/json' \
1414
--data '{

cerebrium/endpoints/openai-compatible-endpoints.mdx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -51,7 +51,7 @@ from openai import OpenAI
5151

5252
client = OpenAI(
5353
# This is the default and can be omitted
54-
base_url="https://api.cortex.cerebrium.ai/v4/p-xxxxx/1-openai-compatible-endpoint/run", ##This is the name of the function you are calling
54+
base_url="https://api.aws.us-east-1.cerebrium.ai/v4/p-xxxxx/1-openai-compatible-endpoint/run", ##This is the name of the function you are calling
5555
api_key="<CEREBRIUM_JWT_TOKEN>",
5656
)
5757

cerebrium/endpoints/streaming.mdx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,7 @@ Once you deploy this code snippet and hit the stream endpoint, you will see the
2424
You can do this as follows:
2525

2626
```bash
27-
curl -X POST https://api.cortex.cerebrium.ai/v4/<YOUR-PROJECT-ID>/2-streaming-endpoint/run \
27+
curl -X POST https://api.aws.us-east-1.cerebrium.ai/v4/<YOUR-PROJECT-ID>/2-streaming-endpoint/run \
2828
-H 'Content-Type: application/json'\
2929
-H 'Accept: text/event-stream\
3030
-H 'Authorization: Bearer <YOUR-JWT-TOKEN>\

cerebrium/endpoints/webhook.mdx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ This allows you to use webhooks in your architecture. To achieve this, we can si
88
query parameter to any API call:
99

1010
```bash
11-
curl -X POST https://api.cortex.cerebrium.ai/v4/<YOUR-PROJECT-ID>/<YOUR-APP>/run?webhookEndpoint=https%3A%2F%2Fwebhook.site%2F'\
11+
curl -X POST https://api.aws.us-east-1.cerebrium.ai/v4/<YOUR-PROJECT-ID>/<YOUR-APP>/run?webhookEndpoint=https%3A%2F%2Fwebhook.site%2F'\
1212
-H 'Content-Type: application/json'\
1313
-H 'Authorization: Bearer <YOUR-JWT-TOKEN>\
1414
--data '{"param": "hello world"}'

cerebrium/endpoints/websockets.mdx

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -35,7 +35,7 @@ Explanation:
3535
You can test your WebSocket endpoint using websocat, a command-line utility for connecting to WebSocket servers:
3636

3737
```bash
38-
websocat wss://api.cortex.cerebrium.ai/v4/<your-project-id>/<your-app-name>/<your-websocket-function-name>
38+
websocat wss://api.aws.us-east-1.cerebrium.ai/v4/<your-project-id>/<your-app-name>/<your-websocket-function-name>
3939
```
4040

4141
## Implementing the WebSocket Endpoint
@@ -62,7 +62,7 @@ Client-side Implementation: When connecting from a client app, ensure you handle
6262
```javascript
6363
// Example using JavaScript in a browser
6464
const socket = new WebSocket(
65-
"wss://api.cortex.cerebrium.ai/v4/<your-project-id>/<your-app-name>/<your-websocket-function-name>",
65+
"wss://api.aws.us-east-1.cerebrium.ai/v4/<your-project-id>/<your-app-name>/<your-websocket-function-name>",
6666
);
6767

6868
socket.onopen = function (event) {

cerebrium/getting-started/introduction.mdx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -62,7 +62,7 @@ cerebrium deploy
6262

6363
This will turn the function into a callable endpoint that accepts json parameters (prompt) and can scale to 1000s of requests automatically!
6464

65-
Once deployed, an app becomes callable through a POST endpoint `https://api.cortex.cerebrium.ai/v4/{project-id}/{app-name}/{function-name}` and takes a json parameter, prompt
65+
Once deployed, an app becomes callable through a POST endpoint `https://api.aws.us-east-1.cerebrium.ai/v4/{project-id}/{app-name}/{function-name}` and takes a json parameter, prompt
6666

6767
Great! You made it! Join our Community [Discord](https://discord.gg/ATj6USmeE2) for support and updates.
6868

cerebrium/integrations/vercel.mdx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -43,7 +43,7 @@ Once you have followed the example and deployed the app, you should have an outp
4343

4444
```javascript
4545
fetch(
46-
"https://api.cortex.cerebrium.ai/v4/p-<YOUR PROJECT ID>/mistral-vllm/predict",
46+
"https://api.aws.us-east-1.cerebrium.ai/v4/p-<YOUR PROJECT ID>/mistral-vllm/predict",
4747
{
4848
method: "POST",
4949
headers: {

0 commit comments

Comments
 (0)