instill-ai
diff --git a/‎README.md‎
Lines changed: 1 addition & 0 deletions b/‎README.md‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎gte-Qwen2-1.5B-instruct/README.md‎
Lines changed: 84 additions & 0 deletions b/‎gte-Qwen2-1.5B-instruct/README.md‎
Lines changed: 84 additions & 0 deletions
diff --git a/‎jina-clip-v1/README.md‎
Lines changed: 94 additions & 0 deletions b/‎jina-clip-v1/README.md‎
Lines changed: 94 additions & 0 deletions
diff --git a/‎llama2-7b-chat/README.md‎
Lines changed: 83 additions & 0 deletions b/‎llama2-7b-chat/README.md‎
Lines changed: 83 additions & 0 deletions
diff --git a/‎llama3-8b-instruct/README.md‎
Lines changed: 87 additions & 0 deletions b/‎llama3-8b-instruct/README.md‎
Lines changed: 87 additions & 0 deletions
@@ -8,6 +8,7 @@ We have a diverse set of models, each optimized for different AI tasks. Please r
 
 | Model Name                                                     | Task Type             | Description                                                                                                            |
 | -------------------------------------------------------------- | --------------------- | ---------------------------------------------------------------------------------------------------------------------- |
+| [phi-3.5-vision-instruct](./phi-3-5-vision/README.md)          | Chat                  | Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model.                                               |
 | [gte-Qwen2-1.5B-instruct](./gte-Qwen2-1.5B-instruct/README.md) | Embedding             | gte-Qwen2-1.5B-instruct is the latest model in the gte (General Text Embedding) model family.                          |
 | [jina-clip-v1](./jina-clip-v1/README.md)                       | Embedding             | jina-clip-v1 is a state-of-the-art English multimodal (text-image) embedding model.                                    |
 | [llama2-7b-chat](./llama2-7b-chat/README.md)                   | Chat                  | llama2-7b-chat is optimized for dialogue use cases.                                                                    |
 
@@ -0,0 +1,84 @@
+# gte Qwen2 1.5B Instruct
+
+## 📖 Introduction
+
+[gte-Qwen2-1.5B-instruct](https://huggingface.co/Alibaba-NLP/gte-Qwen2-1.5B-instruct) is the latest model in the gte (General Text Embedding) model family. The model is built on Qwen2-1.5B LLM model and use the same training data and strategies as the gte-Qwen2-7B-instruct model.
+
+| Task Type                                                          | Description                                                                                                          |
+| ------------------------------------------------------------------ | -------------------------------------------------------------------------------------------------------------------- |
+| [Embedding](https://www.instill.tech/docs/model/ai-task#embedding) | A task to generate means of representing objects like text, images and audio as points in a continuous vector space. |
+
+## 🔄 Compatibility Matrix
+
+To ensure smooth integration, please refer to the compatibility matrix below. It outlines the compatible versions of the model, [`instill-core`](https://github.com/instill-ai/instill-core), and the [`python-sdk`](https://github.com/instill-ai/python-sdk).
+
+| Model Version | Instill-Core Version | Python-SDK Version |
+| ------------- | -------------------- | ------------------ |
+| v0.1.0        | >v0.39.0-beta        | >0.11.0            |
+
+> **Note:** Always ensure that you are using compatible versions to avoid unexpected issues.
+
+## 🚀 Preparation
+
+Follow [this](../README.md) guide to get your custom model up and running! But before you do that, please read through the following sections to have all the necessary files ready.
+
+#### Install Python SDK
+
+Install the compatible [`python-sdk`](https://github.com/instill-ai/python-sdk) version according to the compatibility matrix:
+
+```bash
+pip install instill-sdk=={version}
+```
+
+#### Get model weights
+
+To download the fine-tuned model weights, please execute the following command:
+
+```bash
+git clone https://huggingface.co/Alibaba-NLP/gte-Qwen2-1.5B-instruct
+```
+
+## Test model image
+
+After you've built the model image, and before pushing the model onto any **Instill Core** instance, you can test if the model can be successfully run locally first, by running the following command:
+
+```bash
+instill run instill-ai/gte-qwen2-1.5b-instruct -g -i '{"prompt": "hi"}'
+```
+
+The input payload should strictly follow the the below format
+
+```json
+{
+  "prompt": "..."
+}
+```
+
+A successful response will return a similar output to that shown below.
+
+```bash
+2024-09-11 02:36:18,416.416 INFO     [Instill] Starting model image...
+2024-09-11 02:36:29,444.444 INFO     [Instill] Deploying model...
+2024-09-11 02:37:10,118.118 INFO     [Instill] Running inference...
+2024-09-11 02:37:11,585.585 INFO     [Instill] Outputs:
+[{'data': {'embeddings': [{'created': 1725993431,
+                           'index': 0,
+                           'vector': [0.014460443519055843,
+                                      0.0885428711771965,
+                                      0.02166132815182209,
+                                      ...
+                                      ...
+                                      ...
+                                      -0.02432815358042717]}]}}]
+2024-09-11 02:39:58,651.651 INFO     [Instill] Done
+```
+
+Here is the list of flags supported by `instill run` command
+
+- -t, --tag: tag for the model image, default to `latest`
+- -g, --gpu: to pass through GPU from host into container or not, depends on if `gpu` is enabled in the config.
+- -i, --input: input in json format
+
+---
+
+Happy Modeling! 💡
@@ -0,0 +1,94 @@
+# Jina CLIP V1
+
+## 📖 Introduction
+
+[jina-clip-v1](https://huggingface.co/jinaai/jina-clip-v1) is a state-of-the-art English multimodal (text-image) embedding model.
+
+| Task Type                                                          | Description                                                                                                          |
+| ------------------------------------------------------------------ | -------------------------------------------------------------------------------------------------------------------- |
+| [Embedding](https://www.instill.tech/docs/model/ai-task#embedding) | A task to generate means of representing objects like text, images and audio as points in a continuous vector space. |
+
+## 🔄 Compatibility Matrix
+
+To ensure smooth integration, please refer to the compatibility matrix below. It outlines the compatible versions of the model, [`instill-core`](https://github.com/instill-ai/instill-core), and the [`python-sdk`](https://github.com/instill-ai/python-sdk).
+
+| Model Version | Instill-Core Version | Python-SDK Version |
+| ------------- | -------------------- | ------------------ |
+| v0.1.0        | >v0.39.0-beta        | >0.11.0            |
+
+> **Note:** Always ensure that you are using compatible versions to avoid unexpected issues.
+
+## 🚀 Preparation
+
+Follow [this](../README.md) guide to get your custom model up and running! But before you do that, please read through the following sections to have all the necessary files ready.
+
+#### Install Python SDK
+
+Install the compatible [`python-sdk`](https://github.com/instill-ai/python-sdk) version according to the compatibility matrix:
+
+```bash
+pip install instill-sdk=={version}
+```
+
+#### Get model weights
+
+To download the fine-tuned model weights, please execute the following command:
+
+```bash
+git clone https://huggingface.co/jinaai/jina-clip-v1
+```
+
+## Test model image
+
+After you've built the model image, and before pushing the model onto any **Instill Core** instance, you can test if the model can be successfully run locally first, by running the following command:
+
+```bash
+instill run instill-ai/jina-clip-v1 -g -i '{"text" : "hi", "image": "https://artifacts.instill.tech/imgs/bear.jpg"}'
+```
+
+The input payload should strictly follow the the below format
+
+```json
+{
+  "text": "...",
+  "image": "https://",
+}
+```
+
+A successful response will return a similar output to that shown below.
+
+```bash
+2024-09-11 02:42:38,605.605 INFO     [Instill] Starting model image...
+2024-09-11 02:42:49,440.440 INFO     [Instill] Deploying model...
+2024-09-11 02:43:16,851.851 INFO     [Instill] Running inference...
+2024-09-11 02:43:19,756.756 INFO     [Instill] Outputs:
+[{'data': {'embeddings': [{'created': 1725993799,
+                           'index': 0,
+                           'vector': [-0.042002271860837936,
+                                      0.002093376824632287,
+                                      0.007119686808437109,
+                                      ...
+                                      ...
+                                      ...
+                                      -0.0350787378847599]},
+                          {'created': 1725993799,
+                           'index': 1,
+                           'vector': [-0.07706715911626816,
+                                      -0.006987405009567738,
+                                      0.0100631695240736,
+                                      ...
+                                      ...
+                                      ...
+                                      -0.02432815358042717]}]}}]
+2024-09-11 02:43:23,487.487 INFO     [Instill] Done
+```
+
+Here is the list of flags supported by `instill run` command
+
+- -t, --tag: tag for the model image, default to `latest`
+- -g, --gpu: to pass through GPU from host into container or not, depends on if `gpu` is enabled in the config.
+- -i, --input: input in json format
+
+---
+
+Happy Modeling! 💡
@@ -0,0 +1,83 @@
+# Llama2 7B Chat
+
+## 📖 Introduction
+
+[Llama 2](https://huggingface.co/meta-llama/Llama-2-7b-chat-hf) is a collection of pretrained and fine-tuned generative text models ranging in scale from 7 billion to 70 billion parameters. This is the repository for the 7B fine-tuned model, optimized for dialogue use cases and converted for the Hugging Face Transformers format.
+
+| Task Type                                                | Description                                                                                 |
+| -------------------------------------------------------- | ------------------------------------------------------------------------------------------- |
+| [Chat](https://www.instill.tech/docs/model/ai-task#chat) | A task to generate conversational style text output base on single or multi-modality input. |
+
+## 🔄 Compatibility Matrix
+
+To ensure smooth integration, please refer to the compatibility matrix below. It outlines the compatible versions of the model, [`instill-core`](https://github.com/instill-ai/instill-core), and the [`python-sdk`](https://github.com/instill-ai/python-sdk).
+
+| Model Version | Instill-Core Version | Python-SDK Version |
+| ------------- | -------------------- | ------------------ |
+| v0.1.0        | >v0.39.0-beta        | >0.11.0            |
+
+> **Note:** Always ensure that you are using compatible versions to avoid unexpected issues.
+
+## 🚀 Preparation
+
+Follow [this](../README.md) guide to get your custom model up and running! But before you do that, please read through the following sections to have all the necessary files ready.
+
+#### Install Python SDK
+
+Install the compatible [`python-sdk`](https://github.com/instill-ai/python-sdk) version according to the compatibility matrix:
+
+```bash
+pip install instill-sdk=={version}
+```
+
+#### Get model weights
+
+To download the fine-tuned model weights, please execute the following command:
+
+```bash
+git clone https://huggingface.co/meta-llama/Llama-2-7b-chat-hf
+```
+
+## Test model image
+
+After you've built the model image, and before pushing the model onto any **Instill Core** instance, you can test if the model can be successfully run locally first, by running the following command:
+
+```bash
+instill run instill-ai/llama2-7b-chat -g -i '{"prompt": "hi"}'
+```
+
+The input payload should strictly follow the the below format
+
+```json
+{
+  "prompt": "..."
+}
+```
+
+A successful response will return a similar output to that shown below.
+
+```bash
+2024-09-11 01:44:12,423.423 INFO     [Instill] Starting model image...
+2024-09-11 01:44:22,843.843 INFO     [Instill] Deploying model...
+2024-09-11 01:44:52,935.935 INFO     [Instill] Running inference...
+2024-09-11 01:44:56,534.534 INFO     [Instill] Outputs:
+[{'data': {'choices': [{'created': 1725990296,
+                        'finish-reason': 'length',
+                        'index': 0,
+                        'message': {'content': "Hello! It's nice to meet you. "
+                                               'Is there something I can help '
+                                               'you with or would you like to '
+                                               'chat?',
+                                    'role': 'assistant'}}]}}]
+2024-09-11 01:45:00,240.240 INFO     [Instill] Done
+```
+
+Here is the list of flags supported by `instill run` command
+
+- -t, --tag: tag for the model image, default to `latest`
+- -g, --gpu: to pass through GPU from host into container or not, depends on if `gpu` is enabled in the config.
+- -i, --input: input in json format
+
+---
+
+Happy Modeling! 💡
@@ -0,0 +1,87 @@
+# Llama3 8B Instruct
+
+## 📖 Introduction
+
+Meta developed and released the Meta [Llama 3](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) family of large language models (LLMs), a collection of pretrained and instruction tuned generative text models in 8 and 70B sizes. The Llama 3 instruction tuned models are optimized for dialogue use cases and outperform many of the available open source chat models on common industry benchmarks.
+
+| Task Type                                                | Description                                                                                 |
+| -------------------------------------------------------- | ------------------------------------------------------------------------------------------- |
+| [Chat](https://www.instill.tech/docs/model/ai-task#chat) | A task to generate conversational style text output base on single or multi-modality input. |
+
+## 🔄 Compatibility Matrix
+
+To ensure smooth integration, please refer to the compatibility matrix below. It outlines the compatible versions of the model, [`instill-core`](https://github.com/instill-ai/instill-core), and the [`python-sdk`](https://github.com/instill-ai/python-sdk).
+
+| Model Version | Instill-Core Version | Python-SDK Version |
+| ------------- | -------------------- | ------------------ |
+| v0.1.0        | >v0.39.0-beta        | >0.11.0            |
+
+> **Note:** Always ensure that you are using compatible versions to avoid unexpected issues.
+
+## 🚀 Preparation
+
+Follow [this](../README.md) guide to get your custom model up and running! But before you do that, please read through the following sections to have all the necessary files ready.
+
+#### Install Python SDK
+
+Install the compatible [`python-sdk`](https://github.com/instill-ai/python-sdk) version according to the compatibility matrix:
+
+```bash
+pip install instill-sdk=={version}
+```
+
+#### Get model weights
+
+To download the fine-tuned model weights, please execute the following command:
+
+```bash
+git clone https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct
+```
+
+## Test model image
+
+After you've built the model image, and before pushing the model onto any **Instill Core** instance, you can test if the model can be successfully run locally first, by running the following command:
+
+```bash
+instill run instill-ai/llama3-8b-instruct -g -i '{"prompt": "hows life?"}'
+```
+
+The input payload should strictly follow the the below format
+
+```json
+{
+  "prompt": "..."
+}
+```
+
+A successful response will return a similar output to that shown below.
+
+```bash
+2024-09-11 01:48:37,795.795 INFO     [Instill] Starting model image...
+2024-09-11 01:48:48,785.785 INFO     [Instill] Deploying model...
+2024-09-11 01:49:28,613.613 INFO     [Instill] Running inference...
+2024-09-11 01:49:33,350.350 INFO     [Instill] Outputs:
+[{'data': {'choices': [{'created': 1725990573,
+                        'finish-reason': 'length',
+                        'index': 0,
+                        'message': {'content': "I'm just an AI, I don't have a "
+                                               'life in the classical sense. I '
+                                               'exist solely to assist and '
+                                               'communicate with users like '
+                                               "you. I don't have emotions, "
+                                               'experiences, or personal '
+                                               "relationships. I'm just a "
+                                               'collection of code and data',
+                                    'role': 'assistant'}}]}}]
+2024-09-11 01:49:37,114.114 INFO     [Instill] Done
+```
+
+Here is the list of flags supported by `instill run` command
+
+- -t, --tag: tag for the model image, default to `latest`
+- -g, --gpu: to pass through GPU from host into container or not, depends on if `gpu` is enabled in the config.
+- -i, --input: input in json format
+
+---
+
+Happy Modeling! 💡