Skip to content

Commit 63406dc

Browse files
feng-intelpre-commit-ci[bot]
andauthoredSep 11, 2024··
Yaml: add comments to specify gaudi device ids. (#753)
Signed-off-by: fengding <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
1 parent ff6f841 commit 63406dc

File tree

3 files changed

+10
-3
lines changed

3 files changed

+10
-3
lines changed
 

‎ChatQnA/docker_compose/intel/hpu/gaudi/README.md

+6-1
Original file line numberDiff line numberDiff line change
@@ -243,8 +243,13 @@ export BACKEND_SERVICE_ENDPOINT="http://${host_ip}:8888/v1/chatqna"
243243
export DATAPREP_SERVICE_ENDPOINT="http://${host_ip}:6007/v1/dataprep"
244244
export DATAPREP_GET_FILE_ENDPOINT="http://${host_ip}:6007/v1/dataprep/get_file"
245245
export DATAPREP_DELETE_FILE_ENDPOINT="http://${host_ip}:6007/v1/dataprep/delete_file"
246+
247+
export llm_service_devices=all
248+
export tei_embedding_devices=all
246249
```
247250

251+
To specify the device ids, "llm_service_devices" and "tei_embedding_devices"` can be set as "0,1,2,3" alike. More info in [gaudi docs](https://docs.habana.ai/en/latest/Orchestration/Multiple_Tenants_on_HPU/Multiple_Dockers_each_with_Single_Workload.html).
252+
248253
If guardrails microservice is enabled in the pipeline, the below environment variables are necessary to be set.
249254
250255
```bash
@@ -254,7 +259,7 @@ export SAFETY_GUARD_ENDPOINT="http://${host_ip}:8088"
254259
export GUARDRAIL_SERVICE_HOST_IP=${host_ip}
255260
```
256261

257-
Note: Please replace with `host_ip` with you external IP address, do **NOT** use localhost.
262+
Note: Please replace `host_ip` with your external IP address, do **NOT** use localhost.
258263

259264
### Start all the services Docker Containers
260265

‎ChatQnA/docker_compose/intel/hpu/gaudi/compose.yaml

+2-2
Original file line numberDiff line numberDiff line change
@@ -39,7 +39,7 @@ services:
3939
no_proxy: ${no_proxy}
4040
http_proxy: ${http_proxy}
4141
https_proxy: ${https_proxy}
42-
HABANA_VISIBLE_DEVICES: all
42+
HABANA_VISIBLE_DEVICES: ${tei_embedding_devices}
4343
OMPI_MCA_btl_vader_single_copy_mechanism: none
4444
MAX_WARMUP_SEQUENCE_LENGTH: 512
4545
INIT_HCCL_ON_ACQUIRE: 0
@@ -121,7 +121,7 @@ services:
121121
HF_TOKEN: ${HUGGINGFACEHUB_API_TOKEN}
122122
HF_HUB_DISABLE_PROGRESS_BARS: 1
123123
HF_HUB_ENABLE_HF_TRANSFER: 0
124-
HABANA_VISIBLE_DEVICES: all
124+
HABANA_VISIBLE_DEVICES: ${llm_service_devices}
125125
OMPI_MCA_btl_vader_single_copy_mechanism: none
126126
runtime: habana
127127
cap_add:

‎ChatQnA/tests/test_compose_on_gaudi.sh

+2
Original file line numberDiff line numberDiff line change
@@ -50,6 +50,8 @@ function start_services() {
5050
export DATAPREP_SERVICE_ENDPOINT="http://${ip_address}:6007/v1/dataprep"
5151
export DATAPREP_GET_FILE_ENDPOINT="http://${ip_address}:6008/v1/dataprep/get_file"
5252
export DATAPREP_DELETE_FILE_ENDPOINT="http://${ip_address}:6009/v1/dataprep/delete_file"
53+
export llm_service_devices=all
54+
export tei_embedding_devices=all
5355

5456
sed -i "s/backend_address/$ip_address/g" $WORKPATH/ui/svelte/.env
5557

0 commit comments

Comments
 (0)
Please sign in to comment.