gcloud container ai profiles manifests create --accelerator-type=ACCELERATOR_TYPE --model=MODEL --model-server=MODEL_SERVER [--model-bucket-uri=MODEL_BUCKET_URI] [--model-server-version=MODEL_SERVER_VERSION] [--namespace=NAMESPACE] [--output=OUTPUT; default="all"] [--output-path=OUTPUT_PATH] [--serving-stack=SERVING_STACK] [--serving-stack-version=SERVING_STACK_VERSION] [--target-itl-milliseconds=TARGET_ITL_MILLISECONDS] [--target-ntpot-milliseconds=TARGET_NTPOT_MILLISECONDS] [--target-ttft-milliseconds=TARGET_TTFT_MILLISECONDS] [--use-case=USE_CASE] [GCLOUD_WIDE_FLAG …]
gcloud alpha container ai profiles model-and-server-combinations
list. To get supported accelerators with their performance metrics, run
gcloud alpha container ai profiles accelerators list.
--accelerator-type=ACCELERATOR_TYPE--model=MODEL--model-server=MODEL_SERVER--model-bucket-uri=MODEL_BUCKET_URI--model-server-version=MODEL_SERVER_VERSION--namespace=NAMESPACE--output=OUTPUT; default="all"OUTPUT must be
one of: manifest, comments, all.
--output-path=OUTPUT_PATH--serving-stack=SERVING_STACK--serving-stack-version=SERVING_STACK_VERSION--target-itl-milliseconds=TARGET_ITL_MILLISECONDS--target-ntpot-milliseconds=TARGET_NTPOT_MILLISECONDS--target-ttft-milliseconds=TARGET_TTFT_MILLISECONDS--use-case=USE_CASE--access-token-file,
--account, --billing-project,
--configuration,
--flags-file,
--flatten, --format, --help, --impersonate-service-account,
--log-http,
--project, --quiet, --trace-token, --user-output-enabled,
--verbosity.
Run $ gcloud help for details.
gcloud alpha container ai profiles manifests create
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2026-05-27 UTC.