AI on GKE

This repository contains assets related to AI/ML workloads on Google Kubernetes Engine (GKE).

Architecture

Defaults:

Creates a new VPC & subnet (can be disabled)
Creates Private Clusters with external endpoints disabled
Registers the cluster with fleet in current project
Solution uses Anthos Connect Gateway to connect to private clusters

Getting Started

0. Deployment Options

There are 3 options for using this repository. You can use any of these.

Deploy with Terraform commands
Deploy with Cloudshell
Deploy with Infrastructure Manager

1. Deploy with Terraform commands

Create Infrastructure (optional)

Platform module can be used to create a GKE cluster. Please check README file

cd platform

Update platform.auto.tfvars with project_id, cluster_name and other required variables

terraform init
terraform apply

Deploy Workloads

The repository supports the following workload deployments

Jupyterhub
Ray Clusters
Triton Inference Server
(TODO) Feast feature store

** TODO: add readme files for individual workload modules & link it here **

cd ../workloads

Update workloads.auto.tfvars with project_id, workloads to install flag and other required variables

terraform init
terraform apply

Delete the deployment

cd workloads
terraform destroy

cd platform
terraform destroy

Accessing Deployments

Lorum Ipsum

2. Deploy with Cloudshell

Deploy the platform & workloads using cloud shell. Follow the tutorial side pannel for instructions.

3. Deploy with Infrastructure Manager

PROJECT_ID=<your-project-id>
SERVICE_ACCOUNT_NAME=<sa-name>

gcloud  infra-manager deployments apply projects/$PROJECT_ID/locations/us-central1/deployments/aiongke-deployment \
    --service-account=projects/$PROJECT_ID/serviceAccounts/$SERVICE_ACCOUNT_NAME@$PROJECT_ID.iam.gserviceaccount.com \
    --git-source-repo=https://github.com/umeshkumhar/ai-on-gke \
    --git-source-directory=platform \
    --git-source-ref=main \
    --input-values=project_id=$PROJECT_ID,cluster_name=ml-cluster0

Check the status of deployment

gcloud infra-manager deployments describe projects/$PROJECT_ID/locations/us-central1/deployments/aiongke-deployment

Important Note

The use of the assets contained in this repository is subject to compliance with Google's AI Principles

Licensing

See LICENSE

Name		Name	Last commit message	Last commit date
Latest commit History 195 Commits
iam		iam
modules		modules
platform		platform
workloads		workloads
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
cloudbuild.yaml		cloudbuild.yaml
cloudbuild_delete.yaml		cloudbuild_delete.yaml
tutorial.md		tutorial.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI on GKE

Architecture

Getting Started

0. Deployment Options

1. Deploy with Terraform commands

Create Infrastructure (optional)

Deploy Workloads

Delete the deployment

Accessing Deployments

2. Deploy with Cloudshell

3. Deploy with Infrastructure Manager

Important Note

Licensing

About

Releases

Packages

Languages

License

genai-llm/platform

Folders and files

Latest commit

History

Repository files navigation

AI on GKE

Architecture

Getting Started

0. Deployment Options

1. Deploy with Terraform commands

Create Infrastructure (optional)

Deploy Workloads

Delete the deployment

Accessing Deployments

2. Deploy with Cloudshell

3. Deploy with Infrastructure Manager

Important Note

Licensing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages