Skip to content

genai-llm/platform

 
 

Repository files navigation

AI on GKE

This repository contains assets related to AI/ML workloads on Google Kubernetes Engine (GKE).

Architecture

Defaults:

  • Creates a new VPC & subnet (can be disabled)
  • Creates Private Clusters with external endpoints disabled
  • Registers the cluster with fleet in current project
  • Solution uses Anthos Connect Gateway to connect to private clusters

Getting Started

0. Deployment Options

There are 3 options for using this repository. You can use any of these.

  1. Deploy with Terraform commands
  2. Deploy with Cloudshell
  3. Deploy with Infrastructure Manager

1. Deploy with Terraform commands

Create Infrastructure (optional)

Platform module can be used to create a GKE cluster. Please check README file

cd platform

Update platform.auto.tfvars with project_id, cluster_name and other required variables

terraform init
terraform apply

Deploy Workloads

The repository supports the following workload deployments

  • Jupyterhub
  • Ray Clusters
  • Triton Inference Server
  • (TODO) Feast feature store

** TODO: add readme files for individual workload modules & link it here **

cd ../workloads

Update workloads.auto.tfvars with project_id, workloads to install flag and other required variables

terraform init
terraform apply

Delete the deployment

cd workloads
terraform destroy

cd platform
terraform destroy

Accessing Deployments

Lorum Ipsum

2. Deploy with Cloudshell

Deploy the platform & workloads using cloud shell. Follow the tutorial side pannel for instructions.

Deploy using Cloud Shell

3. Deploy with Infrastructure Manager

PROJECT_ID=<your-project-id>
SERVICE_ACCOUNT_NAME=<sa-name>
gcloud  infra-manager deployments apply projects/$PROJECT_ID/locations/us-central1/deployments/aiongke-deployment \
    --service-account=projects/$PROJECT_ID/serviceAccounts/$SERVICE_ACCOUNT_NAME@$PROJECT_ID.iam.gserviceaccount.com \
    --git-source-repo=https://github.com/umeshkumhar/ai-on-gke \
    --git-source-directory=platform \
    --git-source-ref=main \
    --input-values=project_id=$PROJECT_ID,cluster_name=ml-cluster0

Check the status of deployment

gcloud infra-manager deployments describe projects/$PROJECT_ID/locations/us-central1/deployments/aiongke-deployment

Important Note

The use of the assets contained in this repository is subject to compliance with Google's AI Principles

Licensing

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HCL 91.1%
  • Smarty 3.8%
  • Makefile 2.7%
  • Shell 2.4%