Build and Deploy a Scalable ML App Using Streamlit, Docker, and GKE

Jul 06, 2025 By Alison Perry

Creating a machine learning app is one thing, but getting it to run smoothly across different platforms is a whole other task. Most developers struggle not with the model itself, but with the setup around it—UI, containers, and cloud services. That’s where Streamlit, Docker, and GKE come in. Together, they help turn your code into a usable app that others can access, test, and rely on. In this guide, you’ll learn how to move from a working ML script to a full app that runs on Google Cloud.

How to Build and Deploy an ML App Using Streamlit, Docker, and GKE?

Start With Streamlit to Build the UI

Most ML scripts are fine in notebooks or command lines, but if you want others to interact with them, you need a proper interface. Streamlit is lightweight, easy to set up, and doesn’t ask for much. If you can write a few Python functions, you’re good to go.

Set up your app

Make sure Python is installed, then create a virtual environment and install Streamlit.

bash

CopyEdit

pip install streamlit

Create a new file, say app.py, and write a small app:

python

CopyEdit

import streamlit as st

import joblib

model = joblib.load('model.pkl')

st.title("Prediction App")

user_input = st.number_input("Enter a value")

if st.button("Predict"):

result = model.predict([[user_input]])

st.write(f"Prediction: {result[0]}")

Place your trained model (model.pkl) in the same folder. Run the app:

bash

CopyEdit

streamlit run app.py

That’s your basic app. Clean layout, no fluff, and fully functional.

Wrap It in Docker for Portability

Running the app locally is fine, but you want it to work the same way for everyone. That’s what Docker is for. It gives you a consistent environment, no matter the machine.

Write a Dockerfile

Here’s a simple one:

# Use an official Python runtime as a base image

FROM python:3.10-slim

# Set environment variables

ENV MODEL_PATH=/app/model.pkl

# Set working directory

WORKDIR /app

# Copy current directory contents into the container

COPY . /app

# Install dependencies

RUN pip install --no-cache-dir streamlit joblib

# Expose the port Streamlit runs on

EXPOSE 8501

# Run the app

CMD ["streamlit", "run", "app.py"]

Build and test the container

bash

CopyEdit

docker build -t ml-app .

docker run -p 8501:8501 ml-app

Now, your app runs in a container. Open your browser and go to http://localhost:8501 to confirm everything’s working.

Handle Model Files and Environment Variables

When running the app locally, you can load your model from a .pkl file sitting right next to your script. But things change once you move to containers and the cloud. It's better to keep model files in a consistent path and avoid hardcoding anything. You can use environment variables to set paths or API keys, then pass them into the container. In Docker, it looks like this:

dockerfile

CopyEdit

ENV MODEL_PATH=/app/model.pkl

And in your code:

python

CopyEdit

import os

model_path = os.getenv("MODEL_PATH", "model.pkl")

model = joblib.load(model_path)

On GKE, you can add these variables to your deployment file under 'env'. If your model is big or updates often, store it in a Google Cloud Storage bucket and download it when the container starts.

Optimize Docker Image Size

Streamlit apps don't need heavy images. If your container is large, your builds will be slow, and deployments will take longer. To fix this, you can start by switching to a smaller base image like python:3.10-alpine. Also, remove anything that's not needed—no test files, no unused assets.

Multi-stage builds are also worth looking into. You can install and compile everything in one stage, then copy only the final build into a slim image. This keeps your image clean and lean, which makes a difference when deploying multiple versions or running updates.

Push Your Image to Google Container Registry

The next step is moving the Docker image to the cloud. Google Container Registry (GCR) lets you store your Docker images so they can be used on GKE.

Tag and push your image

First, authenticate:

bash

CopyEdit

gcloud auth login

gcloud config set project [PROJECT_ID]

gcloud auth configure-docker

Then tag your image:

bash

CopyEdit

docker tag ml-app gcr.io/[PROJECT_ID]/ml-app

And push it:

bash

CopyEdit

docker push gcr.io/[PROJECT_ID]/ml-app

Now the image is in GCR, ready for Kubernetes to use.

Deploy With GKE

Now it’s time to get your app running on Google Kubernetes Engine (GKE). This makes it scalable and accessible from anywhere.

Set up the cluster

bash

CopyEdit

gcloud container clusters create ml-cluster --num-nodes=1

gcloud container clusters get-credentials ml-cluster

Create a deployment and a service

Create a file called deployment.yaml:

yaml

Copy

Edit

apiVersion: apps/v1

kind: Deployment

metadata:

name: ml-app-deployment

spec:

replicas: 1

selector:

matchLabels:

app: ml-app

template:

metadata:

labels:

app: ml-app

spec:

containers:

- name: ml-app

image: gcr.io/[PROJECT_ID]/ml-app

ports:

- containerPort: 8501

env:

- name: MODEL_PATH

value: "/app/model.pkl"

And one for the service.yaml:

yaml

Copy

Edit

apiVersion: v1

kind: Service

metadata:

name: ml-app-service

spec:

type: LoadBalancer

selector:

app: ml-app

ports:

- protocol: TCP

port: 80

targetPort: 8501

To apply these:

bash

Copy

Edit

kubectl apply -f deployment.yaml

kubectl apply -f service.yaml

Monitor and Log the App in Production

Once your app is live, don’t just leave it running and hope for the best. GKE supports built-in logging and monitoring through Google Cloud’s operations suite. All logs from your app—print statements, errors, usage—can show up in the Cloud Logging dashboard.

You can also track memory, CPU usage, and request count. If your app crashes or becomes slow, this is where you’ll catch it. Just make sure your app writes to standard output, and Google Cloud will handle the rest. No need to add fancy logging tools if all you need is visibility into what’s happening.

Final Thoughts

Putting together Streamlit, Docker, and GKE can look like a long road, but each step has a purpose. Streamlit builds your UI without getting in the way. Docker wraps it all in a clean box. GKE scales it so anyone can use it. Together, they help turn your ML script into a shareable tool that doesn’t break when it leaves your laptop.

Recommended Updates

How to Ensure AI Transparency and Compliance

How to Use Gradio on Hugging Face Spaces to Run ComfyUI Workflows Without Paying

What the Hugging Face Integration Means for the Artificial Analysis LLM Leaderboard

Explore How Google and Meta Antitrust Cases Affect Regulations

Getting Started with LeNet: A Look at Its Architecture and Implementation

Midjourney 2025: V7 Timeline and Video Features You Need to Know

How to Use NumPy’s argmax() to Find the Index of the Max Value

How the Open Chain of Thought Leaderboard is Redefining AI Evaluation

6 Risks of ChatGPT in Customer Service: What Businesses Need to Know

CyberSecEval 2: Evaluating Cybersecurity Risks and Capabilities of Large Language Models

How the Open Medical-LLM Leaderboard Is Setting Standards for AI in Healthcare

Build a Multi-Modal Search App with Chroma and CLIP

Guide to Build and Deploy a Scalable Machine Learning App with Streamlit, Docker, and GKE

How to Build and Deploy an ML App Using Streamlit, Docker, and GKE?

Start With Streamlit to Build the UI

Set up your app

Wrap It in Docker for Portability

Write a Dockerfile

Build and test the container

Handle Model Files and Environment Variables

Optimize Docker Image Size

Push Your Image to Google Container Registry

Tag and push your image

Deploy With GKE

Set up the cluster

Create a deployment and a service

Monitor and Log the App in Production

Final Thoughts