Serverless Spring Boot with Knative on Kubernetes
Modern applications demand scalability, efficiency, and flexibility, especially when handling unpredictable workloads. Traditionally, Kubernetes offers robust orchestration for containerized applications, but it doesn’t inherently provide serverless capabilities like automatic scaling down to zero when an application is idle. This is where Knative comes into play.
In this article, we’ll explore how Knative enables serverless auto-scaling for Spring Boot microservices running on Kubernetes, helping teams optimize resource usage while supporting event-driven architectures.
1. Understanding Knative and Serverless Java
1.1 What is Knative?
Knative is an open-source Kubernetes-based platform designed for building, deploying, and managing serverless applications. It abstracts away much of the complexity involved in running serverless workloads on Kubernetes, providing features like:
- Automatic scaling (including scaling to zero when idle)
- Event-driven execution
- Traffic management for revisions
- Built-in observability
By integrating Knative Serving and Knative Eventing, we can create Spring Boot microservices that scale dynamically based on HTTP requests or cloud events, making applications cost-efficient and responsive.
1.2 Why Use Knative for Spring Boot Microservices?
Spring Boot is widely used for developing microservices, but traditional deployments in Kubernetes can be resource-intensive. Knative enables Spring Boot applications to scale up during high demand and scale down to zero when idle, reducing infrastructure costs.
Key benefits of using Knative for Spring Boot:
- Efficient resource utilization – No need to keep idle instances running.
- Simplified deployment – Knative abstracts Kubernetes complexities.
- Event-driven capabilities – Seamless integration with Kafka, CloudEvents, and other event sources.
- Automatic traffic routing – Canary releases and blue-green deployments become easier.
2. Deploying a Spring Boot Application with Knative
Let’s walk through deploying a Spring Boot microservice in a Knative-enabled Kubernetes cluster and configure auto-scaling.
Step 1: Prepare Your Kubernetes Cluster
Ensure you have a Kubernetes cluster ready and install Knative Serving and Knative Eventing:
1 2 | kubectl apply -f https: //github .com /knative/serving/releases/latest/download/serving-crds .yaml kubectl apply -f https: //github .com /knative/serving/releases/latest/download/serving-core .yaml |
If you’re using Kourier as the networking layer:
1 2 3 4 5 | kubectl apply -f https: //github .com /knative/net-kourier/releases/latest/download/kourier .yaml kubectl patch configmap /config-network \ --namespace knative-serving \ -- type merge \ --patch '{"data":{"ingress-class":"kourier.ingress.networking.knative.dev"}}' |
Step 2: Create a Simple Spring Boot Application
For demonstration, we’ll deploy a basic Spring Boot REST API. If you don’t have a Spring Boot application ready, create one using Spring Initializr with the dependencies Spring Web and Spring Boot Actuator.
Example HelloController.java
:
1 2 3 4 5 6 7 8 | @RestController public class HelloController { @GetMapping ( "/" ) public String hello() { return "Hello from Knative!" ; } } |
Step 3: Containerize and Push the Application
Build and push the Docker image:
1 2 | docker build -t your-dockerhub-username /spring-boot-knative . docker push your-dockerhub-username /spring-boot-knative } |
Step 4: Deploy the Spring Boot Service to Knative
Create a Knative Service (KService) YAML file to define the deployment:
01 02 03 04 05 06 07 08 09 10 11 12 13 14 15 16 | apiVersion: serving.knative.dev /v1 kind: Service metadata: name: spring-boot-knative spec: template: spec: containers: - image: your-dockerhub-username /spring-boot-knative env : - name: SPRING_PROFILES_ACTIVE value: "prod" resources: limits: memory: "512Mi" cpu: "500m" |
Apply the Knative service:
1 | kubectl apply -f spring-boot-knative.yaml |
Step 5: Access Your Deployed Service
Knative automatically assigns a URL to your service. Retrieve it using:
1 | kubectl get kservice spring-boot-knative |
You can now access your Spring Boot app via the assigned URL.
3. Enabling Auto-Scaling in Knative
Knative’s automatic scaling (scale-to-zero) works based on incoming HTTP requests. To fine-tune the scaling behavior, modify the autoscaling.knative.dev
annotations:
1 2 3 4 5 6 7 8 | apiVersion: serving.knative.dev /v1 kind: Service metadata: name: spring-boot-knative annotations: autoscaling.knative.dev /minScale : "1" autoscaling.knative.dev /maxScale : "10" autoscaling.knative.dev /target : "5" |
This configuration ensures:
- At least 1 pod is always running.
- The service scales up to 10 instances when demand increases.
- Each instance handles up to 5 requests per second before scaling up.
Apply the updated configuration:
1 | kubectl apply -f spring-boot-knative.yaml |
4. Handling Event-Driven Workloads with Knative Eventing
Knative Eventing allows services to respond to Kafka events, HTTP events, and cloud events.
Example: Subscribing to a Kafka topic:
01 02 03 04 05 06 07 08 09 10 11 12 13 14 15 | apiVersion: sources.knative.dev /v1alpha1 kind: KafkaSource metadata: name: kafka- source spec: consumerGroup: my-group bootstrapServers: - my-kafka-broker:9092 topics: - my-topic sink: ref: apiVersion: serving.knative.dev /v1 kind: Service name: spring-boot-knative |
This configuration enables event-driven execution in Spring Boot based on Kafka messages.
5. Conclusion
Knative provides serverless scaling, event-driven execution, and automatic traffic routing for Spring Boot applications running in Kubernetes. By leveraging Knative, developers can build efficient, cost-effective microservices that scale dynamically based on real-time traffic and events.
Further Reading
- Knative Official Documentation
- Spring Boot & Kubernetes Guide
- Knative Eventing
- Autoscaling with Knative
With Knative + Spring Boot, serverless Java becomes a reality, offering seamless scalability and cost optimization for modern cloud-native applications. 🚀