Enterprise Java

Building a Semantic Search System with Spring AI and PGVector

Semantic search enhances traditional keyword-based search by understanding the meaning and context of queries rather than relying on exact keyword matches. This approach is useful for applications such as recommendation systems, document retrieval, intelligent chatbots, and enterprise knowledge management.

In this article, we will implement semantic search using Spring AI and PGVector, an extension for PostgreSQL that enables vector-based similarity search. We will integrate Ollama, an open-source AI model framework, to generate embeddings that represent the semantic meaning of text.

1. Adding Dependencies

To set up a Spring Boot project with Spring AI, PGVector, and Ollama, we will generate the project using Spring Initializr.

spring ai pgvector semantic search - dependencies

After adding the required dependencies, your pom.xml should look like this:

01
02
03
04
05
06
07
08
09
10
11
12
13
14
15
16
17
18
19
20
21
<dependency>
    <groupId>org.springframework.ai</groupId>
    <artifactId>spring-ai-ollama-spring-boot-starter</artifactId>
</dependency>
<dependency>
    <groupId>org.springframework.ai</groupId>
    <artifactId>spring-ai-pgvector-store-spring-boot-starter</artifactId>
</dependency>
 
<dependency>
    <groupId>org.springframework.boot</groupId>
    <artifactId>spring-boot-docker-compose</artifactId>
    <scope>runtime</scope>
    <optional>true</optional>
</dependency>
<dependency>
    <groupId>org.springframework.ai</groupId>
    <artifactId>spring-ai-spring-boot-docker-compose</artifactId>
    <scope>runtime</scope>
    <optional>true</optional>
</dependency>

Each dependency in the pom.xml file serves a specific purpose in our application. Here’s what each one does:

  • spring-ai-ollama-spring-boot-starter: Integrates Ollama and Spring Boot for generating vector embeddings of text. This converts textual data, such as movie descriptions, into numerical vector representations, which are then stored in PGVector for similarity search.
  • spring-ai-pgvector-store-spring-boot-starter: Provides support for storing and retrieving vector embeddings in PostgreSQL using PGVector.
  • spring-boot-docker-compose: Enables Spring Boot to automatically manage Docker Compose. It enables Spring Boot applications to automatically start required Docker containers locally, ensuring PostgreSQL and Ollama are available without manual intervention.
  • spring-ai-spring-boot-docker-compose: A helper dependency for managing Spring AI-related services with Docker Compose. It works alongside spring-boot-docker-compose to ensure that AI models and vector databases are started and connected properly within a Dockerized environment.

2. Setting Up the Environment

To get started, we need to configure our environment using Docker. Below is the docker-compose.yml file, which sets up PostgreSQL with PGVector and Ollama:

01
02
03
04
05
06
07
08
09
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
services:
  postgres:
    image: pgvector/pgvector:pg17
    environment:
      POSTGRES_DB: movie_vector_db
      POSTGRES_USER: postgres
      POSTGRES_PASSWORD: postgres
    ports:
      - "5434:5432"
    healthcheck:
      test: [ "CMD-SHELL", "pg_isready -U postgres" ]
      interval: 10s
      timeout: 5s
      retries: 5
 
  ollama:
    image: ollama/ollama:latest
    ports:
      - "11432:11434"
    volumes:
      - ollama_data:/root/.ollama
 
volumes:
  ollama_data:

The PostgreSQL service uses the pgvector/pgvector:pg17 image, which includes PostgreSQL 17 with PGVector support for storing and searching vector embeddings. It initializes a database named movie_vector_db, setting postgres as both the username and password. Additionally, it maps port 5432 inside the container to 5434 on the host machine.

The Ollama service is used to generate text embeddings and maps port 11434 inside the container to 11432 on the host machine for external access. It also utilizes a named volume (ollama_data) to designate where to persist AI models (Ollama is using an AI embedding model internally to generate embeddings).

3. Application Properties Configuration

We need to configure Spring Boot to interact with PGVector and Ollama. Below is the application.properties file:

01
02
03
04
05
06
07
08
09
10
11
12
13
14
15
16
17
18
19
spring.application.name=spring-ai-pgvector-movie-search
 
spring.ai.ollama.init.pull-model-strategy=when_missing
spring.ai.ollama.init.chat.include=true
spring.ai.ollama.embedding.options.model=nomic-embed-text
 
spring.ai.vectorstore.pgvector.initialize-schema=true
spring.ai.vectorstore.pgvector.dimensions=768
spring.ai.vectorstore.pgvector.index-type=hnsw
 
spring.docker.compose.file=docker-compose.yml
spring.docker.compose.enabled=true
 
spring.datasource.url=jdbc:postgresql://localhost:5434/movie_vector_db
spring.datasource.username=postgres
spring.datasource.password=postgres
spring.datasource.driver-class-name=org.postgresql.Driver
 
spring.jpa.database-platform=org.hibernate.dialect.PostgreSQLDialect

This application.properties file configures the application by defining properties for Ollama AI, PGVector, database connectivity, and Docker Compose integration.

  • pull-model-strategy=when_missing: Ensures that Ollama will download and use an AI model only if it is not already available, preventing unnecessary downloads.
  • chat.include=true: Enables Ollama’s chat capability, allowing it to process conversational queries.
  • embedding.options.model=nomic-embed-text: Specifies the embedding AI model Ollama will use (in our case, nomic-embed-text), to convert text into vector representations for semantic search.
  • initialize-schema=true: Automatically initializes the PGVector schema in PostgreSQL if it doesn’t exist.
  • dimensions=768: Sets the embedding vector dimension size to 768, aligning with the AI model’s output (in our case, nomic-embed-text).
  • index-type=hnsw: Uses Hierarchical Navigable Small World (HNSW) indexing for faster vector similarity search.
  • spring.docker.compose.file=docker-compose.yml: Specifies the Docker Compose file to be used when starting the application.
  • spring.docker.compose.enabled=true: Enables automatic service management, ensuring PostgreSQL and Ollama containers start when the application runs.
  • spring.datasource.url=jdbc:postgresql://localhost:5434/movie_vector_db: Configures the application to connect to the PostgreSQL database (movie_vector_db) running on port 5434.
  • spring.datasource.username=postgres & spring.datasource.password=postgres: Provides authentication credentials for database access.
  • spring.datasource.driver-class-name=org.postgresql.Driver: Specifies the PostgreSQL JDBC driver required for database connectivity.
  • spring.jpa.database-platform=org.hibernate.dialect.PostgreSQLDialect: Instructs Hibernate to use PostgreSQL-specific SQL dialect, optimizing database queries for PostgreSQL.

4. Creating the Movie Entity

Below is a simple record class for movies:

1
2
3
public record Movie(String title, String director, String description) {
 
}

Each movie has a title, director, and description. This will be converted into embeddings for similarity search.

5. Storing Movies in PGVector

To perform a semantic search, we must first store movies as embeddings in PGVector.

01
02
03
04
05
06
07
08
09
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
@Component
public class MovieStoragePipeline {
 
    private final VectorStore vectorStore;
 
    public MovieStoragePipeline(VectorStore vectorStore) {
        this.vectorStore = vectorStore;
    }
 
    @PostConstruct
    void run() {
        var movies = List.of(
                new Movie("Inception", "Christopher Nolan",
                        "A thief who enters the dreams of others to steal their secrets is given a chance to erase his criminal record."),
                new Movie("Interstellar", "Christopher Nolan",
                        "A group of astronauts travels through a wormhole in search of a new home for humanity."),
                new Movie("The Social Network", "David Fincher",
                        "The story of Mark Zuckerberg and the creation of Facebook, a revolutionary social media platform."),
                new Movie("Parasite", "Bong Joon-ho",
                        "A poor family infiltrates a wealthy household, leading to unexpected consequences."),
                new Movie("Blade Runner", "Ridley Scott",
                        "Set in a dystopian future, a blade runner is tasked with hunting down bioengineered beings known as replicants."),
                new Movie("High-Rise", "Ben Wheatley",
                        "Residents of a luxury tower block descend into chaos as class divisions and societal breakdowns emerge."),
                new Movie("Children of Men", "Alfonso Cuarón",
                        "In a world where humans have become infertile, a disillusioned bureaucrat becomes the unlikely champion in the fight for humanity's survival."),
                new Movie("The Hunger Games", "Gary Ross",
                        "In a dystopian future, a young girl volunteers to participate in a televised battle to the death.")
        );
 
        List<Document> documents = movies.stream()
                .map(movie -> new Document(movie.toString()))
                .toList();
 
        vectorStore.add(documents);
    }
}

The MovieStoragePipeline class is responsible for storing movie data in PGVector for semantic search. It initializes a list of Movie objects, each containing a title, director, and description. Using the @PostConstruct annotation, the run() method executes automatically after the class is instantiated, ensuring that the movies are added to the vector database at application startup. Each Movie object is converted into a Document and then stored in the VectorStore, allowing efficient similarity searches based on movie descriptions.

6. Implementing the Semantic Search Controller

The Semantic Search Controller below handles user queries by retrieving similar movie descriptions from PGVector using vector-based similarity search. It leverages semantic understanding, allowing users to find relevant movies even if their query does not contain exact title matches. The controller processes search requests, converts the query into an embedding, and retrieves the most similar stored vectors, making the search experience more intuitive.

01
02
03
04
05
06
07
08
09
10
11
12
13
14
15
16
17
18
19
20
21
@RestController
@RequestMapping("/movies")
public class MovieSearchController {
 
    private final VectorStore vectorStore;
 
    public MovieSearchController(VectorStore vectorStore) {
        this.vectorStore = vectorStore;
    }
 
    @PostMapping("/search")
    List<String> findSimilarMovies(@RequestBody String query) {
        return vectorStore.similaritySearch(SearchRequest.builder()
                .query(query)
                .topK(3)
                .build())
                .stream()
                .map(Document::getText)
                .toList();
    }
}

The MovieSearchController class exposes an endpoint (POST /movies/search) that takes a textual query from the user, converts it into an embedding, and retrieves the top three most similar movie descriptions. The vectorStore performs the similarity search, returning matching documents based on vector proximity rather than exact keyword matches.

7. Running the Application

The application is now ready to run with all configurations and components in place. When started, Spring Boot automatically launches the required Docker containers (PostgreSQL with PGVector and Ollama) and initializes the vector store. The MovieStoragePipeline class preloads a set of movies into PGVector, ensuring that the database has meaningful data for similarity searches.

Once the application is running, we can send POST requests to the /movies/search endpoint with a query like this:

1
curl -X POST --data "dystopian future" http://localhost:8080/movies/search

Example Response:

1
["Movie[title=The Hunger Games, director=Gary Ross, description=In a dystopian future, a young girl volunteers to participate in a televised battle to the death.]","Movie[title=Blade Runner, director=Ridley Scott, description=Set in a dystopian future, a blade runner is tasked with hunting down bioengineered beings known as replicants.]","Movie[title=The Social Network, director=David Fincher, description=The story of Mark Zuckerberg and the creation of Facebook, a revolutionary social media platform.]"]

This response demonstrates the application’s ability to perform semantic searches, retrieving movies that conceptually match the query rather than relying solely on exact keyword matches.

8. Conclusion

In this article, we explored the integration of Spring AI with PGVector and Ollama to build a document query system. We began by setting up the necessary environment, including configuring PostgreSQL with the PGVector extension and integrating the Ollama AI service. We then demonstrated how to implement semantic search capabilities, enabling the application to process natural language queries and retrieve relevant documents effectively.

9. Download the Source Code

This article covered semantic search using Spring AI and PGVector.

Download
You can download the full source code of this example here: spring ai pgvector semantic search
Do you want to know how to develop your skillset to become a Java Rockstar?
Subscribe to our newsletter to start Rocking right now!
To get you started we give you our best selling eBooks for FREE!
1. JPA Mini Book
2. JVM Troubleshooting Guide
3. JUnit Tutorial for Unit Testing
4. Java Annotations Tutorial
5. Java Interview Questions
6. Spring Interview Questions
7. Android UI Design
and many more ....
I agree to the Terms and Privacy Policy

Omozegie Aziegbe

Omos Aziegbe is a technical writer and web/application developer with a BSc in Computer Science and Software Engineering from the University of Bedfordshire. Specializing in Java enterprise applications with the Jakarta EE framework, Omos also works with HTML5, CSS, and JavaScript for web development. As a freelance web developer, Omos combines technical expertise with research and writing on topics such as software engineering, programming, web application development, computer science, and technology.
Subscribe
Notify of
guest


This site uses Akismet to reduce spam. Learn how your comment data is processed.

0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments
Back to top button