登录查看更多内容

Integrating Ollama with DeepSeek-R1 in Spring Boot

Henry Xiloj Herrera

Software Engineer | Open Source Advocate

发布日期: 2025年1月28日

Are you looking to leverage the power of Ollama and DeepSeek-R1 in your Spring Boot application? This post will walk you through the entire process, from understanding what Ollama is to implementing a seamless integration. Let’s get started!

What is Ollama?

Ollama is a powerful tool designed to simplify the deployment and management of large language models (LLMs) locally. It provides an easy-to-use API for interacting with models like DeepSeek-R1, making it an excellent choice for developers who want to integrate AI capabilities into their applications without relying on external cloud services.

With Ollama, you can:

Run LLMs locally on your machine.
Switch between different model versions effortlessly.
Integrate AI capabilities into your applications via a simple API.

Why Integrate Ollama with DeepSeek-R1?

DeepSeek-R1 is a state-of-the-art language model that offers high performance and flexibility. By integrating it with Ollama in your Spring Boot application, you can:

Build AI-powered features like chatbots, content generators, and more.
Keep your AI logic local, ensuring data privacy and reducing latency.
Easily switch between different versions of DeepSeek-R1 based on your application’s needs.

Step 1: Install Ollama

To get started, you’ll need to install Ollama on your system. Run the following command in your terminal:

curl -fsSL https://ollama.com/install.sh | sh

Successful Installation Output:

>>> Cleaning up old version
>>> Installing ollama to /usr/local
>>> Downloading Linux amd64 bundle
>>> Creating ollama user...
>>> Adding ollama user to groups...
>>> Creating ollama systemd service...
Created symlink /etc/systemd/system/default.target.wants/ollama.service
>>> Nvidia GPU detected
>>> API available at 127.0.0.1:11434

Once installed, Ollama will be ready to use, and the API will be available at https://localhost:11434.

Step 2: Application Configuration

Next, configure your Spring Boot application by updating the application.properties file:

spring:
  application:
    name: demo-deepseek-r1.ollama

# Server configuration
server:
  port: 8080
  error:
    include-message: always

# Ollama configuration
ollama:
  endpoint: https://localhost:11434/api/generate
  model: deepseek-r1:1.5b
  timeout:
    connect: 30000
    read: 60000

This configuration sets up the Ollama endpoint, model, and timeout settings for your application.

领英推荐

Microsoft's Unified AI Building Blocks for .NET

developrec 4 个月前

GenAI Thought Leadership

ETR (Enterprise Technology Research) 1 个月前

What makes AI an essential tool for developers?

Tabnine 1 个月前

Step 3: Core Implementation

Records

Create the following records to handle requests and responses:

// OllamaRequest.java
@JsonInclude(JsonInclude.Include.NON_NULL)
public record OllamaRequest(
    String model,
    String prompt,
    boolean stream
) {}

// OllamaResponse.java
@JsonIgnoreProperties(ignoreUnknown = true)
public record OllamaResponse(
    String model,
    String response,
    String created_at,
    boolean done
) {}

Service Layer

Implement the OllamaService to interact with the Ollama API:

@Service
public class OllamaService {

    private static final String OLLAMA_API_URL = "https://localhost:11434/api/generate";
    private final RestTemplate restTemplate;

    public OllamaService(RestTemplateBuilder restTemplateBuilder) {
        this.restTemplate = restTemplateBuilder
                .build();
    }

    public String generateResponse(String prompt) {
        try {
            OllamaRequest request = new OllamaRequest("deepseek-r1:1.5b", prompt, false);
            HttpHeaders headers = new HttpHeaders();
            headers.setContentType(MediaType.APPLICATION_JSON);

            ResponseEntity<OllamaResponse> response = restTemplate.exchange(
                    OLLAMA_API_URL,
                    HttpMethod.POST,
                    new HttpEntity<>(request, headers),
                    OllamaResponse.class
            );

            if (response.getStatusCode().is2xxSuccessful() && response.getBody() != null) {
                return response.getBody().response() != null
                        ? response.getBody().response()
                        : "Received empty response from model";
            }
            return "Ollama API returned status: " + response.getStatusCode();
        } catch (RestClientException e) {
            return "Error communicating with Ollama: " + e.getMessage();
        }
    }
}

REST Controller

Create a REST controller to expose the chat endpoint:

@RestController
@RequestMapping("/api/chat")
public class ChatController {

    private final OllamaService ollamaService;

    public ChatController(OllamaService ollamaService) {
        this.ollamaService = ollamaService;
    }

    @PostMapping
    public ResponseEntity<String> chat(@RequestBody String prompt) {
        if (prompt == null || prompt.isBlank()) {
            return ResponseEntity.badRequest().body("Prompt cannot be empty");
        }
        String response = ollamaService.generateResponse(prompt);
        return ResponseEntity.ok(response);
    }
}

Model Version Compatibility

Here’s a quick reference for DeepSeek-R1 model versions and their requirements:

*Check official model availability at:

Ollama Model Library

Testing the Integration

To test the integration, use the following curl command or postman:

curl -X POST -H "Content-Type: text/plain" -d "Explain AI in simple terms" https://localhost:8080/api/chat

Ouput

Source Code

Here on GitHub.

Joel Sulecio

CEO en Elianai Systems

1 个月

tambien de tomar en cuenta

Joel Sulecio

CEO en Elianai Systems

1 个月

Para los usuarios de docker el archivo compose.yml: services: ?webui: ???image: ghcr.io/open-webui/open-webui:main ???ports: ?????- 3000:8080/tcp ???volumes: ?????- open-webui:/app/backend/data ???extra_hosts: ?????- "host.docker.internal:host-gateway" ???depends_on: ?????- ollama ?ollama: ???image: ollama/ollama ???expose: ?????- 11434/tcp ???ports: ?????- 11434:11434/tcp ???healthcheck: ?????test: ollama --version || exit 1 ???volumes: ?????- ollama:/root/.ollama volumes: ?ollama: ?open-webui: ----------------- para usuarios docker y ejecutar el modelo que se desee: docker compose exec ollama ollama pull deepseek-r1:1.5b --- saludos amigo y gracias por la info.

1 次回应

查看更多评论

要查看或添加评论，请登录

Henry Xiloj Herrera的更多文章

Integrating Google Cloud Pub/Sub with Terraform and Spring Boot 3 (Java 21)

2025年2月19日

Integrating Google Cloud Pub/Sub with Terraform and Spring Boot 3 (Java 21)

Introduction In this blog post, I'll demonstrate how to provision Google Cloud Pub/Sub resources using Terraform and…
How to Sync Terraform with Existing GCP Resources & Avoid instanceAlreadyExists Errors ??

2025年2月10日

How to Sync Terraform with Existing GCP Resources & Avoid instanceAlreadyExists Errors ??

?? One common challenge when managing cloud infrastructure with Terraform is syncing existing resources with the…
Spring Retry: Handling Transient Failures Gracefully in Java 21

2025年1月26日

Spring Retry: Handling Transient Failures Gracefully in Java 21

In modern applications, transient failures (e.g.
Constructor Injection vs. @Autowired: Spring Boot 3

2025年1月4日

Constructor Injection vs. @Autowired: Spring Boot 3

Spring Boot 3, constructor injection is considered the best practice over @Autowired. Here's why: 1.
Virtual Threads in Java 21: Simplified Concurrency for Modern Applications

2024年12月8日

Virtual Threads in Java 21: Simplified Concurrency for Modern Applications

With Java 21, Virtual Threads have redefined how we approach concurrency, offering a lightweight and efficient way to…
IaC Project: Multi-GCP Cloud Networking with Composer v3, VMs, and Cloud SQL PSC (Host & Remote)

2024年12月4日

IaC Project: Multi-GCP Cloud Networking with Composer v3, VMs, and Cloud SQL PSC (Host & Remote)

?? Terraform Project, hosted on GitHub! Explore this comprehensive repository that demonstrates how to provision and…
?? Terraform Google Cloud Dataproc Project: Automated, Secure, and Scalable Infrastructure ??

2024年11月2日

?? Terraform Google Cloud Dataproc Project: Automated, Secure, and Scalable Infrastructure ??

I’ve built a comprehensive Terraform configuration to streamline the deployment of a Google Cloud Dataproc cluster…
Connecting Airflow 2 in Composer 3 to Cloud SQL via Private Service Connect (PSC) VPC Network.

2024年10月13日

Connecting Airflow 2 in Composer 3 to Cloud SQL via Private Service Connect (PSC) VPC Network.

This guide demonstrates how to connect Airflow 2 in Google Cloud Composer 3 to a Cloud SQL instance using Private…
Deploying Spring Boot 3 on Cloud Run with Cloud SQL Private Service Connect Using Terraform

2024年9月22日

Deploying Spring Boot 3 on Cloud Run with Cloud SQL Private Service Connect Using Terraform

Introduction: In this post, we'll walk through the steps required to create Infrastructure as Code (IaC) with Terraform…
Deploying Python 3.12 on Cloud Run with Cloud SQL Private Service Connect Using Terraform

2024年9月22日

Deploying Python 3.12 on Cloud Run with Cloud SQL Private Service Connect Using Terraform

Introduction: In this post, we will walk through the steps required to provision Cloud SQL Private Service Connect…

See all articles

Integrating Ollama with DeepSeek-R1 in Spring Boot

Henry Xiloj Herrera

Software Engineer | Open Source Advocate

What is Ollama?

Why Integrate Ollama with DeepSeek-R1?

Step 1: Install Ollama

Step 2: Application Configuration

领英推荐

Step 3: Core Implementation

Records

Service Layer

REST Controller

Model Version Compatibility

Testing the Integration

Ouput

Source Code

Henry Xiloj Herrera的更多文章

社区洞察

其他会员也浏览了

StackSpot News #12 - Check out our EDP and AI tutorials

The Critical Role of Continuous Performance Testing in AI-Driven Applications

Architecting Robust RAG Agents: 12 Key Design Decisions for Building Scalable and High-Performance AI Solutions (Part I)

The Role of API Developers in AI Integration: A Cost-Effective Approach

AI Monthly Insights #3

June 2024 Edition: Why E2E Fails, Message Queues Testing Guide, Upcoming Events & More for Engineering Leaders

Secure, Sustainable, Smart: Application Modernization for Better Tech

Understanding the case-sensitivity of RAML

Testing the UI on the Business Logic level: using AI-operated BlinqIO Virtual Tester

Harnessing Functional Package Managers and LLMs for Computational Repeatability

What is Ollama?

Why Integrate Ollama with DeepSeek-R1?

Step 1: Install Ollama

Step 2: Application Configuration

领英推荐

Step 3: Core Implementation

Records

Service Layer

REST Controller

Model Version Compatibility

Testing the Integration

Ouput

Source Code

Henry Xiloj Herrera的更多文章

Integrating Google Cloud Pub/Sub with Terraform and Spring Boot 3 (Java 21)

How to Sync Terraform with Existing GCP Resources & Avoid instanceAlreadyExists Errors ??

Spring Retry: Handling Transient Failures Gracefully in Java 21

Constructor Injection vs. @Autowired: Spring Boot 3

Virtual Threads in Java 21: Simplified Concurrency for Modern Applications

IaC Project: Multi-GCP Cloud Networking with Composer v3, VMs, and Cloud SQL PSC (Host & Remote)

?? Terraform Google Cloud Dataproc Project: Automated, Secure, and Scalable Infrastructure ??

Connecting Airflow 2 in Composer 3 to Cloud SQL via Private Service Connect (PSC) VPC Network.

Deploying Spring Boot 3 on Cloud Run with Cloud SQL Private Service Connect Using Terraform

Deploying Python 3.12 on Cloud Run with Cloud SQL Private Service Connect Using Terraform

社区洞察

其他会员也浏览了

StackSpot News #12 - Check out our EDP and AI tutorials

The Critical Role of Continuous Performance Testing in AI-Driven Applications

Architecting Robust RAG Agents: 12 Key Design Decisions for Building Scalable and High-Performance AI Solutions (Part I)

The Role of API Developers in AI Integration: A Cost-Effective Approach

AI Monthly Insights #3

June 2024 Edition: Why E2E Fails, Message Queues Testing Guide, Upcoming Events & More for Engineering Leaders

Secure, Sustainable, Smart: Application Modernization for Better Tech

Understanding the case-sensitivity of RAML

Testing the UI on the Business Logic level: using AI-operated BlinqIO Virtual Tester

Harnessing Functional Package Managers and LLMs for Computational Repeatability