登录查看更多内容

Mastering High-Performance Java Applications: Lessons from the Trenches;

Timir Patel

Entrepreneur & Technology Leader | SaaS, AI/ML, Java & AWS Expert | Open to Senior Tech Leadership Roles | Ex-Hitachi | IIMA Alumnus

发布日期: 2024年9月22日

Building high-performance, reliable Java applications is not an easy task, especially when dealing with large-scale systems where performance bottlenecks can cause substantial issues. In our journey of designing and optimizing enterprise-grade Java applications, we encountered several challenges and developed strategies that turned into game changers. These strategies helped us streamline reliability, boost performance, and handle unexpected traffic spikes efficiently.

Here, I will share some of the most impactful techniques we implemented to ensure high availability, scalability, and performance.

1. Using StreamingResponseBody with Buffering and Lazy Exception Handling

When handling large files such as video or image streams, sending the entire payload at once isn't efficient. Instead, we adopted StreamingResponseBody, which allows streaming the content to the client as it's being read.

We combined this with buffering and lazy exception handling to minimize performance overheads and avoid broken pipes when clients disconnect prematurely. Here's a simplified example:

@RestController
public class MediaController {

    @GetMapping("/media/{id}")
    public StreamingResponseBody streamMedia(@PathVariable String id) {
        return outputStream -> {
            try (BufferedInputStream inputStream = new BufferedInputStream(new FileInputStream(getMediaFile(id)))) {
                byte[] buffer = new byte[1024];
                int bytesRead;
                while ((bytesRead = inputStream.read(buffer)) != -1) {
                    outputStream.write(buffer, 0, bytesRead);
                    outputStream.flush();
                }
            } catch (IOException e) {
                log.warn("Stream interrupted", e);
            }
        };
    }
}

Here, BufferedInputStream ensures efficient reading of the file in chunks, while exceptions are handled lazily, allowing the stream to terminate gracefully even if the client closes the connection prematurely.

2. Dynamic Setup of Executor Services Based on Available CPU and Memory

Executor services can handle asynchronous tasks effectively, but configuring them manually with static values could lead to performance bottlenecks. Instead, we dynamically scale the pool size based on the available system resources.

@Bean
public ExecutorService dynamicThreadPool() {
    int corePoolSize = Runtime.getRuntime().availableProcessors();
    int maxPoolSize = corePoolSize * 2;
    return new ThreadPoolExecutor(corePoolSize, maxPoolSize, 60, TimeUnit.SECONDS, new LinkedBlockingQueue<>());
}

This ensures that the application makes the best use of available CPUs while not exhausting system resources.

3. Choosing the Right Garbage Collector

Choosing the right garbage collector (GC) can make or break your application's performance. Initially, we experimented with G1GC but switched to ZGC for its low-latency capabilities, particularly when handling large heaps and concurrent tasks.

The combination of ZGC with ZGenerational and UseCompressedOops allowed us to reduce pause times and increase throughput.

To configure this in your Dockerfile, use:

JAVA_OPTS="-XX:+UseZGC -XX:+ZGenerational -XX:+UseCompressedOops"

ZGC ensures that garbage collection happens concurrently, so it doesn't block application threads, making it ideal for applications where low-latency is crucial.

4. Implementing Rate-Limiting to Protect Against Traffic Spikes

Handling sudden spikes in traffic can overwhelm your system, leading to downtime. We implemented a rate-limiting solution to throttle requests when necessary, ensuring that our services remained responsive even during traffic surges.

领英推荐

Mastering Java Methods: Functions, Pass by Value, and…

Crio.Do 4 个月前

10 Tips To Optimize Java Code Performance

IntelliaTech Solutions Pvt. Ltd. 8 个月前

Java CMS GC Tuning

yCrash 1 年前

We used Bucket4j, a Java rate-limiting library that allows us to set dynamic limits based on system conditions.

@Bean
public RateLimiter rateLimiter() {
    return RateLimiter.of("mediaApi", Bandwidth.classic(10, Refill.intervally(10, Duration.ofSeconds(1))));
}

@GetMapping("/media")
public ResponseEntity<?> getMedia(@RequestParam String id) {
    if (rateLimiter.consume(1)) {
        // Process the request
        return ResponseEntity.ok().build();
    } else {
        return ResponseEntity.status(HttpStatus.TOO_MANY_REQUESTS).build();
    }
}

5. Entry Prevention Based on Available Resources

When CPU or memory usage is critically high, accepting new requests might result in degraded performance or crashes. To mitigate this, we built a dynamic mechanism to monitor system resource usage and prevent further entries when resources were insufficient.

We used Spring's @ControllerAdvice and combined it with a custom health check service that monitors system health.

@Component
public class SystemHealthMonitor {

    public boolean canAcceptNewRequests() {
        double cpuLoad = ManagementFactory.getOperatingSystemMXBean().getSystemLoadAverage();
        long freeMemory = Runtime.getRuntime().freeMemory();
        return cpuLoad < 0.75 && freeMemory > 500 * 1024 * 1024; // Custom threshold
    }
}

@RestController
public class RequestController {

    @Autowired
    private SystemHealthMonitor healthMonitor;

    @GetMapping("/process")
    public ResponseEntity<?> processRequest() {
        if (!healthMonitor.canAcceptNewRequests()) {
            return ResponseEntity.status(HttpStatus.SERVICE_UNAVAILABLE).body("System under high load, try again later.");
        }
        // Proceed with processing
        return ResponseEntity.ok().build();
    }
}

By dynamically checking system health before processing requests, we avoided overloading the system, providing a more stable service.

6. Dynamic Memory Assignment for Docker Containers

Finally, memory management in Docker containers can be tricky. Assigning fixed memory sizes can lead to resource wastage. Instead, we dynamically adjust memory allocation using the environment variables within Docker.

In our Dockerfile or entry script:

export JAVA_OPTS="-Xms${HEAP_SIZE:-256m} -Xmx${HEAP_SIZE:-512m}"

This dynamically assigns memory based on available system resources or Docker's limits.

Conclusion

Building and maintaining high-performance Java applications requires a thoughtful approach. Through techniques like streaming with buffering, dynamic resource management, intelligent garbage collection, and system health monitoring, we can build systems that remain responsive even under high load and adverse conditions.

Each of these strategies comes from real-world experience in dealing with performance bottlenecks and reliability challenges. By following these best practices, you can optimize your own applications and avoid common pitfalls.

Thank you for reading this out. I will appreciate your comments and feedback. i would love to address if there are any questions via DM too.

Good Day !!

- Timir Patel

Himanshu Rawal

5 个月

Great Article ??. Just a add on, we should use canAcceptNewRequests method check in interceptor.

查看更多评论

要查看或添加评论，请登录

Timir Patel的更多文章

Understanding Graph Data Structure Through a Social Networking Use Case

2024年10月19日

Understanding Graph Data Structure Through a Social Networking Use Case

Graph data structures are incredibly powerful when dealing with relationships between entities, making them a perfect…
The hard truth of Retailing in India !!

2019年5月22日

The hard truth of Retailing in India !!

India is the biggest unstructured retail market globally. Yes, You read it correctly, I seriously mean it as…
Will DTH & Cable die?

2019年4月25日

Will DTH & Cable die?

DTH/Cable TV is basic requirement for Home or Business consumers. Of Course they made a huge difference in last decade…
Successful failure of Startup?

2017年3月22日

Successful failure of Startup?

I am not a professional writer or blogger, but just giving a try to share my view point in my own language and feeling.…

4 条评论

Mastering High-Performance Java Applications: Lessons from the Trenches;

Timir Patel

Entrepreneur & Technology Leader | SaaS, AI/ML, Java & AWS Expert | Open to Senior Tech Leadership Roles | Ex-Hitachi | IIMA Alumnus

1. Using StreamingResponseBody with Buffering and Lazy Exception Handling

2. Dynamic Setup of Executor Services Based on Available CPU and Memory

3. Choosing the Right Garbage Collector

4. Implementing Rate-Limiting to Protect Against Traffic Spikes

领英推荐

5. Entry Prevention Based on Available Resources

6. Dynamic Memory Assignment for Docker Containers

Conclusion

Timir Patel的更多文章

社区洞察

其他会员也浏览了

Java ZGC algorithm Tuning

Common Java Development Challenges and How Divergent Solve Them

JFrame in Java

Follow These Steps to Hire Java Developers

Make Java fast! Performance tuning Java

Handling memory leaks in Java programs

Java Best Practices, Tips, and Tricks

Mastering Concurrency in Java and Spring Framework with Job Runr: A Comprehensive Guide to Efficient Job Scheduling

Brief Notes From Effective Java Book

Java Developer

1. Using StreamingResponseBody with Buffering and Lazy Exception Handling

2. Dynamic Setup of Executor Services Based on Available CPU and Memory

3. Choosing the Right Garbage Collector

4. Implementing Rate-Limiting to Protect Against Traffic Spikes

领英推荐

5. Entry Prevention Based on Available Resources

6. Dynamic Memory Assignment for Docker Containers

Conclusion

Timir Patel的更多文章

Understanding Graph Data Structure Through a Social Networking Use Case

The hard truth of Retailing in India !!

Will DTH & Cable die?

Successful failure of Startup?

社区洞察

其他会员也浏览了

Java ZGC algorithm Tuning

Common Java Development Challenges and How Divergent Solve Them

JFrame in Java

Follow These Steps to Hire Java Developers

Make Java fast! Performance tuning Java

Handling memory leaks in Java programs

Java Best Practices, Tips, and Tricks

Mastering Concurrency in Java and Spring Framework with Job Runr: A Comprehensive Guide to Efficient Job Scheduling

Brief Notes From Effective Java Book

Java Developer