登录查看更多内容

Garbage-First Garbage Collector

Amit Rai

?? Technical Lead | Cybersecurity & Innovation | Homomorphic Encryption| Researcher | AI & Security | Data Privacy | Innovation & PoC

发布日期: 2017年2月3日

The Garbage-First (G1) garbage collector is a server-style garbage collector, targeted for multiprocessor machines with large memories. It attempts to meet garbage collection (GC) pause time goals with high probability while achieving high throughput. Whole-heap operations, such as global marking, are performed concurrently with the application threads. This prevents interruptions proportional to heap or live-data size.

The G1 collector achieves high performance and pause time goals through several techniques.

The heap is partitioned into a set of equally sized heap regions, each a contiguous range of virtual memory. G1 performs a concurrent global marking phase to determine the liveness of objects throughout the heap. After the marking phase completes, G1 knows which regions are mostly empty. It collects these regions first, which often yields a large amount of free space. This is why this method of garbage collection is called Garbage-First. As the name suggests, G1 concentrates its collection and compaction activity on the areas of the heap that are likely to be full of reclaimable objects, that is, garbage. G1 uses a pause prediction model to meet a user-defined pause time target and selects the number of regions to collect based on the specified pause time target.

G1 copies objects from one or more regions of the heap to a single region on the heap, and in the process both compacts and frees up memory. This evacuation is performed in parallel on multiprocessors to decrease pause times and increase throughput. Thus, with each garbage collection, G1 continuously works to reduce fragmentation. This is beyond the capability of both of the previous methods. CMS (Concurrent Mark Sweep) garbage collection does not do compaction. Parallel compaction performs only whole-heap compaction, which results in considerable pause times.

It is important to note that G1 is not a real-time collector. It meets the set pause time target with high probability but not absolute certainty. Based on data from previous collections, G1 estimates how many regions can be collected within the target time. Thus, the collector has a reasonably accurate model of the cost of collecting the regions, and it uses this model to determine which and how many regions to collect while staying within the pause time target.

The first focus of G1 is to provide a solution for users running applications that require large heaps with limited GC latency. This means heap sizes of around 6 GB or larger, and a stable and predictable pause time below 0.5 seconds.

Applications running today with either the CMS or the with parallel compaction would benefit from switching to G1 if the application has one or more of the following traits.

More than 50% of the Java heap is occupied with live data.
The rate of object allocation rate or promotion varies significantly.
The application is experiencing undesired long garbage collection or compaction pauses (longer than 0.5 to 1 second).

G1 is planned as the long-term replacement for the Concurrent Mark-Sweep Collector (CMS). Comparing G1 with CMS reveals differences that make G1 a better solution. One difference is that G1 is a compacting collector. Also, G1 offers more predictable garbage collection pauses than the CMS collector, and allows users to specify desired pause targets.

As with CMS, G1 is designed for applications that require shorter GC pauses.

要查看或添加评论，请登录

Amit Rai的更多文章

Difference between array and arraylist #java

2018年10月19日

Difference between array and arraylist #java

I. Size: Array in Java is fixed in size.
Java Stream API for Bulk Data Operations on Collections

2017年12月3日

Java Stream API for Bulk Data Operations on Collections

A new java.util.
Nashorn

2017年9月22日

Nashorn

Nashorn: A Next-Generation JavaScript Engine for the JVM by Julien Ponge Scenarios for using Oracle Nashorn as a…
CLUSTER

2017年8月13日

CLUSTER

A single instance of Node.js runs in a single thread.
Parallel Operations

2017年2月19日

Parallel Operations

With the addition of Lambda expressions to arrays operations, Java introduced a key concept into the language of…
RxJS

2017年2月18日

RxJS

What is RxJS? RxJS or Reactive Extensions for JavaScript is a library for transforming, composing, and querying streams…
HTTP/2 vs HTTP 1.x

2017年2月16日

HTTP/2 vs HTTP 1.x

Let’s take a look at the key differences compared to HTTP 1.x and what issue each improvement is addressing: HTTP/2 is…
Concurrent Accumulators

2017年2月12日

Concurrent Accumulators

One of the most common scenarios in concurrent programming is updating of numeric counters accessed by multiple…
Exception Propagation

2017年1月30日

Exception Propagation

After a method throws an exception, the runtime system searches the call stack for a method that contains a block of…
Microbenchmarks

2017年1月24日

Microbenchmarks

Microbenchmarks are coming The Java Microbenchmarking Harness (JMH) by Alexey Shipilev is taking the next step in its…

See all articles

Garbage-First Garbage Collector

Amit Rai

?? Technical Lead | Cybersecurity & Innovation | Homomorphic Encryption| Researcher | AI & Security | Data Privacy | Innovation & PoC

Amit Rai的更多文章

社区洞察

其他会员也浏览了

9 Features a Good Cold Chain Data Logger Should Have

Viriato MoD - Partial Coverage: Solving challenges using microscopy in medium-term timetabling

Thread Safety in .NET: lock, Semaphore and Mutex

System architecture milestone of 5G Phase 1 is achieved (3GPP)

The Fine Line Between Synchronization and Performance

“An energy-aware scheduling algorithm for big data applications in Spark. Cluster Computing" [Hongjian et al., 2020]: Extended Abstract

kube-controller-manager Controllers: A Glossary

Open Footprint on Fabric Part 2 – Activity Data Models for OPC UA

Kubernetes persistent volume, persistent volume clam, and storage class

Amit Rai的更多文章

Difference between array and arraylist #java

Java Stream API for Bulk Data Operations on Collections

Nashorn

CLUSTER

Parallel Operations

RxJS

HTTP/2 vs HTTP 1.x

Concurrent Accumulators

Exception Propagation

Microbenchmarks

社区洞察

其他会员也浏览了

9 Features a Good Cold Chain Data Logger Should Have

Viriato MoD - Partial Coverage: Solving challenges using microscopy in medium-term timetabling

Thread Safety in .NET: lock, Semaphore and Mutex

System architecture milestone of 5G Phase 1 is achieved (3GPP)

The Fine Line Between Synchronization and Performance

“An energy-aware scheduling algorithm for big data applications in Spark. Cluster Computing" [Hongjian et al., 2020]: Extended Abstract

kube-controller-manager Controllers: A Glossary

Open Footprint on Fabric Part 2 – Activity Data Models for OPC UA

Kubernetes persistent volume, persistent volume clam, and storage class