ç™»å½•æŸ¥çœ‹æ›´å¤šå†…å®¹

Vectorization Part 2 â€“ Why and What?

Rohan Douglas

Founder and CEO at Quantifi Inc

å‘å¸ƒæ—¥æœŸ: 2017å¹´6æœˆ23æ—¥

New challenges in the financial markets driven by changes in market structure and regulations and accounting rules like Basel III, EMIR, Dodd Frank, MiFID II, Solvency II, IFRS 13, IRFS 9, and FRTB have increased demand for higher performance risk and analytics. Problems like XVA require orders of magnitude more calculations for accurate results. This demand for higher performance has put a focus on how to get the most out of the latest generation of hardware.

This is the second in a series of blogs on Vectorization which is a key tool for dramatically improving the performance of code running on modern CPUs. Vectorization is the process of converting an algorithm from operating on a single value at a time to operating on a set of values at one time. Modern CPUs provide direct support for vector operations where a single instruction is applied to multiple data (SIMD).

In my last blog I cover how CPUs have evolved and how software must leverage both Threading and Vectorization to get the highest performance possible from the latest generation of processors.

In this blog I cover the why and what of Vectorization.

Why Vectorize

Vectorization is the process of converting an algorithm from operating on a single value at a time to operating on a set of values (vector) at one time.

Modern CPUs provide direct support for vector operations where a single instruction is applied to multiple data (SIMD). For example a CPU with a 512 bit register could hold 16 32-bit single precision doubles and do a single calculation 16 times faster than executing a single instruction at a time. Combine this with threading and multi-core CPUs leads to orders of magnitude performance gains.

The following is code to add two vectors.

for (i = 0; i < 4; i++)

 c[i] = a[i] + b[i];

In a serial calculation, the individual vector (array) elements are added in sequence. The additional register space in modern CPUs is unused.

In a vectorized calculation, all elements of the vector (array) can be added in one calculation step.

What kind of problem is vectorizable?

Not all code can take advantage of vectorization. The problem set must be amenable to a vectorized solution. Vectorization works best on problems that require the same simple operation to be performed on each element in a data set. So, first of all, look for a loop. The prototypical example is used above - the addition of each element in an array.

for (i = 0; i < count; i++)

 c[i] = a[i] + b[i];

But many other primitive operators can also be vectorized. The kinds of matrix transformation seen in linear algebra are usually a good candidate for vectorization. The good news is that the Finance domain provides many problem sets that are suitable.

Issues that impact Vectorizing your code

There are a range of issues that can impact the effectiveness of vectorisation. Some of the more common ones include:

1. Loop Dependencies (Avoid read-after-write)

for (i = 1; i < end; i++)

 f[i] = f[i-1] + b[i-1];

2. Indirect Memory Access (Use loop index directly. Seek unit loop stride)

for (i = 0; i < end; i++)   

  c[idxC[i]] = a[i] + b[i];

3. Non â€˜Straight lineâ€™ code (function calls, conditions, unknown loop count)

for (i = 0; i < CalcEnd(); i++)

{                              

 if (DoJump())

   i += CalcJump(); 

  c[i] = a[i] + b[i];

}

Resources

Vectorization, Kirill Rogozhin, Intel, March 2017

Vectorization of Performance Dies for the Latest AVX SIMD, Kevin Oâ€™Leary, Intel, Aug 2016

A Guide to Vectorization with Intel? C++ Compilers, Intel, Nov 2010

Vectorization Codebook, Intel, Sep 2015

The Free Lunch Is Over - A Fundamental Turn Toward Concurrency in Software, Herb Sutter, March 2005

Recipe: Using Binomial Option Pricing Code as Representative Pricing Derivative Method, Shuo-li, Intel, June 2016

è¦æŸ¥çœ‹æˆ–æ·»åŠ è¯„è®ºï¼Œè¯·ç™»å½•

Rohan Douglasçš„æ›´å¤šæ–‡ç«

What Trends are Impacting Asset Managers and Asset Owners?

2019å¹´6æœˆ4æ—¥

What Trends are Impacting Asset Managers and Asset Owners?

Forward-looking investment management firms are searching for ways to outperform their peers. The firms that we seeâ€¦

1 æ¡è¯„è®º
Vectorization Part 4 - Application to CVA Aggregation

2017å¹´7æœˆ26æ—¥

Vectorization Part 4 - Application to CVA Aggregation

New challenges in the financial markets driven by changes in market structure and regulations and accounting rules likeâ€¦

1 æ¡è¯„è®º
Vectorization Part 3 - Implemention

2017å¹´7æœˆ6æ—¥

Vectorization Part 3 - Implemention

New challenges in the financial markets driven by changes in market structure and regulations and accounting rules likeâ€¦

8 æ¡è¯„è®º
Vectorization Part 1 â€“ The Rise of Parallelism

2017å¹´6æœˆ15æ—¥

Vectorization Part 1 â€“ The Rise of Parallelism

New challenges in the financial markets driven by changes in market structure and regulations and accounting rules likeâ€¦

12 æ¡è¯„è®º
Last chance to register for FRTB webinar

2017å¹´1æœˆ25æ—¥

Last chance to register for FRTB webinar

Last chance to register for Quantifi & Kauri Solutions' webinar on FRTB for Wen 25th January, 2017 at 10am EST / 3pmâ€¦
What can we learn from history?

2016å¹´11æœˆ11æ—¥

What can we learn from history?

I recently finished an excellent book about the history of Rome that covers a 1,000 years of it's history from anâ€¦

4 æ¡è¯„è®º
Quantifi London Conference, October 4th

2016å¹´9æœˆ6æ—¥

Quantifi London Conference, October 4th

Quantifi is pleased to announce that registration is now OPEN for our 2016 Annual Conference! Join senior practitionersâ€¦
Quantifi New York Conference, 29th October

2015å¹´10æœˆ28æ—¥

Quantifi New York Conference, 29th October

Please join us for our 2nd annual risk conference next week - The Dynamics Driving OTC Markets. We have a great speakerâ€¦
Please help me with Team for Kids in the NYC Marathon

2015å¹´9æœˆ28æ—¥

Please help me with Team for Kids in the NYC Marathon

This year Iâ€™ve decided to take the plunge and run the New York City Marathon. Having not run much before this year, itâ€¦
British Heart Foundation London to Brighton bike ride

2015å¹´5æœˆ24æ—¥

British Heart Foundation London to Brighton bike ride

We're taking part in the British Heart Foundation London to Brighton bike ride on Sunday June 21st. Spare a thought forâ€¦

See all articles

Vectorization Part 2 â€“ Why and What?

Rohan Douglas

Founder and CEO at Quantifi Inc

Why Vectorize

What kind of problem is vectorizable?

Issues that impact Vectorizing your code

Resources

Rohan Douglasçš„æ›´å¤šæ–‡ç«

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

Ingenious Ways Developers Use typeof

A Survey of Computing Paradigms - From Literature to Machines

Introduction to Error Detection and Correction #2: Reed-Solomon

SPSC Queue Part 2: Going Atomic

?? Optimizing JVM with G1 Garbage Collector (G1GC)

Ai-Thinker's latest offline voice module factory firmware tutorial: Is the SDK open source?

Cache

Can you provide an explanation of how a computer stores, fetches, and executes instructions? What is the underlying model for this process?

exFAT2-IP: CPU-Free File System with Two-User for NVMe

A step-by-step guide to install Intel Advisor and analyze a sample application and find out where Vectorization matters the most

Why Vectorize

What kind of problem is vectorizable?

Issues that impact Vectorizing your code

Resources

Rohan Douglasçš„æ›´å¤šæ–‡ç«

What Trends are Impacting Asset Managers and Asset Owners?

Vectorization Part 4 - Application to CVA Aggregation

Vectorization Part 3 - Implemention

Vectorization Part 1 â€“ The Rise of Parallelism

Last chance to register for FRTB webinar

What can we learn from history?

Quantifi London Conference, October 4th

Quantifi New York Conference, 29th October

Please help me with Team for Kids in the NYC Marathon

British Heart Foundation London to Brighton bike ride

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

Ingenious Ways Developers Use typeof

A Survey of Computing Paradigms - From Literature to Machines

Introduction to Error Detection and Correction #2: Reed-Solomon

SPSC Queue Part 2: Going Atomic

?? Optimizing JVM with G1 Garbage Collector (G1GC)

Ai-Thinker's latest offline voice module factory firmware tutorial: Is the SDK open source?

Cache

Can you provide an explanation of how a computer stores, fetches, and executes instructions? What is the underlying model for this process?

exFAT2-IP: CPU-Free File System with Two-User for NVMe

A step-by-step guide to install Intel Advisor and analyze a sample application and find out where Vectorization matters the most

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†