登录查看更多内容

Is the Software a Nightmare for #Xilinx #Versal ACAP?

Yousef B. Bedoustani, PhD. Eng.

Principal FPGA & Hardware Engineer

发布日期: 2019年6月24日

Finally, Xilinx shipped Versal “The World’s First ACAP” or Adaptive Compute Acceleration Platform.

The important question is: Are the software platforms and algorithms ready for the Versal technology? In other words, how can the software architectures dedicate and divide (not even optimize) the elements of mathematical algorithms between Scaler engines (ARM A72 and ARM R5 cores), Adaptable engine (FPGA) and Intelligent engines (AI and DSP engines) inside Versal ACAP.

Is the high-speed interconnect NOC (Network On Chip) efficient enough for low latency, high bandwidth data transferring between the enginess?

We know that the parallel/manycore processing experienced the same issues. Still a lot of applications that are running in parallel/manycore processors are not efficient due to the nature of their mathematical algorithms.

Some processor design experiences show that the hardware must be designed and optimized based on nature of mathematical algorithms that they should execute. Moreover, the existing software platforms and libraries should be considered during the design of new hardware. We know that developing new software platforms for new hardware is time-consuming and expensive. The solution that is currently used is to use the existing software methods and libraries for new hardware for which they are not necessarily efficient for, a fruitless circle!

Is the Software a Nightmare for Xilinx Versal ACAP? Only time will tell.

David Escandon

Area Sales Manager at MACOM Technolgies Driving Business Growth and Engineering Solutions that Enable Customers to Deploy Differentiated Products and Services Focused on Leading Edge Technogies and Applications

5 年

One more point. To quote you here: "We know that developing new software platforms for new hardware is time-consuming and expensive." You bet your marbles it is, but it's a necessity more so now than ever. My point earlier about silicon development costs comes courtesy of John Hruska at ExtremeTech, which does seem to align with industry opinion.?https://www.extremetech.com/computing/272096-3nm-process-node

1 次回应

Theodore Omtzigt

Accelerating innovation: solving problems with high-performance compute

5 年

Time has already told that building a compute fabric devoid of application requirements will always yield sub-optimal results. FPGAs only deliver 1% of the performance of what the silicon is capable of, and if your algorithm doesn't fit the DSP block architecture, you end up with noncompetitive price/perf points compared to GPUs or TPUs or even DSPs.? History is full of suckers that fell for the marketing ploy of being sold peak performance only to realize that real performance is nowhere near the marketing numbers.

1 次回应

Olivier Tremois

SMTS Product Development Engineer at AMD

5 年

In a month there will be an event called Xilinx Developper Forum: https://www.xilinx.com/products/design-tools/developer-forum.html A lot of announcements will be done and we will be able to say a lot more on the devices and the tools.

2 次回应

Olivier Tremois

SMTS Product Development Engineer at AMD

5 年

You'll have a lot of answers by the end of the year. Stay tuned!

3 次回应

Andrei V.

Founder, Principal Consultant

5 年

Parallel processing by architectures such as Versal will not be easy. However, there is no other way to sustain demand for compute power. Software folks like known to work libraries, but it will take cross-discipline efforts to optimise algorithms on such architectures. Just remember what IBM team did with 256 GPU cluster for ML training. Similar breakthrough performance is enabled by Versal.

4 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

Is the Software a Nightmare for #Xilinx #Versal ACAP?

Yousef B. Bedoustani, PhD. Eng.

Principal FPGA & Hardware Engineer

更多精彩文章

社区洞察

其他会员也浏览了

Banana Pi BPI-F3 SpacemiT K1 RISC-V chip datasheet

vLLM or Triton

Lightbits Delivers High Performance and Efficiency in MLPerf Benchmarks

Maximizing IT Infrastructure Efficiency: The Power of Compression (Part 2 of 3)

ABCDE: Why do we invest in Cysic？

AINVIDIA’s new superchip promises to revolutionize data centers and AI applications.

On addressing #Meltdown and #Spectre in future silicon...

WSTS 2024 Summary & Trends

Where did the terms computer "chips" and "debugging" come from?

Want to learn more about oneAPI ?

RTL vs. Software Mentality in FPGA/ASIC Design; Latency From 161 to 2 Clock Cycle!

2020年9月1日

What would be the next technology within Xilinx FPGAs? Analog In-Memory Processing or Manycore architecture?

2019年7月22日

How to architect engines of Xilinx #Versal ACAP for a specific application?

2019年7月8日

Aurora 64B/66B Latency Challenge!!!

2018年10月15日

Why don't the current robot arms cover all ranges of workspace?

2015年3月26日

Using ZYNQ Soc. for Power Electronics Real-time Simulations

2015年3月7日