Year of the Compiler

Year of the Compiler

By Jeremy Fowers

I’ve had an eye-opening month and I have three stories to share. Groq recently kicked off the release of our “early adopter” SDK, marking Groq Compiler as the primary means of programming GroqChip? accelerators.

Now, I’ve built my career on kernel optimization – meticulously coding important workloads specifically for targeted hardware . Sometimes Verilog has felt a little too high level for my needs. While I’m happy to attend events here like Groq-a-thon - an all-day Groq hackathon - to try out the early adopter SDK, I’m also confident my habits and experience will send me back to my kernel optimizing ways the next morning.

No alt text provided for this image

But change is in the air. First, we get a drop of over 100 LSTM-based models from a customer. Before I can do so much as load up VS Code, my teammate Lev has kicked off a distributed cluster to run Groq Compiler on the entire drop.?

Not only do some compiled programs beat my hand-coded benchmark, but the whole set comes in at an average of 16x speedup over Nvidia A100 and redefines what is possible within the customer’s latency requirement. No kernel engineering needed here.

The next week, another customer sends us a model that mixes LSTM and Transformer layers. I could code this by hand, but I give Groq Compiler first crack at the problem. After a little slicing and dicing of the ONNX file – this compiler is good but not perfect yet – I am looking at a result that offers over 100x speedup compared to the reference implementation.?

No alt text provided for this image

Finally, my teammate Chetan asks me if we should try supporting his favorite Transformer, ELECTRA. I’ve never heard of it before, and nobody at Groq had worked on it yet, but I may as well take a look. ELECTRA is an improved version of BERT that changes up the hyperparameters and adds a projection layer at the top. These differences sound benign, but it could still take considerable effort to adapt an optimized handwritten BERT into optimized ELECTRA.

So I put it through Groq Compiler, and it just worked. Same performance advantage as our BERT.

This week a friend asked me if I’m worried about my skill set being obsolete. I told him no, hung up the phone, and went merrily back to learning PyTorch.

Interesting in seeing the Groq Compiler in action? Reach out to?[email protected] ?to learn how you can participate in the early adopter program.

Read more Groq insights here .

Header image credit: Photo by?Behnam Norouzi ?on?Unsplash

Image one credit: Photo by?Yuichi Kageyama ?on?Unsplash ?

Image two credit: Photo by?charlesdeluvio ?on?Unsplash

Igor Tolstopalov

President | Founder at Flash - Revolutionizing BNPL for Wholesalers | Innovating Products People Love

2 年

Ramakrishnan quite interesting !

回复
Jonathan Ross

groq :: ceo + founder

2 年

Want to learn about how our compiler works, come to bit.ly/groqday2 this Thursday, or our Meet and Geek in Mountain View, Thursday at 7PM. See you there!

要查看或添加评论,请登录

社区洞察

其他会员也浏览了