登录查看更多内容

‘Must-Read’ AI Papers Suggested by Experts - Pt 2

Nikita Johnson

Founder | Coach | Entrepreneur

发布日期: 2020年8月14日

+ 关注

Due to the overwhelming response to our previous expert paper suggestion blog, we had to do another. We asked some of our expert community the papers they would suggest everybody read when working in the field.

Haven't seen the first blog? You can read the recommendations of Andrew Ng, Jeff Clune, Myriam Cote and more here.

--------

Alexia Jolicoeur-Martineau, PhD Researcher, MILA

f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization - Sebastian Nowozin et al.

https://arxiv.org/pdf/1711.04894.pdf

Alexia suggested this paper as it explains how many classifiers can be thought of as estimating an f-divergence. Thus, GANs can be interpreted as estimating and minimizing a divergence. This paper from Microsoft Research clearly maps the experiments undertaken, methods and related work to support. Read this paper here.

Sobolev GAN - Youssef Mroueh et al.

https://arxiv.org/pdf/1711.04894.pdf

This paper shows how the gradient norm penalty (used in the very popular WGAN-GP) can be thought of as constraining the discriminator to have its gradient in a unit-ball. The paper is very mathematical and complicated, but the key message is that we can apply a wide variety of constraints to the discriminator/critic. These constraints help prevent the discriminator from becoming too strong. I recommend focusing on Table 1, which shows the various different constraints that can be used. I have come back many times to this paper just to look at Table 1. You can read this paper here.

Jane Wang, Senior Research Scientist, DeepMind

To be honest, I don't believe in singling out any one paper as being more important than the rest, since I think all papers build on each other, and we should acknowledge science as a collaborative effort. I will say that there are some papers I've enjoyed reading more than others, and that I've learned from, but others might have different experiences, based on their interest and background. That said, I've enjoyed reading the following:

Where Do Rewards Come From? - Satinder Singh et al.

https://all.cs.umass.edu/pubs/2009/singh_l_b_09.pdf

This paper advances a general computational framework for reward that places it in an evolutionary context, formulating a notion of an optimal reward function given a fitness function and some distribution of environments. Novel results from computational experiments show how traditional notions of extrinsically and intrinsically motivated behaviors may emerge from such optimal reward functions. You can read this paper here.

Building machines that learn and think like people - Brenden Lake et al

https://www.cambridge.org/core/journals/behavioral-and-brain-sciences/article/building-machines-that-learn-and-think-like-people/A9535B1D745A0377E16C590E14B94993

This paper reviews progress in cognitive science suggesting that truly human-like learning and thinking machines will have to reach beyond current engineering trends in both what they learn and how they learn it. Specifically, we argue that these machines should (1) build causal models of the world that support explanation and understanding, rather than merely solving pattern recognition problems; (2) ground learning in intuitive theories of physics and psychology to support and enrich the knowledge that is learned; and (3) harness compositionality and learning-to-learn to rapidly acquire and generalize knowledge to new tasks and situations. Read more on this paper here.

Jekaterina Novikova, Director of Machine Learning, WinterLight Labs

Attention Is All You Need - Ashish Vaswani et al

https://arxiv.org/abs/1706.03762

Novel large neural language models like BERT or GPT-2/3 were developed soon after NLP scientists realized in 2017 that "Attention is All You Need". The exciting results produced by these models caught the attention of not just ML/NLP researchers but also the general public. For example, GPT-2 caused almost mass hysteria in 2019 as a model that is "too dangerous to be public" as it can potentially generate fake news indistinguishable from real news articles. The GPT-3, which was only released several weeks ago, has already been called "the biggest thing since bitcoin". You can read this paper here.

Climbing towards NLU: On Meaning, Form, and Understanding in the Age of Data - Emily M. Bender et al.

https://www.aclweb.org/anthology/2020.acl-main.463.pdf

To outweigh the hype, I would recommend everyone to read a great paper that was presented and recognized as the best theme paper at the ACL conference in the beginning of July 2020 - "Climbing towards NLU: On Meaning, Form, and Understanding in the Age of Data". In the paper, the authors argue that while the existing models, such as BERT or GPT, are undoubtedly useful, they are not even close to human-analogous understanding of language and its meaning. The authors explain that understanding happens when one is able to recover the communicative intent of what was said. As such, it is impossible to learn and understand language if language is not associated with some real-life interaction, or in other words - "meaning cannot be learned from form alone". This is why even very large and complex language models can only learn a "reflection" of meaning but not meaning itself. Read more on the paper here.

Eric Charton, Senior Director, AI Science, National Bank of Canada

The Computational Limits of Deep Learning - Johnson et al

https://arxiv.org/abs/2007.05558

This recent paper from MIT and IBM Watson Lab is a meta-analysis of DL publications highlighting the correlation between increase of computational consumption to train DL models and performances evolution. It also states the fact that performances progress is slowing as computation capacities increase. You can read more on this paper here.

Survey on deep learning with class imbalance. Journal of Big Data, 6(1), 27.

https://link.springer.com/article/10.1186/s40537-019-0192-5

This suggestion is an exhaustive paper about how the class imbalance problem (present in many industrial applications like credit modelling, fraud detection or medical like cancer detection) is handled by DL algorithms. The survey concludes with a discussion that highlights various gaps in deep learning from class imbalanced data and open multiple tracks for future research. You can read more on this paper here.

Anirudh Koul, Machine Learning Lead, NASA

The one silver lining of 2020 will be the revolution of self-supervision, aka pretraining without labels, and then fine-tuning for a downstream task with limited labels. The state of the art metrics have been shattered more times than the months so far this year. PIRL, SimCLR, InfoMin, MOCO, MOCOv2, BYOL, SwAV, SimCLRv2 are just a few well-known names from this year (some serious FOMO), and that is by June 2020. To admire the current state of the art, pretraining on ImageNet without labels, and then fine-tuning with 1% labels, SimCLRv2 models are able to achieve 92.3% Top-5 accuracy on ImageNet dataset. Yup, with just 1% labels. This has huge practical applications on datasets with way more data than labels (think medical, satellite, etc).

A Simple Framework for Contrastive Learning of Visual Representations - Ting Chen et al

https://www.aclweb.org/anthology/2020.acl-main.463.pdf

Great papers don't just have exceptional results and rigorous experimentation, they also convey their key thoughts in a simple manner. And luckily, SimCLR has simple in its very name, putting it among the first papers worth reading in the area of contrastive learning. Among many learnings, it shows the critical role of data augmentation strategies during contrastive learning specific to your dataset domain to obtain a better representation of images. I expect many papers and tools inspired by SimCLR to come up in the future, addressing X-Rays, MRIs, audio, satellite imagery, and more.

CONTINUE READING ON THE RE?WORK BLOG HERE.

Interested in hearing more from our AI industry experts? We are hosting live digital talks with over 50 experts in just three weeks time. See more on this here.

Bogdan Grigorescu

Sr Tech Lead | Engineering | Automation

4 年

A classic is missing from the list... https://phil415.pbworks.com/f/TuringComputing.pdf

2 次回应

要查看或添加评论，请登录

Nikita Johnson的更多文章

Experts' advice and insights into the past, present, and future of AI

2022年7月22日

Experts' advice and insights into the past, present, and future of AI

Did you miss this month's RE?WORK Newsletter? Check out the highlights below: 5 Tips for a Successful AI Project 85% of…

1 条评论
The 13 ‘Must-Read’ AI Papers of 2021

2021年12月8日

The 13 ‘Must-Read’ AI Papers of 2021

As we approach the end of 2021, we wanted to share 13 of the most important AI papers of the year, as selected by the…

2 条评论
Interview with Inioluwa Deborah Raji, Forbes Tech 30 Under 30 Pick

2021年2月10日

Interview with Inioluwa Deborah Raji, Forbes Tech 30 Under 30 Pick

Inioluwa Deborah Raji has been in the news quite a lot recently, mainly due to her inclusion in the Forbes 30 Under 30…
AI for Peace - Interview with Branka Panic

2021年1月12日

AI for Peace - Interview with Branka Panic

An interview with Branka Panic, Founder and CEO of AI for Peace, a not-for-profit organisation focused on studying and…
The AI Roundup - Top 15 Blogs of 2020

2021年1月5日

The AI Roundup - Top 15 Blogs of 2020

It's been a busy year, even if interrupted slightly by you know what..
AI Experts Predict 2021 Trends

2020年12月15日

AI Experts Predict 2021 Trends

2020 hasn't quite gone to plan for many of us, but here's to 2021! We asked some of our AI expert community what they…

1 条评论
30 Influential Women Advancing AI in 2020

2020年12月8日

30 Influential Women Advancing AI in 2020

Let's be honest, 2020 hasn't been the best year, seemingly, on the surface at least, bringing the world to a halt. That…

21 条评论
20+ Pieces Of Advice From AI Experts To Those Starting Out In The Field

2020年11月17日

20+ Pieces Of Advice From AI Experts To Those Starting Out In The Field

Following on from our previous expert-led series, we asked our community of AI experts what advice they would give to…

3 条评论
Experts Predict The Next AI Hub

2020年10月1日

Experts Predict The Next AI Hub

Having now asked our experts their must-read AI papers and their roadblock in AI forecast, we wanted to finish our…
Women in AI & Engineering - 100+ Pioneer Podcasts

2020年6月23日

Women in AI & Engineering - 100+ Pioneer Podcasts

In celebration of International Women in Engineering Day (#INWED20), we have collated over 100 podcasts featuring Women…

3 条评论

See all articles

‘Must-Read’ AI Papers Suggested by Experts - Pt 2

Nikita Johnson

Founder | Coach | Entrepreneur

Due to the overwhelming response to our previous expert paper suggestion blog, we had to do another. We asked some of our expert community the papers they would suggest everybody read when working in the field.

Alexia Jolicoeur-Martineau, PhD Researcher, MILA

f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization - Sebastian Nowozin et al.

Sobolev GAN - Youssef Mroueh et al.

Jane Wang, Senior Research Scientist, DeepMind

Where Do Rewards Come From? - Satinder Singh et al.

Building machines that learn and think like people - Brenden Lake et al

Jekaterina Novikova, Director of Machine Learning, WinterLight Labs

Attention Is All You Need - Ashish Vaswani et al

Climbing towards NLU: On Meaning, Form, and Understanding in the Age of Data - Emily M. Bender et al.

Eric Charton, Senior Director, AI Science, National Bank of Canada

The Computational Limits of Deep Learning - Johnson et al

Survey on deep learning with class imbalance. Journal of Big Data, 6(1), 27.

Anirudh Koul, Machine Learning Lead, NASA

A Simple Framework for Contrastive Learning of Visual Representations - Ting Chen et al

Nikita Johnson的更多文章

社区洞察

其他会员也浏览了

Best of last 30 days intelligence

AI/ML Digest | Issue 37

Stability AI DeepFloyd 4.3b Text To Image Model Review and Full How To Use On Kaggle (free account) Tutorial

The Quest for Truly 'Reasoning' AI: Beyond LLMs

Here are 11 Super ?? Cool AI Research Papers ALONG with SUMMARY from CMU (2024)

The Hitchhiker's Guide to Artificial Intelligence

Simple, Elegant, Convincing, and Wrong: The fallacy of ‘Explainable AI’ and how to fix it, part 4

Why we should (not) fear AI

XAI : The WHY

AI Leading the Charge Towards a Unified Theory through Interdisciplinary Convergence

Due to the overwhelming response to our previous expert paper suggestion blog, we had to do another. We asked some of our expert community the papers they would suggest everybody read when working in the field.

Alexia Jolicoeur-Martineau, PhD Researcher, MILA

f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization - Sebastian Nowozin et al.

Sobolev GAN - Youssef Mroueh et al.

Jane Wang, Senior Research Scientist, DeepMind

Where Do Rewards Come From? - Satinder Singh et al.

Building machines that learn and think like people - Brenden Lake et al

Jekaterina Novikova, Director of Machine Learning, WinterLight Labs

Attention Is All You Need - Ashish Vaswani et al

Climbing towards NLU: On Meaning, Form, and Understanding in the Age of Data - Emily M. Bender et al.

Eric Charton, Senior Director, AI Science, National Bank of Canada

The Computational Limits of Deep Learning - Johnson et al

Survey on deep learning with class imbalance. Journal of Big Data, 6(1), 27.

Anirudh Koul, Machine Learning Lead, NASA

A Simple Framework for Contrastive Learning of Visual Representations - Ting Chen et al

Nikita Johnson的更多文章

Experts' advice and insights into the past, present, and future of AI

The 13 ‘Must-Read’ AI Papers of 2021

Interview with Inioluwa Deborah Raji, Forbes Tech 30 Under 30 Pick

AI for Peace - Interview with Branka Panic

The AI Roundup - Top 15 Blogs of 2020

AI Experts Predict 2021 Trends

30 Influential Women Advancing AI in 2020

20+ Pieces Of Advice From AI Experts To Those Starting Out In The Field

Experts Predict The Next AI Hub

Women in AI & Engineering - 100+ Pioneer Podcasts

社区洞察

其他会员也浏览了

Best of last 30 days intelligence

AI/ML Digest | Issue 37

Stability AI DeepFloyd 4.3b Text To Image Model Review and Full How To Use On Kaggle (free account) Tutorial

The Quest for Truly 'Reasoning' AI: Beyond LLMs

Here are 11 Super ?? Cool AI Research Papers ALONG with SUMMARY from CMU (2024)

The Hitchhiker's Guide to Artificial Intelligence

Simple, Elegant, Convincing, and Wrong: The fallacy of ‘Explainable AI’ and how to fix it, part 4

Why we should (not) fear AI

XAI : The WHY

AI Leading the Charge Towards a Unified Theory through Interdisciplinary Convergence