登录查看更多内容

5 More arXiv Deep Learning Papers, Explained

Gregory Piatetsky-Shapiro

Part-time philosopher, Retired, Data Scientist, KDD and KDnuggets Founder, was LinkedIn Top Voice on Data Science & Analytics. Currently helping Ukrainian refugees in MA.

发布日期: 2016年1月11日

arXiv, maintained by Cornell University, is a popular open access academic paper preprint repository. It is an outlet for cutting edge research in numerous scientific fields, including machine learning. Mirroring the current general trend in academia, much of the recent posted machine learning research is deep learning related.

Hugo Larochelle, PhD, is a Université de Sherbrooke machine learning professor (on leave), Twitter research scientist, noted neural network researcher, and deep learning aficionado. Since late summer 2015, he has been drafting and publicly sharing notes on arXiv machine learning papers that he has taken an interest in.

A previous KDnuggets article outlined and explained a selection of 5 arXiv machine learning papers that Hugo has read and shared notes on. In an effort to help us better understand new research, this article will present and summarize 5 additional arXiv papers, and will share excerpts from Hugo's notes in order to provide some additional perspective and critique. Links to all original papers, abstracts, and explanatory notes are also included. It is hoped that having top deep learning papers explained by a noted expert in the field will make some of the more complex aspects of the science more approachable.

1. Infinite Dimensional Word Embeddings

Authors: Eric Nalisnick, Sachin Ravi
Date posted to arXiv: 17 Nov 2015

Abstract (excerpt):
We describe a method for learning word embeddings with stochastic dimensionality. Our Infinite Skip-Gram (iSG) model specifies an energy-based joint distribution over a word vector, a context vector, and their dimensionality. By employing the same techniques used to make the Infinite Restricted Boltzmann Machine (Cote & Larochelle, 2015) tractable, we define vector dimensionality over a countably infinite domain, allowing vectors to grow as needed during training.

Hugo's Two Cents (excerpt):

This is a quite original use of our "infinite dimensions" trick we introduced in the iRBM. It wasn't entirely "plug and play" either, and the authors had to be smart in the approximations they proposed for training the iSG.

The qualitative results showing how the conditional on the number of dimensions contain information about polysemy are really neat! One assumption behind distributed word embeddings is that they should be able to represent the multiple meanings of words using different dimensions, so it's nice to see that this is exactly what is being learned here.

I think the only thing missing in this paper are comparisons with regular skipgram and perhaps other word embeddings methods on a specific task or on a word similarity task. In v2 of this paper, the authors do mention they are working on such results, so I'm looking forward to seeing those!

Read the full post on KDnuggets 5 More arXiv Deep Learning Papers, Explained

https://www.kdnuggets.com/2016/01/more-arxiv-deep-learning-papers-explained.html

要查看或添加评论，请登录

Gregory Piatetsky-Shapiro的更多文章

KDnuggets: Personal History and Nuggets of Experience

2021年12月4日

KDnuggets: Personal History and Nuggets of Experience

Dear Readers, I have big news! After 40+ years of working full time, including 35+ years of data mining/KDD/data…

160 条评论
Which Data Science Skills are core and which are hot/emerging ones?

2019年9月17日

Which Data Science Skills are core and which are hot/emerging ones?

The latest KDnuggets Poll asked 1. Which skills / knowledge areas do you currently have (at the level you can use in…

30 条评论
Gainers, Losers, and Trends in Gartner 2019 Magic Quadrant for Data Science and Machine Learning Platforms

2019年2月11日

Gainers, Losers, and Trends in Gartner 2019 Magic Quadrant for Data Science and Machine Learning Platforms

For the first time in several years the name of this highly anticipated Gartner MQ for Data Science and Machine…

10 条评论
AI, Data Science, Analytics Main Developments in 2018 and Key Trends for 2019

2018年12月4日

AI, Data Science, Analytics Main Developments in 2018 and Key Trends for 2019

As in the past, we bring you a roundup of predictions and analysis from experts. We have asked What were the main…

6 条评论
How Important is that Machine Learning Model be Understandable?

2018年11月19日

How Important is that Machine Learning Model be Understandable?

The previous KDnuggets Poll asked When building Machine Learning / Data Science models in 2018, how often was it…

10 条评论
Anticipating the next move in data science – my interview with Thomson Reuters

2018年11月18日

Anticipating the next move in data science – my interview with Thomson Reuters

Thomson Reuters has a series, AI experts, where they interview thought leaders from different areas - including…

11 条评论
Amazing consistency: Largest Dataset Analyzed / Data Mined – Poll Results and Trends

2018年10月31日

Amazing consistency: Largest Dataset Analyzed / Data Mined – Poll Results and Trends

The latest KDnuggets Poll asked: What was the largest dataset you analyzed / data mined? This poll received 1108 votes,…

5 条评论
How many Data Scientists are there and is there a shortage?

2018年9月19日

How many Data Scientists are there and is there a shortage?

(this blog was jointly written with Preet Gandhi, NYU) The 2011 McKinsey report on Big Data said that “The United…

8 条评论
Why Germany did not defeat Brazil in the final, or Data Science lessons from the World Cup

2018年7月30日

Why Germany did not defeat Brazil in the final, or Data Science lessons from the World Cup

This article is based on a KDnuggets blog jointly written with Dan Clark. The 2018 World Cup is over, with France…

45 条评论
SuperDataScience Podcast: Insights from the Founder of KDnuggets

2018年7月23日

SuperDataScience Podcast: Insights from the Founder of KDnuggets

I recently appeared on Super DataScience Podcast, where I had an interesting conversation with SDS Founder Kirill…

4 条评论

See all articles

5 More arXiv Deep Learning Papers, Explained

Gregory Piatetsky-Shapiro

Part-time philosopher, Retired, Data Scientist, KDD and KDnuggets Founder, was LinkedIn Top Voice on Data Science & Analytics. Currently helping Ukrainian refugees in MA.

Gregory Piatetsky-Shapiro的更多文章

社区洞察

其他会员也浏览了

Top 5 arXiv Deep Learning Papers, Explained

Which are the Top Deep Learning Algorithms?

Understanding Deep Neural Networks Training Course

Training, Validation & Accuracy in PyTorch

Configure Deep Learning Architecture

The 5 Deep Learning Frameworks Every Serious Machine Learner Should Be Familiar With

Training, Validation & Accuracy in PyTorch

Deep Learning Guide: Introduction to Implementing Neural Networks using TensorFlow in Python

Deep Learning Summit Toronto: What to expect

Intro to Machine Learning ? Introduction to Machine Learning: A Historical Perspective

Gregory Piatetsky-Shapiro的更多文章

KDnuggets: Personal History and Nuggets of Experience

Which Data Science Skills are core and which are hot/emerging ones?

Gainers, Losers, and Trends in Gartner 2019 Magic Quadrant for Data Science and Machine Learning Platforms

AI, Data Science, Analytics Main Developments in 2018 and Key Trends for 2019

How Important is that Machine Learning Model be Understandable?

Anticipating the next move in data science – my interview with Thomson Reuters

Amazing consistency: Largest Dataset Analyzed / Data Mined – Poll Results and Trends

How many Data Scientists are there and is there a shortage?

Why Germany did not defeat Brazil in the final, or Data Science lessons from the World Cup

SuperDataScience Podcast: Insights from the Founder of KDnuggets

社区洞察

其他会员也浏览了

Top 5 arXiv Deep Learning Papers, Explained

Which are the Top Deep Learning Algorithms?

Understanding Deep Neural Networks Training Course

Training, Validation & Accuracy in PyTorch

Configure Deep Learning Architecture

The 5 Deep Learning Frameworks Every Serious Machine Learner Should Be Familiar With

Training, Validation & Accuracy in PyTorch

Deep Learning Guide: Introduction to Implementing Neural Networks using TensorFlow in Python

Deep Learning Summit Toronto: What to expect

Intro to Machine Learning ? Introduction to Machine Learning: A Historical Perspective