登录查看更多内容

Focusing on inherent structure of sensor data

Ankur Verma, PhD

Founder CEO @ Lightscline | SME 30 under 30

发布日期: 2024年12月11日

How to develop efficient algorithms that exploit this structure?

Sensor data has structure. Paying attention to the structure can help us understand domain-specific priors that can be imparted to novel data-driven architectures. Some examples include: shift and distortion invariance in images, shift invariance in sensor data, spectral stability in both images and time series data. Models need to preserve the symmetry, invariance, and equivariance of the data being studied.

Here are some examples of structure in different types of data:

Computer graphics and vision - 3D shapes - meshes - descriptors - curvature like properties
Computational sociology - topological structure - social relations - community detection
NLP - words in a corpus - co-occurrence graph - connected if near
Signal processing on graph - similar to convolution - CNN to graphs

To understand this better, let’s divide the underlying structure into the following two categories:

1. Euclidean structure:

Euclidean structure is characterized by linear spaces or grid-like structures that adhere to the principles of Euclidean geometry. Examples of such data include audio, images, 1-D sensor data, text, etc.

2. Non-Euclidean structure:

Data having non-Euclidean structure exist in spaces that do not conform to the axioms of Euclidean geometry, and have features like: curved spaces, lack of global co-ordinate system, and variable distance measures. Graphs and manifolds are two primary examples of data having non-Euclidean geometry.

We will look at three specific examples of how structure (symmetry, invariance, and equivariance) of data have been used to design network architectures:

1. CNNs:

“Convolutional Networks combine three architectural ideas to ensure some degree of shift, scale, and distortion invariance: local receptive fields, shared weights (or weight replication), and spatial or temporal subsampling

An interesting property of convolutional layers is that if the input image is shifted, the feature map output will be shifted by the same amount, but will be left unchanged otherwise This property is at the basis of the robustness of convolutional networks to shifts and distortions of the input.

Once a feature has been detected, its exact location becomes less important. Only its approximate position relative to other features is relevant. Not only is the precise position of each of those features irrelevant for identifying the pattern, it is potentially harmful because the positions are likely to vary for different instances of the character. A simple way to reduce the precision with which the position of distinctive features are encoded in a feature map is to reduce the spatial resolution of the feature map. This can be achieved with a so called subsampling layers which performs a local averaging and a subsampling, reducing the resolution of the feature map, and reducing the sensitivity of the output to shifts and distortions.”

LeCun et al, 1998.

2. Defining invariant operations on graphs and manifolds:

Operations such as convolutions and translation don’t directly apply to non-Euclidean data. Let’s consider the following two examples:

A heat kernel changes its shape with a change in position on non-Euclidean domains (such as graphs and manifolds shown below), indicating a lack of shift-invariance. This means that operations like convolutions are not readily applicable on non-Euclidean domains.

Heat kernel changes shape on graphs and manifolds - lack of shift-invariance

Similarly, we need to define deformation-invariant convolutional filters for Geometric CNNs. As can be seen in the figure below, a Euclidean CNN is not invariant to distortion as it changes the shape from 2x2 per quadrant to 2x3 per quadrant if we directly take a projection onto a curved surface.

The readily applicable convolution theorem in the Euclidean domain needs to be modified for the non-Euclidean domain due to the lack of shift invariance.

For Euclidean domain:

领英推荐

Neo4j Graph Tech Weekly

Neo4j 1 年前

Revolutionizing Mathematical Problem-Solving: OpenAI’s…

Anand Ramachandran 2 个月前

Revitalizing the Korean Geoinformatics Industry with…

Deep Block 1 年前

Convolution theorem for Euclidean domain

For non-Euclidean domain:

Here are some important notes to highlight the difference in the use of operators in the Euclidean and non-Euclidean space. Go here for more details.

Fourier basis - convolution - Euclidean space

Eigenfunctions - Laplacian - spectral analysis - convolution like - non-Euclidean space

Laplacian - eigenfunctions generalize the classical Fourier bases, allowing to perform spectral analysis on manifolds and graphs

3. Tangent kernels for manifold learning:

Data on manifolds naturally arise in different fields. Some examples include:

3.1 Hyperspheres: model directional data in molecular and protein biology

3.2 Hyperbolic spaces: impedance density estimation

3.3 Symmetric positive definite matrices: Diffusion tensor imaging, functional MRI, ASR

3.4 Lie transforms: articulated objects like the human spine

3.5 Stiefel manifolds: process video action data

3.6 Grassmann: computer vision to perform video-based face recognition and shape recognition

3.7 Landmark spaces: Biological shapes

The following diagram shows a mapping between manifold and tangent spaces. You can read more here.

Moving between manifold and tangent space

There are 3 important steps in manifold learning:

Gradient calculation
Projection of the gradient to tangent space
Retraction of the tangent space to the manifold

These translations to and from the manifold space to the tangent space can help in linearizing the problem and subsequently using the tools from Euclidean machine learning for analyzing data having manifold-like structure.

An easy to visualize example of structure in the data is through Takens embedding:

Takens embedding for gravitational wave and pure noise

As can be seen above, structure of a gravitational wave is clearly visible in Takens embedding, whereas no such structure is observed in the case of pure noise.

From the above examples, we can see that once we have sufficient insights from the structure of the data, it is easier to think how to design architectures that can preserve properties such as symmetry, invariance, and equivariance. A good example of this is how the network architecture for Alphafold 2 incorporated physics of the structure such as bond angles, energy, etc, from Alphafold 1. A good approach for solving hard scientific discovery problems using deep learning is to keep incorporating previous priors into novel architectures.

Lightweight intelligence

648 位关注者

Harsh Ranjan

Pre-final Year @ IIEST | Tech & Innovation Enthusiast | EDC, CodeIIEST, Debsoc | Ex-WebDev @Lightscline | Former Intern @NIT Patna

2 个月

Insightful

2 次回应

要查看或添加评论，请登录

Ankur Verma, PhD的更多文章

How to run real-time inference on 10% important data?

2025年2月11日

How to run real-time inference on 10% important data?

Selective learning enables joint training and inference on 10% important data Contemporary ways of applying machine…
Getting insights efficiently from Terabytes of sensor data

2025年1月23日

Getting insights efficiently from Terabytes of sensor data

How to design end-to-end pipelines by leveraging AI to only analyze relevant data? Sensors are fundamental to bringing…
How to inform models about the inherent structure in data?

2025年1月8日

How to inform models about the inherent structure in data?

Understanding geometry to impart good priors Utilizing the inherent structure in sensor data is crucial for building…
Preserving physics of vibration data using post-Nyquist data-driven techniques

2024年12月24日

Preserving physics of vibration data using post-Nyquist data-driven techniques

Imparting physics in end-to-end data-driven learning “If you want to find the secrets of the universe, think in terms…
Redundancy enables upfront sampling instead of dimensionality reduction

2024年11月27日

Redundancy enables upfront sampling instead of dimensionality reduction

Getting similar performance with 90% less resources - data, FLOPS, etc. “Once a pattern is found in raw data, the data…

4 条评论
A 90% smaller footprint to enable AI deployment at scale

2024年11月13日

A 90% smaller footprint to enable AI deployment at scale

Enabling AI inference on low size, weight, power, and cost devices Consider the following scenarios: 1. A cluster of…
Low-dimensional structures in big data suffice for several tasks

2024年11月5日

Low-dimensional structures in big data suffice for several tasks

Using much less data (~90%) for different downstream tasks Several data modalities like sensor data or genomics /…

3 条评论
Training, re-training, and iterating close to the data source

2024年10月29日

Training, re-training, and iterating close to the data source

Working with 90% less data enables shorter & faster feedback loops Data and prediction models operating in the physical…
Leveraging channels and augmentation to improve sensor datasets

2024年10月22日

Leveraging channels and augmentation to improve sensor datasets

Maintaining the same fidelity with 90% less sensor data For a lot of real-world use-cases, we do not have a lot of…
Leveraging inherent structure to work with 90% less data

2024年10月15日

Leveraging inherent structure to work with 90% less data

Structure in real-world sensor data enables data reduction while preserving information The amount of sensor data to be…

See all articles

Focusing on inherent structure of sensor data

Ankur Verma, PhD

Founder CEO @ Lightscline | SME 30 under 30

How to develop efficient algorithms that exploit this structure?

领英推荐

Lightweight intelligence

648 位关注者

Ankur Verma, PhD的更多文章

社区洞察

其他会员也浏览了

Knowledge Graphs and Knowledge Networks: The Story in Brief

Shape Matrix: Harness the Power of Truth as a Competitive Advantage

Geospatial 2.0: How AI and 3D Technology Are Redefining Smart Decisions

Paper Review: Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting

[Analysis Example] Analysis of Li ion diffusion in solid-state battery by MD-GAN

Harnessing the Future: Kolmogorov-Arnold Networks Revolutionize Time Series Forecasting

Cognitive Analytics in Urban Science

Ghanaian Food Vision model

What are the axioms of Category Theory? (Chat GPT edition)

The Geography of AI Power: Rethinking Global Inequality in the Age of Artificial Intelligence

How to develop efficient algorithms that exploit this structure?

领英推荐

Lightweight intelligence

648 位关注者

Ankur Verma, PhD的更多文章

How to run real-time inference on 10% important data?

Getting insights efficiently from Terabytes of sensor data

How to inform models about the inherent structure in data?

Preserving physics of vibration data using post-Nyquist data-driven techniques

Redundancy enables upfront sampling instead of dimensionality reduction

A 90% smaller footprint to enable AI deployment at scale

Low-dimensional structures in big data suffice for several tasks

Training, re-training, and iterating close to the data source

Leveraging channels and augmentation to improve sensor datasets

Leveraging inherent structure to work with 90% less data

社区洞察

其他会员也浏览了

Knowledge Graphs and Knowledge Networks: The Story in Brief

Shape Matrix: Harness the Power of Truth as a Competitive Advantage

Geospatial 2.0: How AI and 3D Technology Are Redefining Smart Decisions

Paper Review: Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting

[Analysis Example] Analysis of Li ion diffusion in solid-state battery by MD-GAN

Harnessing the Future: Kolmogorov-Arnold Networks Revolutionize Time Series Forecasting

Cognitive Analytics in Urban Science

Ghanaian Food Vision model

What are the axioms of Category Theory? (Chat GPT edition)

The Geography of AI Power: Rethinking Global Inequality in the Age of Artificial Intelligence