登录查看更多内容

Finding your wifi password in 512-dimensional space

qdive

Passion for data

发布日期: 2022年9月29日

We recently participated in kaggle’s and AI Village’s capture-the-flag @ DEFCON competition, which considers different aspects of security in data science. We’d like to tell you about some of the mini-challenges that we solved and their implications. Maybe the one with the highest wow-effect was finding a wifi password in a really high-dimensional space. A mixture of data science and escape room, as we will see. The challenge text stated:

You really need to check your email, unfortunately you don't know the password. Fortunately, someone wrote it down. Unfortunately, it's written down on a low-dimensional manifold embedded in a very high-dimensional space. Check out the wifi/Embedded characters.npz file -- a list of tokens is given in the tokens key with their corresponding embeddings in the same order under the embeddings key -- and recover the password.

A little confusing, you say? No problem, let’s apply some investigative Data Science to figure it out!

We are given two items: A large 182x512 matrix…

and the following token sequence, which happens to contains 182 characters: !!""##$$%%&&''(())**++,,--..//00112233445566778899::;;<<==>>??@@AABBCCDDEEFFGGHHIIJJKKLLMMNNOOPPQQRRSSTTUUVVWWXXYYZZ[[\\]]^^__``aabbccddeeffgghhiijjkkllmmnnooppqqrrssttuuvvwwxxyyzz{{||}}~~

领英推荐

Unveiling the Art of Time Series Analysis: Choosing…

360DigiTMG 5 个月前

Datasets/ Data Sources and where to find them, ????.

Women in Data Africa 1 年前

Unlocking Insights with Anomaly Detection in Data

Handson School Of Data Science Management & Technology 1 年前

What now? Each character in the token sequence seems to correspond to a row in the matrix, but 512 dimensions is a little too large to analyze comfortably. Could we reduce the number of dimensions to something manageable? Well, yes, we can! Meet Principal Component Analysis (affectively known among data scientists as “PCA”): An algorithm that reduces a huge 182x512 matrix (i.e. 182 points with 512 dimensions each) to something with less dimensions, but as much information about the original data as possible. Something like 182x2 would be nice and easy to visualize. We can plot the results to produce the following beautiful graph:

Incredible! Now the question remains: How do we extract the password from here? Maybe after some thinking the escape room-savvy reader guessed it: Assign the i-th letter in the token sequence to the i-th point in the spiral and then read the result outwards: FLAG{TURNED}0123456789abcdefghijklmnopqrstu… and so on That’s your password!

Why is this relevant to security and data science? On the one hand, it shows how powerful dimensionality reduction can be. It transforms an intractable problem into an intuitive one. Most importantly, however, it demonstrates how to hide information in plain sight. Who needs quantum computer-proof encryption algorithms if you can hide it with a little help of statistics?

Stay tuned for more exciting data science posts!

Yacine Benyamina

Unity Developer | Data Science and AI student.

12 个月

hello is it possible to provide the code for this?

要查看或添加评论，请登录

qdive的更多文章

See all articles

Finding your wifi password in 512-dimensional space

qdive

Passion for data

领英推荐

qdive的更多文章

社区洞察

其他会员也浏览了

Industry Keynote

How data.world began

Notes on Data Compression: Part 1

Big Data: How The Amazing Insights From Video Are Changing The World

Analyze real data, data scientists!

Knowledge As Elementary Information Node Graph A.k.a. the EINGRAPH

Business Intelligence, Data Science and its Impacts on Public Security Forces

Bigdata Problem

A DREAM CALLED BIG DATA

The Forgotten Data: What Happens to the Data We Don’t Use?

领英推荐

qdive的更多文章

Using Machine Learning for pricing in B2B sales

qdive goes to space

Django vs. FastAPI

The first 100 days after returning from maternity leave - an interview with our qdiver Julia

New members of the qdive family in early 2022

qdive.io is looking to a very exciting future!

Building an ML-based search engine: Introduction

qdive Offsite Report - exciting discussions and a lot of fun

Data Science Trends - Meet the Experts

Porsche AI Coding days

社区洞察

其他会员也浏览了

Industry Keynote

How data.world began

Notes on Data Compression: Part 1

Big Data: How The Amazing Insights From Video Are Changing The World

Analyze real data, data scientists!

Knowledge As Elementary Information Node Graph A.k.a. the EINGRAPH

Business Intelligence, Data Science and its Impacts on Public Security Forces

Bigdata Problem

A DREAM CALLED BIG DATA

The Forgotten Data: What Happens to the Data We Don’t Use?