Probabilistic Nearest Neighbors: The Swiss Army Knife of GenAI
ANN — Approximate Nearest Neighbors —? is at the core of fast vector search, itself central to GenAI, especially GPT and LLM. My new methodology, abbreviated as PANN, has many other applications: clustering, classification, measuring the similarity between two datasets (images, soundtracks, time series, and so on), tabular data synthetization (improving poor synthetizations), model evaluation, and even detecting extreme observations.
Just to give an example, you could use it to categorize all time series without statistical theory. Statistical models are redundant and less explainable, leading to definitions less useful to developers, and math-heavy.? PANN avoids that.
Fast and simple, PANN (for Probabilistic ANN) does not involve training or neural networks, and it is essentially math-free. Its versatility comes from four features:
领英推荐
The figure below shows how quickly the loss function drops at the beginning. In this case, the loss represents the average distance to the approximate nearest neighbor, obtained so far in the iterative algorithm. The X-axis represents the iteration number. Note the excellent curve fitting (in orange) to the loss function, allowing you to predict its baseline (minimum loss, or optimum) even after a small number of iterations. To see what happens if you maximize the loss instead, read the full technical document.
Technologist | Digital Innovation & Management
5 个月Intricate graph! The loops and twists represent the complex relationships within knowledge graphs. Fascinating to see how 346.18 and 346.21 are connected.
Research And Development Engineer
5 个月Spoiler alert: maximizing a loss function is the same as minimizing the negative of that loss function.
Time is Everything and I Help Teach that You can take control of it, for better Mental & Physical Wellbeing - Thinking outside the Box with Business, Relationships and Pleasure.
5 个月without understanding the context of the question which gives the answers/data points, all AI generative results are just garbage in the end, not able to create, just give answers at the mean.