Tutorials and Examples for the SimilaritySearch.jl package
Here you will find several examples for the SimilaritySearch.jl package.
Installation
You need a Julia installation https://julialang.org/downloads/, in particular, we recommend to use Julia SimilaritySearch
.
We present our examples just to copy and paste on the REPL but also provide some in Jupyter and Pluto notebooks; you must install IJulia
and Pluto
packages, just run the following commands in the REPL
using Pkg; Pkg.add(["IJulia", "Pluto"])
Please check their documentation:
This website was made with Quarto with the Julia engine.
Notes about multithreading
Nearest neighbor search can be computationally expensive, therefore SimilaritySearch
has multithreading support. You should want to run jupyter or julia using all available threads, that is
JULIA_NUM_THREADS=auto jupyter-lab .
or
JULIA_NUM_THREADS=auto julia
Perhaps all your threads may become your computer useless for a while, so you can replace auto
by some other more conservative number that allow you to work on the same computer.
News
- march 20th, 2025: the site is behind the
SimilaritySearch
API; I will be working on updating examples and moving the site to Quarto. - june 5th, 2023: adds news section, some installation requirements, Jupyter notebookes were updated to work with the
v0.10
and with the current julia releasev1.9
. I also moved most plots toMakie
.
Problem statement
Given a finite dataset,
At first glance, the problem is simple since it can be solved using an exhaustive evaluation of all possible results
The SearchGraph
index in the SimilaritySearch.jl
package is a competitive alternative for solving search queries that automatically tune search speed and quality and also remains very competitive in memory and construction costs. Here we show some demostrations of how using SimilaritySearch.jl
in several synthetic and real problems.