KNearestCenters

The KNearestCenters package contains classification algorithms based on prototype selection and feature mapping through kernel functions. It model selection to improve the classification performance.

KNearestCenters.Knc
KNearestCenters.KncProto
Base.rand
Base.rand
KNearestCenters.accuracy_score
KNearestCenters.change_criterion
KNearestCenters.classification_scores
KNearestCenters.epsilon_criterion
KNearestCenters.f1_score
KNearestCenters.fun_criterion
KNearestCenters.isqerror
KNearestCenters.kfun
KNearestCenters.kfun
KNearestCenters.kfun
KNearestCenters.kfun
KNearestCenters.kfun
KNearestCenters.kfun
KNearestCenters.kfun
KNearestCenters.pearson
KNearestCenters.precision_recall
KNearestCenters.precision_score
KNearestCenters.predict_raw
KNearestCenters.recall_score
KNearestCenters.salesman_criterion
KNearestCenters.size_criterion
KNearestCenters.softmax!
KNearestCenters.spearman
KNearestCenters.transform
SearchModels.combine
SearchModels.combine
SearchModels.mutate
StatsAPI.fit
StatsAPI.fit
StatsAPI.fit
StatsAPI.fit
StatsAPI.predict
StatsAPI.predict

KNearestCenters.Knc — Type

A nearest centroid classifier with support for kernel functions

source

KNearestCenters.KncProto — Type

A simple nearest centroid classifier

source

Base.rand — Method

rand(space::KncConfigSpace)

Creates a random KncConfig instance based on the space definition.

source

Base.rand — Method

rand(space::KncProtoConfigSpace)

Creates a random KncProtoConfig instance based on the space definition.

source

KNearestCenters.accuracy_score — Method

accuracy_score(gold, predicted)

Computes the accuracy score between the gold and the predicted sets

source

KNearestCenters.change_criterion — Function

change_criterion(tol=0.001, window=3)

Creates a fuction that stops the process whenever the maximum distance converges (averaging window far items). The tol parameter defines the tolerance range.

source

KNearestCenters.classification_scores — Method

classification_scores(gold, predicted; labelnames=nothing)

Computes several scores for the given gold-standard and predictions, namely: precision, recall, and f1 scores, for global and per-class granularity. If labelnames is given, then it is an array of label names.

source

KNearestCenters.epsilon_criterion — Method

epsilon_criterion(e)

Creates a function that evaluates the stop criterion when the distance between far items achieves the given e

source

KNearestCenters.f1_score — Method

f1_score(gold, predicted; weight=:macro)::Float64

It computes the F1 score between the gold dataset and the list of predictions predicted

It applies the desired weighting scheme for binary and multiclass problems

:macro performs a uniform weigth to each class
:weigthed the weight of each class is proportional to its population in gold
:micro returns the global F1, without distinguishing among classes

source

KNearestCenters.fun_criterion — Method

fun_criterion(fun::Function)

Creates a stop-criterion function that stops whenever the number of far items reaches $\lceil fun(|database|)\rceil$. Already defined examples:

    sqrt_criterion() = fun_criterion(sqrt)
    log2_criterion() = fun_criterion(log2)

source

KNearestCenters.isqerror — Method

isqerror(X::AbstractVector{F}, Y::AbstractVector{F}) where {F <: AbstractFloat}

Negative squared error (to be used for maximizing algorithms)

source

KNearestCenters.kfun — Method

kfun(kernel::CauchyKernel, d, σ::AbstractFloat)::Float64

Creates a Cauchy kernel with the given distance function

source

KNearestCenters.kfun — Method

kfun(kernel::DirectKernel, d, σ::AbstractFloat)::Float64

Creates a Direct kernel with the given distance function

source

KNearestCenters.kfun — Method

kfun(kernel::GaussianKernel, d, σ::AbstractFloat)::Float64

Creates a Gaussian kernel with the given distance function

source

KNearestCenters.kfun — Method

kfun(kernel::LaplacianKernel, d, σ::AbstractFloat)::Float64

Creates a Laplacian kernel with the given distance function

source

KNearestCenters.kfun — Method

kfun(kernel::ReluKernel, d, σ::AbstractFloat)::Float64

Creates a Relu kernel with the given distance function

source

KNearestCenters.kfun — Method

kfun(kernel::SigmoidKernel, d, σ::AbstractFloat)::Float64

Creates a Sigmoid kernel with the given distance function

source

KNearestCenters.kfun — Method

kfun(kernel::TanhKernel, d, σ::AbstractFloat)::Float64

Creates a Tanh kernel with the given distance function

source

KNearestCenters.pearson — Method

pearson(X::AbstractVector{F}, Y::AbstractVector{F}) where {F <: AbstractFloat}

Pearson correlation score

source

KNearestCenters.precision_recall — Method

precision_recall(gold::AbstractVector, predicted::AbstractVector)

Computes the global and per-class precision and recall values between the gold standard and the predicted set

source

KNearestCenters.precision_score — Method

precision_score(gold, predicted; weight=:macro)::Float64

It computes the precision between the gold dataset and the list of predictions predict

It applies the desired weighting scheme for binary and multiclass problems

:macro performs a uniform weigth to each class
:weigthed the weight of each class is proportional to its population in gold
:micro returns the global precision, without distinguishing among classes

source

KNearestCenters.predict_raw — Method

predict_raw(model::KnnModel, x)

Computes the correspoding vectors without any normalization (or determining the label).

source

KNearestCenters.recall_score — Method

recall_score(gold, predicted; weight=:macro)::Float64

It computes the recall between the gold dataset and the list of predictions predict

It applies the desired weighting scheme for binary and multiclass problems

:macro performs a uniform weigth to each class
:weigthed the weight of each class is proportional to its population in gold
:micro returns the global recall, without distinguishing among classes

source

KNearestCenters.salesman_criterion — Method

salesman_criterion()

It creates a function that explores the entire dataset making a full farthest first traversal approximation

source

KNearestCenters.size_criterion — Method

size_criterion(maxsize)

Creates a function that stops when the number of far items are equal or larger than the given maxsize

source

KNearestCenters.softmax! — Method

softmax!(vec::AbstractVector)

Inline computation of the softmax function on the input vector

source

KNearestCenters.spearman — Method

spearman(X::AbstractVector{F}, Y::AbstractVector{F}) where {F <: AbstractFloat}

Spearman rank correleation score

source

KNearestCenters.transform — Function

transform(nc::Knc, kernel::Function, X, normalize!::Function=softmax!)

Maps a collection of objects to the vector space defined by each center in nc; the kernel function is used measure the similarity between each $u \in X$ and each center in nc. The normalization function is applied to each vector (normalization methods needing to know the attribute's distribution can be applied on the output of transform)

source

SearchModels.combine — Method

combine(a::KncConfig, b::KncConfig)

Creates a new configuration combining the given configurations

source

SearchModels.combine — Method

combine(a::KncProtoConfig, b::KncProtoConfig)

Creates a new configuration combining the given configurations

source

SearchModels.mutate — Method

mutate(space::KncProtoConfigSpace, a::KncProtoConfig, iter)

Creates a new configuration based on a slight perturbation of a

source

StatsAPI.fit — Method

fit(config::KncConfig, X, y::CategoricalArray; verbose=true)

Creates a Knc classifier using the given configuration and data.

source

StatsAPI.fit — Method

fit(config::KncProtoConfig, X, y::CategoricalArray; verbose=true)
fit(config::KncProtoConfig,
    input_clusters::ClusteringData,
    train_X::AbstractVector,
    train_y::CategoricalArray;
    verbose=false
)

Creates a KncProto classifier using the given configuration and data.

source

StatsAPI.fit — Method

fit(::Type{KnnModel}, index::AbstractSearchIndex, meta::AbstractVecOrMat{<:Real}; k=3, weight=KnnUniformWeightKernel(), prediction=KnnSoftmaxPrediction())
fit(::Type{KnnModel}, examples::AbstractMatrix, meta::AbstractVecOrMat{<:Real}; k=3, weight=KnnUniformWeightKernel(), prediction=KnnSoftmaxPrediction(), dist=L2Distance())

Creates a new KnnModel classifier with the examples indexed by index and it associated labels

Arguments:

KnnModel: the type to dispatch the fit request
index: the search structure see SimilaritySearch.jl
examples: a matrix that will be indexed using SimilaritySearch.jl
meta: numerical associated data

Keyword arguments

k: the number of neighbors to be used.
weight: the neighbor weighting scheme.
dist: distance function to be used

source

StatsAPI.fit — Method

fit(::Type{KnnModel}, index::AbstractSearchIndex, ctx::AbstractContext, labels::CategoricalArray; k=3, weight=KnnUniformWeightKernel())

Creates a new KnnModel classifier with the examples indexed by index and it associated labels

Arguments:

KnnModel: the type to dispatch the fit request
index: the search structure see SimilaritySearch.jl
labels: Categorical array of labels

Keyword arguments

k: the number of neighbors to be used.
weight: the neighbor weighting scheme.

source

StatsAPI.predict — Method

predict(nc::KncProto, x)

Predicts the class of x using the label of the k nearest centers

source

StatsAPI.predict — Method

predict(model::KnnModel, x)

Predict based on examples using the model, a KnnModel object.

Arguments:

model: The KnnModel struct.
x: A compatible object with the exemplars given to the model while fitting.

source